CADEX logo
CADEX

Cadex Solutions Corporation is a holding company formed by Trivest Partners LP to build the premier provider of commercial order-to-cash management solutions. With a history spanning nearly 100 years, Cadex is uniquely positioned with in-depth experience that builds relationships alongside results. Our team of industry experts brings innovation and data insight, improves your processes with hands-on help, and provides custom solutions based on specific needs. Cadex has approximately 800 employees serving over 1,000 clients across all industries from locations including the United States, Colombia, Brazil, Romania, Italy, India, Singapore, and South Africa. Since 2019, Cadex has been putting together a strong portfolio of ARM companies, including: A.G. Adjustments, formed in 1974 and headquartered in Melville, NY D&S Global Solutions, formed in 1997 and fully remote ABC-Amega, formed in 1929 and headquartered in Buffalo, NY TranSubro, formed in 2012 and headquartered in Oceanside, NY DAL, formed in 1974 and headquartered in Clifton Heights, PA RCC, formed in 1970 and headquartered in Maple Grove, MN IRG, formed in 1997 and headquartered in Marlborough, MA

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteMid LevelTeam 51-200

Location

Romania

Posted

3 days ago

Salary

0

Seniority

Mid Level

No structured requirement data.

Job Description

Site Reliability Engineer

CADEX

Role Description We are looking for a mid‑level Site Reliability Engineer focused on GCP to help us transition from a traditional IT Support model to a modern SRE operating model. You will design and implement our GCP‑based platform (GKE, Terraform, Prometheus, Grafana, GCP Operations Suite) and act as a hands‑on guide for our existing team as we adopt SRE ways of working, with a strong focus on automation and tooling in Python. Responsibilities: - Maintain GCP infrastructure using Terraform, including GKE clusters, Compute Engine, Cloud Storage, Cloud SQL or other managed databases, VPC networking, load balancers, and Cloud DNS. - Manage and operate Kubernetes workloads on GKE: deployments, services, ingresses, autoscaling, configuration, secrets and cluster upgrades. - Participate in on‑call rotations for GCP services and lead or assist in incident response. - Design and maintain observability for GKE and GCP workloads using Prometheus for metrics collection and Grafana for dashboards and visualization. - Provide advanced production support for business‑critical applications (web and backend services), investigating incidents, performance issues and functional degradations together with development teams. - Use metrics, logs, traces and error reports to triage and debug application issues across multiple services and components. - Maintain and improve runbooks, playbooks and knowledge base articles so recurring production issues can be resolved quickly and consistently. - Analyze incident and ticket trends to propose reliability improvements, automation and changes to application configuration or architecture. - Define and implement SLIs and SLOs based on Prometheus metrics and GCP Operations Suite (Cloud Monitoring/Logging) and configure alerts (in Prometheus Alertmanager, Grafana, or Cloud Monitoring) that focus on real customer impact. Qualifications - 2–5 years experience in SRE, DevOps or platform engineering operating production systems, with strong exposure to GCP. - Solid experience with GKE and containerized applications (deployment strategies, scaling, troubleshooting) in production. - Strong Infrastructure‑as‑Code skills with Terraform for provisioning GCP resources (projects, networks, IAM, GKE, databases, etc.). - Experience with Prometheus and Grafana, including: - setting up metrics collection (exporters, scraping configs) for applications and infrastructure; - building and maintaining Grafana dashboards for services, platforms, and SLOs; - configuring alerts (Alertmanager/ Grafana/ Cloud Monitoring) with appropriate thresholds and routing. - Good knowledge of Linux and Docker, including debugging performance, networking and security issues. - Familiarity with GCP Operations Suite (Cloud Monitoring/ Logging) and how to combine it with Prometheus/ Grafana for a complete observability story. - Understanding of GCP security basics: IAM, service accounts, least‑privilege, network security and Secret Manager. - Experience supporting production applications (web or backend services), including debugging issues across logs, metrics, traces and application‑level errors. - Mentoring and coaching mindset: enjoys guiding colleagues through new tools and practices. Schedule 16:00-00:50 Romania time Company Description Cadex Solutions Corporation is a holding company formed by Trivest Partners LP to build the premier provider of commercial order-to-cash management solutions. With a history spanning nearly 100 years, Cadex is uniquely positioned with in-depth experience that builds relationships alongside results. Our team of industry experts brings innovation and data insight, improves your processes with hands-on help, and provides custom solutions based on specific needs. Cadex has approximately 800 employees serving over 1,000 clients across all industries from locations including the United States, Colombia, Brazil, Romania, Italy, India, Singapore, and South Africa. Since 2019, Cadex has been putting together a strong portfolio of ARM companies, including: - A.G. Adjustments, formed in 1974 and headquartered in Melville, NY - D&S Global Solutions, formed in 1997 and fully remote - ABC-Amega, formed in 1929 and headquartered in Buffalo, NY - TranSubro, formed in 2012 and headquartered in Oceanside, NY - DAL, formed in 1974 and headquartered in Clifton Heights, PA - RCC, formed in 1970 and headquartered in Maple Grove, MN - IRG, formed in 1997 and headquartered in Marlborough, MA

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Mastercard logo

Site Reliability Lead Engineer

Mastercard

Founded in 1966, Mastercard is a worldwide transaction, payment-processing, and consulting company best known for its line of personal and business credit cards. As an employer, Ma

DevOps Engineer3 days ago
Full TimeRemoteTeam 38,800Since 1966

Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Lead Engineer Lead Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results. Technology at Mastercard What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable. And we need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day. Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences, and offers you the flexibility to shape a career across disciplines and continents. And the opportunity to work alongside experts and leaders at every level of the business, improving what exists, and inventing what's next. About the Role The Business Operations (Biz Ops) team is seeking a Business Operations Site Reliability Engineer (SRE) The role of Business Operations Organization is to be the production readiness steward for Mastercard products. As a Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principals that includes operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. All about you We are seeking a highly motivated and experienced Sr./Lead/Principal Site Reliability Engineer (SRE) to join our growing team. You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: - Abide by Mastercard's security policies and practices; - Ensure the confidentiality and integrity of the information being accessed; - Report any suspected information security violation or breach, and - Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.

Ireland
Full TimeRemoteTeam 1-10H1B No Sponsor

• Support the full release lifecycle (intake → validation → release → post-release tracking) • Validate release requests, including dependencies, readiness, and required inputs prior to submission • Coordinate timelines, milestones, and deliverables across stakeholders • Track release status, risks, and blockers and drive resolution to ensure on-time delivery

Washington
$100K - $110K / year
Remote Recruitment logo

DevOps Engineer

Remote Recruitment

Remote Recruitment operates as a full-service employment agency providing recruitment/staffing for UK based companies

DevOps Engineer3 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Design, build, and maintain CI/CD pipelines to support fast and reliable software delivery • Manage cloud infrastructure on AWS or Azure using infrastructure-as-code • Implement monitoring, alerting, and observability tools across all environments • Collaborate with developers to improve build, test, and deployment processes • Maintain security best practices across infrastructure and pipeline configurations

South Africa
R35K / month
Leap Tools logo

DevOps Engineer

Leap Tools

Leap Tools is an equal opportunity employer committed to fostering an inclusive, equitable, and accessible environment. Accommodations are available on request for candidates taking part in all aspects of the interview process. If you require any accommodation, please contact us at ta@leaptools.com.

DevOps Engineer3 days ago
Full TimeRemoteTeam 201-500

Role Description At Leap Tools, we are building the world's most advanced solutions for the interior décor industry. Our technology lets you preview products in your own room before you buy them. You’ll be responsible for a variety of development-related automation tasks that involve: - Smooth operation of state of the art production systems used by millions of users - Engineering tools (e.g. process and work tracking) - CI/CD infrastructure and release automation - Testing infrastructure (various environments and stages) - Investigation and resolution of scalability bottlenecks and production incidents - Communicating and sharing knowledge with peers, QA Engineers, and Developers/Engineers Qualifications - Strong computer science fundamentals based on a degree in computer science or distinctive work experience in software development - Experience with Kubernetes, AWS, or GCP in a cloud environment - Shell scripting prowess in a Linux environment - Development track record in at least one of the following languages: Python, JavaScript, TypeScript, Java, C and/or C++ - Ability to develop foundational engineering infrastructure to be used across the entire company - A demonstrated ability to provide guidance, mentorship, and support - Exceptional attention to detail and focus on quality - Strong communication skills for capturing requirements, as well as sharing designs and progress Requirements - Comfortable maintaining a personal Linux box and customizing it - Ability to set up complex systems that work flawlessly - Open-mindedness to listen and discover challenges Benefits - Remote-first work environment - Work anywhere in the world for up to 3 months - Parental leave program - Work-from-home stipend - Your birthday (and our company's birthday) is a day off! Company Description Leap Tools is an equal opportunity employer committed to fostering an inclusive, equitable, and accessible environment. Accommodations are available on request for candidates taking part in all aspects of the interview process.

Worldwide
C$65K - C$115K / year