Job Closed
This listing is no longer active.
FICO, also known as Fair Isaac Corporation, is one of the world’s leading credit history and financial analysis organizations. It was founded in 1956 on the i
Senior Engineer – DevOps, DataOps
Location
United States
Posted
95 days ago
Salary
$119K - $187K / year
Seniority
Senior
Job Description
Senior Engineer – DevOps, DataOps
FICO - Fair Isaac Corporation
• Design, build, and maintain scalable, resilient data and ML pipelines, infrastructure, and workflows using tools such as GitHub Actions, ArgoCD, Crossplane, Terraform, Helm, and others. • Automate infrastructure provisioning and configuration management using cloud-native services (preferably AWS) with tools like Terraform, CloudFormation, or Crossplane. • Design, containerize, and manage Kubernetes (EKS) clusters and/or ECS environments in AWS. • Collaborate with development teams to optimize performance, deployment, and cost. • Partner with DevOps and SRE teams to ensure high availability, observability, scalability, and security of the data and ML infrastructure. • Work closely with Data Scientists and ML Engineers to operationalize machine learning models, including building CI/CD pipelines for model training, validation, and deployment. • Implement observability for data pipelines and ML services using tools like Prometheus, Grafana, Datadog, or similar. • Develop and maintain automated pipelines for model retraining, monitoring drift, and versioning in production. • Support experimentation and prototyping in areas such as Machine Learning and Generative AI, transitioning successful prototypes into production systems. • Ensure cloud infrastructure is secure, compliant, and cost-efficient, following best practices in governance, identity, and access management.
Job Requirements
- 7+ years of experience in DataOps, MLOps, or related fields, with at least 2 years focused on ML model operationalization and workflow automation.
- Proficient in AWS services including EC2, S3, IAM, ACM, Route 53, CloudWatch, EKS, and ECS.
- Experience with infrastructure as code (IaC) tools such as Terraform, CloudFormation, and Helm.
- Familiarity with CI/CD for ML pipelines, GitOps practices, and tools like GitHub Actions, Jenkins, or Argo Workflows.
- Strong scripting and automation skills using Python, or GitHub workflows.
- Understanding of observability and monitoring tools (e.g., Prometheus, Grafana, Datadog, or OpenTelemetry).
- Solid understanding of security best practices for cloud and Kubernetes environments, including secrets management, identity & access control, and policy enforcement.
- Familiarity with data governance, lineage, and metadata management is a plus.
- Excellent collaboration and communication skills, with a proven ability to work effectively in cross-functional, globally distributed teams.
- A bachelor’s degree in computer sciences, Engineering, or a related discipline, or equivalent hands-on industry experience.
Benefits
- An inclusive culture strongly reflecting our core values: Act Like an Owner, Delight Our Customers and Earn the Respect of Others.
- The opportunity to make an impact and develop professionally by leveraging your unique strengths and participating in valuable learning experiences.
- Highly competitive compensation, benefits and rewards programs that encourage you to bring your best every day and be recognized for doing so.
- An engaging, people-first work environment offering work/life balance, employee resource groups, and social events to promote interaction and camaraderie.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Looking for a high energy, enthusiastic DevOps/Toolchain Engineer for automotive environments to join the Tools group to develop and maintain tools and systems for our organization. • General maintenance and support • Administer, configure and setup new projects in ALM tool • Administer, configure, setup new users in problem management tool • Manage tool upgrades, improvements, and vendor-released fixes • Administer and configure license servers • Troubleshoot tool issues, train & mentor users, etc. • Facilitate workshops between development teams and 3rd party vendors • Support tool testing and verification; support tools deployment • Aid with other activities such as exit checklist, license tracking, and managing tool user groups, etc. • Support tool proof-of-concepts (POCs) and R&Ds • Establish strong relationship between process and development teams across multiple departments to ensure alignment with tools and process usage
Senior Site Reliability Engineer
HopperHopper is an accredited, mobile-only travel agency using big data to analyze and predict airfare and accommodations. A fully remote employer, Hopper strives to give every member of
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in Google Cloud that is used by hundreds of engineers to provide a first class experience to millions of end users around the world. You are passionate about automating everything possible and ensuring systems remain optimized. You also like the infrastructure to be as scalable, reliable, secure, and optimized as possible. You like to solve problems in a practical way, building solutions that are simple, reliable, cost-effective, and easy to use. What would your day-to-day look like: - Work on projects that will drive a higher cost efficiency, such as: - Reduce our network egress costs by removing unnecessary headers. - Ensure that our warehouse data is in use and select the most efficient storage for it (e.g., cold storage for buckets with infrequent retrieval). - Ensure that autoscaling for both databases and compute is well optimized. - Work on improving the current cost attribution to ensure all teams have clear visibility into their costs. - Participate in providing support to incidents and be part of on-call rotation for platform incidents. - Contribute to solving doubts and problems engineers might face with our infrastructure and approving PRs that require Platform supervision. - Be part of a small and highly efficient team of SREs. Qualifications - Strong background in SRE, DevOps, Software Engineering or Systems engineering - Troubleshooting skills - System design with good analytical capabilities - Good communication skills - Knowledge of major cloud providers, preferably Google Cloud - SQL knowledge - Containers, Kubernetes, and related tooling like Kustomize and Helm - Service Mesh, preferably with Istio - Networking knowledge (DNS, TLS, certificates, ingresses, etc.) - Observability with log collection, metrics, APM, etc., preferably Datadog - Security knowledge (IAM, RBAC, network security, etc.) - Knowledge on authentication and authorization technologies - CI/CD - Database technologies - Competent in scripting with Bash and Python or other scripting languages Benefits - Well-funded and proven startup with large ambitions, competitive salary and the upsides of pre-IPO equity packages. - Unlimited PTO. - Carrot Cash travel stipend. - Access to co-working space on demand through FlexDesk AND Work-from-home stipend. - Very generous parental leave, much above industry standards. - Entrepreneurial culture where pushing limits and taking risks is everyday business. - Open communication with management and company leadership. - Small, dynamic teams = massive impact. - 100% employer paid Medical, Dental and Vision coverage for employees. - Access to Disability & Life insurance. - Health Reimbursement Account (HRA). - DCA/ FSA and access to 401k plan. Company Description At Hopper, we are on a mission to become the leading travel platform globally – powering Hopper’s mobile app, website and our B2B business, HTS (Hopper Technology Solutions). By leveraging massive amounts of data and advanced machine learning algorithms, Hopper combines its world-class travel agency offering with proprietary fintech products to bring transparency, flexibility and savings to travelers globally. - The Hopper platform serves hundreds of millions of travelers globally and continues to capture market share around the world. - The Hopper app has been downloaded over 120 million times and has become largely popular among younger travelers – with 70% of its users being Gen Z and millennials. - Hopper has been named the #1 most innovative company in travel by Fast Company. - Hopper has raised over $750 million USD of private capital and is backed by some of the largest institutional investors and banks in the world.
DevOps Platform Architect
CarrierCarrier Global Corporation, global leader in intelligent climate and energy solutions, is committed to creating innovations that bring comfort, safety and sustainability to life. Through cutting-edge advancements in climate solutions such as temperature control, air quality and transportation, we improve lives, empower critical industries and ensure the safe transport of food, life-saving medicines and more. Since inventing modern air conditioning in 1902, we lead with purpose: enhancing the lives we live and the world we share. We continue to lead because of our world-class, inclusive workforce that puts the customer at the center of everything we do.
About Carrier: Carrier, global leader in intelligent climate and energy solutions, is committed to creating innovations that bring comfort, safety and sustainability to life. Through cutting-edge advancements in climate solutions such as temperature control, air quality and transportation, we improve lives, empower critical industries and ensure the safe transport of food, life-saving medicines and more. Since inventing modern air conditioning in 1902, we lead with purpose: enhancing the lives we live and the world we share. We continue to lead because of our world-class, inclusive workforce that puts the customer at the center of everything we do. For more information, visit corporate.carrier.com or follow Carrier on social media at @Carrier. About This Role: The DevOps Platform Architect will lead the design, delivery, and ongoing evolution of the organization's DevOps platform capabilities. This is a hands-on technical leadership role responsible for building the foundational CI/CD infrastructure that engineering teams across the organization depend on to deliver software reliably and at scale. The primary focus of this role is GitHub Enterprise platform ownership and the development of a centralized, reusable CI/CD pipeline framework that serves multiple internal customer teams. The successful candidate will establish standards, tooling, and best practices that enable teams to adopt consistent software delivery patterns without building pipelines from scratch, reducing duplication, improving security posture, and accelerating time to delivery. This individual will integrate CI/CD pipelines with Infrastructure as Code tooling, cloud-native deployment patterns, and security scanning capabilities, ensuring that delivery pipelines are secure, observable, and aligned with enterprise compliance requirements. They will work closely with Cloud Engineering, Security, and application development teams to embed DevOps practices across the platform lifecycle. The role requires a self-motivated technical leader with strong problem-solving ability, excellent communication skills, and a demonstrated track record of building and scaling DevOps platforms in complex enterprise environments. The individual must be equally comfortable leading a team, engaging with stakeholders, and getting hands-on with platform tooling and automation. Key Responsibilities: GitHub Enterprise Platform Ownership - Own the architecture, administration, and roadmap for GitHub Enterprise across the organization. - Define and enforce GitHub governance policies, including branch protection rules, code review standards, secret scanning, and access control models. - Establish and maintain GitHub Actions runner infrastructure, including self-hosted runners integrated with enterprise cloud environments. - Drive adoption of GitHub Advanced Security (GHAS) capabilities, including code scanning, Dependabot, and secret detection. - Develop and maintain a centralized GitHub Actions workflow library available to application and platform teams. CI/CD Platform Engineering - Design and deliver a centralized, reusable CI/CD pipeline framework enabling consistent software delivery practices across multiple internal customer teams. - Build standardized pipeline templates covering build, test, security scanning, artifact management, and multi-environment deployment stages. - Establish CI/CD best practices and developer experience standards, including pipeline-as-code patterns, shift-left testing, and automated quality gates. - Define and implement deployment strategies (blue/green, canary, rolling) across cloud-native and hybrid workloads. - Develop pipeline observability capabilities, including build metrics, deployment frequency tracking, and DORA metric dashboards. Platform & Infrastructure Integration - Lead CI/CD capability development for current and future internal platform products and services. - Integrate CI/CD pipelines with Infrastructure-as-Code tooling to enable automated infrastructure provisioning and drift detection. - Collaborate with cloud platform teams to embed DevOps pipelines into cloud foundation patterns across AWS, Azure, and GCP. - Champion automation across the platform lifecycle, reducing toil and enabling self-service for engineering teams. Technical Leadership & Team Management - Lead and mentor the DevOps Platform team, providing technical guidance, career development, and delivery accountability. - Set team standards for code quality, documentation, security, and operational readiness of DevOps tooling. - Collaborate cross-functionally with engineering, security, and application development teams. - Drive agile planning, backlog management, and sprint execution for the DevOps platform team. - Act as an internal subject matter expert and advocate for DevOps, CI/CD, and platform engineering practices across the organization. Governance, Security & Compliance - Ensure all CI/CD pipelines and platform tooling adhere to organizational security and compliance requirements. - Partner with Security teams to integrate scanning (SAST, DAST, SCA, secrets detection) natively into delivery pipelines. - Define pipeline audit logging standards and ensure traceability of deployments across environments. Required Qualifications: - Bachelor's degree with 8+ years of experience in DevOps, platform engineering, or software delivery. OR - High School Diploma/GED with 10+ years experience in DevOps, platform engineering, or software delivery. - 5+ years of hands-on experience with GitHub Enterprise, including administration, GitHub Actions, and GitHub Advanced Security. Preferred Qualifications: - Demonstrated expertise in designing and operating centralized CI/CD pipeline platforms serving multiple internal teams or customers. - Strong proficiency with Infrastructure as Code tools such as Terraform, with experience integrating IaC into delivery pipelines. - Deep knowledge of containerization (Docker) and Kubernetes-based deployment patterns across major cloud providers (AWS, Azure, GCP). - Experience building and managing self-hosted GitHub Actions runner fleets integrated with enterprise cloud environments. - Proficiency with scripting and automation languages such as Python, Bash, and/or Go. - Working knowledge of artifact management platforms (e.g., JFrog Artifactory, AWS ECR, GCP Artifact Registry). - Familiarity with software supply chain security concepts (SBOM, artifact signing, SLSA framework). - Strong communication skills with the ability to translate complex technical concepts for a range of audiences. - Experience with IaC orchestration platforms such as Spacelift, Atlantis, or similar tools. - Experience with platform observability and developer experience tooling (e.g., Grafana, Datadog, Backstage). - Knowledge of DORA metrics and platform engineering KPIs; experience building delivery performance dashboards. - Experience operating in large-scale, multi-cloud enterprise environments. - Background in regulated industries with experience navigating enterprise change management and compliance requirements. Benefits: Employees are eligible for benefits, including: - Health Care benefits: Medical, Dental, Vision; wellness incentives - Retirement benefits - Time Off and Leave: Paid vacation days, up to 15 days; paid sick days, up to 5 days; paid personal leave, up to 5 days; paid holidays, up to 13 days; birth and adoption leave; parental leave; family and medical leave; bereavement leave; jury duty; military leave; purchased vacation - Disability: Short-term and long-term disability - Life Insurance and Accidental Death and Dismemberment - Tax-Advantaged Accounts: Health Savings Account; Healthcare Spending Account; Dependent Care Spending Account - Tuition Assistance To learn more about our benefits offering, please click here: Work With Us | Carrier Corporate The specific benefits available to any employee may vary depending on state and local laws and eligibility factors, such as date of hire and the applicability of collective bargaining agreements. This position is entitled to short-term cash incentives, subject to plan requirements. Pay Range: The annual salary for this position is $146,750–$205,250. Factors which may affect pay within this range include, but are not limited to, skills, education, experience, and other unique qualifications of the successful candidate. Applications will be accepted for at least 3 days from Job Posting Date. Job Posting Date: 03/05/2026 Carrier is An Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status, age or any other federally protected class. Job Applicant's Privacy Notice: Click on this link to read the Job Applicant's Privacy Notice
Senior Site Reliability Engineer
Centene CorporationTransforming the health of the communities we serve, one person at a time.
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: Helps lead projects that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents. Understands and advocates for standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process. Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability. - Troubleshoots and resolves more complex problems with systems and services and initiates regular deployment of new versions of the systems and their subcomponents - Leads more complex projects focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility - Helps make decisions around periodic system validation and testing, service monitoring, and standing up new services/tools - Uses knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization - Identifies and implements necessary manual and automated procedures for improved collaborative response in real-time - Leads lower level Engineers in stress, security, and performance testing - Resolves issues that come up through support escalation - Keeps documentation and runbooks up to date to effectively deal with new incidents that might arise - Leads post incident reviews and documents findings for future informed decision making - Reviews proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability and makes decisions around which proposals should move forward - Communicates complex topics with development teams to investigate and document issues and leads internal team to develop solutions to mitigate them - Performs other duties as assigned - Complies with all policies and standards Education/Experience: A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science) and Requires 4 – 6 years of related experience. Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position. Technical Skills: - Experience with Linux Operating System; Operating Systems; Unix Operating System; Windows Operating System - Experience with with observability/monitoring tools such as Splunk, Dynatrace, Elastic, New Relic, Prometheus, Grafana - Experience with enterprise level CICD Tools such as Ansible, Jenkins, Cloudbees, OpenShift - Experience with working in public cloud platforms like AWS and Azure - Experience with Programming Tools - Experience with building and operating highly scaled applications - Experience with MongoDB; MySQL; Oracle Database Management System (DBMS); PL SQL; SQL (Programming Language) - Experience with varying code repositories, auto deployments, branching with tools such as Gitlab, Bitbucket, Subversion - Experience with IT service management tools such as Service Now, Atlassian, BMC Soft Skills: - Intermediate - Seeks to acquire knowledge in area of specialty - Intermediate - Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions - Intermediate - Ability to work independently - Intermediate - Demonstrated analytical skills - Intermediate - Demonstrated project management skills - Intermediate - Demonstrates a high level of accuracy, even under pressure - Intermediate - Demonstrates excellent judgment and decision making skills Pay Range: $87,000.00 - $161,300.00 per year Centene offers a comprehensive benefits package including: competitive pay, health insurance, 401K and stock purchase plans, tuition reimbursement, paid time off plus holidays, and a flexible approach to work with remote, hybrid, field or office work schedules. Actual pay will be adjusted based on an individual's skills, experience, education, and other job-related factors permitted by law, including full-time or part-time status. Total compensation may also include additional forms of incentives. Benefits may be subject to program eligibility. Centene is an equal opportunity employer that is committed to diversity, and values the ways in which we are different. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other characteristic protected by applicable law. Qualified applicants with arrest or conviction records will be considered in accordance with the LA County Ordinance and the California Fair Chance Act




