Job Closed
This listing is no longer active.
Helping Companies Bring Joy to CX.
Senior DevOps Engineer – Compute Platforms
Location
United States
Posted
92 days ago
Salary
$82.3K - $228.8K / year
Seniority
Senior
Job Description
Senior DevOps Engineer – Compute Platforms
Five9
• Operate and support enterprise compute platforms across hardware, OS, virtualization, and container orchestration layers • Deploy and maintain bare metal server infrastructure for Ubuntu OS with Kubernetes and hypervisors including Openstack & Harvester • Implement and maintain PXE-based provisioning environments leveraging Redfish APIs for large-scale server deployments • Install, patch, and maintain operating systems including Ubuntu and Harvester • Operate and support virtualization and private cloud platforms, including KVM on Ubuntu, OpenStack environments and Harvester HCI • Develop Infrastructure-as-Code using Ansible, Terraform, Helm and Git, with Python/Bash automation. • Implement CI/CD pipelines for infrastructure updates, patching, upgrades, testing, and rollback. • Perform firmware updates, patch management, and hardware health validation • Monitor system performance, capacity, and availability; proactively address reliability risks • Troubleshoot complex cross-stack issues spanning hardware, OS, virtualization, OpenStack, and Kubernetes • Manage to SLAs, KPIs and error budgets • Participate in on-call escalation support for complex platform-related issues • Collaborate globally on change management, documentation, and operational best practices. • Develop and maintain runbooks, operational procedures, and technical documentation
Job Requirements
- 6+ years of experience as a DevOps Engineer, Site Reliability Engineer, or Infrastructure Operations Engineer with a strong focus on compute
- Strong hands-on experience operating bare metal compute environments at scale
- Experience with PXE boot, automated OS provisioning, and server imaging systems
- Practical experience supporting Bare Metal as a Service (BMaaS) platforms leveraging Redfish APIs
- Strong Linux administration skills, especially with Ubuntu
- Operational experience with virtualization and private cloud platforms, including KVM on Ubuntu, OpenStack operations and troubleshooting, Harvester HCI
- Experience deploying and operating production Kubernetes environments
- Expertise with enterprise compute hardware, including Cisco UCS, Dell PowerEdge, Supermicro systems and HPE
- Proficiency with Infrastructure as Code tools (e.g., Terraform, Ansible, or similar)
- Experience building or supporting CI/CD pipelines for infrastructure and platform automation
- Strong scripting skills in Python, Bash, or similar languages
- Strong understanding on SRE functions like toil reduction, error budgets and meeting SLAs
- Proven troubleshooting and root cause analysis skills in complex distributed systems
- Excellent written and verbal communication skills
- Bachelor’s degree in computer science or equivalent professional experience.
Benefits
- Health, dental, and vision coverage, beginning on the first day of employment. Five9 covers 100% of the employee portion of the health, dental and vision coverage and shares a high portion of the dependent cost.
- Short & Long-Term Disability, Basic Life Insurance, and a 401k saving plan with employer matching.
- Access to an innovative mental health support platform that offers personalized care and resources in areas such as: therapy, coaching and self-guided mindfulness exercises for all covered employees and their covered dependents.
- Generous employee stock purchase plan.
- Paid Time Off, Company paid holidays, paid volunteer hours and 12 weeks paid parental leave.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer (Bazel) - US Remote
WorkstateWe believe that every great idea deserves to become reality.
Workstate is seeking a DevOps Engineer with deep experience designing, implementing, and operating deployment pipelines built around Bazel. This role focuses on Bazel as part of a build, packaging, artifact, and release platform, including reproducibility, remote caching/execution, CI/CD integration, dependency management, environment promotion, and operational reliability. In addition to Bazel expertise, strong experience with AWS, Terraform, and modern DevOps practices is highly valued. Residents of the United States with the right to work without need for visa sponsorship are eligible for this role.
Senior Software Engineer, DevOps - SDA
Katalyst Space TechnologiesAt Katalyst, our work on projects involving the U.S. Department of Defense requires adherence to International Traffic in Arms Regulations (ITAR), 22 C.F.R. Parts 120-130, which requires compliance with U.S. export laws before allowing employees to perform certain positions. Currently, our available roles necessitate access to ITAR-controlled information, and as a result, Katalyst would have to ensure any non-US person is authorized access to ITAR information before the commencement of employment. We are committed to equal employment opportunities and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin. *remote work will require an exception from the hiring team
Katalyst Description: We build robotic spacecraft that enable dynamic space operations, creating a future where maneuvering, upgrading, refueling, and exploration are as routine as they are on Earth. Getting to space and overcoming the gravity well is only half the story. What you do there is the future. We are not building the trains. We are building the machines that lay down the tracks. By developing the foundational capabilities that make sustained, responsive operations possible, we enable a new era of space activity that strengthens national security and ensures freedom of action in an increasingly contested domain. Katalyst Culture: Working at Katalyst is intense, hands-on, and deeply rewarding. You’ll take real ownership and see your work move from concept to operations in an environment where the problems are hard and the impact is real. We value humility, craftsmanship, and doing things the right way, even when it’s harder. You’ll work closely with thoughtful, driven teammates who push each other to do their best. What You'll Do - Manage, provision, and help guide architectural decisions to support SDA applications running in AWS - Work with cross-functional teams to help efficiently implement and deploy SDA algorithms - Manage networking infrastructure including VPCs, load balancers, and firewalls in AWS - Lead internal company infrastructure initiatives to improve Developer Experience (DevEx) including efforts like managing common shared resources such as repository managers, secrets managers, etc. - Design and optimize CI/CD pipelines (Gitlab CI) - Monitor spending across different AWS Orgs and provide common tagging strategies to track spending across the company - Establish comprehensive observability solutions and best practices (We use the LGTM stack) - Mentor and help develop other less experienced engineers What We Value: - Strong ownership mentality – you’ll lead technical execution and ensure your features make it from development to production - Accountable – The team knows they’ll be able to depend on you - Curiosity – We are solving the hardest problems in space and want to hear everyone’s voice to ensure we’re always innovating - Leadership – You will be on a small team and will be empowered to help shape the culture and influence technical direction Your Ideal Background: Required: - 3+ years of full-time software engineering experience - 2+ years of DevOps experience - Comfortable working in a small, fast-paced Agile environment - Strong communication skills and ability to execute under uncertainty - Proven track owning features end-to-end from initial requirements, design, to application development and deployment - Strong proficiency in containerization and deployment technologies like Terraform, Docker, Kubernetes, etc. - Strong knowledge of software engineering best practices and CI/CD tooling - Eligible to obtain and maintain an active U.S. security clearance Nice to Have: - Knowledge of security best practices for DevOps and MLOps - Familiarity with MLOps concepts - Familiarity with Kubernetes and deployments into air-grapped environments Additional Requirements: Must be willing to work extended hours and weekends as needed. Compensation and Benefits: Your base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience. The anticipated salary range for this role is $120,000 - $160,000 annually. Base salary is just one part of your total rewards package at Katalyst. You will also be eligible for long-term incentives, in the form of the Employee Stock Option and Equity Plan, as well as a relocation bonus and other discretionary bonuses. You will also receive access to comprehensive medical, vision, and dental coverage, and unlimited Paid Time Off. At Katalyst our work on projects involving the U.S. Department of Defense requires adherence to International Traffic in Arms Regulations (ITAR), 22 C.F.R. Parts 120-130, which requires compliance with U.S. export laws before allowing employees to perform certain positions. Currently, our available roles necessitate access to ITAR-controlled information, and as a result, Katalyst would have to ensure any non-US person is authorized access to ITAR information before the commencement of employment. We are committed to equal employment opportunities and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin. *remote work will require an exception from the hiring team
• Liderar o desenho, implementação e evolução da infraestrutura como código, com forte foco em Terraform e Terragrunt, definindo padrões, módulos reutilizáveis e boas práticas para todo o time. • Desenhar e implementar arquiteturas de pipelines de CI/CD robustas, escaláveis e seguras, garantindo qualidade, rastreabilidade e governança de deploys em múltiplos ambientes. • Conduzir iniciativas de modernização e migração de pipelines legados, propondo novas abordagens, ferramentas e fluxos que reduzam tempo de entrega e aumentem confiabilidade. • Atuar como referência técnica em containerização e orquestração, utilizando Docker e Kubernetes (preferencialmente em AWS/EKS), incluindo observabilidade, segurança, autoscaling e estratégias de deploy (blue/green, canary, etc.). • Apoiar e orientar times de desenvolvimento na adoção de arquitetura orientada a eventos e APIs, garantindo boas práticas de integração, resiliência e monitoramento. • Propor e implementar melhorias contínuas em monitoramento, logging, métricas e alertas, construindo um ambiente observável e confiável. • Desenvolver e manter scripts e ferramentas internas (principalmente em Bash e/ou Python) para automação de tarefas recorrentes, integrações entre sistemas e verificação de conformidade de infraestrutura. • Atuar como guardião de padrões de DevOps, disseminando práticas de automação, segurança, observabilidade, custo otimizado e confiabilidade ao longo de todo o ciclo de vida das aplicações. • Realizar revisões técnicas (code review de Terraform, pipelines, scripts) e apoiar na resolução de incidentes complexos, atuando em análises de causa raiz e melhoria permanente. • Colaborar com arquitetura, segurança e desenvolvimento na definição de roadmaps de plataforma, trazendo visão técnica de longo prazo para a evolução do ambiente.
• Provision and manage AWS infrastructure for Databricks (VPCs, subnets, S3 buckets, IAM roles and policies) • Implement Infrastructure as Code (IaC) with Terraform for all environment resources • Create and maintain CI/CD pipelines for automated deployment of notebooks, jobs, and Databricks configurations • Configure Databricks on AWS (workspaces, clusters, instance profiles, Unity Catalog) • Plan and execute the migration of Databricks Workspaces from Azure to AWS • Implement security policies, encryption, and access controls (IAM, KMS, Lake Formation) • Set up monitoring, alerting, and logging (CloudWatch, Databricks Overwatch) • Manage environments (dev, staging, production) and code promotion strategies • Optimize infrastructure costs (Spot Instances, cluster auto-scaling, reservations)



