Job Closed
This listing is no longer active.
Create, manage, optimize, and scale content across channels with dotCMS
Senior Site Reliability Engineer – 100% Remote
Location
Florida
Posted
124 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer – 100% Remote
dotCMS
• Build the "Golden Path": You will own and evolve the build pipelines, dev setups, and development containers that allow our Stream Aligned teams to ship code independently and safely. • Institute Observability (O11y): You will own the strategy and tooling for Alerting, Monitoring, and Tracing, empowering developers to see inside their own applications. • Drive Reliability via SLOs: You will help Stream Aligned teams define and implement Service Level Objectives (SLOs) to back their code pipelines and observability tools. • Enable, Don't Gatekeep: You will act as a consultant and mentor ("on loan" to teams when necessary) to help them tackle complex infrastructure challenges while ensuring final decision-making and ownership remains with the Stream Aligned team. • Future-Proofing: You will help explore and implement new capabilities, including our AI toolchain adoption and modernization efforts. • Incident Management: Participate in an on-call rotation with a focus on blameless post-mortems and systematically removing the root causes of fatigue.
Job Requirements
- +5 years of total experience
- At least 3+ years of experience in one of the following roles: SRE, DevOps, or Platform Engineering roles
- Proven track record of at least 3 YOE with Kubernetes, AWS, Linux, Terraform, and PostgreSQL
- Experience with Java applications is highly preferred
- A deep understanding of CI/CD, Infrastructure as Code, and Observability stacks
- Experience using and contributing to open source projects
Benefits
- Open PTO policy
- Generous local company-paid holidays
- Annual company-paid training and development
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer Azure
Zup InnovationWe create digital assets to build, grow and accelerate your applications with efficiency, security and scalability.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Você atuará na evolução da arquitetura cloud e das práticas de engenharia de plataforma, garantindo que os ambientes em Azure sejam seguros, escaláveis e eficientes. A posição tem forte atuação em automação, observabilidade e confiabilidade, apoiando múltiplos times de engenharia e contribuindo para decisões arquiteturais em sistemas distribuídos. Buscamos uma pessoa com visão sistêmica, autonomia técnica e capacidade de elevar o nível de maturidade DevOps dos times. - Definir e evoluir a arquitetura cloud em Azure, promovendo segurança, confiabilidade e boas práticas; - Implementar e manter infraestrutura como código com Terraform, suportando ambientes escaláveis e resilientes; - Garantir estratégias eficientes de CI/CD, estruturando e otimizando pipelines de entrega; - Estruturar e aprimorar práticas de monitoramento, rastreabilidade e observabilidade com Datadog; - Apoiar times de produto e engenharia na resolução de incidentes e melhorias contínuas de infraestrutura; - Atuar como referência técnica em cloud, segurança, redes e automação, disseminando conhecimento; - Participar de decisões arquiteturais para sistemas distribuídos; - Monitorar e otimizar custos de infraestrutura cloud, buscando eficiência operacional. Qualifications - Experiência sólida com arquitetura e administração de ambientes em Azure; - Domínio de Kubernetes (AKS) em ambientes produtivos; - Experiência avançada com infraestrutura como código (Terraform); - Criação e evolução de pipelines CI/CD com automações de entrega; - Experiência com monitoramento e observabilidade, especialmente Datadog; - Conhecimento em segurança para ambientes cloud (IAM, redes, políticas de acesso); - Vivência em arquitetura distribuída e automação de operações; - Conhecimentos em redes, protocolos e troubleshooting em cloud. Requirements - Boa comunicação para explicar decisões técnicas a diferentes públicos; - Visão sistêmica sobre impactos técnicos no negócio; - Capacidade analítica para resolução de incidentes complexos; - Experiência apoiando múltiplos times ou atuando como referência técnica; - Vivência com outras clouds (AWS, GCP); - Certificações relevantes em Azure ou Kubernetes; - Proatividade na busca por automação, eficiência e melhoria contínua. Benefits - Modelo de trabalho remoto por padrão, priorizando a sua liberdade e responsabilidade; - Liberdade para trabalhar de onde quiser; - Horários flexíveis; - Auxílio Educação; - Ferramenta própria de desenvolvimento de carreira; - Guildas internas e grupos de estudo e interesse; - Plano de saúde; - Plano odontológico; - Parceria na compra de medicamentos; - Telemedicina disponível 24x7; - Terapia online gratuita; - Licença maternidade estendida; - Licença paternidade estendida; - Vale-refeição e alimentação; - Seguro de vida; - Vale-transporte; - Auxílio home office; - Auxílio Creche; - Auxílio plano telefônico; - Participação em Lucros e Resultados.
Lead Configuration Management Engineer
Switzerland Global EnterpriseWe support Swiss SMEs in their international business and help innovative foreign companies to establish in Switzerland.
• Define and support nuclear industry configuration management practices, document management, and change control processes for new commercial nuclear power plant projects. • Drive specification, development, implementation, and operation of assigned CM processes consistent with GEH and regulatory requirements. • Develop in-depth knowledge of CM processes and tools for nuclear plant projects; coordinate GEH project CM processes with other organizations. • Assist with engineering configuration management implementation activities inside GEH, in partnership with Information Technology (IT) and support CM interface activities at suppliers. • Master the project Information Management System and engineering design tool suites necessary to maintain CM equilibrium. • Coordinate project CM processes and tools with those of major suppliers. • Support Engineering and Project Management in planning, assessment, reporting and tracking activities and in preparation of related presentations and reports. • Work within a diverse team environment to execute work plans and schedules as applicable for mission success. • Perform work in compliance with policies and procedures. • Support GEH quality requirements, including participation in design reviews, and initiating and responding to Corrective Actions. • Provide on-time, quality delivery of documentation packages in accordance with contract requirements, business procedures, and regulatory agency guidelines.
• Provide direction and assistance to the Engineering teams to meet assigned configuration management objectives • Define and support nuclear industry configuration management practices, document management, and change control processes for new commercial nuclear power plant projects • Drive specification, development, implementation, and operation of assigned CM processes consistent with GEH and regulatory requirements • Develop in-depth knowledge of CM processes and tools for nuclear plant projects; coordinate GEH project CM processes with other organizations • Assist with engineering configuration management implementation activities inside GEH, in partnership with Information Technology (IT) and support CM interface activities at suppliers • Master the project Information Management System and engineering design tool suites necessary to maintain CM equilibrium • Coordinate project CM processes and tools with those of major suppliers • Support Engineering and Project Management in planning, assessment, reporting and tracking activities and in preparation of related presentations and reports • Work within a diverse team environment to execute work plans and schedules as applicable for mission success • Perform work in compliance with policies and procedures • Support GEH quality requirements, including participation in design reviews, and initiating and responding to Corrective Actions • Provide on-time, quality delivery of documentation packages in accordance with contract requirements, business procedures, and regulatory agency guidelines
CDN Site Reliability Engineer 5 – Live Streaming, Open Connect CDN
NetflixDescribed as the world's top internet television network, Netflix is a publicly-traded entertainment company offering video-on-demand and streaming media. As an
• Support the CDN delivery and day-to-day live-streaming operations for Netflix • Participate in the preparation, validation, and execution of live streaming focused initiatives in collaboration with related production and engineering teams • Impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days • Lead innovation initiatives, implementing new features, and driving enhancements in the streaming services delivery • Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal to maintain highly scalable and reliable CDN services worldwide with excellent quality of experience (QoE) • Implement, automate, execute, and analyze the results from a broad range of streaming CDN delivery focused functional, performance, resilience, and fault injection testing • Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events • Aggregate, analyze, and correlate large amounts of server and application performance data • Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset for service delivery optimization and system reliability improvements • Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule, including weekends and holidays




