Job Closed
This listing is no longer active.
We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age, race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.
Senior Manager – Site Reliability Engineering
Location
United States
Posted
63 days ago
Salary
$125.4K - $181.9K / year
Seniority
Senior
Job Description
Senior Manager – Site Reliability Engineering
Empower
• Lead SRE managers and senior individual contributors, develop leaders through coaching and delegation, and build a strong bench of technical and leadership talent. • Drive talent reviews, succession planning, organizational design discussions, performance management, and development planning for high-potential team members. • Foster a culture of operational excellence, innovation, and continuous learning across the organization. • Define multi-quarter roadmaps and OKRs aligned with business objectives, and balance investments across reliability, scalability, cost, and security priorities. • Develop strategies to reduce technical debt and operational toil, plan capacity and resources across teams, and lead initiatives that span multiple teams or value streams. • Make build versus buy versus partner decisions for platform capabilities and support technology investment decisions within the area of responsibility. • Own reliability across multiple critical systems or value streams, including SLO and SLI frameworks, operational metrics, incident management, on-call practices, and disaster recovery and business continuity capabilities. • Provide technical direction on major infrastructure decisions, participate in architecture reviews, and drive adoption of best practices across teams, including infrastructure as code, GitOps, observability, CI/CD, and zero-trust security architecture. • Partner with Engineering, Product Management, Security, Compliance, Finance, and executive stakeholders on reliability requirements, roadmap planning, infrastructure initiatives, and broader technology strategy. • Communicate vision, strategy, progress, incident summaries, and investment needs to stakeholders at all levels, including VP and Director audiences, while influencing technical and process decisions across the organization.
Job Requirements
- 3 to 5 years of management experience or equivalent, ideally including experience managing managers.
- 8+ years of hands-on technical experience in Site Reliability Engineering, DevOps, or Operational Excellence.
- Proven track record of leading engineering teams through operational challenges.
- Deep technical knowledge of AWS, Kubernetes, and modern infrastructure practices.
- Strong understanding of SRE principles and experience applying them at scale.
- Experience setting strategy and executing multi-quarter initiatives.
- Demonstrated ability to develop managers and senior engineers.
- Excellent communication skills with the ability to influence senior leadership.
- Strong judgment in balancing competing priorities and making trade-offs.
- Experience managing distributed or hybrid teams.
Benefits
- Medical, dental, vision and life insurance
- Retirement savings – 401(k) plan with generous company matching contributions (up to 6%), financial advisory services, potential company discretionary contribution, and a broad investment lineup
- Tuition reimbursement up to $5,250/year
- Business-casual environment that includes the option to wear jeans
- Generous paid time off upon hire – including a paid time off program plus ten paid company holidays and three floating holidays each calendar year
- Paid volunteer time — 16 hours per calendar year
- Leave of absence programs – including paid parental leave, paid short- and long-term disability, and Family and Medical Leave (FMLA)
- Business Resource Groups (BRGs) – BRGs facilitate inclusion and collaboration across our business internally and throughout the communities where we live, work and play. BRGs are open to all.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
AvengaA global IT engineering and consulting company specializing in custom software development.
• Lead infrastructure initiatives for a cutting-edge SaaS platform • Own complex AWS/Kubernetes projects, CI/CD standards, and platform tooling • Enable self-service patterns for product teams • Automate security and compliance workflows
SRE – Site Reliability Engineering
Sofka TechnologiesTo transform people’s lives being the most trusted technology partner
• Actuar como el puente entre la innovación y la estabilidad, dominando conceptos de Observabilidad & Reliability, Arquitectura de Sistemas Distribuidos y Automatización (IaC). • Adaptar las necesidades de observabilidad a cada solución técnica para asegurar cobertura, visibilidad y eficiencia operativa. • Configurar y mantener dashboards, métricas, alertas y controles críticos para el negocio. • Validar la resiliencia de las soluciones mediante pruebas de caos y evaluaciones de escalabilidad bajo carga. • Implementar patrones de diseño resilientes como circuit breakers, fallbacks y retries en arquitecturas distribuidas. • Identificar y automatizar procesos manuales utilizando herramientas de infraestructura como código para reducir el MTTR. • Liderar la implementación de flujos de autorremediación y promover prácticas de mejora continua en la operación. • Colaborar con equipos de desarrollo y arquitectura para asegurar la calidad técnica en los journeys críticos de los usuarios.
DevOps Freelance – Instalación, Configuración Sonatype Nexus Repository
TheWhiteamSPECIALIZED IT CONSULTANTS
• Instalación y configuración de Sonatype Nexus Repository (v3) en entorno Linux (on‑prem o cloud, según se defina). • Configuración de repositorios: Maven (hosted, proxy, group). npm (hosted y proxy). • Opcional: Docker registry (hosted y/o proxy), si hay experiencia. • Configuración como servicio (systemd), logs, backup básico y tareas de limpieza (cleanup policies). • Definición de usuarios, roles y permisos (desarrolladores, CI/CD, administración). • Integración mínima con CI/CD (Jenkins / GitLab CI / GitHub Actions, según stack del cliente). • Documentación para el equipo del cliente (guía de uso y operación básica).
DevSecOps Engineer
Phorest SoftwarePhorest is your all-in-one solution to managing and growing your business. #TogetherWeGrow
• Own & Evolve Security Standards - Take ownership of security standards across Phorest, ensuring they are practical, up-to-date, and consistently applied. Continuously improve them in line with evolving threats, business needs, and industry best practice. • Protect Our Cloud & Infrastructure - Configure, maintain, and optimise security tooling across our AWS environment. Lead threat monitoring, improve alert quality, and proactively identify gaps in our security coverage. • Drive Risk Reduction - Lead security assessments across infrastructure and applications. Prioritise vulnerabilities based on risk and work closely with teams to ensure effective remediation. Facilitate threat modelling to catch risks early in the development lifecycle. • Embed Security into Engineering (Shift-Left) - Partner with engineering teams to integrate security into CI/CD pipelines and development workflows — enabling secure-by-default practices without slowing delivery. • Incident Response & Triage - Lead the triage and analysis of security alerts and incidents. Provide clear guidance on remediation and identify patterns to reduce recurring risks. • Be a Trusted Security Partner - Act as a go-to security point of contact across the business. Support teams in making secure decisions, balancing risk with practicality and speed. • Build Security Awareness & Culture - Contribute to internal security education and secure coding initiatives, helping teams understand not just the “what” but the “why” behind security. • Continuously Improve Our Security Posture - Identify opportunities to strengthen our tools, processes, and ways of working — and take ownership of driving those improvements forward.




