Job Closed

This listing is no longer active.

We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age, race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.

Senior Manager – Site Reliability Engineering

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 10,001+H1B SponsorCompany Site LinkedIn

Location

United States

Posted

63 days ago

Salary

$125.4K - $181.9K / year

Seniority

Senior

Bachelor Degree8 yrs expEnglishAWS Kubernetes

Job Description

• Lead SRE managers and senior individual contributors, develop leaders through coaching and delegation, and build a strong bench of technical and leadership talent. • Drive talent reviews, succession planning, organizational design discussions, performance management, and development planning for high-potential team members. • Foster a culture of operational excellence, innovation, and continuous learning across the organization. • Define multi-quarter roadmaps and OKRs aligned with business objectives, and balance investments across reliability, scalability, cost, and security priorities. • Develop strategies to reduce technical debt and operational toil, plan capacity and resources across teams, and lead initiatives that span multiple teams or value streams. • Make build versus buy versus partner decisions for platform capabilities and support technology investment decisions within the area of responsibility. • Own reliability across multiple critical systems or value streams, including SLO and SLI frameworks, operational metrics, incident management, on-call practices, and disaster recovery and business continuity capabilities. • Provide technical direction on major infrastructure decisions, participate in architecture reviews, and drive adoption of best practices across teams, including infrastructure as code, GitOps, observability, CI/CD, and zero-trust security architecture. • Partner with Engineering, Product Management, Security, Compliance, Finance, and executive stakeholders on reliability requirements, roadmap planning, infrastructure initiatives, and broader technology strategy. • Communicate vision, strategy, progress, incident summaries, and investment needs to stakeholders at all levels, including VP and Director audiences, while influencing technical and process decisions across the organization.

Job Requirements

3 to 5 years of management experience or equivalent, ideally including experience managing managers.
8+ years of hands-on technical experience in Site Reliability Engineering, DevOps, or Operational Excellence.
Proven track record of leading engineering teams through operational challenges.
Deep technical knowledge of AWS, Kubernetes, and modern infrastructure practices.
Strong understanding of SRE principles and experience applying them at scale.
Experience setting strategy and executing multi-quarter initiatives.
Demonstrated ability to develop managers and senior engineers.
Excellent communication skills with the ability to influence senior leadership.
Strong judgment in balancing competing priorities and making trade-offs.
Experience managing distributed or hybrid teams.

Benefits

Medical, dental, vision and life insurance
Retirement savings – 401(k) plan with generous company matching contributions (up to 6%), financial advisory services, potential company discretionary contribution, and a broad investment lineup
Tuition reimbursement up to $5,250/year
Business-casual environment that includes the option to wear jeans
Generous paid time off upon hire – including a paid time off program plus ten paid company holidays and three floating holidays each calendar year
Paid volunteer time — 16 hours per calendar year
Leave of absence programs – including paid parental leave, paid short- and long-term disability, and Family and Medical Leave (FMLA)
Business Resource Groups (BRGs) – BRGs facilitate inclusion and collaboration across our business internally and throughout the communities where we live, work and play. BRGs are open to all.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior DevOps Engineer

Avenga

A global IT engineering and consulting company specializing in custom software development.

DevOps Engineer63 days ago

Full Time RemoteTeam 5,001-10,000H1B No Sponsor

Company Site LinkedIn

• Lead infrastructure initiatives for a cutting-edge SaaS platform • Own complex AWS/Kubernetes projects, CI/CD standards, and platform tooling • Enable self-service patterns for product teams • Automate security and compliance workflows

AWS Flux Kubernetes Linux Prometheus Python Terraform

View details: Senior DevOps Engineer

United States

Apply

Job Closed

SRE – Site Reliability Engineering

Sofka Technologies

To transform people’s lives being the most trusted technology partner

DevOps Engineer63 days ago

Full Time RemoteTeam 1,001-5,000Since 2013H1B No Sponsor

Company Site LinkedIn

• Actuar como el puente entre la innovación y la estabilidad, dominando conceptos de Observabilidad & Reliability, Arquitectura de Sistemas Distribuidos y Automatización (IaC). • Adaptar las necesidades de observabilidad a cada solución técnica para asegurar cobertura, visibilidad y eficiencia operativa. • Configurar y mantener dashboards, métricas, alertas y controles críticos para el negocio. • Validar la resiliencia de las soluciones mediante pruebas de caos y evaluaciones de escalabilidad bajo carga. • Implementar patrones de diseño resilientes como circuit breakers, fallbacks y retries en arquitecturas distribuidas. • Identificar y automatizar procesos manuales utilizando herramientas de infraestructura como código para reducir el MTTR. • Liderar la implementación de flujos de autorremediación y promover prácticas de mejora continua en la operación. • Colaborar con equipos de desarrollo y arquitectura para asegurar la calidad técnica en los journeys críticos de los usuarios.

Ansible AWS Azure Docker GCP Grafana Kubernetes OpenShift Prometheus Python Terraform

View details: SRE – Site Reliability Engineering

Ecuador

Apply

Job Closed

DevOps Freelance – Instalación, Configuración Sonatype Nexus Repository

TheWhiteam

SPECIALIZED IT CONSULTANTS

DevOps Engineer63 days ago

Contract RemoteTeam 201-500Since 2012H1B No Sponsor

Company Site LinkedIn

• Instalación y configuración de Sonatype Nexus Repository (v3) en entorno Linux (on‑prem o cloud, según se defina). • Configuración de repositorios: Maven (hosted, proxy, group). npm (hosted y proxy). • Opcional: Docker registry (hosted y/o proxy), si hay experiencia. • Configuración como servicio (systemd), logs, backup básico y tareas de limpieza (cleanup policies). • Definición de usuarios, roles y permisos (desarrolladores, CI/CD, administración). • Integración mínima con CI/CD (Jenkins / GitLab CI / GitHub Actions, según stack del cliente). • Documentación para el equipo del cliente (guía de uso y operación básica).

Docker Gradle Jenkins Linux Maven

View details: DevOps Freelance – Instalación, Configuración Sonatype Nexus Repository

Spain

Apply

Job Closed

DevSecOps Engineer

Phorest Software

Phorest is your all-in-one solution to managing and growing your business. #TogetherWeGrow

DevOps Engineer63 days ago

Full Time RemoteTeam 201-500Since 2003H1B No Sponsor

Company Site LinkedIn

• Own & Evolve Security Standards - Take ownership of security standards across Phorest, ensuring they are practical, up-to-date, and consistently applied. Continuously improve them in line with evolving threats, business needs, and industry best practice. • Protect Our Cloud & Infrastructure - Configure, maintain, and optimise security tooling across our AWS environment. Lead threat monitoring, improve alert quality, and proactively identify gaps in our security coverage. • Drive Risk Reduction - Lead security assessments across infrastructure and applications. Prioritise vulnerabilities based on risk and work closely with teams to ensure effective remediation. Facilitate threat modelling to catch risks early in the development lifecycle. • Embed Security into Engineering (Shift-Left) - Partner with engineering teams to integrate security into CI/CD pipelines and development workflows — enabling secure-by-default practices without slowing delivery. • Incident Response & Triage - Lead the triage and analysis of security alerts and incidents. Provide clear guidance on remediation and identify patterns to reduce recurring risks. • Be a Trusted Security Partner - Act as a go-to security point of contact across the business. Support teams in making secure decisions, balancing risk with practicality and speed. • Build Security Awareness & Culture - Contribute to internal security education and secure coding initiatives, helping teams understand not just the “what” but the “why” behind security. • Continuously Improve Our Security Posture - Identify opportunities to strengthen our tools, processes, and ways of working — and take ownership of driving those improvements forward.

AWS Cloud JavaScript Python Terraform

View details: DevSecOps Engineer

United Kingdom

Apply

Job Closed

Senior Manager – Site Reliability Engineering

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior DevOps Engineer

SRE – Site Reliability Engineering

DevOps Freelance – Instalación, Configuración Sonatype Nexus Repository

DevSecOps Engineer