Job Closed
This listing is no longer active.
Enabling the development of electric vehicles of the future. From #materialscience to ultimate #emobility products.
Senior Manager, Site Reliability Engineering – SRE
Location
United States
Posted
112 days ago
Salary
$160K - $180K / year
Seniority
Senior
Job Description
Senior Manager, Site Reliability Engineering – SRE
UJET
• Build and lead a new SRE team, including hiring, onboarding, and career development • Define and implement SRE best practices: SLIs/SLOs, error budgets, incident management, on-call models, and postmortems • Establish clear operational ownership between SRE and product engineering teams • Drive reliability as a feature, balancing velocity and stability with data—not vibes • Reduce toil through automation and self-service platforms • Design and evolve incident response, escalation, and learning loops (no blame, lots of learning) • Partner with engineering leaders to influence architecture, capacity planning, and launch readiness • Own reliability metrics and communicate risk and performance clearly to technical and executive audiences
Job Requirements
- 8+ years in SRE, infrastructure, or platform engineering, with 3+ years managing managers or senior ICs
- Experience building an SRE team or function from scratch (or significantly scaling one)
- Deep knowledge of modern cloud infrastructure, distributed systems, and observability
- Strong opinions about what SRE is and is not - and the judgment to apply them pragmatically
- Proven ability to collaborate with product and engineering leaders without becoming the “department of no”
- Calm, decisive leadership during incidents (the person people want on the conference call)
Benefits
- Medical
- Dental
- Vision
- 401(k) plan
- Commuter benefits
- Comprehensive Benefits
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers

