Powering Live.
Lead Site Reliability Developer – CSRE Consulting
Location
Arizona + 3 moreAll locations: Arizona | Florida | Texas | Virginia
Posted
31 days ago
Salary
0
Seniority
Senior
Job Description
Lead Site Reliability Developer – CSRE Consulting
Ticketmaster
• Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes. • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines. • Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets. • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned. • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents. • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices. • Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams. • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety. • Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards. • Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification. • Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures. • Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact. • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption. • Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.
Job Requirements
- Deep practical understanding of SRE principles, including SLO governance and error budget policy in practice.
- Proven ability to lead cross-team technical work and influence without authority.
- Strong experience designing and troubleshooting distributed systems with cross-service failure modes.
- Experience shaping observability and alerting strategy and improving operational signal quality.
- Strong Kubernetes and AWS experience, including governance and cost trade-offs.
- Ability to design reliability automation and tooling that is reusable and adopted by multiple teams.
- Experience leading production readiness and resilience practices, including DR validation and controlled testing.
- Strong software engineering fundamentals with the ability to deliver and review high-quality changes in enterprise codebases.
- Advanced incident analysis skills focused on systemic risk reduction and organizational learning.
- Excellent communication skills, including exec-ready summaries and clear technical diagrams.
Benefits
- Medical, vision, dental and mental health benefits for you and your family, with access to a health care concierge, and Flexible or Health Savings Accounts (FSA or HSA)
- Free concert tickets, generous paid time off including paid holidays, sick time, and personal days
- 401(k) program with company match, stock reimbursement program
- New parent programs including caregiver leave, plus fertility, adoption, foster, or surrogacy support
- Career and skill development programs with School of Live, tuition reimbursement, and student loan repayment
- Volunteer time off, crowdfunding match
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps / Platform Engineer
High SpiritsServing up unfiltered insights, insider perspectives and transformative ideas to drive clarity in the cannabis industry
• Build and run the infrastructure behind platforms that accelerate the transition to carbon-free energy • Design, automate, and operate the infrastructure for the virtual battery platform • Make decisions on deployment and security
Lead Site Reliability Developer – CSRE Consulting
Live Nation EntertainmentLive Nation produces more concerts, sells more tickets and connects more brands to music than anyone else in the world.
• Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes. • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines. • Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets. • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned. • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents. • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices. • Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams. • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety. • Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards. • Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification. • Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures. • Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact. • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption. • Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.
Lead Site Reliability Developer
Live Nation EntertainmentLive Nation produces more concerts, sells more tickets and connects more brands to music than anyone else in the world.
• Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes. • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines. • Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets. • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned. • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents. • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices. • Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams. • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety. • Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards. • Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification. • Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures. • Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact. • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption. • Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.
DevOps Engineer
Infopro DigitalInfopro Digital est le 8 ème groupe technologique français. Suivez notre actualité et rejoignez-nous !
Role Description Infopro Digital group is recruiting for a DevOps Engineer on a permanent basis to join our Eucon Americas LLC business unit, supporting our North American team. This role is ideally based in Atlanta, Georgia, with fully remote working options available within the United States. As part of our continued growth across the Americas, we are seeking a DevOps Engineer to support and enhance the operation of our digital products and cloud infrastructure. This role is central to ensuring secure, stable and scalable cloud environments. You will act as a key technical partner to colleagues across Product, Application Development and IT Operations, translating complex technical challenges into practical, effective solutions. This is a full-time W-2 position, ideally suited to a DevOps professional who thrives in collaborative, agile environments and is passionate about automation, reliability and continuous improvement. Key Tasks and Responsibilities - Take significant responsibility for the secure cloud setup and operation of digital products, including monitoring - Administer systems and solutions, including processes, users and environments - Continuously develop and improve Azure cloud architecture with a focus on high operational stability - Manage code and system resources using Continuous Integration and Continuous Delivery (CI/CD) practices - Optimise and automate CI/CD pipelines and operational processes - Analyse, troubleshoot and resolve incidents in live production environments - Support colleagues with error analysis, root cause investigation and problem-solving Qualifications - Completed IT specialist apprenticeship or a degree in business or technical IT - 3+ years’ experience in application support and software development within a DevOps context - Hands-on experience with CI/CD tools and pipelines - Experience with scripting and infrastructure-as-code tools such as YAML and Terraform - Experience working with Docker and Kubernetes - Strong analytical skills, high quality awareness, and a structured approach to problem-solving - Fluency in English and confidence working in international teams Requirements - In-depth knowledge of Microsoft Azure services - Experience with cloud security best practices - Strong enthusiasm for agile ways of working, DevOps principles and automation Benefits - Competitive base salary with bonus structure - 15 days paid annual leave - Group health insurance plan - Employer contribution towards health insurance - 401(k) retirement plan - Workers’ Compensation coverage - Standard full-time working hours (40 hours per week) - Paid semi-monthly



