Job Closed
This listing is no longer active.
We are an equal opportunity employer with a commitment to diversity. All individuals, regardless of personal characteristics, are encouraged to apply. All qualified applicants will receive consideration for employment without regard to age, race, color, national origin, ancestry, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, religion, physical or mental disability, military or veteran status, genetic information, or any other status protected by applicable state or local law.
Lead Engineer – Site Reliability
Location
India
Posted
79 days ago
Salary
0
Seniority
Senior
Job Description
Lead Engineer – Site Reliability
Empower
• combine deep technical expertise with team leadership to drive reliability • lead other SREs in solving complex operational challenges • establish technical standards and serve as an advisor to engineering leadership • lead cross-functional reliability initiatives • architect enterprise-scale infrastructure solutions • establish Service Level Objectives (SLOs) • lead major incident response as incident commander • drive strategic improvements to observability • evaluate and introduce new technologies
Job Requirements
- 6-10 years of experience in Site Reliability Engineering (or equivalent)
- Proven ability to lead technical teams
- Expert-level knowledge of AWS
- Deep Kubernetes expertise
- Mastery of Infrastructure as Code using Terraform
- Strong software engineering background with production experience in Python and/or Go
- Extensive experience with observability platforms (Datadog, Splunk)
- Deep understanding of CI/CD principles
- Proven track record leading major incidents
Benefits
- flexible work environment
- fluid career paths
- celebrating internal mobility
- purpose and well-being recognition
- work-life balance initiatives
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• As a DevOps / Data Engineer, you play a central role in building and evolving our data and platform infrastructure. • You ensure stable, scalable and automated operations, from data integration through to production run-time. • You design, develop and optimize CI/CD pipelines for data workloads and infrastructure components (Infrastructure as Code). • You implement high-performance, robust ETL/ELT processes and orchestrate production-ready data pipelines including monitoring, logging and alerting. • You ensure the stability, scalability and platform compatibility of our systems and continuously evolve our architecture according to current best practices. • You work closely with data scientists, analysts and business units to reliably deliver high-quality, production-ready data products. • You drive the development of a sustainable DataOps culture with a focus on automation, testing, versioning and quality assurance. • You evaluate new technologies in the cloud and data engineering space and introduce them selectively where they add value.
• Partner directly with clients to assess infrastructure requirements, security constraints, and deployment preferences. • Design and implement deployment strategies for Kubernetes clusters across AWS, Azure, and on-premise environments. • Serve as the primary technical point of contact throughout the deployment lifecycle. • Troubleshoot complex deployment issues, distinguishing between infrastructure and application-level concerns. • Act as a system validator to ensure our solutions function seamlessly within client environments. • Gather client feedback to inform internal development priorities. • Continuously apply and improve deployment best practices and coach peers in their adoption.
DevOps Engineer
Transfermarkt GmbH & Co. KGTransfermarkt is the leading digital platform for football facts, statistics, market values and community.
• As a DevOps Engineer you are part of our infrastructure team and work closely with backend, frontend and microservice developers. • You design our CI/CD processes, advance our infrastructure, and ensure that our platform operates stably, is scalable, and performs well. • You develop and maintain our CI/CD pipelines, ensuring automated, seamless build, test, and deployment processes. • Together with the team you will support the migration of our platform from a monolith to a service-oriented architecture. • You monitor and optimize our systems for performance, availability, and security. • You work with containers and orchestrate our services using the HashiCorp stack (Consul, Nomad, Vault, Terraform/Terragrunt). • In close coordination with developers you plan our infrastructure roadmap and integrate new tools where appropriate. • You actively contribute to reviews, retrospectives, and the technical advancement of our DevOps processes.
• Improve and maintain CI/CD, deployment workflows, and environment management across backend, web, and internal services • Build, maintain and scale infrastructure across AWS and container based services • Improve monitoring, alerting, logging, dashboards, tracing, and runbooks • Work with engineers on safer deploys, rollback plans, and recovery from failures • Automate repetitive operational work and improve internal tooling • Maintain and improve infrastructure as code and deployment tooling • Help improve failover planning, recovery procedures, and backup/restore testing for critical systems • Support production systems and take part in on-call for critical services • Manage and scale infrastructure across AWS, ECS, Docker, PostgreSQL, Redis, Celery, and Go/Python-based services • Lead incident response and postmortems, and drive follow-up actions to reduce repeat issues • Improve reliability, resilience, and operational readiness across critical systems




