Scale Faster, Reduce Costs, Meet Diversity Targets
Senior Site Reliability Engineer Manager
Location
United Kingdom
Posted
74 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer Manager
RemoteStar
• Ensuring the reliability, scalability, and performance of infrastructure and services • Taking full ownership of the production estate from both a technical and process perspective • Providing consistent smooth operation of live systems • Designing and operating a new incident tracking process • Creating and maintaining high-end monitoring and automation tooling • Driving automation initiatives to improve operational workflows • Developing and maintaining tools, scripts, and dashboards to monitor system health • Building a first-class SRE team and providing leadership and guidance
Job Requirements
- Proven experience in a senior or lead SRE role
- Expertise in incident management including incident response, resolution, and post-mortem analysis
- Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog
- Experience with cloud platforms such as AWS, Azure, or GCP
- Strong scripting and automation skills with proficiency in languages such as Python, Bash, or Go
- Excellent communication and collaboration skills
- Demonstrated leadership capabilities
- Passion for mentoring and developing team members
Benefits
- Dynamic working environment in an extremely fast-growing company
- Work in an international environment
- Work in a pleasant environment with very little hierarchy
- Intellectually challenging
- Flexible working hours
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Join Dev.Pro’s exclusive screening process to gain valuable career insights and access personalized feedback on your skill set. • Get priority consideration by Dev.Pro for suitable job openings. • Opportunity to work with top global corporations and participate in industry-shaping projects. • Send CV in English. • Schedule a call with recruiters. • Participate in an experience interview focused on soft skills. • Undergo online evaluation of technical skills.
Senior Site Reliability Engineer – Team Lead
Dev.ProSoftware Development Partner. Result-driven. Quality-obsessed.
• Oversee a Cloud/SRE support team, ensuring reliable operations, effective processes, and strong collaboration across global and cross-functional teams • Lead our Cloud/SRE Support team, providing coaching, prioritization, and oversight • Drive team performance, ensuring high-quality support, SLA compliance, and continuous improvement • Coordinate with India-based and cross-functional teams for alignment and 24/7 coverage • Translate complex issues into actionable plans and scalable solutions • Design and improve support processes and operational frameworks • Identify gaps and risks, improving operations and team engagement • Collaborate with cross-functional teams to define priorities and communicate progress, risks, and solutions • Oversee execution across MDM operations, access management, monitoring, incidents, and RCA • Maintain clear documentation, runbooks, and escalation procedures • Promote best practices in reliability and customer-focused support
• Develop, implement and manage practical solutions to support clients' growth • Innovate and turn ideas into reality using the best market solutions • Oversee the entire process from conception through solution implementation
DevOps – AWS
KonnectboxUnleashing Infinite Possibilities Of Growth, We are not just a tech company; we are architects of the future.
• Strong Experience with AWS services for DevOps. • Setting up CI/CD • Repository branching strategies using Git/BitBucket, Jenkins, Kubernetes • Troubleshooting deployment, configuration, and networking issues • Designing, deploying and scaling Docker based production systems



