EVOLVE BY INTEGRATION
Senior Site Reliability Engineer
Location
United Kingdom
Posted
67 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer
Prima Power
• Design, build, and operate reliable and scalable systems by defining and monitoring SLOs/SLIs, working directly on production infrastructure, and collaborating closely with software engineers on system design and reliability improvements • Actively develop automation for infrastructure and operational workflows to eliminate toil and reduce MTTR, participate in and lead incident response, and drive blameless post-incident reviews with concrete follow-ups implemented in code and tooling • Continuously analyze and optimize system performance and cost, provide data, insights, and recommendations to inform capacity planning, and support security best practices through hands-on vulnerability remediation and threat mitigation
Job Requirements
- SRE & Cloud Engineering: Hands-on experience with SRE practices in production, strong AWS expertise, Kubernetes, networking, DNS, and Infrastructure as Code (Pulumi preferred, Terraform a plus)
- Automation & Software Engineering: Demonstrate strong software engineering fundamentals with an emphasis on code quality and maintainability. This includes solid Python proficiency and deep knowledge of the Python ecosystem (testing, debugging, packaging) and a consistent focus on writing clean, well-structured, and maintainable code
- Reliability, Data & Operations: add stakeholder engagement and mentoring e.g. lead incident response and RCAs, improve system reliability, and engage stakeholders to propose solutions, share learnings, and mentor others
- Nice-to-Have: Experience operating in highly regulated industries (e.g. Insurance, Banking, Healthcare), managing sensitive data, and supporting secure networking setups, including exposure to security technologies such as Cloudflare.
- Strong understanding of microservices architectures, their principles and trade-offs, with the ability to troubleshoot and maintain distributed systems and supporting technologies (RabbitMQ, Kafka, PostgreSQL, Redis).
- Hands-on experience with Datadog for platform and application monitoring, performance optimisation, and solid fundamentals in database structures and operational troubleshooting. Hands-on experience with PySpark and familiarity with MLOps practices including model registries, versioning, retraining workflows, and deployment lifecycles.
Benefits
- Work Your Way: Enjoy full flexibility – work from home, the office or a mix of both. Plus, work from anywhere for up to 30 days a year.
- Grow with us: Get access to learning resources, mentorship and a growth plan tailored to you.
- Thrive and perform: Enjoy private healthcare, gym discounts, wellbeing programs and mental health support.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Hands-on Reliability & System Engineering: Design, build, and operate reliable and scalable systems by defining and monitoring SLOs/SLIs, working directly on production infrastructure, and collaborating closely with software engineers on system design and reliability improvements • Automation, Operations & Incident Response: Actively develop automation for infrastructure and operational workflows to eliminate toil and reduce MTTR, participate in and lead incident response, and drive blameless post-incident reviews with concrete follow-ups implemented in code and tooling • Performance, Capacity & Security: Continuously analyze and optimize system performance and cost, provide data, insights, and recommendations to inform capacity planning, and support security best practices through hands-on vulnerability remediation and threat mitigation
Junior Java Developer / DevOps Engineer (Remote/Thessaloniki)
EUROPEAN DYNAMICS"{ engineer; innovate; excite; }"
Are you a Junior Java Developer or DevOps Engineer looking for a dynamic role in a growing development team? Join us in Thessaloniki or work fully remotely, and become part of our innovative team implementing cutting-edge IT software projects for major international public organizations. What You'll Do: - Participate in the design and development of large-scale web-based and mobile applications using state-of-the-art techniques and technologies; - Maintain and develop DevOps tools and services such as CI/CD pipelines and automated processes; - Participate and contribute to the process of defining the software architecture; - Work in a multi-national environment.
Are you a Junior Java Developer eager to dive into challenging IT software projects? Join our team in Athens or work fully remotely, and become part of our dynamic software development team working on projects for major international public organizations. What You'll Do: - Develop excellent quality web and mobile software, using state-of-the-art software development techniques and technologies; - Participate and contribute to software architecture design; - Work in a multi-national environment.
• Analizar y entender las políticas de acceso condicional actuales en tres inquilinos de Entra ID. • Desarrollar módulos de Terraform y pipelines de Azure DevOps para garantizar una gestión segura y automatizada de las políticas. • Preparar y apoyar la transición operativa de la gestión de políticas de acceso condicional del equipo actual al equipo de ciberseguridad. • Asegurar el mantenimiento y operación continuos de las políticas de acceso condicional, incluyendo: - Solucionar y resolver problemas. - Implementar nuevas políticas. - Mejorar y optimizar las políticas existentes.


