Job Closed
This listing is no longer active.
Site Reliability Engineer – SRE
Location
Ireland
Posted
31 days ago
Salary
€55K - €68K / year
Seniority
Senior
Job Description
Site Reliability Engineer – SRE
Air Apps
• Design and implement scalable, reliable, and fault-tolerant systems across cloud environments. • Develop and maintain observability tools, including monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK). • Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. • Optimize system performance, scalability, and incident response workflows to improve uptime. • Work closely with development and DevOps teams to improve system design for reliability. • Conduct root cause analysis (RCA) and implement preventative measures to minimize failures. • Ensure high availability by designing and maintaining load balancing, failover, and disaster recovery strategies. • Improve CI/CD pipelines to enhance deployment speed while maintaining stability. • Optimize cloud cost and resource utilization for AWS, Azure, or Google Cloud Platform (GCP). • Participate in on-call rotations to quickly address system failures and minimize downtime.
Job Requirements
- Around 4+ years of experience in Site Reliability Engineering (SRE), DevOps, or System Engineering.
- Strong knowledge of cloud platforms (AWS, Azure, or GCP) and cloud-native architectures.
- Experience with observability and monitoring tools (Prometheus, Grafana, ELK, Datadog, New Relic).
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi.
- Hands-on experience with containerization and orchestration (Docker, Kubernetes, Helm).
- Strong Linux system administration and networking fundamentals.
- Experience with incident management, debugging, and root cause analysis.
- Proficiency in scripting (Bash, Python, or Go) for automation and system monitoring.
- Knowledge of load balancing, failover strategies, and distributed systems.
- Understanding of security best practices, access control, and compliance requirements.
- Strong communication skills and the ability to collaborate with cross-functional teams.
Benefits
- Apple hardware ecosystem for work.
- Annual Bonus
- Top-tier Health and Life Insurance for peace of mind.
- Transportation Budget to support your commute needs.
- Coverflex benefits package for meal allowances, well-being, and more.
- Childcare support.
- Air Conference - an opportunity to meet the team, collaborate, and grow together.
- Pension Fund to support your long-term financial planning.
- Urban Sports Club membership to keep you active.
- Meals 100% free at the hub.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and maintain robust CI/CD pipelines tailored for mobile app development (iOS and Android). • Automate build, test, and deployment workflows across multiple environments. • Manage cloud infrastructure to support mobile backend services and tooling. • Collaborate with mobile developers to ensure efficient release cycles and fast feedback loops. • Optimize the performance, reliability, and scalability of mobile-specific infrastructure and build systems. • Integrate monitoring, logging, and alerting for mobile services and CI/CD systems. • Enforce mobile app security best practices, including certificate management, secrets handling, and signing processes. • Support infrastructure as code (IaC) practices using tools like Terraform. • Continuously evaluate and implement emerging DevOps tools and practices to improve mobile development workflows.
• Architect and execute a comprehensive overhaul of our existing CI/CD pipelines (leveraging tools like GitHub Actions/Pipelines) to enhance speed, reliability, and security, preparing our delivery process for future growth. • Proactively identify, diagnose, and resolve complex infrastructure and application issues (performance, availability, scaling) across our production environments (primarily AWS, with exposure to GCP). • Design and manage infrastructure as code (IaC) using Terraform to provision and maintain resilient, scalable, and cost-effective cloud resources. • Collaborate closely with software engineering teams to embed SRE principles, optimize application performance, and eradicate developer friction points and bottlenecks. • Document technical systems, processes, and workflows thoroughly to ensure clarity, maintainability, and operational consistency.
Mid-level SRE Engineer
Casas Bahia TecnologiaA Tecnologia do Grupo Casas Bahia - A dedicação nunca foi tão forte!
• Manage and evolve Cloud infrastructure practices, principles, services, standards, community, and policies; • Actively participate in project deployments involving Cloud environments; • Automate tasks that are currently performed manually (Infrastructure as Code - IaC); • Define and support cluster architecture for microservices, correct instance sizing, and scalability and upgrade processes; • Evaluate and improve observability metrics focused on software products; • Facilitate resource configuration.
• Estruturar e manter pipelines de CI/CD para ambientes de desenvolvimento, homologação e produção; • Realizar a conteinerização de aplicações, garantindo padronização e escalabilidade dos ambientes; • Provisionar e gerenciar infraestrutura utilizando práticas de Infrastructure as Code (IaC); • Configurar e integrar ambientes com ferramentas de IA e desenvolvimento, como GitHub e soluções do ecossistema AI/Cockpit; • Monitorar desempenho, disponibilidade e confiabilidade das aplicações e infraestrutura na AWS; • Apoiar processos de deploy e sustentação de microsserviços em ambiente cloud; • Atuar no suporte técnico e onboarding de times nas ferramentas, processos e ambientes do projeto; • Garantir boas práticas de automação, segurança e eficiência operacional.



