Airalo is an eSIM store where travelers can access more than 200 eSIMS at affordable, local rates from around the world while using an eSIM-compatible tablet, s
Senior Site Reliability Engineer
Location
Spain
Posted
3 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer
Airalo
• Lead the design of scalable, fault-tolerant and self-healing systems in a multi-region AWS environment. • Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to drive architectural decisions and error budget policies. • Conduct blameless post-incident reviews to uncover systemic root causes and implement long-term preventive measures. • Identify patterns of manual work and lead the development of internal tools/automation to permanently eliminate them. • Develop and maintain automated runbooks and playbooks for common operational tasks and complex incident response. • Shift from simple monitoring to deep observability, ensuring high cardinality data leads to proactive actionable insights. • Proactively identify and mitigate operational risks through chaos engineering and architecture reviews. • Work with software engineers to design systems for reliability, scalability, and maintainability from the early stages of the SDLC. • Continuously evaluate and optimize system performance, capacity, and cost efficiency. • Beyond just participating, you will refine the on-call experience to reduce alert fatigue, improve MTTR, and ensure sustainable rotation health.
Job Requirements
- Bachelor’s degree in Computer Engineering or a similar discipline.
- 5+ years of experience as a Site Reliability Engineer or in a similar role.
- 3+ years of experience with AWS services including strong knowledge of container orchestration.
- 2+ years of Kubernetes experience.
- Deep understanding of observability principles and tools such as: Prometheus, Datadog, OpenTelemetry and similar.
- Experience with leading incident management and complex postmortem analysis.
- Experience and interest in managing infrastructure as code (Terraform).
- Experience with chaos engineering and other techniques for testing system resilience.
- Experience with CI/CD tools such as GitHub Actions for automated delivery.
- Proficiency in at least one programming language (Python, Go, Java, etc.) for building automation and internal tooling.
- Event-driven architecture experience (SNS, SQS etc).
- Ability to work independently and collaboratively in a fast-paced environment.
- Team player and open to new ideas.
- Good communication skills and fluency in English.
Benefits
- Remote work
- Generous PTO
- Wellness allowances
- Learning allowances
- Annual Airalo Away retreat
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Be responsible for maintaining and improving the infrastructure; • Communicate effectively and remain in close contact with developers; • Improve application performance and scalability in the cloud; • Collaborate with the internal technology team and with external partner/integrator teams.
• Build and operate the infrastructure that keeps Themis secure, reliable, and fast • Own the systems for cloud infrastructure, CI/CD pipelines, observability, and security controls • Automate provisioning, configuration, scaling, and routine operational tasks • Manage containerized workloads and orchestration • Build monitoring, logging, alerting, and dashboards to ensure system health and performance • Define and improve incident response processes • Drive reliability improvements, capacity planning, and performance tuning • Implement and maintain security controls and access management
Senior DevOps Engineer
CodiLimeA strategic partner for technology-driven companies | Network engineering | Software engineering
• Design, provision, and maintain cloud infrastructure using Terraform and Terraform Cloud. • Manage Azure networking, including VNets, subnets, Private Endpoints, DNS zones, and NSGs. • Manage Azure Kubernetes Service (AKS) clusters. • Implement and optimize CI/CD pipelines using GitHub Actions. • Manage container build, deployment, and release processes. • Implement monitoring and observability solutions. • Support incident analysis and root-cause investigations. • Collaborate with architects, developers, and security teams. • Promote an automation-first and DevOps culture across engineering teams. • Participate in technical discovery, proof-of-concepts, and architecture discussions.
Senior DevOps Engineer
CodiLimeA strategic partner for technology-driven companies | Network engineering | Software engineering
• Design, provision, and maintain cloud infrastructure using Terraform and Terraform Cloud. • Manage Azure networking, including VNets, subnets, Private Endpoints, DNS zones, and NSGs. • Manage Azure Kubernetes Service (AKS) clusters. • Implement and optimize CI/CD pipelines using GitHub Actions. • Manage container build, deployment, and release processes. • Implement monitoring and observability solutions. • Support incident analysis and root-cause investigations. • Collaborate with architects, developers, and security teams. • Promote an automation-first and DevOps culture across engineering teams. • Participate in technical discovery, proof-of-concepts, and architecture discussions.



