Airalo logo
Airalo

Airalo is an eSIM store where travelers can access more than 200 eSIMS at affordable, local rates from around the world while using an eSIM-compatible tablet, s

Senior Site Reliability Engineer

Location

Spain

Posted

3 days ago

Salary

0

Seniority

Senior

Job Description

Senior Site Reliability Engineer

Airalo

• Lead the design of scalable, fault-tolerant and self-healing systems in a multi-region AWS environment. • Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to drive architectural decisions and error budget policies. • Conduct blameless post-incident reviews to uncover systemic root causes and implement long-term preventive measures. • Identify patterns of manual work and lead the development of internal tools/automation to permanently eliminate them. • Develop and maintain automated runbooks and playbooks for common operational tasks and complex incident response. • Shift from simple monitoring to deep observability, ensuring high cardinality data leads to proactive actionable insights. • Proactively identify and mitigate operational risks through chaos engineering and architecture reviews. • Work with software engineers to design systems for reliability, scalability, and maintainability from the early stages of the SDLC. • Continuously evaluate and optimize system performance, capacity, and cost efficiency. • Beyond just participating, you will refine the on-call experience to reduce alert fatigue, improve MTTR, and ensure sustainable rotation health.

Job Requirements

  • Bachelor’s degree in Computer Engineering or a similar discipline.
  • 5+ years of experience as a Site Reliability Engineer or in a similar role.
  • 3+ years of experience with AWS services including strong knowledge of container orchestration.
  • 2+ years of Kubernetes experience.
  • Deep understanding of observability principles and tools such as: Prometheus, Datadog, OpenTelemetry and similar.
  • Experience with leading incident management and complex postmortem analysis.
  • Experience and interest in managing infrastructure as code (Terraform).
  • Experience with chaos engineering and other techniques for testing system resilience.
  • Experience with CI/CD tools such as GitHub Actions for automated delivery.
  • Proficiency in at least one programming language (Python, Go, Java, etc.) for building automation and internal tooling.
  • Event-driven architecture experience (SNS, SQS etc).
  • Ability to work independently and collaboratively in a fast-paced environment.
  • Team player and open to new ideas.
  • Good communication skills and fluency in English.

Benefits

  • Remote work
  • Generous PTO
  • Wellness allowances
  • Learning allowances
  • Annual Airalo Away retreat

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 201-500H1B No Sponsor

• Be responsible for maintaining and improving the infrastructure; • Communicate effectively and remain in close contact with developers; • Improve application performance and scalability in the cloud; • Collaborate with the internal technology team and with external partner/integrator teams.

Brazil
Full TimeRemoteTeam 11-50Since 2018H1B No Sponsor

• Build and operate the infrastructure that keeps Themis secure, reliable, and fast • Own the systems for cloud infrastructure, CI/CD pipelines, observability, and security controls • Automate provisioning, configuration, scaling, and routine operational tasks • Manage containerized workloads and orchestration • Build monitoring, logging, alerting, and dashboards to ensure system health and performance • Define and improve incident response processes • Drive reliability improvements, capacity planning, and performance tuning • Implement and maintain security controls and access management

New York
CodiLime logo

Senior DevOps Engineer

CodiLime

A strategic partner for technology-driven companies | Network engineering | Software engineering

DevOps Engineer3 days ago
ContractRemoteTeam 201-500Since 2011H1B No Sponsor

• Design, provision, and maintain cloud infrastructure using Terraform and Terraform Cloud. • Manage Azure networking, including VNets, subnets, Private Endpoints, DNS zones, and NSGs. • Manage Azure Kubernetes Service (AKS) clusters. • Implement and optimize CI/CD pipelines using GitHub Actions. • Manage container build, deployment, and release processes. • Implement monitoring and observability solutions. • Support incident analysis and root-cause investigations. • Collaborate with architects, developers, and security teams. • Promote an automation-first and DevOps culture across engineering teams. • Participate in technical discovery, proof-of-concepts, and architecture discussions.

Poland
zł22K - zł29K / month
CodiLime logo

Senior DevOps Engineer

CodiLime

A strategic partner for technology-driven companies | Network engineering | Software engineering

DevOps Engineer3 days ago
ContractRemoteTeam 201-500Since 2011H1B No Sponsor

• Design, provision, and maintain cloud infrastructure using Terraform and Terraform Cloud. • Manage Azure networking, including VNets, subnets, Private Endpoints, DNS zones, and NSGs. • Manage Azure Kubernetes Service (AKS) clusters. • Implement and optimize CI/CD pipelines using GitHub Actions. • Manage container build, deployment, and release processes. • Implement monitoring and observability solutions. • Support incident analysis and root-cause investigations. • Collaborate with architects, developers, and security teams. • Promote an automation-first and DevOps culture across engineering teams. • Participate in technical discovery, proof-of-concepts, and architecture discussions.

Egypt