Job Closed

This listing is no longer active.

MLabs

We are a Haskell, Rust, Blockchain and AI consultancy.

Senior Site Reliability Engineer (Azure)

DevOps EngineerDevOps EngineerOther Remote SeniorTeam 51-200H1B No SponsorCompany Site LinkedIn

Location

France

Posted

67 days ago

Salary

Seniority

Senior

EnglishAzure Terraform Python Kubernetes Prometheus Grafana

Job Description

Senior Site Reliability Engineer (Enterprise Platform) Location: US - Open to Europe if happy to overlap with EST Remote | Full-time Compensation: $150K - $200K Our client is seeking a Senior Site Reliability Engineer (Azure) to architect and scale a robust infrastructure foundation for a high-growth distributed systems platform. This position is critical for ensuring that the platform operates as a secure, scalable, and production-ready environment capable of supporting complex enterprise use cases and high reliability standards. The successful candidate will take a lead role in designing infrastructure from first principles, bridging the gap between product requirements and technical execution. This is a high-impact opportunity for a seasoned engineer to build greenfield Azure environments and establish operational excellence across a global ecosystem. Key Responsibilities - Infrastructure Design: Architect and deploy secure, scalable Azure infrastructure tailored for production-grade distributed systems. - Automation & IaC: Develop and maintain Terraform-based infrastructure as code to enable repeatable, automated deployments across various environments. - Technical Leadership: Translate ambiguous product and customer requirements into structured technical architecture and actionable execution plans. - Platform Enhancement: Build and optimize platform services, APIs, and integrations to extend core system capabilities. - Cross-Functional Collaboration: Partner with engineering, security, and product teams to deliver enterprise-ready infrastructure solutions. - Operational Excellence: Drive improvements in reliability, observability, and incident response while providing Tier 2 infrastructure support for customer deployments. Performance Goals: What Success Looks Like In the first 6–12 months of this role, the following milestones are expected to be achieved: - Production Readiness: The Azure environment is established as a fully production-ready deployment setting for the platform. - Scalable Deployments: All customer deployments are verified as repeatable, scalable, and secure. - Feature Parity: Azure achieves full feature parity with all other supported cloud environments within the organization’s ecosystem. Interview Process - Recruiter & Technical Screening: Initial HR call followed by an introductory technical interview covering foundational questions. - Hiring Manager Interview: A deeper dive into experience, alignment, and role-specific expectations. - Technical Interview: A comprehensive evaluation of architectural and technical execution skills. - Final Leadership Interview: A concluding session with the VP of Engineering.

Job Requirements

Proven Track Record: Extensive experience designing and building production-grade systems specifically on the Azure stack.
Problem Solving: Ability to transform high-level requirements into scalable, delivered systems.
Communication: Strong technical communication skills with the ability to interface with both engineering teams and non-technical stakeholders.
Mindset: A high-ownership approach with a strong bias for action and accountability.
Functional Expertise
Azure Services: Deep knowledge of Azure networking, compute, identity, security, and storage.
Infrastructure as Code: Advanced proficiency with Terraform at production scale.
Programming: Professional experience in Go and/or Python.
Systems Engineering: Background in distributed systems, high-availability architectures, or platform engineering.
CI/CD: Experience with automation tooling for the entire infrastructure lifecycle.
Preferred Qualifications
Hands-on experience with Kubernetes and container orchestration.
Familiarity with observability tools such as Prometheus and Grafana.
Experience with workflow/orchestration platforms like Argo or Spacelift.

Benefits

Our client offers a competitive compensation package designed to reward high-impact contributors, including:
Equity & Tokens: Participation in the long-term growth of the project.
Performance Bonuses: Annual incentives based on individual and company milestones.
Health & Retirement: Comprehensive health insurance and 401k plans (available for US-based employees).
Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.
Commitment to Equality and Accessibility:
At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing human-resources@mlabs.city.
MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd’s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting legal@mlabs.city.

Related Categories

DevOps Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior Site Reliability Engineer (Azure)

MLabs

We are a Haskell, Rust, Blockchain and AI consultancy.

DevOps Engineer67 days ago

Other RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

View details: Senior Site Reliability Engineer (Azure)

United States

$150K - $200K / year

Apply

Job Closed

DevOps AWS Cloud Engineer

N Consulting Ltd

DevOps Engineer67 days ago

Contract RemoteTeam 51-200

Role Description We are looking for a passionate DevOps AWS Cloud Engineer to join our dynamic team and help drive automation, scalability, and reliability across our cloud environments. Job Title: DevOps AWS Cloud Engineer Location: Bratislava, Slovakia Key Responsibilities: - Collaborate with DevOps engineers and agile development teams to enhance and automate software delivery processes - Build, maintain, and optimize CI/CD pipelines for fast and reliable deployments - Implement Configuration as Code and GitOps practices - Automate provisioning of test and production environments in AWS Cloud - Operate and scale Kubernetes clusters (Amazon EKS) - Manage Helm charts and Kubernetes manifests for application deployments - Develop automation scripts using Go, Bash, or Java - Work effectively with global teams across different time zones Qualifications - Strong hands-on experience with AWS Cloud services - Expertise in CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.) - Solid experience with Kubernetes & Amazon EKS - Good understanding of Infrastructure as Code (Terraform, CloudFormation) - Scripting experience (Go, Bash, or Java) - Strong communication and collaboration skills

AWS Kubernetes Amazon EKS Helm Java Jenkins GitHub Actions GitLab CI Terraform

View details: DevOps AWS Cloud Engineer

Slovakia

€270 - €280 / day

Apply

Job Closed

Cloud - Lead Nre (Network Reliability Engineering)

Ministère des armées. Liberté, égalité, fraternité.

Personnes à contacter : dcsca-arcueil.gestionnaire.fct@intradef.gouv.fr stephanie.porcher@intradef.gouv.fr

DevOps Engineer67 days ago

Contract Remote

Role Description Ce recrutement s'inscrit dans le projet d'infrastructure cloud pour le numérique de défense — conçue, intégrée et opérée par nos équipes — en se dotant d'une stack maîtrisée, sécurisée, performante et résiliente, déployée sur l'ensemble du territoire national. Vos missions seront les suivantes : - Concevoir, déployer et opérer le réseau datacenter (dont fabric programmable) propre à l’infrastructure cloud : architecture ToR/Spine, orchestration, automatisation. - Implémenter et opérer de manière soutenable les mécanismes de sécurité réseau sous l'angle disponibilité et fiabilité : segmentation (VRF, isolation des tenants au niveau routage), filtrage (les firewalls périphériques, etc.). - Mettre en œuvre et maintenir les services réseau : routage GBP, load balancing (L4/L7), ingress, DNS interne, service discovery. - Implémenter et exploiter l'architecture réseau des clusters : CNI, overlay, network policies, segmentation. - Déployer et opérer la stack d'observabilité réseau (e.g., latence, packet loss) ainsi que les mécanismes de mesure associés. - Piloter l’exploitation de manière pragmatique par les SLO réseau, les errors budgets et les RETEX. - Participer à la gestion de crise (niveau N3/N4) et aux astreintes. - Rédaction des procédures d’exploitation et de la documentation technique. - Encadrer techniquement les ingénieurs NRE ; contribuer au recrutement et à la montée en compétences. - Être l'interlocuteur technique des équipes réseau ministérielles. Qualifications - Expert réseau (10+ ans d'expérience en production). - Capable de concevoir et opérer l'architecture réseau d'une plateforme critique en production à l'échelle nationale, avec une approche SRE. - Exploitation de réseau en datacenter : maîtrise du routage (BGP, OSPF), architecture ToR/Spine, VRF, segmentation, firewalling, load balancing L4/L7. - Expérience d'orchestration et d'automation de fabric réseau. - Expérience de l’observabilité réseau à l’échelle. - Expérience de la culture SRE (pilotage par SLO/SLI, automatisation, observabilité réseau, RETEX et amélioration continue). Requirements - Expertise sur le fonctionnement d’un réseau en datacenter : maîtrise du routage (BGP, OSPF), architecture ToR/Spine, VRF, segmentation, firewalling, load balancing L4/L7. - Maîtrise l'architecture réseau des clusters (conteneurs ou VM) : CNI, overlay, network policies, segmentation. - Expertise d’une stack d’observabilité réseau et des outils/méthodes de métrologie réseau. - Maîtrise d'au moins un langage de programmation (Go ou Python) : être capable d’implémenter des routines d’orchestration (contrôleurs). - Très bonne connaissance des questions de sécurité, capacité à dialoguer avec une chaîne SSI. Benefits - Rigoureux : Capacité à concevoir et maintenir des infrastructures critiques avec une attention méticuleuse aux détails, particulièrement dans les aspects de sécurité et de reproductibilité. - Innovant : Capacité à proposer des solutions techniques avancées et à implémenter des bonnes pratiques. - Ancrer dans une culture d'analyse factuelle et d'amélioration continue. Company Description - Atouts appréciés : - Expérience d'environnements multi-sites / multi-régions. - Expérience avec des environnements air-gapped. - Connaissance de SecNumCloud et IGI 1300. - Contributions open source. Éléments de candidature Documents à transmettre : Pour postuler à cette offre, l'envoi du CV et d'une lettre de motivation est obligatoire. Personnes à contacter - dc-dirisi-sdorh-rrh-gpc-gpec.mobilite.fct@intradef.gouv.fr - laurent.prosperi@intradef.gouv.fr

Firewalls DNS R Python

View details: Cloud - Lead Nre (Network Reliability Engineering)

France

Apply

Job Closed

DevOps Engineer

Kayzen

Kayzen powers the world's best mobile marketing teams to take programmatic in-house.

DevOps Engineer67 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Develop and maintain automated provisioning pipelines (PXE, ZTP) to deploy bare-metal servers ( configure hardware, peripherals, services, settings, directories, storage, etc. in accordance with standards and project/operational requirements) at scale across global data centers. • You research and recommend innovative and automated approaches for system administration tasks. • You perform regular security monitoring to identify any possible intrusions. • You repair and recover from hardware or software failures, coordinating with impacted teams. • You apply OS patches and upgrades on a regular basis, and upgrade administrative tools and utilities. • You maintain data center environmental and monitoring equipment. • You perform ongoing performance tuning, hardware upgrades, and resource optimization as required.

Ansible Chef Grafana HAProxy Java Jenkins Kubernetes Nginx Prometheus Puppet Python SQL TCP/IP Terraform Unix

View details: DevOps Engineer

India

Apply

Senior Site Reliability Engineer (Azure)

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer (Azure)

DevOps AWS Cloud Engineer

Cloud - Lead Nre (Network Reliability Engineering)

DevOps Engineer