Job Closed
This listing is no longer active.
Founded in 2018, MLabs is a private software engineering consultancy specializing in Haskell and Rust development with a focus on blockchain, artificial intelli
Senior Site Reliability Engineer
Location
United States + 40 moreAll locations: United States | United Kingdom | Germany | France | Estonia | Portugal | Hungary | Poland | Ukraine | Romania | Bulgaria | Czechia | Slovakia | Belarus | Moldova | Sweden | Greece | Belgium | Italy | Ireland | Switzerland | Netherlands | Finland | Malta | Denmark | Lithuania | Croatia | Spain | Austria | Bosnia And Herzegovina | Iceland | Luxembourg | North Macedonia | Montenegro | Norway | Serbia | Slovenia | Albania | Cyprus | Latvia | Monaco
Posted
82 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer
MLabs LTD
Role Description We are hiring on behalf of our client, a high-growth software company supporting the development of a premier open-source, EVM-compatible public ledger built for global enterprise and Web3 use cases. They are currently hiring a Senior Site Reliability Engineer for their "greenfield" enterprise-focused team. This team is building a private and consortium distributed ledger platform designed specifically for sectors with high security and privacy requirements, such as financial services, healthcare, and supply chain. This is a hands-on, high-impact role where you will own the design, deployment, and reliability of mission-critical, multi-region infrastructure. This is not a traditional support role; they are looking for an engineer who has operated real systems at scale and is eager to take end-to-end ownership of architecture and operational standards from the ground up. Key Responsibilities - Systems Architecture: Design and operate highly available, multi-region distributed systems with rigorous recovery strategies (RTO/RPO). - Infrastructure as Code: Own large-scale IaC using Terraform, developing reusable modules and multi-account patterns with policy guardrails. - Kubernetes Orchestration: Scale production environments (EKS, GKE, or AKS) utilizing GitOps (ArgoCD), Helm, and strict network policies. - CI/CD Leadership: Build secure pipelines supporting blue/green and canary deployments, artifact signing (SBOM), and automated rollback strategies. - SRE Advocacy: Define and improve SLOs, error budgets, and observability metrics to drive measurable reductions in MTTR. - Collaboration: Partner with the Head of SRE and VP of Engineering to translate complex business requirements into reliable, secure platform services. Qualifications - 7+ years of experience in SRE, Platform Engineering, or Infrastructure Engineering operating production distributed systems. - Multi-Cloud Mastery: Deep expertise in AWS or GCP, with experience running multi-region production environments and disaster recovery testing. - Containerization: Hands-on experience with Kubernetes at scale, including GitOps workflows and production-grade security controls. - Security Mindset: Strong background in Zero Trust principles, secrets management (Vault), and compliance frameworks (SOC 2, HIPAA, or NIST). - Tooling: Extensive experience with Terraform-first infrastructure in large-scale, real-world environments. Nice to Have - Experience with distributed ledger technology (DLT) or blockchain systems, particularly private/consortium deployments. - Familiarity with EVM-based systems and smart contract tooling (Solidity, Hardhat). - Experience operating active-active, globally distributed architectures. - Background in supporting financial services or other highly regulated industries. Benefits - Competitive base salary with Performance Bonuses. - Equity and Token participation. - 401k and comprehensive health insurance (for US-based employees). - The opportunity to build a "greenfield" platform from scratch within a stable, venture-backed organization. - Work on infrastructure that powers the world’s leading organizations across multiple sectors.
Job Requirements
- 7+ years of experience in SRE, Platform Engineering, or Infrastructure Engineering operating production distributed systems.
- Multi-Cloud Mastery: Deep expertise in AWS or GCP, with experience running multi-region production environments and disaster recovery testing.
- Containerization: Hands-on experience with Kubernetes at scale, including GitOps workflows and production-grade security controls.
- Security Mindset: Strong background in Zero Trust principles, secrets management (Vault), and compliance frameworks (SOC 2, HIPAA, or NIST).
- Tooling: Extensive experience with Terraform-first infrastructure in large-scale, real-world environments.
- Nice to Have
- Experience with distributed ledger technology (DLT) or blockchain systems, particularly private/consortium deployments.
- Familiarity with EVM-based systems and smart contract tooling (Solidity, Hardhat).
- Experience operating active-active, globally distributed architectures.
- Background in supporting financial services or other highly regulated industries.
Benefits
- Competitive base salary with Performance Bonuses.
- Equity and Token participation.
- 401k and comprehensive health insurance (for US-based employees).
- The opportunity to build a "greenfield" platform from scratch within a stable, venture-backed organization.
- Work on infrastructure that powers the world’s leading organizations across multiple sectors.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Site Reliability Engineer
MLabs LTDFounded in 2018, MLabs is a private software engineering consultancy specializing in Haskell and Rust development with a focus on blockchain, artificial intelli
Senior Site Reliability Engineer (Enterprise Platform) Location: Remote - US - Open to Europe if happy to overlap with EST Compensation: Competitive We are hiring on behalf of our client, a high-growth software company supporting the development of a premier open-source, EVM-compatible public ledger built for global enterprise and Web3 use cases. They are currently hiring a Senior Site Reliability Engineer for their "greenfield" enterprise-focused team. This team is building a private and consortium distributed ledger platform designed specifically for sectors with high security and privacy requirements, such as financial services, healthcare, and supply chain. This is a hands-on, high-impact role where you will own the design, deployment, and reliability of mission-critical, multi-region infrastructure. This is not a traditional support role; they are looking for an engineer who has operated real systems at scale and is eager to take end-to-end ownership of architecture and operational standards from the ground up. Key Responsibilities: - Systems Architecture: Design and operate highly available, multi-region distributed systems with rigorous recovery strategies (RTO/RPO). - Infrastructure as Code: Own large-scale IaC using Terraform, developing reusable modules and multi-account patterns with policy guardrails. - Kubernetes Orchestration: Scale production environments (EKS, GKE, or AKS) utilizing GitOps (ArgoCD), Helm, and strict network policies. - CI/CD Leadership: Build secure pipelines supporting blue/green and canary deployments, artifact signing (SBOM), and automated rollback strategies. - SRE Advocacy: Define and improve SLOs, error budgets, and observability metrics to drive measurable reductions in MTTR. - Collaboration: Partner with the Head of SRE and VP of Engineering to translate complex business requirements into reliable, secure platform services.
DevSecOps Engineer
PerkboxHelping businesses care for, connect with and celebrate their people— no matter where they are or what they want 🎈
• Take ownership of our security posture across our AWS and Azure estates • Work closely with DevOps and Developer teams to integrate security into delivery pipelines • Enhance threat detection, manage vulnerability scanning, and ensure infrastructure resilience • Monitor applications and respond promptly to security alerts • Perform static and dynamic security testing as part of pipeline enhancements • Document security procedures and report on findings with clarity and precision
• Diseñar e implementar el sistema de monitoreo y alertas centralizadas (la alerta debe llegar al sistema, no al cliente). • Definir métricas de confiabilidad (SLOs, SLIs, SLAs) y garantizar su cumplimiento. • Analizar y prevenir incidentes de disponibilidad, identificando patrones y causas raíz. • Colaborar con DevOps y Data para diseñar arquitecturas que sean resilientes por diseño. • Documentar runbooks, dashboards y protocolos de respuesta a incidentes. • Liderar revisiones postmortem con foco en mejora continua y aprendizaje organizacional.
Development, security, and operations Engineer
MetaPhase ConsultingMetaPhase Consulting is a business management and technology consulting company that specializes in providing its services to commercial clients, nonprofit organizations, and gover
Enhance developer experience by maintaining tools for secure code delivery, implement secure configurations in collaboration with teams, and ensure compliance with government security standards while supporting incident response protocols.

