Job Closed
This listing is no longer active.
Site Reliability Engineer
Location
Wisconsin
Posted
100 days ago
Salary
0
Seniority
Senior
Job Description
Site Reliability Engineer
Crunchafi
• Design, build, and maintain scalable and resilient infrastructure on Microsoft Azure to support production SaaS workloads • Define and track service level objectives (SLOs), service level indicators (SLIs), and error budgets to drive reliability decisions • Build and maintain comprehensive monitoring, alerting, and observability systems to ensure early detection of issues • Develop and maintain CI/CD pipelines using GitHub Actions to enable safe, rapid, and repeatable deployments • Lead incident response and on-call rotations, conduct blameless post-incident reviews, and drive follow-up action items to completion • Automate operational tasks and eliminate toil through scripting, infrastructure-as-code, and self-healing systems • Manage and optimize Azure Kubernetes Service (AKS) clusters, container orchestration, and related networking and storage configurations • Collaborate with software engineering teams to embed reliability into application architecture, including capacity planning, load testing, and chaos engineering • Maintain and improve infrastructure-as-code using tools such as Terraform, Bicep, or ARM templates • Partner cross-functionally with Product, Support, and Quality to reduce friction and accelerate delivery
Job Requirements
- 5+ years of professional experience in site reliability engineering, DevOps, or infrastructure engineering roles
- Strong hands-on experience with Microsoft Azure cloud services (AKS, Azure SQL, App Services, Virtual Networks, Azure Monitor, etc.)
- Proficiency in at least one programming or scripting language (Python, Go, Bash, PowerShell, or C#)
- Experience designing and managing CI/CD pipelines using GitHub Actions, Azure DevOps, or equivalent
- Hands-on experience with containerization and orchestration technologies (Docker, Kubernetes)
- Demonstrated experience with infrastructure-as-code tools (e.g. Bicep + ARM templates)
- Strong understanding of networking fundamentals, DNS, load balancing, and TLS/SSL management
- Experience with monitoring and observability platforms (Azure Monitor, Alerts, App Insights, Seq, etc.)
- Proven track record of managing production incidents, conducting post-mortems, and driving reliability improvements
- Exceptional analytical, interpersonal, and communication skills
Benefits
- Competitive salary
- Health, dental, and vision plans
- 401(k) Retirement savings plan for US-based employees
- 100% remote work environment, with occasional travel for in-person company and/or team meetings
- Unlimited PTO
- Significant professional development growth opportunities
- Dynamic and inclusive company culture with real commitment to our values
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps – Platform Engineer, Harness
XebiaCreating Digital Leaders. Digital Transformation Consultancy Services and Solutions
• Own and evolve the Harness platform while enabling fast, safe, and reliable cloud-native deployments across AWS, Azure, and GCP environments • Design and maintain Harness CI/CD pipelines for Kubernetes, ECS, Serverless, and VM workloads • Implement modern deployment strategies including Canary and Blue-Green releases • Build reusable pipeline templates and delivery workflows • Standardize infrastructure provisioning using Terraform and Helm / Kustomize • Embed Security, quality gates, and automated testing into CI/CD pipelines • Integrate Observability tooling and support platform reliability • Onboard and enable engineering teams on platform capabilities
• Architect and operate multi-region deployments across AWS, GCP, or Azure • Build and maintain high-throughput telemetry ingestion pipelines • Design autoscaling and failover strategies for mission-critical services • Own observability systems including Prometheus, Grafana, and distributed tracing • Improve MTTR and operational readiness processes • Manage CI/CD pipelines, GitOps workflows, and automated deployments • Collaborate with backend teams on API performance and infrastructure reliability • Harden infrastructure for security, compliance, and tenant isolation • Drive long-term infrastructure roadmap and architectural direction
Senior DevOps, Platform Engineer
XebiaCreating Digital Leaders. Digital Transformation Consultancy Services and Solutions
• own and evolve the Harness platform • enable fast, safe, and reliable cloud-native deployments across AWS, Azure, and GCP environments • design and maintain Harness CI/CD pipelines for Kubernetes, ECS, Serverless, and VM workloads • implement modern deployment strategies including Canary and Blue-Green releases • build reusable pipeline templates and delivery workflows • standardize infrastructure provisioning using Terraform and Helm / Kustomize • embed Security, quality gates, and automated testing into CI/CD pipelines • integrate Observability tooling and support platform reliability • onboard and enable engineering teams on platform capabilities
• Own and evolve the Harness platform while enabling fast, safe, and reliable cloud-native deployments across AWS, Azure, and GCP environments • Design and maintain Harness CI/CD pipelines for Kubernetes, ECS, Serverless, and VM workloads • Implement modern deployment strategies including Canary and Blue-Green releases • Build reusable pipeline templates and delivery workflows • Standardize infrastructure provisioning using Terraform and Helm / Kustomize • Embed Security, quality gates, and automated testing into CI/CD pipelines • Integrate Observability tooling and support platform reliability • Onboard and enable engineering teams on platform capabilities



