Job Closed

This listing is no longer active.

Red Hat logo
Red Hat

The leading provider of enterprise open source solutions.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerOtherRemoteSeniorTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

North Carolina

Posted

133 days ago

Salary

$111.3K - $183.6K / year

Seniority

Senior

Job Description

Senior Site Reliability Engineer

Red Hat

• Contribute code to increase the scalability and reliability of the service • Contribute software tests and participate in peer review to increase the quality of our codebase • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration • Participate in a regular on-call schedule, including occasional paid weekends and holidays • Practice sustainable incident response and blameless postmortems • Resolve customer issues escalated from the Red Hat Global Support team • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve • Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle

Job Requirements

  • A bachelor's degree in Computer Science or a related technical field involving software or systems engineering is required
  • Experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language
  • Experience working with public clouds such as AWS, GCP, or Azure
  • Ability to collaboratively troubleshoot and solve problems in a team setting
  • Experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.)
  • Experience working with complex distributed systems
  • Direct experience with Kubernetes or OpenShift is a plus
  • A demonstrated ability to debug, optimize code and automate routine tasks
  • A basic understanding of Unix/Linux operating systems
  • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider
  • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
  • 3+ years of experience with enterprise configuration management software like Ansible, Puppet, or Chef
  • 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred
  • 2+ years of experience delivering a hosted service
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Solid communications skills and experience working directly with and presenting to customers
  • 1+ year(s) of experience with Kubernetes is a plus
  • 1+ year(s) of experience with docker-based containers is a plus

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Beacon Biosignals logo

Site Reliability Engineer

Beacon Biosignals

Our mission is to improve human health by enabling rapid, targeted interventions through advanced brain analytics.

DevOps Engineer133 days ago
OtherRemoteTeam 11-50H1B No Sponsor

• Design and implement infrastructure as code solutions that improve reliability, security, and maintainability of our cloud infrastructure • Lead and execute major infrastructure initiatives including cluster upgrades, security improvements, and architectural changes • Develop and maintain CI/CD pipelines that enable teams to deploy safely and efficiently • Improve observability across our systems through enhanced monitoring, logging, and alerting • Participate in an on-call rotation and lead incident response efforts when issues arise • Collaborate with development teams to improve application reliability and performance • Maintain and enhance our security posture through infrastructure hardening and automation • Create and maintain documentation for infrastructure, deployment processes, and incident response procedures

Massachusetts
$150K - $170K / year
Job Closed
Life360 logo

Staff DevOps Engineer

Life360

The #1 family safety app 📱

DevOps Engineer133 days ago
OtherRemoteTeam 201-500Since 2008H1B Sponsor

• Partner closely with data engineering and data science teams to enable reliable data pipelines, analytics, and ML workflows • Support, operate, and optimize Databricks and Snowflake environments in production • Operate and support Databricks Jobs and Apache Airflow DAGs created by development teams, ensuring reliable orchestration of data pipelines and Databricks workflows • Monitor, troubleshoot, and optimize systems for performance, reliability, and cost efficiency • Partner closely with security and compliance teams to implement data security, auditing, and access review processes • Design, implement, and maintain CI/CD pipelines using Jenkins and GitOps practices • Provision and manage cloud and data platform infrastructure using Terraform and infrastructure-as-code best practices • Deploy and operate platform and data services on Kubernetes (EKS) clusters • Implement and maintain GitOps workflows using Argo CD • Package, deploy, and manage services using Helm and Kustomize • Automate configuration management and operational tasks using Ansible • Implement and maintain observability and monitoring for infrastructure and data platforms • Participate in incident response and on-call rotations supporting data and platform systems • Contribute to operational standards, documentation, and best practices • Lead technical initiatives, mentor team members, and influence architectural direction

United States
$163.5K - $237.5K / year
Job Closed
OtherRemoteTeam 1,001-5,000

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description VOLT is seeking a hands-on Field & Cloud DevOps Engineer to own the deployment, operation, and reliability of our edge and cloud infrastructure. This role sits at the intersection of customer environments, edge computing, and AWS. You will be responsible for ensuring that our systems deploy cleanly, operate reliably in real-world conditions, and scale efficiently as we grow. This is not a pure cloud role and not a traditional field engineering role. You will work directly with customer IT teams, deploy and maintain edge hardware, and operate the AWS and Kubernetes infrastructure that supports those systems. We are looking for someone who is comfortable owning systems end to end—hardware, networking, automation, and cloud—and who can confidently drive technical conversations with other engineers and IT professionals. - Own end-to-end infrastructure operations spanning customer sites, edge devices, and AWS production environments. - Plan and execute customer deployments in collaboration with customer IT teams, including: - Leading technical discussions around network architecture, security requirements, and deployment constraints. - Translating customer IT policies into executable deployment plans. - Driving deployment readiness and timelines to ensure on-time launches. - Deploy, configure, and maintain edge compute hardware in customer environments, including: - Installing and maintaining Linux-based operating systems. - Creating, deploying, and updating standardized golden images. - Managing OS patches, drivers, firmware, and security updates. - Diagnosing hardware, OS, and performance issues remotely and on-site. - Design, integrate, and troubleshoot customer-side networking, including: - VLANs, firewall rules, NAT, routing, and bandwidth constraints. - IP camera and video infrastructure (RTSP, ONVIF, PoE, managed switches). - Operating effectively within locked-down or highly regulated networks. - Build and operate AWS-based infrastructure supporting edge systems, including: - Kubernetes (EKS) clusters and containerized workloads. - CI/CD pipelines for deployment and upgrades. - Logging, monitoring, and alerting across distributed systems. - Develop and maintain automation and internal tooling, including: - Python for deployment automation, operational tooling, and system workflows. - Shell scripting for provisioning, configuration, and diagnostics. - Tools that reduce manual effort and improve deployment repeatability. - Ensure reliable edge-to-cloud connectivity and data flow, debugging failures across networking, software, and infrastructure boundaries. - Lead incident response and operational debugging across edge and cloud systems, driving issues to root cause and permanent resolution. - Own infrastructure cost visibility and optimization, including: - Monitoring and analyzing AWS spend. - Implementing right-sizing, scaling, and cost controls. - Working with external cost-optimization partners and tooling. - Feed real-world deployment and operations insights back into product and infrastructure design, improving reliability and scalability. Qualifications - Experience with edge computing or on-prem deployments. - Familiarity with IP camera systems, RTSP, video infrastructure, and Video Management Systems. - Experience with infrastructure-as-code (Terraform or CloudFormation). - Experience managing or optimizing cloud infrastructure costs. - Background working in fast-moving startup environments. Requirements - Customer deployments are predictable, on-time, and repeatable. - Edge systems are stable, secure, and easy to operate at scale. - Cloud infrastructure is reliable, observable, and cost-efficient. - Issues are diagnosed quickly and resolved permanently. - Engineering teams move faster because infrastructure “just works.” Benefits - $135,000 - $160,000 a year. - Ownership, autonomy, and direct impact on how the company scales its infrastructure and serves its customers. - Opportunity to work on systems that matter—deployed in real environments, solving real problems.

United States
$135K - $160K / year
Job Closed
EnterpriseAlumni logo

Head of Dev Ops, Cloud & Infrastructure

EnterpriseAlumni

Corporate Alumni Engagement & Management Platform For The Enterprise

DevOps Engineer133 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Lead our infrastructure strategy and team • Own the reliability, security, scalability, and cost-efficiency of our cloud infrastructure • Work closely with the CTO and collaborate directly with our development teams • Architect, build, and maintain scalable, secure, multi-regional cloud infrastructure on AWS • Own our Infrastructure as Code practices using Terraform • Design and optimize CI/CD pipelines across Jenkins and CircleCI • Manage container orchestration via EC2/ECS/ECR and Kubernetes • Lead observability strategy using Grafana and Prometheus • Drive high availability and disaster recovery planning across regions • Ensure infrastructure meets SOC 2, ISO 27001, and Cyber Essentials+ requirements • Implement and maintain robust security practices • Continuously monitor and optimize cloud spend • Build, mentor, and lead the DevOps and infrastructure team • Foster a culture of ownership, collaboration, and continuous improvement • Partner closely with development teams to ensure infrastructure supports application needs

Argentina
$55K - $65K / year
Job Closed