Job Closed

This listing is no longer active.

Data Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 10,001+H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

70 days ago

Salary

$87.4K - $123.4K / year

Seniority

Senior

Job Description

Data Reliability Engineer

Empower

• Own the reliability and stability of production data pipelines and data platform services • Diagnose and resolve data pipeline failures, delays, and data quality issues in production environments • Investigate issues across distributed data systems (e.g., Spark/EMR workloads, ingestion pipelines, warehouse performance) • Lead or support incident response, including triage, mitigation, and long-term resolution • Perform root cause analysis (RCA) and implement durable fixes to prevent recurrence • Define and improve data SLAs (freshness, latency, completeness) and ensure adherence • Design and enhance monitoring, alerting, and observability for data systems • Develop automation and tooling to reduce operational toil and improve system resilience • Contribute to disaster recovery (DR) and resiliency planning, including backup validation and recovery workflows • Partner with engineering teams to improve pipeline design, reliability, and operational readiness • Create and maintain runbooks, SOPs, and operational documentation • Participate in occasional off-hours support for production data systems when required

Job Requirements

  • Strong experience working with production data platforms in AWS environments
  • Prior experience building data pipelines and seeing them through production, including exposure to real-world failures and operational challenges
  • Strong experience with Python and SQL in real data systems
  • Hands-on experience troubleshooting distributed data processing systems (e.g., Spark/EMR, Redshift, streaming systems)
  • Proven ability to debug and resolve production issues in data pipelines and data platforms
  • Experience with AWS data services (such as EMR, Redshift, DynamoDB, S3, or similar)
  • Experience handling production incidents and performing root cause analysis
  • Strong problem-solving mindset and ability to work through ambiguous production issues.

Benefits

  • Medical, dental, vision and life insurance
  • Retirement savings – 401(k) plan with generous company matching contributions (up to 6%), financial advisory services, potential company discretionary contribution, and a broad investment lineup
  • Tuition reimbursement up to $5,250/year
  • Business-casual environment that includes the option to wear jeans
  • Generous paid time off upon hire – including a paid time off program plus ten paid company holidays and three floating holidays each calendar year
  • Paid volunteer time — 16 hours per calendar year
  • Leave of absence programs – including paid parental leave, paid short- and long-term disability, and Family and Medical Leave (FMLA)
  • Business Resource Groups (BRGs) – BRGs facilitate inclusion and collaboration across our business internally and throughout the communities where we live, work and play. BRGs are open to all.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Part TimeRemoteTeam 201-500H1B No Sponsor

• Recherchen zu europäischen Cloud-Anbietern • Unterstützung beim Testen von Tools • Mitarbeit an unserem DevOps-Lehrplan • Möglichkeit, an Entwicklungsaufgaben in realen Projekten mitzuarbeiten

Germany
€15 - €20 / hour
Swisscom logo

ICT-Trainee Programm

Swisscom

Top quality I Ground-breaking innovations I Connected to people and the environment

DevOps Engineer70 days ago
Full TimeRemoteTeam 10,001+Since 1998H1B No Sponsor

Your future starts here Im Rahmen unseres zwölfmonatigen Trainee-Programms startest du deine Karriere im spannenden und schnelllebigen ICT-Umfeld und gestaltest die vernetzte Schweiz der Zukunft aktiv mit. In unserem ICT-Trainee-Programm arbeitest du in mehreren Projekten und Teams mit. Du kannst verschiedene Workshops besuchen und dein Netzwerk erweitern. Damit legst du den Grundstein für deine erfolgreiche Zukunft! - Mehrere (IT-)Projekteinsätze und eine grosse Zukunft stehen dir bevor - Aufbau eines Netzwerks, das für deine berufliche Zukunft wertvoll ist - Teilnahme an Workshops zu Themen wie Agilität, Human Centered Design oder Auftrittskompetenz - Austausch mit neun weiteren Trainee-Buddies, die gemeinsam mit dir am 1. November 2026 starten Deine Skills - Masterabschluss (ETH/Uni/FH), Abschluss spätestens im Sommer 2026 und höchstens ein Jahr zurückliegend - ICT-Affinität: Begriffe wie Cloud, Security, Blockchain, AI, Big Data & Analytics, IoT, 5G sind dir nicht fremd - Erste berufliche Erfahrung erwünscht - Gutes Deutsch und Englisch - Du bist neugierig und begeisterungsfähig und verfügst über eine ausgeprägte Lernfähigkeit und Leistungsbereitschaft - Du hast das Videointerview absolviert sowie dein Dossier hochgeladen - Freude am Aufbau eines grossen Netzwerks - Du bist kommunikativ und siehst einen Auftritt vor einem Managementboard als spannende Herausforderung Möchtest du innerhalb von Swisscom etwas bewegen und Teil des Trainee-Programms werden? Dann bewirb dich direkt hier, lade deine Unterlagen hoch und absolviere zusätzlich dein ​Videointerview. Weitere Infos zum Trainee-Programm bei Swisscom findest du hier. Wichtig: Bewerbungen ohne Videointerview werden nicht berücksichtigt. Bei uns hast du die Möglichkeit, in einem unserer Offices in der Schweiz oder im Homeoffice zu arbeiten. Dabei kommst du mit agilen Arbeitsmethoden und den neuesten Technologien in Kontakt. Wir bieten eine flexible Arbeitszeitgestaltung, um deinen persönlichen Bedürfnissen gerecht zu werden. Als Mitarbeitende*r von Swisscom kannst du dich auf eine Vielzahl von attraktiven Leistungen freuen, die deine Arbeitserfahrung bereichern werden. Dazu gehören ein angenehmes Arbeitsumfeld, finanzielle Vorteile und spannende Möglichkeiten zur beruflichen Weiterentwicklung. Entdecke deine Benefits. Is it a match? Apply now. Discover your possibilities Any questions? Here you'll find answers to the most important and frequently asked questions. To all recruitment agencies: Swisscom does not accept agency CVs. Please do not forward CVs to our job's alias, Swisscom employees or any other organisation location. Swisscom is not responsible for any fees related to unsolicited CVs. Contact person Nadine Eschbach Talent Acquisition Manager +41 (58) 2233918 Your homebase Swisscom (Schweiz) AG Alte Tiefenaustrasse 6, 3048 Worblaufen

Switzerland
Job Closed
Full TimeRemoteTeam 51-200Since 1997H1B No Sponsor

• Automation of CI/CD processes • Configuration management • Automation of infrastructure provisioning • Understand the company's products and their business rules • Develop product requirements based on specifications • Report activity results and clarify questions with the technical lead and Product Owner • Identify, report, and address technical debt • Analyze and resolve issues in production environments

Brazil
Job Closed
BEL USA LLC logo

Site Reliability Engineer

BEL USA LLC

Be Part of Something Great

DevOps Engineer70 days ago
Full TimeRemoteTeam 1,001-5,000Since 1995H1B No Sponsor

• Focus on improving service reliability through automation. • Reduce operational toil and implement SLOs and error budgets. • Partner closely with software engineering teams for production operations. • Define, measure, and manage SLIs, SLOs, and error budgets. • Analyze system performance and improve reliability, resilience, and scalability. • Lead reliability reviews and prevent incidents. • Build and optimize monitoring, logging, and alerting systems. • Implement distributed tracing. • Enhance CI/CD pipelines for safe deployments. • Participate in on-call rotations and lead incident responses.

Philippines