Job Closed

This listing is no longer active.

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world.

Site Reliability Engineer – DevOps

DevOps EngineerDevOps EngineerFull Time Remote LeadTeam 10,001+Since 2015H1B SponsorCompany Site LinkedIn

Location

Netherlands

Posted

146 days ago

Salary

Seniority

Lead

Bachelor Degree10 yrs expEnglishAWS GCP Java Jenkins Kubernetes Microservices Python Terraform

Job Description

• Express your passion about infrastructure as code and continuous deployment to build scalable and highly reliable systems. • Define and own KPIs around system availability, quality and scale. • Partner with our developers and quality engineering teams to automate the monitoring, alerting, availability and scalability of our applications and systems. • Ensure system availability and business continuity by implementing redundant servers/services. • Manage after-hours infrastructure updates and maintenance. • Proactively research and propose the use of new concepts, processes, technologies, and tools. • Partner with software developers to create Mist standards for Microservices (APIs, schemas, serialization, data stores and best practices). • Run secure and scalable applications for highly available, multi-region, AWS and GCP deployments. • Ship code several times per week. • Be a part of our On-Call rotation. • Own disaster recovery and business continuity plans.

Job Requirements

An extensive background in developing and operating large-scale cloud-based distributed applications.
Direct experience developing/running applications on AWS or Google Cloud.
Laser focus and be able to design infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence.
The ability to 'fix the plane while in flight' (not just support greenfield solutions).
The ability to prioritize existing technical and infrastructure debt, and experience to build and execute a plan to pay it off.
Delivering web-scale infrastructure for a global market at high release velocity.
A deep understanding of distributed system design and dependency management.
Must have solid experience with at least 2 of the languages: Go, Java, Python.
10+ years industry experience in managing infrastructure.
5 years Kubernetes administration in a large-scale SaaS environment.
5 years maintaining production systems on AWS or GCP.
3 years in implementing, managing, and monitoring metrics specific to SaaS applications.
3 years using infrastructure as code software (eg. Terraform, AWS and Google Cloud Deployment, CloudFormation).
5 years’ experience in continuous integration practices & tools (Jenkins, Travis CI, CircleCI, etc…).

Benefits

Health & Wellbeing We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
Personal & Professional Development We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
Unconditional Inclusion We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

DevSecOps Engineer II

Kapitus

Kapitus is a financial services company that handles the financing for clients, enabling them to focus more on running their businesses. Headquartered in New Yo

DevOps Engineer146 days ago

Other Remote

Company Site

• Perform day-to-day Salesforce administration, including user setup, profiles, permissions, roles, workflows, validation rules, automation (Flows), and other configurations to streamline operations • Execute Salesforce deployments using Gearset and Salesforce Change Sets to maintain consistent, compliant release cycles • Evaluate, implement, and maintain Gearset deployments and third-party integrations • Manage data imports, migrations, and bulk updates, ensuring high levels of data accuracy and integrity • Conduct recurring data audits and cleanup activities to ensure ongoing database health • Establish and enforce data entry standards, deduplication processes, and governance practices • Create and manage custom objects, fields, page layouts, and configurations to support new business functionality • Collaborate with cross-functional teams to troubleshoot issues, implement enhancements, and ensure system stability • Maintain Salesforce platform updates, security standards, release features, and industry best practices. • Maintain accurate system documentation and create training materials for end users • Cloud architecture experience in AWS environment and container-based deployments using AWS CodePipeline and CloudFormation. • Experience with various AWS services like ECS, S3, Lambda, and Route53

AWS Azure

View details: DevSecOps Engineer II

United States

$96.3K - $154.4K / year

Apply

Job Closed

Founding DevOps Engineer

Perfect Venue

The best event management software for independent venues and hospitality groups.

DevOps Engineer146 days ago

Other RemoteTeam 1-10H1B No Sponsor

Company Site LinkedIn

• Define and implement our infrastructure architecture from scratch • Rebuild our CI/CD pipeline to better scale with a growing team • Own all infrastructure-as-code and environment provisioning • Design our observability strategy (metrics, logs, traces, alerting) • Establish best practices for reliability, scaling, and incident response • Own security fundamentals (secrets, access control, production hardening) • Partner with application engineers to create a fast, reliable developer experience • Make foundational decisions on cloud, tooling, and architecture • Contribute as an application developer when needed

JavaScript Python Ruby Terraform

View details: Founding DevOps Engineer

United States

Apply

Job Closed

Senior DevOps Engineer, Cloud Delivery

NetBox Labs

We make it easier to build and manage complex networks.

DevOps Engineer146 days ago

Full Time RemoteTeam 11-50Since 2023H1B No Sponsor

Company Site LinkedIn

• Own and evolve the AWS infrastructure underpinning NetBox Cloud: EKS clusters, VPC design, IAM, RDS, and supporting services. • Lead infrastructure projects from design through delivery, scoping work into clear milestones and distributing tasks across the team. • Work with Product teams to deliver successful customer launches reliably and at scale. • Drive improvements to deployment automation and CI/CD pipelines (GitHub Actions), with a focus on reliability, speed, and developer self-service. • Identify and address technical debt before it becomes a reliability or security risk. • Set and own SLOs for the systems you're responsible for; reduce toil through automation. • Contribute to SOC 2 compliance, security controls, and IAM governance. • Participate in the on call rotation for the platform • Mentor engineers through code review, pairing, and design discussions. • Participate in on-call rotation and lead incident response end to end, including postmortems.

AWS Cloud Grafana Kubernetes Prometheus Python Shell Scripting Terraform Go

View details: Senior DevOps Engineer, Cloud Delivery

New Jersey

$165K - $185K / year

Apply

Job Closed

Staff Site Reliability Engineer

AlphaSense

The market intelligence and search platform trusted by over 3,500 leading organizations

DevOps Engineer146 days ago

Full Time RemoteTeam 1,001-5,000Since 2011H1B Sponsor

Company Site LinkedIn

• Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a 'You Build It, You Run It' culture. • Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention. • Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards. • Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements. • Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively. • Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing.

AWS Azure Cloud DNS Google Cloud Platform Grafana Kubernetes Prometheus Python TCP/IP Go

View details: Staff Site Reliability Engineer

United States

$150K - $225K / year

Apply

Job Closed

Site Reliability Engineer – DevOps

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevSecOps Engineer II

Founding DevOps Engineer

Senior DevOps Engineer, Cloud Delivery

Staff Site Reliability Engineer