Job Closed

This listing is no longer active.

Hewlett Packard Enterprise logo
Hewlett Packard Enterprise

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world.

Site Reliability Engineer – DevOps

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 10,001+Since 2015H1B SponsorCompany SiteLinkedIn

Location

Netherlands

Posted

95 days ago

Salary

0

Seniority

Lead

Job Description

Site Reliability Engineer – DevOps

Hewlett Packard Enterprise

• Express your passion about infrastructure as code and continuous deployment to build scalable and highly reliable systems. • Define and own KPIs around system availability, quality and scale. • Partner with our developers and quality engineering teams to automate the monitoring, alerting, availability and scalability of our applications and systems. • Ensure system availability and business continuity by implementing redundant servers/services. • Manage after-hours infrastructure updates and maintenance. • Proactively research and propose the use of new concepts, processes, technologies, and tools. • Partner with software developers to create Mist standards for Microservices (APIs, schemas, serialization, data stores and best practices). • Run secure and scalable applications for highly available, multi-region, AWS and GCP deployments. • Ship code several times per week. • Be a part of our On-Call rotation. • Own disaster recovery and business continuity plans.

Job Requirements

  • An extensive background in developing and operating large-scale cloud-based distributed applications.
  • Direct experience developing/running applications on AWS or Google Cloud.
  • Laser focus and be able to design infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence.
  • The ability to 'fix the plane while in flight' (not just support greenfield solutions).
  • The ability to prioritize existing technical and infrastructure debt, and experience to build and execute a plan to pay it off.
  • Delivering web-scale infrastructure for a global market at high release velocity.
  • A deep understanding of distributed system design and dependency management.
  • Must have solid experience with at least 2 of the languages: Go, Java, Python.
  • 10+ years industry experience in managing infrastructure.
  • 5 years Kubernetes administration in a large-scale SaaS environment.
  • 5 years maintaining production systems on AWS or GCP.
  • 3 years in implementing, managing, and monitoring metrics specific to SaaS applications.
  • 3 years using infrastructure as code software (eg. Terraform, AWS and Google Cloud Deployment, CloudFormation).
  • 5 years’ experience in continuous integration practices & tools (Jenkins, Travis CI, CircleCI, etc…).

Benefits

  • Health & Wellbeing We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
  • Personal & Professional Development We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
  • Unconditional Inclusion We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Kapitus logo

DevSecOps Engineer II

Kapitus

We believe business owners should be able to focus on running their business, while we take care of the financing.

DevOps Engineer95 days ago
OtherRemoteTeam 201-500Since 2006H1B No Sponsor

• Perform day-to-day Salesforce administration, including user setup, profiles, permissions, roles, workflows, validation rules, automation (Flows), and other configurations to streamline operations • Execute Salesforce deployments using Gearset and Salesforce Change Sets to maintain consistent, compliant release cycles • Evaluate, implement, and maintain Gearset deployments and third-party integrations • Manage data imports, migrations, and bulk updates, ensuring high levels of data accuracy and integrity • Conduct recurring data audits and cleanup activities to ensure ongoing database health • Establish and enforce data entry standards, deduplication processes, and governance practices • Create and manage custom objects, fields, page layouts, and configurations to support new business functionality • Collaborate with cross-functional teams to troubleshoot issues, implement enhancements, and ensure system stability • Maintain Salesforce platform updates, security standards, release features, and industry best practices. • Maintain accurate system documentation and create training materials for end users • Cloud architecture experience in AWS environment and container-based deployments using AWS CodePipeline and CloudFormation. • Experience with various AWS services like ECS, S3, Lambda, and Route53

United States
$96.3K - $154.4K / year
Job Closed
Perfect Venue logo

Founding DevOps Engineer

Perfect Venue

The best event management software for independent venues and hospitality groups.

DevOps Engineer95 days ago
OtherRemoteTeam 1-10H1B No Sponsor

• Define and implement our infrastructure architecture from scratch • Rebuild our CI/CD pipeline to better scale with a growing team • Own all infrastructure-as-code and environment provisioning • Design our observability strategy (metrics, logs, traces, alerting) • Establish best practices for reliability, scaling, and incident response • Own security fundamentals (secrets, access control, production hardening) • Partner with application engineers to create a fast, reliable developer experience • Make foundational decisions on cloud, tooling, and architecture • Contribute as an application developer when needed

United States
Job Closed
NetBox Labs logo

Senior DevOps Engineer, Cloud Delivery

NetBox Labs

We make it easier to build and manage complex networks.

DevOps Engineer95 days ago
Full TimeRemoteTeam 11-50Since 2023H1B No Sponsor

• Own and evolve the AWS infrastructure underpinning NetBox Cloud: EKS clusters, VPC design, IAM, RDS, and supporting services. • Lead infrastructure projects from design through delivery, scoping work into clear milestones and distributing tasks across the team. • Work with Product teams to deliver successful customer launches reliably and at scale. • Drive improvements to deployment automation and CI/CD pipelines (GitHub Actions), with a focus on reliability, speed, and developer self-service. • Identify and address technical debt before it becomes a reliability or security risk. • Set and own SLOs for the systems you're responsible for; reduce toil through automation. • Contribute to SOC 2 compliance, security controls, and IAM governance. • Participate in the on call rotation for the platform • Mentor engineers through code review, pairing, and design discussions. • Participate in on-call rotation and lead incident response end to end, including postmortems.

United States
$165K - $185K / year
AlphaSense logo

Staff Site Reliability Engineer

AlphaSense

The market intelligence and search platform trusted by over 3,500 leading organizations

DevOps Engineer95 days ago
Full TimeRemoteTeam 1,001-5,000Since 2011H1B Sponsor

• Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a 'You Build It, You Run It' culture. • Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention. • Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards. • Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements. • Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively. • Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing.

United States
$150K - $225K / year
Job Closed