Job Closed

This listing is no longer active.

Jobgether logo
Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Senior Reliability Engineer Executive

Location

United States

Posted

95 days ago

Salary

0

No structured requirement data.

Job Description

Senior Reliability Engineer Executive

Jobgether

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description This position is posted by Jobgether on behalf of a partner company. We are currently looking for a VP of Engineering, Reliability. In this pivotal role, you'll define and execute the reliability engineering roadmap while managing a team responsible for ensuring system stability across cutting-edge infrastructure and AI-native architectures. Your impact will bridge the gap between engineering efficiency and operational excellence, paving the way for scalable growth and enhanced service delivery. This position demands a visionary leader with a track record of transforming reliability within innovative technology environments. You will leverage your extensive experience to create a forward-looking vision that meets organizational goals while ensuring compliance and security. - Define and execute the reliability engineering roadmap, aligning with enterprise growth. - Balance centralized platform capabilities with distributed ownership for scalability. - Establish SLO/SLI/error budget frameworks for feature velocity and system stability. - Lead infrastructure cost management and capacity planning to meet enterprise commitments. - Develop and scale a multi-disciplinary team while fostering a culture of ownership. - Drive continuous improvement through DORA metrics and incident trend analysis. - Empower developers with self-service tooling and clear documentation. - Act as the primary engineering interface for compliance and security requirements. - Collaborate with executives to position reliability as a key enabler for success. Qualifications - 15+ years of engineering experience, with 7+ years in leading reliability or infrastructure teams. - Proven track record managing organizations of 40+ engineers across multiple teams. - Demonstrated experience evolving reliability operating models for scalable businesses. - Expertise in regulated sectors where compliance and data sensitivity are critical. - Strong understanding of SRE principles, including SLOs and incident management. - Technical command of AWS, Terraform (IaC), and modern observability stacks. - Experience owning cloud infrastructure budgets and cost management. - Familiarity with AI/ML workloads and their reliability requirements. - Executive presence for engaging with the C-suite on risk management. Benefits - A dynamic, rapidly growing organization focused on helping businesses thrive. - Comprehensive Medical, Dental, & Vision Insurance for full-time employees. - Competitive and fair pay commensurate with experience. - Maternity and paternity leave policies for full-time employees. - Short and long-term disability coverage. - Opportunities to learn from a dedicated leadership team. - Top-of-the-line company swag for team members.

Job Requirements

  • 15+ years of engineering experience, with 7+ years in leading reliability or infrastructure teams.
  • Proven track record managing organizations of 40+ engineers across multiple teams.
  • Demonstrated experience evolving reliability operating models for scalable businesses.
  • Expertise in regulated sectors where compliance and data sensitivity are critical.
  • Strong understanding of SRE principles, including SLOs and incident management.
  • Technical command of AWS, Terraform (IaC), and modern observability stacks.
  • Experience owning cloud infrastructure budgets and cost management.
  • Familiarity with AI/ML workloads and their reliability requirements.
  • Executive presence for engaging with the C-suite on risk management.

Benefits

  • A dynamic, rapidly growing organization focused on helping businesses thrive.
  • Comprehensive Medical, Dental, & Vision Insurance for full-time employees.
  • Competitive and fair pay commensurate with experience.
  • Maternity and paternity leave policies for full-time employees.
  • Short and long-term disability coverage.
  • Opportunities to learn from a dedicated leadership team.
  • Top-of-the-line company swag for team members.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Software Mind logo

Data Platform DevOps Engineer, GCP

Software Mind

Software House focused on results since 1999

DevOps Engineer95 days ago
Full TimeRemoteTeam 1,001-5,000Since 1999H1B No Sponsor

• Join a new, strategic data transformation project, moving analytics from on-premise to GCP and building data architecture and model from scratch with focus on business value creation. • Collaborate with data engineering, analytics and operations teams to streamline data applications, including big data and operational workflows. • Provide documentation of infrastructure, processes and compliance controls. • Monitor infrastructure health, performance and security, and resolve issues promptly. • Conduct regular reviews and audits of systems to ensure ongoing compliance and drive remediation as needed. • Implement and enforce privacy and security requirements in line with organizational and regulatory standards. • Lead the technical implementation of access controls, encryption, data retention and security monitoring. • Automate and document recurring operational and compliance procedures to ensure reliability and transparency.

Poland
Job Closed
Hewlett Packard Enterprise logo

Site Reliability Engineer – DevOps

Hewlett Packard Enterprise

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world.

DevOps Engineer95 days ago
Full TimeRemoteTeam 10,001+Since 2015H1B Sponsor

• Express your passion about infrastructure as code and continuous deployment to build scalable and highly reliable systems. • Define and own KPIs around system availability, quality and scale. • Partner with our developers and quality engineering teams to automate the monitoring, alerting, availability and scalability of our applications and systems. • Ensure system availability and business continuity by implementing redundant servers/services. • Manage after-hours infrastructure updates and maintenance. • Proactively research and propose the use of new concepts, processes, technologies, and tools. • Partner with software developers to create Mist standards for Microservices (APIs, schemas, serialization, data stores and best practices). • Run secure and scalable applications for highly available, multi-region, AWS and GCP deployments. • Ship code several times per week. • Be a part of our On-Call rotation. • Own disaster recovery and business continuity plans.

Netherlands
Job Closed
Kapitus logo

DevSecOps Engineer II

Kapitus

We believe business owners should be able to focus on running their business, while we take care of the financing.

DevOps Engineer95 days ago
OtherRemoteTeam 201-500Since 2006H1B No Sponsor

• Perform day-to-day Salesforce administration, including user setup, profiles, permissions, roles, workflows, validation rules, automation (Flows), and other configurations to streamline operations • Execute Salesforce deployments using Gearset and Salesforce Change Sets to maintain consistent, compliant release cycles • Evaluate, implement, and maintain Gearset deployments and third-party integrations • Manage data imports, migrations, and bulk updates, ensuring high levels of data accuracy and integrity • Conduct recurring data audits and cleanup activities to ensure ongoing database health • Establish and enforce data entry standards, deduplication processes, and governance practices • Create and manage custom objects, fields, page layouts, and configurations to support new business functionality • Collaborate with cross-functional teams to troubleshoot issues, implement enhancements, and ensure system stability • Maintain Salesforce platform updates, security standards, release features, and industry best practices. • Maintain accurate system documentation and create training materials for end users • Cloud architecture experience in AWS environment and container-based deployments using AWS CodePipeline and CloudFormation. • Experience with various AWS services like ECS, S3, Lambda, and Route53

United States
$96.3K - $154.4K / year
Job Closed
Perfect Venue logo

Founding DevOps Engineer

Perfect Venue

The best event management software for independent venues and hospitality groups.

DevOps Engineer95 days ago
OtherRemoteTeam 1-10H1B No Sponsor

• Define and implement our infrastructure architecture from scratch • Rebuild our CI/CD pipeline to better scale with a growing team • Own all infrastructure-as-code and environment provisioning • Design our observability strategy (metrics, logs, traces, alerting) • Establish best practices for reliability, scaling, and incident response • Own security fundamentals (secrets, access control, production hardening) • Partner with application engineers to create a fast, reliable developer experience • Make foundational decisions on cloud, tooling, and architecture • Contribute as an application developer when needed

United States
Job Closed