Job Closed

This listing is no longer active.

HiBob is a modern HR technology company focused on transforming the way organizations operate in today’s dynamic workplace. Its platform streamlines core HR processes, enhances e

Senior Site Reliability Engineer - Remote EST

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 1,350Since 2015

Location

Connecticut + 13 more

Posted

82 days ago

Salary

$170K - $215K / year

Seniority

Senior

Bachelor Degree9 yrs expEnglishAI Argo CD AWS CI/CD Datadog GitHub Actions Kubernetes Python

Job Description

Job Description Join us as a Senior SRE where you'll bridge the gap between cutting-edge AI innovation and rock-solid production stability. Working independently from the East Coast, you will collaborate with our global DevOps teams to automate 70% of your workload while owning the reliability of our AWS/Kubernetes environment. This is a role for a production-hardened engineer who wants a strong voice in technology decisions and the opportunity to build the future of AI-driven operations. This is a fully remote role, however, you must be physically located in EST and be willing and able to work EST hours Monday-Friday and participate in on-call rotations. We cannot consider candidates located in CST, MST or PST at this time. Base salary for this role ranges from $170,000 - $215,000 per year. Job Requirements - 5+ years of experience as a Senior SRE or Production Engineer (this is a hard requirement). - Deep Production Expertise: You must have extensive experience managing live, high-traffic SaaS environments; developer-only backgrounds without ops experience will not be a fit. - Cloud & Orchestration: Proven mastery of Kubernetes and AWS in production settings. - Coding/Scripting: Advanced proficiency in Python (preferred) or Go for automation; we need more than just Bash skills. - AI Knowledge: A strong understanding of or direct experience with AI/LLM technologies. - Observability: Hands-on experience with Datadog for monitoring and incident response. - Autonomy: Ability to work independently without direct daily oversight, managing production incidents and on-call responsibilities. - Time Zone: Located in the East Coast time zone to provide coverage overlap with our global teams. Job Responsibilities - Design, build, and operate production-grade Kubernetes infrastructure on AWS - Developing Ai Agents to handle incidents and root cause analisys - Build and maintain GitOps-based CI/CD pipelines using GitHub Actions and ArgoCD - Develop internal DevOps tooling and developer self-service platforms - Own monitoring, observability, and operational excellence using Datadog - Collaborate with engineering teams to improve delivery speed and reliability Benefits HiBob is a village filled with amazing people and we're especially proud of that. It's a place where Bobbers can be themselves. We're about fun, dreams, hopes and ambition, just as much as we are about precision, growth, and top performance. Becoming a Bobber means you'll receive competitive compensation, benefits, and pre-IPO equity alongside all of this: - Stock options at a high-growth unicorn startup - 100% subsidized medical, dental, and vision coverage for employees - 401(k) with a 3% company match starting from Day 1 - Hybrid working model for bobbers in the NY metro area - Work from home allowance to get your home office set up! - Temporary remote work-from-anywhere in the world for up to 2 months after 6 months of employment - Annual Headspace subscription and wellness benefits - Two social impact days per year for volunteering - Bob balance days - 4 additional days within a calendar year - Enjoy a company-wide long weekend at the beginning of each quarter - Employee referral program - $2,500 bonus for each successful referral with an additional ambassador bonus - Fun and frequent social events (in-person and virtual) - We love birthdays - take the day off and receive a special gift - Dog-friendly office If this sounds like something you've been looking for, we'd love to have you. Come on, join our village! Location Eligibility: While this is a remote position, HiBob is currently authorized to hire in the following states: CA. CO, CT, DC, FL, GA, IL, IN, KS, MA, MD, MN, NC, NH, NJ, NV, NY, OH, OK, OR, PA, RI, SC, TN, TX, UT, VA, WA. Will consider Canadian residents as well! Candidates must reside in one of these states to be considered for employment.

Benefits

401(K), 401(K) matching, Commuter benefits, Company equity, Company-sponsored outings, Company sponsored family events, Dental insurance, Disability insurance, Volunteer in local community, Family medical leave, Flexible Spending Account (FSA), Generous parental leave, Generous PTO, Company-sponsored happy hours, Health insurance, Highly diverse management team, Open door policy, Life insurance, Mentorship program, Paid volunteer time, Online course subscriptions available, Open office floor plan, Paid holidays, Paid sick days, Performance bonus, Pet friendly, Pet insurance, Promote from within, Lunch and learns, Remote work program, Return-to-work program post parental leave, Free snacks and drinks, OKR operational model, Team workouts, Mandated unconscious bias training, Vision insurance, Wellness programs, Some meals provided, Mental health benefits, Home-office stipend for remote employees, Diversity employee resource groups, Hiring practices that promote diversity, Employee resource groups, Employee-led culture committees, Day off for your birthday, Quarterly engagement surveys, Hybrid work model, In-person revenue kickoff, Employee awards, Diversity recruitment program, Pay transparency, Wellness days, Mother's room, Virtual coaching services, Bereavement leave benefits

Related Categories

DevOps Engineer

Related Job Pages

DevOps Engineer Jobs in Connecticut Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior DevOps Engineer

SuccessKPI

All-in-one revolutionary insight and action platform that uses AI, analytics, and automation to remove CX obstacles.

DevOps Engineer82 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Be a key member of the Engineering team, responsible for designing, building, and maintaining the infrastructure that supports our SaaS analytics platform. • Champion automation, reliability, security, and scalability as you optimize cloud-based environments and drive best practices across CI/CD pipelines, monitoring, and infrastructure-as-code. • Design, implement, and maintain scalable, reliable, and secure infrastructure on cloud platforms (primarily AWS). • Create new infrastructure or environments to meet evolving customer and product demands. • Monitor infrastructure performance and availability, ensuring high uptime and efficiency. • Apply infrastructure-as-code principles using tools such as Terraform, AWS CloudFormation, or the Serverless framework. • Build, maintain, and optimize CI/CD pipelines for application deployments using tools like Jenkins, Bitbucket Pipelines, or equivalent. • Automate and standardize release processes to support frequent, reliable, and fast software delivery. • Support production release and bug-fix deployments, including environment configurations. • Develop scripts and tooling (using Python, Bash, Node.js, etc.) to automate infrastructure management, deployments, and operational tasks. • Champion continuous improvement in automation to reduce manual effort and improve reliability. • Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions. • Proactively identify, troubleshoot, and resolve infrastructure and application issues before they impact end users. • Participate in on-call rotations for critical production systems support. • Apply security best practices across all infrastructure layers to ensure secure operations. • Conduct routine security audits and vulnerability assessments to maintain compliance with applicable standards and frameworks. • Partner closely with development teams to support application architecture, deployments, and infrastructure decisions. • Collaborate with QA, Product, and Customer Support teams to resolve customer-impacting issues and improve system reliability.

AWS Cloud Docker JavaScript Jenkins Kubernetes Linux Node.js Python Terraform

View details: Senior DevOps Engineer

Maryland + 1 more

Apply

Job Closed

Senior SRE / DevOps Engineer

Minor Hotels Europe and Americas

DevOps Engineer82 days ago

Full Time RemoteTeam 10,001+Since 1978H1B No Sponsor

Company Site LinkedIn

• Ensure high system reliability and uptime through proactive monitoring, incident response, and root-cause investigations using leading observability and alerting tools (Grafana, Splunk, Datadog, Prometheus, Elastic, Tines, Jira, ServiceNow, PagerDuty, VictorOps, Slack) • Maintain, optimize, and scale cloud infrastructure across AWS and Azure, ensuring cost efficiency, security, and operational excellence • Develop automation solutions (Python, bash or similar) to reduce manual work, streamline operations, and improve overall platform stability • Support and enhance CI/CD and deployment workflows using GitHub, GitLab, Terraform/Terragrunt • Manage containerized workloads, including building, deploying, and troubleshooting services running in Docker and Kubernetes environments • Collaborate closely with engineering, DevOps, security, and operations teams to drive continuous improvement and ensure service SLIs/SLOs/SLA compliance • Participate in on-call rotation and provide timely response to incidents, documenting findings and contributing to long-term reliability improvements

AWS Azure Distributed Systems Docker Grafana Kubernetes Prometheus Python ServiceNow Splunk Terraform

View details: Senior SRE / DevOps Engineer

Ukraine

Apply

Job Closed

Azure DevOps / Developer & Automation Skills

CNX

We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.

DevOps Engineer82 days ago

Full Time RemoteTeam 10,001

Role Description We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The Concentrix Catalyst team is the driving force behind Concentrix’s transformation, data, and technology services. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. - Lead end-to-end architecture for large-scale Dev platforms - Engage C-level and senior IT leadership to align outcomes with business objectives - Drive architectural decisions, risk mitigation, and long-term optimization - Establish engineering governance and operational excellence using DevOps practices and automation Qualifications - Principal Customer Engineer (15+ years) serving as a technical authority and trusted advisor for enterprise and public sector customers - Strong DevOps engineering capability with Azure DevOps and GitHub - Experience using automation and engineering practices to standardize, secure, and operationalize mission-critical platforms Requirements - Azure DevOps: Repos, Pipelines (CI/CD), Boards, Artifacts; engineering governance and delivery standardization - GitHub: GitHub repos, PR workflows, branching strategies; automation with GitHub Actions (where applicable) - DevOps operating model: release management, environment promotion, policy & approval gates, quality controls, auditability - PowerShell: advanced automation for Windows + Azure operations, diagnostics, deployment, and configuration - C# / .NET: building automation utilities, integration components, and platform tooling - Infrastructure as Code (DevOps-enabled delivery) - Standardized deployments through IaC + DevOps practices (ARM/Bicep/Terraform as engagement requires) Company Description Join us and be part of this journey towards greater opportunities and brighter futures.

Azure DevOps Azure GitHub GitHub Actions PowerShell C#.NET Infrastructure as Code Terraform CI/CD

View details: Azure DevOps / Developer & Automation Skills

Netherlands

Apply

SRE Production DevOps

General Electric - GE

Built on more than 130 years of experience, GE Vernova, a division of General Electric (GE), is leading a new era of energy by electrifying the world while work

DevOps Engineer82 days ago

Full Time Remote

Company Site

Role Description The Production DevOps Engineer serves as a critical link in the "Middle-Mile" of software delivery for the GE Vernova’s Grid Software SaaS products. This role is responsible for ensuring that software moves from development to production environments through a standardized, secure, and highly observable path. You will own the Change Management Process, serving as a primary authority for production deployments to ensure that new SaaS product versions do not compromise the stability of global energy grid operations. This position requires a strong technical background in automation and a disciplined approach to release safety in a 24/7 operational environment. Works independently and is seen as a Technical Leader. The role demonstrates deep understanding of concurrent software development, its effect on build management and releasing the builds across versions and environments. Qualifications - 3–5 years of experience in DevOps, SRE, or Release Engineering roles for cloud-native SaaS applications. - Bachelor's Degree in Computer Science or “STEM” Majors (Science, Technology, Engineering and Math) with advanced experience. Requirements - Hands-on experience with Jenkins, Artifactory, GitHub Actions and ArgoCD for automated software delivery. - Proficiency in managing workloads on Kubernetes, specifically with EKS clusters. - Strong skills in Ansible and Terraform for configuration management and infrastructure-as-code. - Solid understanding of AWS cloud services (VPC, IAM, EKS, RDS, S3, MSK, etc) in a production setting. - Experience using Prometheus, Grafana, Splunk, Datadog or Dynatrace to monitor deployment health and system performance. - Experience building dynamic build pipelines using Groovy Script, Python, Bash or Go languages. - Proven ability to manage production changes and troubleshooting under pressure in a high-stakes environment. - Familiarity with regulated industries and security frameworks such as NERC CIP, SOC2, ISO 27001, IEC 62443 is highly preferred. - Strong ability to document technical procedures and communicate clearly with stakeholders during global shift handovers. Benefits - Relocation Assistance Provided: No - #LI-Remote - This is a remote position Key Performance Indicators (KPIs) - Contribution towards the 4-hour SLA target for Customer Onboarding Speed. - Help maintain 99.99% availability of mission critical grid SaaS products. - Maintaining a low rate of failed production deployments through improved quality gates for Change Failure Rate. - Ensuring fast restoration of service through automated rollbacks and clear runbooks for Mean Time to Recover (MTTR). - Automating repetitive manual tasks to ensure at least 50% of time is spent on engineering improvements for Toil Reduction. Business Acumen - Strong problem solving abilities and capable of articulating specific technical topics or assignments. - Experience in building scalable and highly available distributed systems. - Skilled in breaking down problems and estimating time for development tasks. - Evangelizes how our technology solves customer problems from a technology and business perspective. Leadership - Demonstrates clarity of thinking to work through limited information and vague problem definitions. - Influences through others; builds direct and "behind the scenes" support for ideas. - Proactively identifies and removes project obstacles or barriers on behalf of the team. - Shares knowledge, power, and credit, establishing trust, credibility, and goodwill. Personal Attributes - Able to work under minimal supervision. - Excellent communication skills and the ability to interface with senior leadership with confidence and clarity. - Skilled in providing oversight and mentoring team members. Shows ability to effectively delegate work. - Applies values, business strategy, policies, precedent, and experience to make complex decisions in ambiguity and with uncertain consequences.

View details: SRE Production DevOps

Worldwide

Apply

Job Closed

Senior Site Reliability Engineer - Remote EST

Job Description

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior DevOps Engineer

Senior SRE / DevOps Engineer

Azure DevOps / Developer & Automation Skills

SRE Production DevOps