Job Closed

This listing is no longer active.

LivePerson is an online engagement solutions company, which means that it works with clients to provide their customers with real, live assistance and advice. The company was found

Site Reliability Engineer II

DevOps EngineerDevOps EngineerFull Time Remote Senior

Location

India

Posted

111 days ago

Salary

₹312K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAWS GCP Grafana Kubernetes Linux Prometheus Python Shell Terraform

Job Description

• Collaborate closely with Developers, QA, and Product teams during sprint planning to understand release plans, dependencies, and infrastructure requirements. • Participate in the application release cycle, ensuring deployments are automated, consistent, and reliable. • Manage and operate Kubernetes clusters in Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS). • Develop and manage Terraform modules for provisioning and configuring cloud infrastructure across GCP and AWS. • Standardize service deployments using Helm for templating and versioned releases. • Build and enhance observability with Prometheus, Grafana, and Datadog to monitor application and platform performance. • Design, implement, and maintain GitLab CI/CD pipelines for build, test, and deployment automation. • Drive an automation-first culture by developing scripts and tooling in Python, Go, or Shell to minimize manual effort and improve efficiency. • Participate in a 24/7 on-call rotation, ensuring quick detection, mitigation, and resolution of incidents. • Perform root cause analysis (RCA) and contribute to post-incident reviews to prevent recurrence. • Proactively identify reliability or scalability gaps, raise early warnings, and partner with teams to address systemic risks.

Job Requirements

5-8 years of experience as a Site Reliability Engineer, Platform Engineer, or DevOps Engineer.
Hands-on experience managing Kubernetes clusters (GKE, EKS) in GCP and AWS.
Strong knowledge of Terraform, Helm, and GitLab CI/CD pipelines.
Proficiency in Python, Go, or Shell scripting for automation and tooling.
Experience implementing and managing observability stacks (Prometheus, Grafana, Datadog).
Deep understanding of Linux systems, cloud networking, and container orchestration concepts.
Experience working in Agile/Scrum environments and partnering closely with developers.
Excellent analytical skills with a proactive attitude — able to question assumptions and escalate potential risks early.

Benefits

15 Days PTO + Casual & Sick Leave
Insurance: 8 Lakhs Family Floater Coverage; Personal Accident & Life Insurance: 3x of Gross Annual Salary*

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior AWS DevOps Engineer, SRE – AI

Xebia

Creating Digital Leaders. Digital Transformation Consultancy Services and Solutions

DevOps Engineer111 days ago

Full Time RemoteTeam 5,001-10,000H1B Sponsor

Company Site LinkedIn

• Building and supporting the tools, processes and infrastructure empowering the faster delivery and scaling of software iterations • Ensuring availability, reliability and scalability of application infrastructure • Building and supporting continuous integration/delivery and release tools • Ensuring the right metrics are collected and monitored

AWS Grafana Kubernetes Prometheus Python Terraform

View details: Senior AWS DevOps Engineer, SRE – AI

Poland

zł22K - zł30K / month

Apply

Job Closed

Senior AWS DevOps Engineer, SRE, AI

Xebia Poland

A place where experts grow.

DevOps Engineer111 days ago

Full Time RemoteTeam 1,001-5,000Since 2001H1B No Sponsor

Company Site LinkedIn

AWS Grafana Kubernetes Prometheus Python Terraform

View details: Senior AWS DevOps Engineer, SRE, AI

Poland

zł22K - zł30K / month

Apply

Job Closed

Senior Site Reliability Engineer

Selector

Industry leading AIOps platform for operational intelligence.

DevOps Engineer111 days ago

Full Time RemoteTeam 51-200H1B Sponsor

Company Site LinkedIn

• Serve as a senior technical expert in deploying and maintaining Selector’s operational analytics platform across on-premises and SaaS environments. • Lead complex customer installations, including deployments in air-gapped and highly regulated environments. • Partner directly with customers via Zoom/Teams to troubleshoot, triage services, and resolve installation or performance nuances. • Author, review, and maintain Infrastructure as Code (IaC) using Terraform/OpenTofu, ensuring scalable and maintainable infrastructure design. • Deploy and manage containerized applications using Kubernetes (including RKE) and Kustomize in production environments. • Triage and resolve issues across distributed systems, Kafka pipelines, CI/CD workflows (Jenkins), and Google Cloud infrastructure. • Provide structured, actionable feedback to Platform Engineering and DevOps teams to improve reliability, scalability, and performance. • Participate in and help mature on-call processes, ensuring high availability and operational excellence. • Perform root cause analysis for production incidents and implement long-term corrective and preventative solutions. • Research, evaluate, and implement new tools or architectural improvements to address infrastructure and operational challenges. • Mentor junior engineers and promote SRE best practices across reliability, observability, and automation. • Improve internal tooling, automation, and operational workflows to enhance developer productivity and system stability.

Distributed Systems GCP Jenkins Apache Kafka Kubernetes Python Terraform

View details: Senior Site Reliability Engineer

India

Apply

Job Closed

Junior DevSecOps Engineer

EUROPEAN DYNAMICS

"{ engineer; innovate; excite; }"

DevOps Engineer111 days ago

Full Time RemoteTeam 501-1,000Since 1998H1B No Sponsor

Company Site LinkedIn

• Write and maintain Bash scripts to automate operational and deployment tasks; • Create, support and improve existing CI/CD pipelines for application delivery; • Help manage and monitor cloud environments (dev, test, production); • Perform basic troubleshooting of infrastructure and deployment issues; • Work with Linux systems (services, processes, permissions, networking basics); • Assist with monitoring, logging, and alerting solutions; • Document procedures, configurations, and runbooks; • Collaborate with developers and senior engineers to improve system reliability; • Assist in building and maintaining cloud infrastructure using Terraform and other IaC tools.

AWS Azure DNS Docker Firewalls GCP Grafana Jenkins Kubernetes Linux Prometheus Python Terraform Unix

View details: Junior DevSecOps Engineer

Greece

Apply

Site Reliability Engineer II

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior AWS DevOps Engineer, SRE – AI

Senior AWS DevOps Engineer, SRE, AI

Senior Site Reliability Engineer

Junior DevSecOps Engineer