Cribl

Cribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 501-1,000Since 2017H1B SponsorCompany Site LinkedIn

Location

Poland

Posted

75 days ago

Salary

Seniority

Senior

EnglishAnsible AWS Azure Cloud Grafana JavaScript Linux Node.js Prometheus Splunk Terraform TypeScript

Job Description

• Engage with teams and improve service delivery and reliability across their entire lifecycle • Measure and monitor all production systems with an eye towards availability, latency and overall system health • Seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence • Engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability • Help identify and drive down toil with creative innovation and automation • This position will require stand-by, on-call, or off-hours duties

Job Requirements

Proven experience designing, implementing, and operating observability systems for complex cloud-based platforms
Experience with Configuration Management and Infrastructure as a Code Tools like Terraform (preferred) or Ansible
Knowledge of cloud platforms (prefer AWS and Azure)
Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc.
Extensive experience with enterprise scale continuous delivery environments
Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment
Experience with sustainable incident response in a blameless environment
Background in Linux Systems Engineering
Experience with Incident response related tools for instance, PagerDuty, FireHydrant, Blameless etc.
Comfortable with a high level of autonomy and working with a distributed team
Knowledge of Cloud and application security best practices
Strong knowledge of cloud design patterns for scale, data management, resiliency, etc.
A love for high quality and a knack for testing
Opinions about business metrics, and SLOs

Benefits

Diversity drives innovation and better decisions
Remote-first culture
Welcoming and valuing differences

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior DevOps

Resilient Co.

WE ARE RESILIENT CO. We adapt to your needs.

DevOps Engineer75 days ago

Contract RemoteTeam 11-50Since 2020H1B No Sponsor

Company Site LinkedIn

• Design and implement infrastructure-as-code using Terraform for Azure services including AKS, Blob Storage and App Services. • Build, maintain and optimize CI/CD pipelines and mobile/web build pipelines. • Operate, troubleshoot and tune Kubernetes and Docker-based workloads running on AKS. • Implement and manage SSO and External ID flows using Microsoft Entra. • Create reusable templates, Terraform modules and pipeline templates to enable developer self-service. • Collaborate directly with technical leads to define platform direction and deployment patterns. • Mentor engineers on deployment best practices, observability and platform usage. • Own platform-level decisions and improvements, prioritizing strategic work over ticket-level execution. • Write clear, async-friendly documentation and communicate effectively in AI-augmented workflows. • Manage and support PostgreSQL-related deployment and operational concerns as they relate to platform infrastructure.

Azure Docker JavaScript Kubernetes Node.js PostgreSQL React Terraform

View details: Senior DevOps

Argentina

Apply

Job Closed

Site Reliability Engineer

SupplyHouse.com

Plumbing, Heating & HVAC Supplies. Real People. Real Service.

DevOps Engineer75 days ago

Full Time RemoteTeam 501-1,000Since 2004H1B Sponsor

Company Site LinkedIn

• Design, build, and maintain scalable, reliable systems on GCP (Compute Engine, GKE, Cloud Storage, Cloud SQL) • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager • Build and maintain observability platforms (monitoring, logging, tracing) using tools such as Stackdriver (Cloud Monitoring), Prometheus, or Grafana • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence • Partner with DevOps and engineering teams to enhance CI/CD pipelines for resilient deployments • Define and monitor SLAs, SLOs, and SLIs to ensure application availability and performance • Implement disaster recovery (DR) and backup strategies across cloud services • Continuously optimize performance, capacity, and cost-efficiency of GCP resources

Ansible Cloud Docker Google Cloud Platform Grafana Jenkins Kubernetes Linux Prometheus Python SQL Terraform Unix Go

View details: Site Reliability Engineer

India

$29K - $36K / year

Apply

Job Closed

DevOps Engineer, AWS, Terraform

HRM Group

Accelerating Digital Evolution

DevOps Engineer75 days ago

Full Time RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

• Manage, automate and optimize cloud environments, with a particular focus on AWS. • Implement Infrastructure as Code, manage CI/CD pipelines, and support continuous delivery of applications. • Collaborate with development and operations teams to ensure system reliability, scalability and performance. • Contribute to platform evolution and process automation.

AWS Cloud Jenkins Terraform

View details: DevOps Engineer, AWS, Terraform

United States

Apply

Job Closed

DevOps Engineer

PandaDoc

Taking the work out of document workflow.

DevOps Engineer75 days ago

Full Time RemoteTeam 501-1,000Since 2011H1B Sponsor

Company Site LinkedIn

• Platform & IaC Ownership: Analyze and implement infrastructure designs for services and shared components, managing them as Infrastructure as Code (IaC) using tools like Terraform and Helm within our cloud environment (AWS). • Delivery Lifecycle Management: Design and implement robust CI/CD pipelines and own the full delivery lifecycle of infrastructure tools, services, and components from development testing through to production rollout. • Developer Enablement: Actively participate in regular support cadences to provide hands-on technical assistance and expertise to development teams regarding platform adoption and usage. • Reliability Integration: Integrate and maintain monitoring, logging, and alerting components for platform services, and participate in the team's on-call rotation for immediate incident mitigation within the platform ownership scope. • Security & Compliance: Collaborate closely with the Security team to embed DevSecOps best practices and guardrails, ensuring the security and compliance of the platform and delivery process. • Process Improvement: Drive continuous improvements in platform tooling usability, deployment efficiency, and environment stability.

AWS Cloud DNS Jenkins Kubernetes Python Terraform Go

View details: DevOps Engineer

Poland

zł17.4K - zł26K / month

Apply

Job Closed

Senior Site Reliability Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior DevOps

Site Reliability Engineer

DevOps Engineer, AWS, Terraform

DevOps Engineer