The all-in-one sales & marketing platform that agencies can white-label. CRM, Email, 2-way SMS, Funnel Builder, & more!
Lead DevOps Engineer
Location
India
Posted
36 days ago
Salary
0
Seniority
Senior
Job Description
Lead DevOps Engineer
HighLevel
• Drive visibility, efficiency, and savings across multi-cloud infrastructure • Design and implement strategies to reduce operational costs across GCP, AWS, Firebase, and managed services • Build and maintain cost dashboards to track spend across cloud providers • Define and maintain Cost per Operation (CPO) metrics; collaborate with product owners • Set up policies, guardrails, budgets, and alerts to prevent cost overruns • Develop automation scripts to detect idle resources, right-size workloads • Break down cloud bills by team, project, and service; provide actionable insights • Partner with finance, DevOps, platform, and product teams • Provide guidance on designing cost-efficient cloud-native systems
Job Requirements
- 7+ years in Cloud Engineering roles with a focus on cost optimization
- Deep hands-on experience with GCP (BigQuery, GKE, Firebase, Pub/Sub), AWS (EC2, S3, Lambda, EKS), and managed services like MongoDB Atlas, Elastic.co, ClickHouse
- Strong working knowledge of DoiT Cloud Analytics, GCP Billing Export, AWS CUR (Cost & Usage Reports), and BigQuery
- Proficient in Python, Bash, and automation frameworks for cost cleanup and reporting
- Comfortable querying and visualizing cloud billing data to derive unit economics (e.g., cost per user, per API call, per deployment)
- Familiar with cost management in Kubernetes (e.g., node cost allocation, workload optimization, spot/preemptible usage)
- Ability to translate complex cost insights into actionable plans for engineering and business stakeholders
Benefits
- EEO Statement: The company is an Equal Opportunity Employer.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Own reliability outcomes for Tango’s cloud platform (availability, latency, performance, and scalability) across production and non-production environments • Design, implement, and operate SLOs/SLIs, error budgets, and reliability reporting; drive prioritization of reliability work with Engineering and Product • Build and maintain observability foundations: metrics, logging, tracing, dashboards, and alerting that are actionable and reduce noise • Lead incident response and post-incident reviews (blameless RCAs); implement remediation and prevention work to measurably reduce repeat incidents • Engineer and evolve CI/CD and release safety practices (progressive delivery, canary/blue-green, automated rollbacks, change controls) • Improve infrastructure-as-code and environment consistency; standardize and harden platform components • Partner with Security and Compliance to support secure operations, vulnerability remediation, audits, and customer trust requirements • Optimize cloud cost and capacity through right-sizing, autoscaling, and performance tuning; track and report on cost drivers • Enable engineering teams with reliable internal tooling, runbooks, and self-service operational capabilities • Mentor engineers on reliability best practices, operational excellence, and automation
Site Reliability Engineer Team Lead
CoralogixFull-stack observability for logs, metrics, traces and security events with built-in cost optimization.
Role Description Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, trace and security events with features such as APM, RUM, SIEM, Kubernetes monitoring and more, all enhancing operational efficiency and reducing observability spend by up to 70%. We are looking for a Site Reliability Engineer Team Lead to lead our Cloud Infrastructure Team, focusing on Enterprise FedRAMP Cloud Infrastructure. In this role, you will: - Lead and mentor a team of engineers, including hiring, onboarding, and performance management. - Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day. - Adopt cutting edge technologies with end-to-end responsibility. - Build internal tools to expand our platform capabilities. - Collaborate with R&D to improve stability & reliability of the system. - Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap. - Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management. This role is remote; employees must be within EST / CT time zone. Our tech stack is unique and in constant growth: - Kubernetes - Kops - AWS - Kafka - Prometheus - Thanos - Coralogix - Git - Argo CD - Istio - and many more! Qualifications - 2+ years of experience as a Team Lead / Tech Lead. - At least 5 years of experience as a DevOps Engineer/ SRE in production environments. - At least 2 years of experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting. - In-depth experience with Kubernetes - operating & monitoring are key parts. - High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus. - Experience in AWS or other cloud providers. - Experience with infrastructure as code (Terraform, Crossplane, etc.). - Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl). - Some software engineering experience, preferably in Golang. - An advantage - operating data pipelines. - An advantage - familiarity with Apache Kafka. Cultural Fit We’re seeking candidates who are hungry, humble, and smart. Coralogix fosters a culture of innovation and continuous learning, where team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we’d love to hear from you. Compensation and Rewards The earnings range for this role is $230,000 - $270,000. When determining your salary, we consider your experience, skills, education, and work location. Our total compensation package includes comprehensive and inclusive employee benefits for healthcare, dental, and mental health benefits, a 401(k) plan and match, paid sick time, and paid time off. Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply.
Site Reliability Engineer
CoralogixFull-stack observability for logs, metrics, traces and security events with built-in cost optimization.
Role Description We are looking for a Site Reliability Engineer to work as part of our Cloud Infrastructure Team, focusing on Enterprise FedRal Cloud Infrastructure. - Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day - Adopt cutting edge technologies with end-to-end responsibility - Building internal tools to expand our platform capabilities - Collaborate with R&D to improve stability & reliability of the system - Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap. - Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management. This role is remote, employees must be within EST / CT time zone. Our Tech Stack Is Unique And In Constant Growth: - Kubernetes - Kops - AWS - Kafka - Prometheus - Thanos - Coralogix - Git - Argo CD - Istio - and many more! Qualifications - At least 5 years of experience as a DevOps Engineer/ SRE in production environments - In-depth experience with Kubernetes - operating & monitoring are key parts - At least 2 years of experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting - advantage - High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus - Experience in AWS or other cloud providers - Experience with infrastructure as code (Terraform, Crossplane, etc.) - Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl) - Some software engineering experience, preferably in Golang. - An advantage - operating data pipelines - An advantage - familiarity with Apache Kafka Cultural Fit We’re seeking candidates who are hungry, humble, and smart. Coralogix fosters a culture of innovation and continuous learning, where team members are encouraged to challenge the status quo and contribute to our shared mission. If you thrive in dynamic environments and are eager to shape the future of observability solutions, we’d love to hear from you. Compensation and Rewards The earnings range for this role is $170,000-$220,000. When determining your salary, we consider your experience, skills, education, and work location. - Our total compensation package includes comprehensive and inclusive employee benefits for healthcare, dental, and mental health benefits - A 401(k) plan and match - Paid sick time and paid time off Coralogix is an equal opportunity employer and encourages applicants from all backgrounds to apply.
Senior Software/DevSecOps Engineer
KBRKBR, formerly a subsidiary of Halliburton, is a company in defense and space, offering services in technology, engineering, procurement, and construction on a global scale. Since i
• Work as part of the team supporting the Test Resource Management Center’s (TRMC) Test and Training Enabling Architecture (TENA), Cloud Hybrid Edge to Enterprise Test Analysis Suite (CHEETAS), and Joint Mission Environment Testing Capability (JMETC) User Support Teams • Develop software applications that interface with the TENA and CHEETAS products • Lead and mentor junior members of the team on development efforts • Documenting, managing configuration, testing, analysis and bug fixing involved in creating and maintaining applications and frameworks involved within an agile software release life cycle



