EPAM Systems is an information technology (IT) company that has become a leading global digital and product design, digital platform engineering, and product de
Senior Data DevOps Engineer
Location
Georgia + 1 moreAll locations: Georgia | Georgia
Posted
4 days ago
Salary
0
Seniority
Senior
No structured requirement data.
Job Description
Senior Data DevOps Engineer
EPAM Systems
Title: Senior Data DevOps Engineer Location: Remote in Georgia Job Description: We are looking for a proactive and detail-oriented Senior Data DevOps Engineer with strong experience in cloud infrastructure and automation, particularly within Google Cloud Platform (GCP). The ideal candidate is passionate about building scalable, secure and reliable systems, and is comfortable working in a fast-paced, collaborative environment. You should have a strong ownership mindset, the ability to streamline operations through automation, and the judgment to balance speed with stability. Experience the freedom of remote work from anywhere in Georgia, whether from the comfort of your home, our modern offices in Tbilisi and Batumi or a coworking space in Kutaisi. Responsibilities - Design, implement and manage cloud infrastructure on GCP using Infrastructure as Code (IaC) principles - Development and maintenance of Terraform modules for environment provisioning and standardization - Use of Ansible for configuration management and system automation - Build, optimize and maintain CI/CD pipelines using Jenkins to support efficient software delivery - Management and security of GCP services, including IAM roles, networking configurations and access controls - Administration of BigQuery environments, ensuring performance, cost optimization and data governance - Management of Google Cloud Storage (GCS) buckets, including lifecycle policies, access control and security best practices - Deployment and operation of data processing workloads using GCP Dataproc (Spark jobs) - Collaboration with engineering, data and product teams to support reliable and scalable platform operations - Monitor system performance, troubleshoot issues and implement improvements to enhance reliability and efficiency - Contribution to best practices, documentation and continuous improvement of DevOps processes Requirements - 4+ years of experience in DevOps, Cloud Engineering or related roles - Expertise in Terraform for infrastructure provisioning and Ansible for configuration management and automation - Proficiency in building and maintaining CI/CD pipelines using Jenkins - Solid experience with Google Cloud Platform (GCP), including IAM and networking - Background in administering BigQuery and managing large-scale data environments - Skills in GCS bucket management, including lifecycle and security configurations - Familiarity with GCP Dataproc and Spark job deployment/operations - Understanding of cloud security, scalability and reliability principles - Good problem-solving skills and ability to work independently under general direction - Strong communication and collaboration skills - English proficiency at B2 level or higher Nice to have - Experience with Looker or similar BI/reporting tools - Exposure to data engineering workflows or analytics platforms We offer/Benefits We connect like-minded people - Delivering innovative solutions to industry leaders, making a global impact - Enjoyable working environment, whether it is the vibrant office or the comfort of your own home - Opportunity to work abroad for up to two months per year - Relocation opportunities within our offices in 55+ countries - Corporate and social events We invest in your growth - Leadership development, career advising, soft skills and well-being programs - Certifications, including GCP, Azure and AWS</li> - Unlimited access to LinkedIn Learning and Udemy - Free English classes with certified teachers We cover it all - Participation in the Employee Stock Purchase Plan - Monetary bonuses for engaging in the referral program - Comprehensive medical & family care package - Five trust days per year (sick leave without a medical certificate) - Benefits package (sports activities, a variety of stores and services) EPAM Georgia is a team of innovators united by a passion for technology. The dynamic and inclusive culture we embrace helps positively impact our communities, clients, and employees. Here you will collaborate with multi-national teams, contribute to numerous cutting-edge projects, deliver the most creative solutions, and have an opportunity to learn. Our people are at the heart of our success, and we are proud to provide talents with a solid ground to develop and grow.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Software Engineer/SRE
itDFormerly known as iTalent Digital. We are a different kind of global software development and technology consultancy.
• Lead the design, development and operation of large-scale, secure observability systems • Collaborate with internal teams on industry thought leadership • Attend regular internal practice community meetings • Complete client case studies and learning material (blogs, media material) • Build out material to contribute to the Digital Transformation practice • Attend internal itD networking events (in person and virtual)
• Troubleshoot system issues using logs, diagnostics, and monitoring tools • Develop and maintain automation scripts using PowerShell and Bash • Design and manage CI/CD pipelines for continuous integration and deployment • Configure and maintain web servers (IIS) and support application hosting environments • Deploy, manage, and scale applications in Azure Kubernetes Service (AKS) • Build and maintain Docker images, Docker files, and containerized environments • Implement and maintain monitoring, logging, and observability solutions (Grafana, Prometheus, ELK, etc.) • Collaborate with development teams to ensure smooth release cycles and reliable deployments • Contribute to improving system architecture and adopting best DevOps practices
Senior Manager, Site Reliability Engineering
Clover HealthClover is a healthcare technology company helping members live their healthiest lives with our Medicare Advantage plans.
• Lead and grow our SRE team of ~10 engineers, including hiring, retention, career development, and performance management across multiple time zones (US, HK, NZ). • Build strategic partnerships with product engineering pillars — shifting SRE from reactive, ticket-based support to proactive co-ownership of reliability outcomes. • Scale our multi-tenant infrastructure to support new customer onboarding and growing patient populations. • Own cloud cost management and FinOps practices, building frameworks that balance cost control with reliability and performance. • Champion developer self-service and platform engineering. Build self-service capabilities so product teams can manage routine operations without filing SRE tickets. Establish SLOs/SLIs for critical services and improve alert quality so every page is meaningful. • Ensure the SRE team is fully leveraging AI tooling in their workflows — using tools like Claude Code for IaC generation, log analysis, root cause investigation, and automating repetitive work — at the same level as the rest of engineering.
Senior Manager, Site Reliability Engineering
Counterpart HealthIn 2018, Clover Health set out to build a clinically intuitive, AI-enabled solution that fits within physicians' workflows to help support the earlier diagnosis and management of chronic conditions. Years later, that vision is a reality, with thousands of practitioners using Counterpart Assistant during patient visits. Counterpart Health is a subsidiary of Clover Health, committed to Diversity & Inclusion as key to our success. We are an Equal Opportunity Employer, valuing diverse strengths, experiences, perspectives, and backgrounds.
Role Description We're looking for a Senior Manager of Site Reliability Engineering to join our team. You'll lead a team of ~10 SREs across North America, UK, HK, and New Zealand — owning both the day-to-day operations and the long-term technical direction of the SRE organization. This role sits at the intersection of people leadership, technical depth, and strategic partnership: you're here to make Counterpart’s infrastructure reliable, scalable, and cost-efficient — and to transform the SRE team's engagement model from reactive support to proactive collaboration with our product engineering pillars. - Lead and grow our SRE team of ~10 engineers, including hiring, retention, career development, and performance management across multiple time zones (US, HK, NZ). - Build strategic partnerships with product engineering pillars — shifting SRE from reactive, ticket-based support to proactive co-ownership of reliability outcomes. - Scale our multi-tenant infrastructure to support new customer onboarding and growing patient populations. - Own cloud cost management and FinOps practices, building frameworks that balance cost control with reliability and performance. - Champion developer self-service and platform engineering. Build self-service capabilities so product teams can manage routine operations without filing SRE tickets. Establish SLOs/SLIs for critical services and improve alert quality so every page is meaningful. - Ensure the SRE team is fully leveraging AI tooling in their workflows — using tools like Claude Code for IaC generation, log analysis, root cause investigation, and automating repetitive work — at the same level as the rest of engineering. Qualifications - You have 6+ years managing an SRE team and 10+ years of hands-on SRE or infrastructure engineering experience. - You're deeply comfortable with our core stack: Kubernetes, GCP (GKE, Cloud SQL, Pub/Sub, GCS), Terraform, Helm, ArgoCD, PostgreSQL, and Prometheus/Grafana. - You have strong programming skills in Python and/or Go, and you're comfortable writing and reviewing infrastructure tooling code — including using AI coding tools to do so. - You have experience with CI/CD pipelines (GitHub Actions) and a track record of building or improving developer tooling and automation. - You have sound build vs. buy judgment — you default to the right answer, not the easiest one, and you're comfortable building internal tooling when existing solutions don't fit. - You have experience leading teams across multiple time zones and a track record of developing engineers into strong technical contributors. Benefits - Financial Well-Being: Competitive base salary and equity opportunities, performance-based bonus program, 401k matching, and regular compensation reviews. - Physical Well-Being: Comprehensive medical, dental, and vision coverage. - Mental Well-Being: Initiatives such as No-Meeting Fridays, monthly company holidays, access to mental health resources, and a generous flexible time-off policy. - Professional Development: Learning programs, mentorship, professional development funding, and regular performance feedback and reviews. - Additional Perks: Employee Stock Purchase Plan (ESPP), reimbursement for office setup expenses, monthly cell phone & internet stipend, remote-first culture, paid parental leave for all new parents, and much more!



