Empowering companies to work with the best engineers in the world
Senior DevOps Engineer, Cloud, MongoDB, Terraform
Location
India
Posted
58 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer, Cloud, MongoDB, Terraform
Smart Working
• Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform. • Take full ownership of MongoDB Atlas in production, including: - Cluster architecture and scaling - Replication and high availability - Backup and disaster recovery strategies - Performance tuning and query optimisation - Security and access control • Architect and manage containerised and serverless workloads (e.g., Cloud Run, ECS, Kubernetes, or equivalents). • Design and operate event-driven systems (e.g., Pub/Sub, SQS/SNS, EventBridge, or equivalents). • Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability. • Develop reusable Infrastructure as Code (Terraform) modules and manage multi-environment setups. • Collaborate with engineering teams on system architecture, scalability, and performance optimisation. • Implement robust monitoring, alerting, and observability across distributed systems. • Lead incident response and root cause analysis, driving long-term improvements. • Own infrastructure decisions end-to-end, including architecture, cost optimisation, and performance. • Document systems, create runbooks, and establish best practices. • Mentor engineers and promote DevOps best practices across the organisation.
Job Requirements
- 6+ years of DevOps / Infrastructure Engineering experience in production environments.
- Strong hands-on experience with at least one major cloud provider: GCP, AWS, or Azure using Terraform.
- Advanced experience with Terraform (modularisation, remote state, multi-environment setups).
- Proven experience designing and operating scalable cloud infrastructure.
- Mandatory: Deep MongoDB Atlas experience in production, including:
- Cluster configuration and scaling
- Replication and failover
- Backup and recovery strategies
- Performance tuning and indexing
- Security and access management
- Experience with containerised environments (Docker, Kubernetes, or equivalents).
- Experience building and maintaining CI/CD pipelines.
- Solid understanding of event-driven architectures.
- Strong knowledge of monitoring, logging, and observability in distributed systems.
- Ability to operate at an architect/owner level, not just execute tasks.
- Strong communication skills and ability to work in a remote, async-first team.
Benefits
- Fixed Shifts: 12:00 PM - 9:30 PM IST (Summer) | 1:00 PM - 10:30 PM IST (Winter)
- No Weekend Work: Real work-life balance
- Day 1 Benefits: Laptop and full medical insurance provided
- Support That Matters: Mentorship, community, and collaboration
- True Belonging: A long-term career where your contributions are valued
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
About Smart Working At Smart Working Solutions, we believe your job should not only look right on paper but also feel right every day. This isn’t just another remote opportunity - it’s about finding where you truly belong, no matter where you are. From day one, you’re welcomed into a genuine community that values your growth and well-being. Our mission is simple: to break down geographic barriers and connect skilled professionals with outstanding global teams and products for full-time, long-term roles. We help you discover meaningful work with teams that invest in your success, where you’re empowered to grow personally and professionally. Join one of the highest-rated workplaces on Glassdoor and experience what it means to thrive in a truly remote-first world. About the Role We are looking for a Senior DevOps Engineer with strong cloud infrastructure expertise (GCP / AWS / Azure) using Terraform and deep MongoDB Atlas ownership experience to design, operate, and scale a cloud-native infrastructure powering a large enterprise SaaS platform. This is a high-ownership, architecture-level role, not just execution. You will be responsible for designing and running production systems end-to-end, with a particular focus on database infrastructure (MongoDB Atlas) and scalable cloud environments. You will work in a fully remote, async-first environment, collaborating closely with engineering teams to ensure high availability, performance, and operational excellence across multiple environments. Responsibilities - Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform. - Take full ownership of MongoDB Atlas in production, including: - Cluster architecture and scaling - Replication and high availability - Backup and disaster recovery strategies - Performance tuning and query optimisation - Security and access control - Architect and manage containerised and serverless workloads (e.g., Cloud Run, ECS, Kubernetes, or equivalents). - Design and operate event-driven systems (e.g., Pub/Sub, SQS/SNS, EventBridge, or equivalents). - Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability. - Develop reusable Infrastructure as Code (Terraform) modules and manage multi-environment setups. - Collaborate with engineering teams on system architecture, scalability, and performance optimisation. - Implement robust monitoring, alerting, and observability across distributed systems. - Lead incident response and root cause analysis, driving long-term improvements. - Own infrastructure decisions end-to-end, including architecture, cost optimisation, and performance. - Document systems, create runbooks, and establish best practices. - Mentor engineers and promote DevOps best practices across the organisation. Requirements - 6+ years of DevOps / Infrastructure Engineering experience in production environments. - Strong hands-on experience with at least one major cloud provider: GCP, AWS, or Azure using Terraform. - Advanced experience with Terraform (modularisation, remote state, multi-environment setups). - Proven experience designing and operating scalable cloud infrastructure. - Mandatory: Deep MongoDB Atlas experience in production, including: - Cluster configuration and scaling - Replication and failover - Backup and recovery strategies - Performance tuning and indexing - Security and access management - Experience with containerised environments (Docker, Kubernetes, or equivalents). - Experience building and maintaining CI/CD pipelines. - Solid understanding of event-driven architectures. - Strong knowledge of monitoring, logging, and observability in distributed systems. - Ability to operate at an architect/owner level, not just execute tasks. - Strong communication skills and ability to work in a remote, async-first team. Nice to Have - Experience working across multiple cloud providers. - Experience implementing GitOps practices. - Familiarity with advanced observability tools (Datadog, APM, tracing). - Experience supporting high-scale SaaS platforms. - Interest in platform engineering and developer experience. Benefits - Fixed Shifts: 12:00 PM - 9:30 PM IST (Summer) | 1:00 PM - 10:30 PM IST (Winter) - No Weekend Work: Real work-life balance - Day 1 Benefits: Laptop and full medical insurance provided - Support That Matters: Mentorship, community, and collaboration - True Belonging: A long-term career where your contributions are valued At Smart Working, you’ll never be just another remote hire. Be a Smart Worker - valued, empowered, and part of a culture that celebrates integrity, excellence, and ambition.
• Provide recruitment and staffing services to various industries. • Understand hiring strategies and talent availability. • Collaborate as business partners to deliver high value and return on investment for clients. • Stay knowledgeable of latest industry trends and technologies.
• Responsible for designing, implementing, and maintaining robust CI/CD pipelines and infrastructure solutions. • Lead the design, deployment, and operation of a new Multi-region Artifactory platform hosted in AWS. • Design, implement, and maintain the Artifactory cloud architecture on AWS. • Lead the migration of existing repositories from on-prem to AWS. • Automate infrastructure provisioning and configuration management using tools such as Terraform, CloudFormation, or Ansible. • Design and maintain robust pipelines using industry-standard tools. • Implement and maintain orchestration solutions using Docker and Kubernetes. • Monitor system performance, troubleshoot complex issues, and implement solutions. • Enforce software supply chain policies and ensure security best practices are implemented throughout the CI/CD pipeline.
DevOps Engineer
PavagoPavago specializes in connecting businesses with top-tier offshore talent in operations, sales, and marketing, offering a comprehensive recruitment solution designed to reduce cost
Job Title: DevOps Engineer Position Type: Full-Time, Remote Working Hours: U.S. client business hours (with flexibility for deployments, incident response, and on-call rotations) About the Role: Our client is seeking a DevOps Engineer to build, maintain, and optimize infrastructure and deployment pipelines. This role requires expertise in cloud platforms, automation, monitoring, and CI/CD. The DevOps Engineer ensures systems are secure, scalable, and reliable, enabling development teams to ship code quickly and safely. Responsibilities: Infrastructure Management: - Provision and manage infrastructure on AWS, GCP, or Azure. - Implement Infrastructure-as-Code using Terraform, Pulumi, or CloudFormation. - Configure networking, storage, and compute resources to scale with demand. CI/CD Pipelines: - Build and maintain pipelines with GitHub Actions, Jenkins, GitLab CI, or CircleCI. - Automate builds, tests, and deployments across multiple environments. - Ensure rollback strategies and zero-downtime deployments. Containerization & Orchestration: - Manage Docker containers and Kubernetes clusters. - Deploy microservices and monitor cluster health. - Optimize cost and performance for containerized workloads. Monitoring & Incident Response: - Implement observability with Prometheus, Grafana, Datadog, or New Relic. - Configure logging and alerting pipelines (ELK stack, Splunk). - Participate in on-call rotations, performing root cause analysis post-incident. Security & Compliance: - Apply cloud security best practices (IAM, least privilege, encryption). - Support SOC 2, HIPAA, PCI, or GDPR compliance in infrastructure. - Run vulnerability scans and patch systems proactively. Collaboration & Process Improvement: - Partner with developers to streamline deployments and remove bottlenecks. - Document infrastructure, pipelines, and workflows for team knowledge. - Identify opportunities for automation and performance optimization. What Makes You a Perfect Fit: - Problem solver who thrives at the intersection of development and operations. - Calm and methodical in high-pressure incident scenarios. - Passionate about automation, scalability, and reliability. - Strong communicator who bridges technical and business needs. Required Experience & Skills (Minimum): - 3+ years experience in DevOps, SRE, or infrastructure engineering. - Proficiency with at least one cloud provider (AWS, GCP, Azure). - Strong knowledge of CI/CD tools and pipelines. - Experience with Docker and Kubernetes. Ideal Experience & Skills: - Terraform or Pulumi Infrastructure-as-Code expertise. - Industry background in SaaS, fintech, healthcare, or enterprise applications. - Familiarity with serverless deployments (AWS Lambda, Google Cloud Functions). - Security certifications or cloud certifications (AWS Certified DevOps Engineer, CKA, etc.). What Does a Typical Day Look Like? A DevOps Engineer’s day revolves around keeping systems secure, automated, and reliable. You will: - Review monitoring dashboards for performance and incident alerts. - Update CI/CD pipelines to improve build/test/deploy efficiency. - Provision or modify infrastructure with Terraform or CloudFormation. - Collaborate with developers to troubleshoot deployments or optimize services. - Document workflows and update runbooks for incident response. - End the day by analyzing logs, reviewing metrics, and planning optimizations. In essence: you are the custodian of infrastructure and automation, ensuring systems scale, stay secure, and support fast product delivery. Key Metrics for Success (KPIs): - Deployment frequency (faster, more reliable releases). - System uptime ≥ 99.9%. - MTTR (Mean Time to Recovery) reduced for incidents. - Cost efficiency of infrastructure usage. - Positive developer feedback on deployment speed and reliability. Interview Process: - Initial Phone Screen - Video Interview with Pavago Recruiter - Technical Task (e.g., design a CI/CD pipeline or provision infrastructure with Terraform) - Client Interview with Engineering/DevOps Leadership - Offer & Background Verification


