Job Closed
This listing is no longer active.
Cloud Transformation for the Enterprise
Lead DevOps Engineer – Azure, Terraform
Location
India
Posted
153 days ago
Salary
₹2,800K - ₹3,100K / year
Seniority
Senior
Job Description
Lead DevOps Engineer – Azure, Terraform
NorthBay Solutions
• Design, implement, and manage CI/CD pipelines using tools such as Jenkins, GitHub Actions, or Azure DevOps • Develop and maintain Infrastructure-as-Code using Terraform • Manage and scale container orchestration environments using Kubernetes, including experience with larger production-grade clusters • Ensure cloud infrastructure is optimized, secure, and monitored effectively • Collaborate with data science teams to support ML model deployment and operationalization • Implement MLOps best practices, including model versioning, deployment strategies (e.g., blue-green), monitoring (data drift, concept drift), and experiment tracking (e.g., MLflow) • Build and maintain automated ML pipelines to streamline model lifecycle management
Job Requirements
- 8 to 12 years of experience in DevOps and/or MLOps roles
- Proficient in CI/CD tools: Jenkins, GitHub Actions, Azure DevOps
- Strong expertise in Terraform, including managing and scaling infrastructure across large environments
- Hands-on experience with Kubernetes in larger clusters , including workload distribution, autoscaling, and cluster monitoring
- Strong understanding of containerization technologies (Docker) and microservices architecture
- Solid grasp of cloud networking, security best practices, and observability
- Scripting proficiency in Bash and Python
- Experience with MLflow, TFX, Kubeflow, or SageMaker Pipelines (preferred)
- Knowledge of model performance monitoring and ML system reliability (preferred)
- Familiarity with AWS MLOps stack or equivalent tools on Azure/GCP (preferred)
Benefits
- Health insurance
- Flexible working hours
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevSecOps Engineer
DroneSense, Inc.Builders of the most comprehensive drone management and collaboration platform on the market.
• Work alongside DevOps and engineering teams to ensure our platforms, repositories and CI/CD pipelines are secure by default while remaining easy to build, test, and deploy against • Identify security risks through tools, audits, and monitoring, and drive them to resolution — whether that means changing a policy, updating infrastructure, or improving a pipeline • Take ownership of the security posture across multiple AWS accounts and continuously improve it over time • Design and maintain security guardrails around IAM, logging, monitoring, and encryption • Manage repository-level security scanning (SAST, dependency scanning, secrets detection) using tools such as Aikido or similar, and ensure findings are actionable • Support security and compliance initiatives (e.g., SOC 2, TX-RAMP, or similar) by implementing and maintaining required technical controls and automation, in partnership with a Security Analyst • Partner with DevOps teams to secure Kubernetes clusters, with a strong focus on Rancher • Improve security visibility through monitoring, logging, and reporting • Automate security controls and validations using Infrastructure as Code and scripting • Help document security standards, patterns, and operational runbooks
Site Reliability Engineer
EnsonoEnsono delivers complete Hybrid IT solutions, from mainframe to cloud, tailored to each client’s journey.
• We are seeking an experienced Site Reliability Engineer (SRE) with expertise in Infrastructure as Code tools like Terraform, core CI/CD tools such as Azure DevOps, and monitoring tools including DataDog and AWS CloudWatch. • The ideal candidate will have commercial experience in technologies like Dotnet or Java, and be skilled in troubleshooting, incident resolution, and improving service and change management processes. • Strong leadership in client-facing discussions and engagement with third-party suppliers is essential. • An SRE Foundation certificate and a cloud provider associate-level certification are highly beneficial.
Director, Cloud Operations – DevOps
MediSpendMediSpend solutions are designed to empower life sciences companies to grow their business compliantly.
• Own and operate AWS and Azure cloud environments, ensuring high availability, performance, and security. • Lead cloud administration including incident management, monitoring, patching, and system health. • Establish and enforce cloud governance, IAM, security controls, logging, and audit readiness. • Drive disaster recovery and business continuity planning and execution. • Own cloud cost management, forecasting, and optimization. • Partner with Finance to manage cloud budgets and implement best practices. • Lead DevOps and platform teams supporting CI/CD pipelines, Infrastructure-as-Code, and automation. • Champion DevOps best practices to improve deployment speed, reliability, and operational efficiency. • Partner with Engineering to enable self-service and developer productivity. • Support SOC 1&2, ISO 27001, GDPR, and other compliance requirements. • Collaborate with Security on IAM, SSO, encryption, monitoring, and incident response. • Lead and develop a high-performing, globally distributed team. • Manage relationships with AWS, Azure, and key vendors/MSPs. • Communicate cloud strategy, risks, and progress to executive leadership.
Senior DevOps Engineer, AWS, SQL
equivantequivant is dedicated to enhancing the justice system through a comprehensive array of solutions, offering services across a range of areas like court, pretrial, treatment court, a
• Design, build, and maintain secure, scalable, and highly available AWS infrastructure supporting our mission-critical applications. • Manage SQL Server environments hosted on AWS RDS and EC2, optimizing performance, tuning queries, and ensuring data reliability and availability. • Develop and maintain CI/CD pipelines (GitHub, Terraform, PowerShell, or your tool of choice) to automate build, test, and deployment workflows. • Collaborate closely with Development, QA, Product Management, and Information Security teams to ensure smooth, secure releases and stable environments. • Monitor and troubleshoot performance issues using tools like CloudWatch, DataDog, and SIEM dashboards, driving root-cause analysis and proactive prevention. • Contribute to security and compliance initiatives by adhering to SOC 2 and CJIS-aligned policies, managing IAM roles, patching systems, and safeguarding secrets. • Participate in incident response activities, assisting with mitigation, root-cause investigation, and long-term resolution. • Document infrastructure configurations, procedures, and architecture for operational continuity and knowledge sharing. • Support continuous improvement by proposing new automation, cost optimization, and resilience strategies across environments.



