Senior DevOps Engineer

Location

India

Posted

93 days ago

Salary

0

Seniority

Senior

No structured requirement data.

Job Description

Senior DevOps Engineer

AHL - Saaf AI

Role Description Saaf AI is building the infrastructure backbone for modern mortgage operations by combining advanced AI with scalable, reliable systems. As a Senior DevOps Engineer, you will own the infrastructure, deployment pipelines, and reliability practices that support our platform and enable engineering teams to ship quickly and safely. You will design and operate scalable systems, improve observability, and ensure high availability across critical workflows. We are an AI-native engineering team, where AI-assisted tools are a regular part of how we build, deploy, and maintain infrastructure. From writing infrastructure code to debugging production issues and optimizing system performance, you will use these tools to improve efficiency and reliability. You will also support the infrastructure required to run AI-driven workflows in production, ensuring they are robust, scalable, and maintainable. Key Responsibilities - Infrastructure & Cloud Operations - Design, build, and maintain production-grade AWS infrastructure using Infrastructure-as-Code (Terraform preferred). - Architect and manage serverless and containerized environments that balance cost, performance, and reliability. - Implement and maintain networking, security groups, IAM policies, and cloud resource configurations following least-privilege principles. - CI/CD & Deployment - Own and evolve the CI/CD pipeline ecosystem, primarily using GitHub Actions, to enable fast, safe, and repeatable deployments. - Implement deployment strategies (blue-green, canary, rolling) that minimize risk and downtime. - Automate build, test, and release workflows across multiple services and environments. - AI-Integrated DevOps - Leverage AI-assisted tools (code generation, intelligent autocomplete, automated IaC authoring) as a regular part of your infrastructure workflow to accelerate delivery and reduce configuration errors. - Use AI tools to support incident diagnosis, log analysis, runbook generation, and documentation of infrastructure decisions. - Evaluate and integrate emerging AI tools and practices into the team's DevOps processes. - Build and support the infrastructure layer for agentic workflows, including compute orchestration, autoscaling, and cost-efficient execution of AI-powered automation. - Monitoring, Observability & Incident Management - Design and maintain monitoring, logging, and alerting systems that provide clear visibility into platform health and performance. - Implement distributed tracing and structured logging across services and multi-step workflows. - Lead incident response, conduct post-mortems, and drive reliability improvements based on findings. - Security & Compliance - Apply cloud security best practices across all infrastructure, including secrets management, encryption, network segmentation, and access controls. - Design secure secrets and configuration management for agentic processes, including API keys, model tokens, and external service credentials. - Ensure infrastructure meets financial regulatory and compliance requirements with full auditability. - Data Infrastructure Support - Support and maintain infrastructure for data engineering workflows, including Snowflake environments, ETL/ELT pipelines, and dbt execution. - Manage serverless event-driven pipelines and orchestration tools (Step Functions, Temporal, or similar). - Team & Process - Collaborate with product engineers, data engineers, and founders to ensure infrastructure supports rapid iteration and reliable delivery. - Document infrastructure decisions, runbooks, and operational procedures to support team knowledge sharing and onboarding. - Regularly review and improve operational workflows, automation coverage, and infrastructure cost efficiency. Qualifications - 4+ years of experience in DevOps, SRE, or similar infrastructure-focused roles. - Proficient in AWS with strong Infrastructure-as-Code experience (Terraform preferred). - Strong CI/CD expertise with GitHub Actions. - Experience with containerization and serverless architectures. - Skilled in monitoring, logging, and incident management. - Strong scripting and automation skills in Bash, Python, or Node.js. - Knowledge of cloud security principles, least privilege, and compliance requirements. - Experience with Snowflake and data engineering workflows (ETL, dbt). - Exposure to Kubernetes and orchestration tools. - Understanding of serverless event-driven pipelines (Step Functions, Temporal). - Demonstrated, regular use of AI-powered development tools (e.g., Cursor, GitHub Copilot, Claude Code, or similar) to accelerate infrastructure authoring, debugging, or documentation workflows. - Startup mindset: hands-on, resourceful, and comfortable operating in a fast-paced environment. Preferred - Experience with event-driven workflow orchestration tools such as Step Functions, Temporal, Airflow, or Prefect. - Familiarity with agentic workflow patterns, including integrating API-based decision points, asynchronous task handling, and dynamic routing of requests. - Understanding of infrastructure requirements for AI-powered automation, including latency optimization, autoscaling strategies, and cost-efficient compute for high-throughput processes. - Ability to design secure secrets and configuration management systems for agentic processes, including API keys, model tokens, and external service credentials. - Experience implementing observability for multi-step workflows, including distributed tracing, structured logging, and audit-friendly data pipelines. - Experience with prompt engineering for IaC generation, incident analysis, or building AI-powered operational tooling. - Prior early-stage startup experience is highly preferred. Benefits - Competitive salary - Unlimited PTO - Remote-first with flexible hours - Yearly professional development budget - Home office setup stipend

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Continuum logo

Release Engineer

Continuum

Accelerating Digital at the Speed of Government

DevOps Engineer93 days ago
OtherRemoteTeam 11-50Since 2023H1B Sponsor

• Lead CI/CD practices for Dynamics 365, Power Platform, and related Azure components, driving consistent and reliable releases. • Manage solution builds, packaging, and deployments across Dev, Test, UAT, and Production using Azure DevOps and proven ALM methods. • Review and validate environment configurations, layering (managed/unmanaged), patch strategies, and dependencies for D365 apps, Dataverse, plugins, and integrations. • Ensure version consistency, connection references, environment variables, and API dependencies are aligned for each release. • Troubleshoot complex release issues such as solution import errors, plugin exceptions, or schema conflicts, and establish long-term prevention tactics. • Support change control by ensuring all Change Requests include accurate release notes, dependency documentation, and the required approvals aligned with governance standards. • Use Power Platform tools (Solution Checker, Plugin Trace Logs, Dataverse telemetry, Azure Application Insights) to diagnose and resolve deployment issues. • Champion automation by building and maintaining Azure DevOps Pipelines or GitHub Actions for exports, validation, versioning, and deployment. • Collaborate with Dynamics developers, architects, integration teams, and functional consultants to coordinate cross-system releases involving Azure APIs, Power Automate, and Power Pages. • Continuously improve release governance, environment management, and ALM practices to support a sustainable D365 delivery model.

Virginia
Job Closed
EverOps logo

Senior DevOps Engineer

EverOps

The Embedded Service Provider

DevOps Engineer93 days ago
OtherRemoteTeam 51-200H1B No Sponsor

• Develop and use automation tools effectively to operate, manage, and scale production and development environments in Azure quickly • Design, build, and maintain CI/CD pipelines using Azure DevOps Pipelines, including multi-stage YAML pipelines for infrastructure and application deployments • Author and maintain Azure infrastructure using Bicep templates and Terraform modules, following IaC best practices • Participate in regular customer and internal EverOps scrums • Monitor Azure environments using native tooling and third-party platforms while focusing on constant improvement • Implement new Azure services and technologies as customer requirements evolve • Design and execute new solutions while working to improve existing ones • Provide operational support and project deployments for our customer environments

United States
Job Closed
Jobgether logo

Senior Staff Site Reliability Engineer

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

DevOps Engineer94 days ago
OtherRemoteH1B No Sponsor

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description This role is pivotal in ensuring the reliability, scalability, and performance of cloud-based enterprise software. As a Senior Staff Site Reliability Engineer, you will: - Design, deploy, and maintain robust infrastructure for mission-critical services - Collaborate closely with development teams to optimize CI/CD pipelines and automate operational workflows - Provide guidance on distributed systems, cloud architecture, and containerized environments - Influence both technical strategy and day-to-day operations - Combine hands-on engineering with leadership in best practices for deployment automation, observability, and cost optimization - Mentor peers and participate in technology evaluations - Ensure resilient, customer-focused infrastructure Qualifications - Strong experience in scalable, distributed systems architecture and cloud platforms - Proficiency in programming with Go (Golang) and containerization technologies such as Docker - Hands-on experience with Kubernetes and orchestration technologies - Expertise in CI/CD processes, deployment automation, and configuration management - Solid understanding of Git workflows in a collaborative team environment - Bachelor’s degree in Computer Science or equivalent experience - Strong analytical, problem-solving, and communication skills - Experience with networking fundamentals, identity and access management, and monitoring/observability tools is a plus - Ability to work independently and collaboratively in a fast-paced, fully remote environment Requirements - Participate in on-call rotation to maintain operational excellence for production systems - Evaluate emerging technologies and recommend solutions to enhance system reliability and security Benefits - Competitive salary range of $170,000–$230,000 - Flexible, fully remote U.S. work environment - Generous paid time off and holiday schedule - Parental leave and progressive healthcare options - Retirement savings programs - Education reimbursement opportunities - Team bonding events and global volunteering initiatives - Inclusive, collaborative culture emphasizing growth and development

United States
Job Closed
Modivcare logo

DevOps Engineer III

Modivcare

To bring equity, hope and healing to those who need it most. To make a world of difference, one member at a time.

DevOps Engineer94 days ago
OtherRemoteTeam 10,001+Since 2017H1B Sponsor

• Designs, builds, and maintains scalable and robust infrastructure using cloud platforms (e.g., AWS, Azure) and containerization technologies (e.g., Docker, Kubernetes, ECS). • Collaborates with InfoSec to ensure the team is building a secure, scalable cloud infrastructure. • Develops and maintains enterprise-grade CI/CD pipelines and components to automate the build, test, and deployment processes for applications. • Implements and manages version control systems (e.g., Git) and artifact repositories to ensure efficient code collaboration and artifact management. • Monitors and improves the performance and reliability of CI/CD pipelines, addressing bottlenecks and implementing proactive measures. • Implements monitoring and logging solutions (e.g., Datadog, Prometheus, ELK stack) to track system health, identify performance issues, and troubleshoot incidents. • Collaborates with development and operations teams to diagnose and resolve production issues, ensuring quick resolution and minimal disruption to services. • Continuously monitors system capacity, performance, and security, implementing proactive measures to optimize resource utilization and enhance system stability. • Develops automation scripts (e.g., TypeScript, Bash, Python) to streamline routine operational tasks, improve efficiency, and reduce manual intervention. • Automates the deployment and configuration of applications, services, and infrastructure components using Infrastructure as Code (IaC) tools such as Pulumi, Terraform, CDK, or CloudFormation. • Works closely with cross-functional teams, including developers, testers, and operations, to foster a collaborative DevOps culture and drive continuous improvement. • Creates and maintains detailed technical documentation, including system diagrams, architectural designs, and standard operating procedures (SOPs).

United States
$97.2K - $133.7K / year
Job Closed