The strongest procure-to-pay platform for mid-market and enterprise companies with integrations to your ERP system.
Senior MLOps Engineer
Location
India
Posted
42 days ago
Salary
0
Seniority
Senior
Job Description
Senior MLOps Engineer
PairSoft
• Design, implement, and maintain end-to-end ML pipelines covering training, validation, deployment, monitoring, and retraining — with a focus on production reliability and long-term maintainability. • Own and operate production ML infrastructure using Infrastructure as Code (IaC), making architectural tradeoffs and enforcing best practices. • Lead CI/CD practices for ML, including artifact/model versioning, promotion, rollout/rollback, and dev/test/prod parity. • Deploy and run ML/GenAI workloads on Azure using Azure App Service and Azure Container Apps, with monitoring via Application Insights. • Implement robust model observability: performance monitoring, data quality checks, drift detection, alerting, and dashboards. • Drive compute and cost optimization for training and inference (scaling policies, capacity planning, cost/performance tradeoffs). • Support GenAI operational needs, including LLM inference patterns, embeddings, and retrieval pipelines; enable hooks for evals/guardrails where required. • Ensure ML systems meet security and governance requirements (RBAC/least privilege, secrets management, audit logging, encryption, secure access patterns). • Partner with the Data & AI Architect to translate architecture standards into reusable pipeline templates and operational controls. • Collaborate with and mentor AI engineers; contribute to model development/experimentation as capacity allows.
Job Requirements
- 5+ years of experience in MLOps, ML engineering, platform engineering, or a closely related role, with at least 2 years in a senior or lead capacity.
- Strong proficiency in Python for ML workflows, automation, and pipeline development.
- Hands-on experience building and operating ML systems on Azure (OCI exposure is a plus).
- Proven track record of owning production-grade MLOps pipelines end-to-end (training → deployment → monitoring → retraining) with measurable reliability or efficiency outcomes.
- Strong experience with Infrastructure as Code (Terraform or equivalent).
- Experience with MLOps tooling such as MLflow (or equivalent experiment tracking) and CI/CD pipelines.
- Experience containerizing services using Docker in production environments.
- Hands-on experience deploying and monitoring services on Azure using Azure App Service, Azure Container Apps, and Application Insights.
- Solid understanding of GenAI/LLM-based systems (inference workflows, embeddings, retrieval/RAG components) and their operational considerations.
- Strong communication and collaboration skills; comfortable working across functions and influencing technical decisions without direct authority.
Benefits
- Company Paid Group Mediclaim Insurance for employee, spouse and up to 2 Kids of INR 400,000 per annum
- Company Paid Group Personal accidental insurance for employees of INR 1,000,000 per annum.
- Company Paid & Manager approved Career Advancement Opportunities
- Best in the Industry referral bonus policy.
- 29 Paid leaves throughout the year
- Company-paid Maternity leaves for female employees
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
• Work across the Campaign Monitor product to identify valuable opportunities in product and customer data, and turn them into predictive features that improve customer outcomes • Turn rich historical product and customer data into predictive features that improve customer outcomes • Identify high-impact opportunities for applied machine learning by analyzing product, behavioral, and content data, and translating ambiguous product questions into concrete ML use cases • Develop and deploy predictive machine learning models, including models for click-through rate, churn, recommendations, and related engagement signals • Design and build features and training datasets from structured product data, historical behavioral data, and content-derived signals • Own the applied ML lifecycle from data exploration and feature engineering through training, evaluation, deployment, monitoring, and iteration • Build production services and workflows for batch and real-time inference, with a pragmatic focus on reliability, maintainability, and speed to impact • Work hands-on in the codebase, contributing to backend systems and product workflows that consume predictions and recommendations • Partner closely with product, design, and engineering to turn customer needs into ML-driven product capabilities with measurable business impact • Establish pragmatic best practices for model evaluation, experimentation, monitoring, and continuous improvement • Help shape how applied machine learning is introduced into the product, while aligning with broader engineering architecture and delivery practices • Contribute to shared knowledge across the engineering organization to improve understanding and adoption of applied ML over time.
Software Development Engineer – MLOps
WillHireNow Magnit - Follow our new LinkedIn account https://www.linkedin.com/company/magnitglobal
• Design, implement, and deliver highly scalable features for our AI platform. • Partner with Agent Developers, Data Scientists, and other Software Engineers to create the technology that brings these features to life. • Support contracts with the U.S. Federal Government requiring US citizenship for personnel.
Machine Learning Engineer
Salesforce👋 We're Salesforce, the customer company. CRM + Data + AI + Trust.
• Build next-generation agentic AI platforms • Work with Data Scientists, Software Engineers, product managers, and other stakeholders to design, implement, and iterate agentic AI systems with customers • Innovate at the frontier of the field, having the opportunity to create new solutions and define new categories of products with meaningful impact to Salesforce customers and beyond.
Senior Applied Scientist - Machine Learning Systems Engineer- Photoshop
AdobeChanging the world through digital experiences.
Role Description Photoshop ART is seeking a Senior Machine Learning (ML) Systems & Efficiency Engineer to join our R&D team focused on delivering practical, production-ready improvements in inference performance, latency, and cost efficiency across image editing applications. This role sits at the intersection of model architecture, systems, inference runtimes, and services, with a clear mandate: deliver high-quality ML systems at substantially lower cost and higher efficiency. Individuals in this role are expected to have deep expertise in areas such as Artificial Intelligence (AI), ML systems, and computer vision. Strong preference will be given to candidates with experience in distributed inference, multimodal model profiling, and performance optimization. You will work closely with research, product, and infrastructure teams to influence model design decisions, improve GPU utilization, and build scalable, cost-aware ML systems deployed in production. This is a hands-on, high-leverage role where a single engineer can drive outsized impact, potentially saving millions of dollars in compute costs. The ideal candidate will have a strong interest in developing practical innovations that advance Adobe products. Qualifications - Master’s or PhD in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning systems, distributed systems, or high-performance computing. - Hands-on experience implementing and scaling large-scale inference or serving workloads using distributed frameworks and runtime systems (e.g., Triton, vLLM, SGLang, xDiT, or similar). - Experience applying inference compilation and optimization tools (e.g., TensorRT, ONNX Runtime, AOTI), including techniques such as operator fusion and graph-level optimization. - Strong understanding of GPU architecture (e.g., memory hierarchy, compute throughput, communication bandwidth) and practical experience diagnosing performance bottlenecks across compute, memory, and I/O subsystems. - Proficiency in Python and C++, with experience building high-performance or distributed systems. - Familiarity with CUDA or Triton for performance-critical workloads is highly desirable. - Demonstrated ability to make engineering decisions based on rigorous measurement and benchmarking, with a focus on improving system efficiency, scalability, and reliability in production environments. Requirements - Design and optimize high-throughput, low-latency inference systems. - Optimize model architectures to improve deployment and runtime efficiency using techniques such as distillation, pruning, quantization, and Mixture-of-Experts (MoE). - Implement advanced serving strategies including batching, caching (KV, semantic, embedding), quantization (FP8/INT8), and distributed inference strategies. - Write and maintain high-performance GPU kernels using Triton or CUDA to accelerate custom model layers and critical workloads. - Conduct deep performance analysis using tools such as PyTorch Profiler and NVIDIA Nsight to identify bottlenecks in compute, memory, and communication. - Partner with infrastructure teams to design scalable and reliable distributed serving systems across heterogeneous hardware environments. - Establish and track efficiency metrics such as cost per million inferences. - Serve as a trusted technical advisor to research and product teams on efficiency tradeoffs. Benefits - Competitive salary and performance-based incentives. - Comprehensive benefits programs. - Opportunities for professional development and growth. Company Description Adobe empowers everyone to create through innovative platforms and tools that unleash creativity, productivity and personalized customer experiences. Adobe’s industry-leading offerings enable people and businesses to turn ideas into impact, powered by AI and driven by human ingenuity. Our 30,000+ employees worldwide are creating the future and raising the bar as we drive the next decade of growth. We’re on a mission to hire the very best and believe in creating a company culture where all employees are empowered to make an impact.




