Job Closed

This listing is no longer active.

ML Ops Engineer

Machine Learning EngineerMachine Learning EngineerOtherRemoteMid LevelTeam 1,001-5,000H1B SponsorCompany SiteLinkedIn

Location

New Jersey

Posted

137 days ago

Salary

$127K - $160.6K / year

Seniority

Mid Level

Job Description

ML Ops Engineer

Zelis

• Build and maintain monitoring infrastructure for conventional machine learning models, with capabilities for performance tracking, drift detection, and alerting. • Research, evaluate, and implement monitoring strategies and tools for Generative AI systems, including LLMs and Agentic AI architectures. • Collaborate with ML Engineers, Data Scientists, and DevOps teams to deploy, manage, and monitor models in production. • Develop and support scalable, secure, and automated data pipelines using Snowflake, SQL, and Python for training, serving, and monitoring ML and GenAI models. • Leverage AutoML tools and frameworks (e.g., MLflow, Kubeflow, SageMaker Autopilot) to streamline experimentation and deployment. • Design dashboards and reporting systems to visualize model health metrics and surface key operational insights. • Ensure auditability, reproducibility, and compliance for model performance and data flow in production environments, with consideration for regulatory standards like GDPR and HIPAA. • Maintain CI/CD workflows and version-controlled codebases (e.g., Git) for ML infrastructure and pipelines. • Utilize containerization and orchestration technologies (e.g., Docker) to manage scalable ML infrastructure. • Leverage tools such as Streamlit and Python visualization libraries to present insights from model and data monitoring. • Perform root cause analyses on model degradation or data quality issues, and proactively implement improvements. • Stay current on industry developments related to ML observability, model governance, responsible GenAI practices, and AI security. • Contribute to analytics projects and data engineering initiatives as needed. • Provide off-hours support for critical deployments or urgent data/model issues.

Job Requirements

  • 2–5 years of experience in ML Ops, ML Engineering, or a related role with a focus on production-level model monitoring, automation, and deployment.
  • Strong experience with ML observability tools or custom-built monitoring systems.
  • Experience with monitoring LLMs and Generative AI models, including prompt evaluation, hallucination tracking, and agent behavior auditing.
  • Experience in deploying and managing ML workloads using containerization and orchestration platforms such as Docker, Kubernetes, Kubeflow, or TensorFlow Extended.
  • Familiarity with AutoML pipelines and workflow management tools (e.g., MLflow, SageMaker Autopilot).
  • Experience working in cloud environments, preferably AWS (e.g., SageMaker, S3, Lambda, ECS/EKS).
  • Understanding of ML lifecycle tools (e.g., MLflow, SageMaker Pipelines) and CI/CD practices.
  • Strong security and compliance awareness, particularly related to model/data governance (e.g., HIPAA, GDPR).
  • Proficiency in Python and key data libraries (Pandas, Numpy, Matplotlib, etc.).
  • Advanced SQL skills and experience with Snowflake or similar data warehousing platforms.
  • Proficiency with version control (Git) and agile development methodologies.
  • Strong collaboration and communication skills, with the ability to explain technical issues to both technical and non-technical stakeholders.
  • Bachelor’s degree in Computer Science, Engineering, Data Science, or a related field—or equivalent industry experience.
  • Domain experience in healthcare data (claims, payments) is preferred.

Benefits

  • 401k plan with employer match
  • flexible paid time off
  • holidays
  • parental leaves
  • life and disability insurance
  • health benefits including medical, dental, vision, and prescription drug coverage

Related Job Pages

More Machine Learning Engineer Jobs

River Financial logo

Staff Software Engineer, Machine Learning, Full-stack

River Financial

Buy and mine Bitcoin. Zero-fee DCA. 100% reserve custody. Built for people who want more Bitcoin. www.River.com

Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

• Design, build, and own Elixir backend systems used across onboarding, fraud detection, compliance, and operations with a direct impact on the experience of hundreds of thousands of clients • Build and maintain data pipelines, integrations, and analytics infrastructure for a rapidly growing team • Develop internal tools used daily by operations and compliance teams • Build and maintain training and inference infrastructure for machine learning models and contribute to models where appropriate • Productionize outputs from machine learning models, heuristics, and LLM-based systems • Partner closely with product management and operations to plan and scope new projects and initiatives • Write high-quality, tested code • Participate in code reviews • Take long-term ownership of critical systems as River scales

North America
$200K - $250K / year
Job Closed
ghSMART logo

Machine Learning Engineer, LLMs / RAG

ghSMART

We help CEOs, boards and investors develop winning executive teams and make high-stakes leadership decisions.

OtherRemoteTeam 51-200Since 1995H1B No Sponsor

• Design, build, and extend ML models (LLMs and traditional ML) that deliver high-accuracy insights from ghSMART’s structured leadership dataset; own end-to-end experimentation, evaluation, and deployment. • Develop RAG-based agents and algorithms to unlock novel leadership insights from our research database. • Integrate advanced solutions and AI Agents into the Leadership Intelligence Platform and partner cross-functionally to align features with strategic objectives and user needs. • Optimize data pipelines and workflows to ensure robust, efficient data ingestion, transformation, and model serving across engineering teams. • Collaborate on research with academic partners and contribute to publications and thought leadership by validating findings with rigorous methods.

United States
$160K - $175K / year
Job Closed
Capital Technology Group, LLC logo

Lead AI/ML Engineer

Capital Technology Group, LLC

Simple Solutions for Complex Problems

OtherRemoteTeam 11-50Since 2010H1B No Sponsor

• Leads the team in evaluating new or emerging technologies using prototypes and/or proof of concepts • Analyzes and communicates the benefits and risks in implementing solutions using the new technologies • Shall have the ability to perform as a developer, operations, and security engineer in cloud environments • Lead teams to support the adoption of new technologies across the enterprise • Provides technical leadership by supervising and mentoring junior IT specialists

United States
$155K - $185K / year
Waymo logo

ML Engineer, Foundation Model Infrastructure

Waymo

Waymo is a company in the autonomous driving technology space offering self-driving vehicles with the potential to increase mobility and decrease lives lost in

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. This role follows a hybrid work schedule and you will report to a Senior Research Scientist. - Build and operate the petabyte-scale data systems and ML pipelines at the heart of Waymo's foundation model development - Shepherd cutting-edge foundation models from research prototypes to robust components within the Waymo Driver - Create the automated infrastructure for rigorously benchmarking, continuously monitoring, and safely releasing models - Wield large-scale compute and frameworks like Flume and JAX to process massive datasets and train/deploy complex models - Drive significant leaps in the speed, reliability, and efficiency of the end-to-end ML development lifecycle - Partner with AI Foundations, ML, and Platform experts to transform model innovations into tangible on-road improvements Qualifications - Masters degree in Computer Science, Machine Learning, Robotics, similar technical field of study, or equivalent practical experience - Proficiency in Python - Proficiency in C++ - Familiarity with one of the modern deep learning frameworks (e.g. Pytorch, JAX, Tensorflow) - Experience building or maintaining large-scale data pipelines or ML infrastructure (e.g., Flume, Spark, Borg, Kubeflow) Requirements - Strong hands-on SWE skills, able to drive development of large, complex shared codebases - Experience in AV planning and related research - Experience designing and building distributed systems or MLOps platforms (e.g., model versioning, experiment tracking, CI/CD for ML) - Prior work in an industrial or research setting developing methodologies for the evaluation of ML models Benefits - Eligible to participate in Waymo’s discretionary annual bonus program - Equity incentive plan - Generous Company benefits program, subject to eligibility requirements Salary Range $204,000 — $259,000 USD

United States
$204K - $259K / year
Job Closed