Job Closed

This listing is no longer active.

ghSMART logo
ghSMART

We help CEOs, boards and investors develop winning executive teams and make high-stakes leadership decisions.

Machine Learning Engineer, LLMs / RAG

Machine Learning EngineerMachine Learning EngineerOtherRemoteSeniorTeam 51-200Since 1995H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

134 days ago

Salary

$160K - $175K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAzure

Job Description

Machine Learning Engineer, LLMs / RAG

ghSMART

• Design, build, and extend ML models (LLMs and traditional ML) that deliver high-accuracy insights from ghSMART’s structured leadership dataset; own end-to-end experimentation, evaluation, and deployment. • Develop RAG-based agents and algorithms to unlock novel leadership insights from our research database. • Integrate advanced solutions and AI Agents into the Leadership Intelligence Platform and partner cross-functionally to align features with strategic objectives and user needs. • Optimize data pipelines and workflows to ensure robust, efficient data ingestion, transformation, and model serving across engineering teams. • Collaborate on research with academic partners and contribute to publications and thought leadership by validating findings with rigorous methods.

Job Requirements

  • 5+ years of ML engineering experience building and shipping large-scale models and systems (training, tuning, inference, MLOps, monitoring).
  • Hands-on expertise with RAG frameworks and LLMs, including designing retrieval strategies, prompt orchestration, evaluation, and deployment at scale. Experience building AI agents via the LangChain, LangGraph framework is a plus.
  • Strong data engineering fundamentals across pipelines, data quality, and feature engineering to support reliable ML workflows. Experience with Databricks and Azure is a plus.
  • Security and privacy mindset, with experience applying best practices to protect sensitive data in ML systems.
  • Collaborative, remote-first working style with clear communication and ownership; familiarity with Salesforce (SFDC), Jira, Confluence, and Git.

Benefits

  • 401(k) plan with an annual employer contribution
  • Comprehensive benefits package
  • Flexible schedules to support life outside of work

Related Job Pages

More Machine Learning Engineer Jobs

Capital Technology Group, LLC logo

Lead AI/ML Engineer

Capital Technology Group, LLC

Simple Solutions for Complex Problems

OtherRemoteTeam 11-50Since 2010H1B No Sponsor

• Leads the team in evaluating new or emerging technologies using prototypes and/or proof of concepts • Analyzes and communicates the benefits and risks in implementing solutions using the new technologies • Shall have the ability to perform as a developer, operations, and security engineer in cloud environments • Lead teams to support the adoption of new technologies across the enterprise • Provides technical leadership by supervising and mentoring junior IT specialists

United States
$155K - $185K / year
Waymo logo

ML Engineer, Foundation Model Infrastructure

Waymo

Waymo is a company in the autonomous driving technology space offering self-driving vehicles with the potential to increase mobility and decrease lives lost in

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. This role follows a hybrid work schedule and you will report to a Senior Research Scientist. - Build and operate the petabyte-scale data systems and ML pipelines at the heart of Waymo's foundation model development - Shepherd cutting-edge foundation models from research prototypes to robust components within the Waymo Driver - Create the automated infrastructure for rigorously benchmarking, continuously monitoring, and safely releasing models - Wield large-scale compute and frameworks like Flume and JAX to process massive datasets and train/deploy complex models - Drive significant leaps in the speed, reliability, and efficiency of the end-to-end ML development lifecycle - Partner with AI Foundations, ML, and Platform experts to transform model innovations into tangible on-road improvements Qualifications - Masters degree in Computer Science, Machine Learning, Robotics, similar technical field of study, or equivalent practical experience - Proficiency in Python - Proficiency in C++ - Familiarity with one of the modern deep learning frameworks (e.g. Pytorch, JAX, Tensorflow) - Experience building or maintaining large-scale data pipelines or ML infrastructure (e.g., Flume, Spark, Borg, Kubeflow) Requirements - Strong hands-on SWE skills, able to drive development of large, complex shared codebases - Experience in AV planning and related research - Experience designing and building distributed systems or MLOps platforms (e.g., model versioning, experiment tracking, CI/CD for ML) - Prior work in an industrial or research setting developing methodologies for the evaluation of ML models Benefits - Eligible to participate in Waymo’s discretionary annual bonus program - Equity incentive plan - Generous Company benefits program, subject to eligibility requirements Salary Range $204,000 — $259,000 USD

United States
$204K - $259K / year
Job Closed
Runway logo

Member of Technical Staff, Inference

Runway

Business financials got stuck in the 15th century so we're showing them today’s computers 🖥

OtherRemoteTeam 11-50Since 2018H1B Sponsor

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We're looking for an ML infrastructure engineer to bridge the gap between research and production at Runway. You'll work directly with our research teams to productionize cutting-edge generative models—taking checkpoints from training to staging to production, ensuring reliability at scale, and building the infrastructure that enables fast iteration. You'll be embedded within research teams, providing platform support throughout the entire model development lifecycle. Your work will directly impact how quickly we can ship new models and features to millions of users. A peek at our technical stack - API endpoints for real-time collaboration and media asset management written in TypeScript, running in ECS containers on AWS Fargate. - Leverage multiple AWS-native components, such as S3, CloudFront, Lambda, Kinesis, and SQS. - Inference backend written in Python (PyTorch, TorchScript), deployed across multiple clusters/cloud providers. - Use Kubernetes for container orchestration, with k8s-native components such as Flyte, Kueue, and Kyverno for efficient job orchestration. - Invest in Prometheus and Grafana for monitoring, and Terraform to manage infrastructure. Qualifications - 4+ years of experience running ML model inference at scale in production environments. - Strong experience with PyTorch and multi-GPU inference for large models. - Experience with Kubernetes for ML workloads—deploying, scaling, and debugging GPU-based services. - Comfortable working across multiple cloud providers and managing GPU driver compatibility. - Experience with monitoring and observability for ML systems (errors, throughput, GPU utilization). - Self-starter who can work embedded with research teams and move fast. - Strong systems thinking and pragmatic approach to production reliability. - Humility and open-mindedness; at Runway we love to learn from one another. Requirements - Experience building custom inference frameworks or serving systems (Nice to Have). - Deep understanding of distributed training and inference patterns (FSDP, data parallelism, tensor parallelism) (Nice to Have). - Ability to debug low-level issues: NCCL networking problems, CUDA errors, memory leaks, performance bottlenecks (Nice to Have). - Experience with diffusion models or video generation systems (Nice to Have). - Knowledge of real-time or latency-sensitive ML applications (Nice to Have). Benefits - Salary range: $240,000 - $290,000. - Commitment to creating a space where employees can bring their full selves to work and have equal opportunity to succeed. Company Description Runway strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on competitive market rates for our size, stage, and industry, and salary is just one part of the overall compensation package we provide. There are many factors that go into salary determinations, including relevant experience, skill level and qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range. Lastly, the provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range, which again, will be communicated to candidates. We're excited to be recognized as a best place to work by Crain's, InHerSight, BuiltIn NYC, and INC.

United States + 171 moreAll locations: United States | Canada | Brazil | Colombia | Argentina | Chile | Venezuela | Bolivia | Ecuador | French Guiana | Guyana | Paraguay | Peru | Suriname | Uruguay | Mexico | Costa Rica | El Salvador | Guatemala | Honduras | Nicaragua | Panama | Dominican Republic | Puerto Rico | Bahamas | Guadeloupe | Haiti | Jamaica | Martinique | Montserrat | United Kingdom | Germany | France | Estonia | Portugal | Hungary | Poland | Ukraine | Romania | Bulgaria | Czechia | Slovakia | Belarus | Moldova | Sweden | Greece | Belgium | Italy | Ireland | Switzerland | Netherlands | Finland | Malta | Denmark | Lithuania | Croatia | Spain | Austria | Bosnia And Herzegovina | Iceland | Luxembourg | North Macedonia | Montenegro | Norway | Serbia | Slovenia | Albania | Cyprus | Latvia | Monaco | South Africa | Egypt | Algeria | Angola | Benin | Botswana | Burkina Faso | Burundi | Cameroon | Cabo Verde | Central African Republic | Chad | Congo | Côte D'ivoire | Democratic Republic of the Congo | Equatorial Guinea | Eritrea | Ethiopia | Gabon | Gambia | Ghana | Guinea | Guinea-bissau | Kenya | Lesotho | Liberia | Libya | Madagascar | Malawi | Mali | Mauritania | Mauritius | Mayotte | Morocco | Mozambique | Namibia | Niger | Nigeria | Réunion | Rwanda | Senegal | Seychelles | Sierra Leone | Somalia | Sudan | Eswatini | Tanzania | Togo | Tunisia | Uganda | Zambia | Zimbabwe | Georgia | Turkey | Israel | United Arab Emirates | Armenia | Azerbaijan | Bahrain | Iraq | Jordan | Kuwait | Lebanon | Oman | Qatar | Saudi Arabia | Palestine | Yemen | India | Japan | Philippines | Pakistan | Thailand | Singapore | Vietnam | Taiwan | Indonesia | Cambodia | Laos | Malaysia | Myanmar | South Korea | China | Afghanistan | Bangladesh | Bhutan | Kazakhstan | Kyrgyzstan | Maldives | Mongolia | Nepal | Sri Lanka | Tajikistan | Turkmenistan | Uzbekistan | Australia | Papua New Guinea | Kiribati | Palau | French Polynesia | Tuvalu | New Zealand
$240K - $290K / year
Job Closed
Waymo logo

ML Engineer, Foundation Model Evaluation

Waymo

Waymo is a company in the autonomous driving technology space offering self-driving vehicles with the potential to increase mobility and decrease lives lost in

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. This role follows a hybrid work schedule and you will report to a Senior Research Scientist. - Develop and extend cutting-edge research in robotics and machine learning to advance state-of-the-art methodologies for evaluating the quality, safety, and realism of embodied AI agents - Partner within and across organizations to land disruptive and innovative tech in production - Work with a variety of state-of-the-art Foundation Models - Drive model development through defining evaluation and benchmarks - Implement and extend large scale data and evaluation pipelines Qualifications - Masters degree in Computer Science, Machine Learning, Robotics, similar technical field of study, or equivalent practical experience - Proficiency in Python - Familiarity with one of the modern deep learning frameworks (e.g. Pytorch, JAX, Tensorflow) - Prior work in an industrial or research setting developing methodologies for the evaluation of ML models Requirements - Strong hands-on SWE skills, able to design, implement, and extend large distributed pipelines - Track record of publications in top-tier conferences or leading open source projects in the related fields - Proficiency in C++ - Experience in AV planning and related research - Experience in labeling and curating data for ML eval and training Benefits - Eligibility to participate in Waymo’s discretionary annual bonus program - Equity incentive plan - Generous Company benefits program, subject to eligibility requirements Salary Range The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Salary Range: $170,000 — $216,000 USD

United States
$170K - $216K / year
Job Closed