Job Closed

This listing is no longer active.

Cantina logo
Cantina

Building the first social AI platform

Machine Learning Engineer, Core Evaluations

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 51-200Since founded by Sean ParkerH1B SponsorCompany SiteLinkedIn

Location

Europe

Posted

34 days ago

Salary

0

Seniority

Senior

Bachelor DegreeEnglish

Job Description

Machine Learning Engineer, Core Evaluations

Cantina

• Designing model evaluation pipelines for models in development and production • Designing user studies for subjective model evaluations. • Converting requirements into measurable metrics. • Designing and developing automated evaluation dashboard to see model performances and compare results. • Training new models to capture new and different evaluation metrics. • Communicating with the model team to help design better models based on the evaluation results. • Communicating with the data team to help decide the type of data necessary to improve model performance. • Communication with the product-manager to make sure product requirements are correctly measured. • Help grow the evaluation team as the founding member. • Lead the evaluation team in the future.

Job Requirements

  • Strong experience and intuition for designing metrics that capture model performance.
  • Strong experience with designing user studies on Mechanical Turk or similar platforms. .
  • Strong experience with model training and fine-tuning for model evaluation.
  • Strong statistical knowledge and experience to statistically compare evaluation results and take decisions.
  • Very strong engineering and programming skills.
  • Experience with training ASR, TTS models.
  • Experience at ML teams working on large-scale machine learning problems. (>3B models with >1m hours of data)

Benefits

  • Please Note: to ensure that candidates select the most relevant jobs for their skills, we have set up limits to the number of times candidates can apply. The following limits apply to all roles at Cantina:
  • Candidates may not apply more than 3 times in any 60 days for any job
  • Candidates may not re-apply to any role within 30 days if not presented with an offer (for re-applying to the same job, the limit is 90 days)

Related Job Pages

More Machine Learning Engineer Jobs

ContractRemoteTeam 11-50H1B No Sponsor

• Ship production LLM agents with tool use: orchestration (state, pause/resume, journey/mode routing, failure handling) • MCP or gateway-style tool integration (schemas, auth, idempotency, audit) • Customized evals • Build alongside internal team

Argentina
3Pillar Global logo

Senior AI/ML Engineer

3Pillar Global

Building digital businesses, together.

ContractRemoteTeam 1,001-5,000H1B Sponsor

• Design, build, and deploy AI agent workflows that enable measurable efficiencies across business functions • Implement orchestration patterns including single-agent and multi-agent systems with appropriate state management • Integrate AI capabilities with existing tools and platforms via APIs, plugins, and function calling • Define and enforce guardrails for agent behavior, capabilities, and permissions in partnership with product and security • Build reusable automation patterns and templates that can be adopted across teams • Work alongside our data platform team to enable AI-driven analytics and reporting tools • Implement monitoring, observability, and feedback loops for deployed AI workflows • Ensure responsible AI practices, including fairness, transparency, and risk mitigation • Evaluate and recommend AI tooling, frameworks, and model providers based on business needs and cost • Stay current on industry trends, emerging AI capabilities, and best practices in automation engineering • Mentor engineers on integrating AI into their development workflows

United States
3CON Consultoria e Sistemas logo

Engenheiro MLOps, AWS Sênior

3CON Consultoria e Sistemas

Enabler de inovação e transformação digital, construindo soluções de impacto para os negócios.

Full TimeRemoteTeam 501-1,000Since 1993H1B No Sponsor

• Desenvolver, validar e manter pipelines de MLOps, garantindo a automação de processos e integração com serviços da AWS. • Projetar e manter a arquitetura de sistemas, assegurando alinhamento com os objetivos dos projetos. • Garantir a sustentação e evolução da ferramenta de automação n8n. • Trabalhar em colaboração com cientistas de dados para operacionalizar modelos de machine learning. • Atuar na resolução de chamados relacionados ao time de MLOps. • Propor e desenvolver soluções técnicas (como aplicações web, APIs e agentes baseados em LLMs) para apoiar a área de Ciência de Dados.

Brazil
Job Closed
C-Serv logo

Senior Machine Learning Engineer, General AI, ML, Big Data

C-Serv

Tech Talent Solutions | Building High-Impact Teams Worldwide

Full TimeRemoteTeam 51-200Since 2013H1B No Sponsor

**What You'll Do** - Define and drive the technical vision for ML solutions across products and platforms - Own the end-to-end software development lifecycle — from design and code reviews through to deployment and operations - Architect high-performance, scalable microservices, including synchronous and asynchronous web services - Build real-time inference pipelines for complex models using Triton, TensorRT, and mixed-precision computing - Mentor engineers, set technical direction, and foster a strong team culture - Champion engineering excellence, system resilience, and continuous operational improvement

Canada