Cantina logo
Cantina

Building the first social AI platform

Applied ML Engineer, Data

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 51-200Since founded by Sean ParkerH1B SponsorCompany SiteLinkedIn

Location

Europe

Posted

58 days ago

Salary

$200K - $260K / year

Seniority

Senior

Bachelor Degree3 yrs expEnglishAWSCloudDynamoDBKubernetesPythonPyTorch

Job Description

Applied ML Engineer, Data

Cantina

• Build and maintain data pipelines for large video generation models, including data ingestion, parsing, filtering, preprocessing, and dataset curation at scale, using tools such as AWS S3 and DynamoDB. • Design and run annotation workflows across platforms such as MTurk, Prolific, and Mechanical Turk, including task design, quality control, and label validation. • Train, evaluate, and improve smaller supporting models used for data filtering, quality assessment, preprocessing, or other parts of the ML pipeline. • Partner closely with research and engineering teams to turn experimental workflows into scalable, repeatable systems that support model training and evaluation. • Own data quality across the pipeline by identifying bottlenecks, failure modes, and low-quality sources, and continuously improving tooling and processes. • Build internal tools and automation that make it easier to prepare datasets, launch annotation jobs, monitor outputs, and support model development end to end. • Drive larger pipeline projects from start to finish, such as new dataset creation efforts or upgrades to labeling and preprocessing infrastructure. • Work within a Kubernetes-based training infrastructure, ensuring datasets are properly prepared, formatted, and delivered to training clusters. • Profile and optimize research model inference scripts used in preprocessing steps, ensuring that model-driven filtering and transformation stages run within practical time and cost constraints when applied to large-scale raw data.

Job Requirements

  • 3+ years of experience in machine learning, applied ML, data pipelines, or related engineering roles, ideally working on large-scale multimodal, video, or vision-based systems.
  • Strong programming skills in Python and solid experience building reliable data processing and preprocessing pipelines for ML workflows.
  • Hands-on experience preparing training data for ML models, including parsing, filtering, dataset curation, quality control, and large-scale data handling using tools such as AWS S3 and DynamoDB.
  • Familiarity with annotation and labeling workflows, including task design, vendor or crowd-platform orchestration such as MTurk or Prolific, and methods for ensuring label quality.
  • Experience working with Kubernetes for orchestrating distributed workloads, including data preprocessing, pipeline execution, and dataset delivery to training clusters.
  • Comfort working across cloud and on-demand compute environments such as AWS and RunPod, with the ability to port and optimize pipelines across infrastructure.
  • Familiarity with distributed data processing frameworks and experience designing systems that operate reliably at scale across many nodes or workers.
  • Working knowledge of PyTorch and the broader deep learning stack, with the ability to read, debug, and optimize research model inference code for use in production preprocessing pipelines.
  • Ability to work cross-functionally with research and engineering teams and translate experimental ideas into robust, scalable systems.
  • Bachelor's, Master's, or PhD in Computer Science, Machine Learning, Engineering, Mathematics, or a related technical field; experience in generative video, computer vision, or multimodal ML is strongly preferred.
  • Bonus: Experience training, evaluating, or fine-tuning smaller ML models used for classification, filtering, ranking, quality assessment, or other supporting tasks in an ML pipeline.

Benefits

  • Competitive salary and generous company equity
  • Medical, dental, and vision insurance – 99.99% of premiums covered by Cantina
  • 42 days of paid time off, including:
  • 15 PTO days
  • 10 sick days
  • 15 company holidays
  • 2 floating holidays
  • Generous parental leave & fertility support
  • 401(k) retirement savings plan
  • Lifestyle spending account – $500/month to use however you’d like
  • Complimentary lunch and snacks for in-office employees
  • One Medical membership, and more!

Related Job Pages

More Machine Learning Engineer Jobs

Full TimeRemoteTeam 501-1,000

We are looking for a Machine Operator to join our dynamic and fast-growing team and be part of the programming, operating, monthly preventative maintenance, and keeping a clean work area while adhering to all company safety policies. Machine operators will work closely with other members of the shop to ensure the completion of correct and accurate parts What You'll do: - Operate forklifts safely and efficiently in a shop environment - Read and interpret blueprints, drawings, and work instructions accurately - Use measuring tools (tape measures, calipers, etc.) with precision to ensure quality standards - Operate a variety of shop equipment, including waterjet, plasma, laser, press brake, plate roll, shear, mill, and band saw - Perform MIG and/or TIG welding on mild steel to meet project specifications - Maintain a clean, safe, and organized work environment - Assume other duties as assigned The Experience we are looking for: - 5+ Years of experience operating multiple forms of relevant shop equipment - 2+ Years of welding experience preferred (Mig or Tig) - Hands-on experience in a metal fabrication or manufacturing shop environment - Strong ability to read blueprints and use measuring tools accurately Additional Requirements: - Willingness to travel up to 10% across the continental U.S. - High school diploma or GED required - Valid and current driver’s license with a clean driving record - Must successfully complete a background check and pre-employment/random drug tests - Legally authorized to work in the United States Bonus Points for: - Prior experience with waterjet - Aluminum welding experience Why Join RMS Energy: We’re not just another power services company. We’re a tight-knit, mission-driven team that values safety, teamwork, innovation, and continuous growth - Competitive Compensation – Overtime potential and merit-based raises - Flexible Work Environment – Remote work with project-based travel - Full Benefits – Medical, dental, and vision coverage fully paid for employees, starting the month after hire - Steady Employment & Career Growth – Be part of a fast-growing company with promotion potential - 401(k) with Company Match – Traditional & Roth options + free investment guidance - Top-Tier Equipment – Provided to support you in the field - Compensated Travel Time plus Per Diem – Earn while seeing new places - Education Support – Paid training, certifications, and industry memberships - Generous PTO – Paid vacation, holidays, and sick leave - Employee Assistance Program – Legal, financial, and mental wellness support Want to be part of something meaningful? Apply today and join a team where People, Purpose, and Power come together – your future starts here. RMS Energy is an Equal Opportunity Employer. We believe diverse teams drive better outcomes, and we’re committed to creating an inclusive environment where all employees feel valued and empowered. For more information about RMS Energy, please visit www.rmsenergy.com.

United States
Airbnb logo

Senior Machine Learning Engineer, Trust

Airbnb

Airbnb is a community based on connection and belonging.

Full TimeRemoteTeam 5,001-10,000Since 2007H1B Sponsor

• Work with large scale structured and unstructured data, build and continuously improve cutting edge Machine Learning models for Airbnb product, business and operational use cases. • Working together with a wide variety of business functions to stop critical life safety and property damage incidents in real time. • Creating new holistic machine learning model detection strategies by collaborating with other trust and safety prevention teams around the Trust Organization. • Work collaboratively with cross-functional partners including software engineers, product managers, operations and data scientists, identify opportunities for business impact, understand, refine, and prioritize requirements for fraud detection and mitigation. • Hands-on develop, productionize, and operate Machine Learning models and pipelines at scale, including both batch and real-time use cases.

United States
$191K - $223K / year
Job Closed
Torch Technologies, Inc. logo

AI/ML Engineer

Torch Technologies, Inc.

Lighting the pathway of freedom.

Part TimeRemoteTeam 501-1,000Since 2002H1B No Sponsor

• Develop methods for producing synthetic training data to make models robust and generalize-able. • Develop, train and optimize AI/ML models (including large generative models and LLM's) for various applications. • Develop and implement efficient deployment strategies for these models on constrained computing platforms. • Conduct research and contribute to the development and enhancement of our machine learning systems and frameworks. • Manage and monitor the performance of deployed models and adjust as necessary.

Florida
Job Closed
Full TimeRemoteTeam 1,001-5,000Since 2010H1B Sponsor

• Develops software using the KnowBe4 Software Development Lifecycle and Agile Methodologies • Designs, develops, and researches Machine Learning systems • Transforms data science prototypes by applying appropriate Machine Learning algorithms and tools • Performs statistical analysis and using results to improve models • Inference Engineering: Drive the deployment and optimization of both standard predictive models and LLM architectures, balancing trade-offs between low latency, high throughput, and cost-efficiency • Platform Hardening: Transition research prototypes into resilient, production-ready microservices that can handle massive traffic • Lifecycle Orchestration: Execute automated pipelines for data and model versioning, validation, and retraining • Observability: Implement advanced monitoring for model drift, data integrity, and system health to ensure production reliability • Collaborative Standards: Uphold clean code practices, thorough documentation, and participate in rigorous code reviews across the ML and Engineering teams

South Africa
Job Closed