Job Closed
This listing is no longer active.
Enterprise-Grade Deepfake Detection Platform
MLOps Engineer
Location
United States
Posted
71 days ago
Salary
$90K - $120K / year
Seniority
Junior
Job Description
MLOps Engineer
Reality Defender (YC W22)
• Work with ML engineers and researchers to ensure delivered datasets are usable and correctly scoped • Build and run batch data generation and preprocessing jobs for image, video, and audio data • Execute preprocessing pipelines using shared batch orchestration tools • Design and run ETL jobs to ingest, transform, and organize data in our warehouse. • Validate input and output datasets (schema, metadata, basic quality checks) • Collect, organize, and deliver processed datasets using established conventions • Support creation of development and prototype datasets ahead of large-scale backfills • Maintain version control of data processing repositories following industry best practices • Debug data pipeline failures, ETL issues, and data quality problems
Job Requirements
- Bachelor's degree in computer science, machine learning, or a related field
- 1+ year of experience working with large datasets in a production environment or academic setting
- Strong command of Python fundamentals and data wrangling (pandas, scikit-learn, matplotlib)
- Basic experience with batch data pipelines and ETL workflows
- Familiarity with cloud object storage (AWS S3 or equivalent) and structured data organization
- Basic understanding of structured data organization and common associated issues
- Ability to follow structured workflows and deliver reproducible results
- Attention to detail and strong ownership of data quality
- Basic experience working with cloud services
Benefits
- Healthcare plans with 100% premium coverage for employees and partial coverage available for dependents
- Dental and Vision plans with 100% premium coverage for employees and their dependents
- Short/Long-term disability and life insurance plans with 100% premium coverage for employees
- FSA/HSA and 401k programs
- Equity compensation
- 20 days of PTO per year
- 12 weeks of Parental Leave
- Learning and Development budget
- Monthly wellness benefits
- Annual company-sponsored offsite
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
• Lead and manage teams of AI & Machine Learning Engineers • Own, execute, and iteratively evolve Zapier’s AI/ML strategy • Coach and guide teams in model development • Collaborate with cross-functional teams for integration of AI capabilities • Foster a culture of innovation • Communicate AI/ML strategy and results clearly • Design and run operating rhythms for AI/ML execution
• Lead the design, development, and deployment of production, multi-turn LLM-powered features, including summarization tools and clinician-facing conversational agents that support follow-up questions and reasoning over clinical context • Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use • Help define technical direction for prompting, grounding, safety, and orchestration strategies used across clinical AI workflows • Establish and improve evaluation approaches for LLM outputs, including accuracy, hallucinations, edge cases, and overall feature quality • Shape engineering patterns for model-related workflows, including testing, CI/CD, observability, and version control • Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities • Work cross-functionally with engineering, analytics, and medical SMEs to refine requirements and ensure data and system design support clinical use cases • Provide technical leadership across projects by creating clarity in ambiguous problem spaces, guiding tradeoff decisions, and raising the quality bar for the team • Stay current with the latest LLM research and emerging AI technologies, and help assess where they can be applied effectively at Fullscript
Senior Machine Learning Engineer, Off-board Perception
Serve RoboticsMeet the future of sustainable, self-driving delivery.
• Design and implement production-grade auto-labeling pipelines that generate 3D and 4D annotations (objects, trajectories, maps) from multi-modal robot data at scale. • Develop data-centric learning workflows that connect auto-label outputs, Serve’s dataset infrastructure, and continuous E2E model training and evaluation pipelines. • Lead initiatives in self-training, weak supervision, and simulation-to-real adaptation to reduce manual labeling dependency and accelerate model iteration cycles. • Collaborate cross-functionally with multiple autonomy teams(e.g ML-infra, mapping, simulation) to align labeling infrastructure with model training and evaluation workflows. • Stay ahead of emerging trends in E2E autonomy and data-centric ML, identifying opportunities to productionize state-of-the-art techniques. • Mentor and support ML engineers and interns in developing robust data-centric practices, from dataset curation and labeling feedback loops to model monitoring and continuous improvement.
• Deployment of ML pipelines in production • Improving and researching algorithms for Video2Haptic generation • Creating algorithms for video metadata extraction and action recognition • Improvement of the algorithms for high-resolution video matting • Improvement of the algorithms of content censuring • Research and development of algorithms for spatiotemporal ROI extraction in videos for short clips and trailer creation • Research and development into the field of volumetric video generation and streaming



