Job Closed
This listing is no longer active.
Bringing real world currency to the blockchain.
Senior Research Engineer – Multimodal, Video Foundation Model
Location
Germany
Posted
78 days ago
Salary
0
Seniority
Senior
Job Description
Senior Research Engineer – Multimodal, Video Foundation Model
Tether.to
• Pioneer multimodal and video-centric research that moves fast and breaks ground, contributing directly to usable prototypes and scalable systems • Design and implement novel AI architectures for multimodal language models, integrating text, visual, and audio modalities • Engineer scalable training and inference pipelines optimized for large-scale multimodal datasets and distributed GPU systems across thousands of GPUs • Optimize systems and algorithms for efficient data processing, model execution, and pipeline throughput • Build modular tools for preprocessing, analyzing, and managing multimodal data assets (e.g., images, video, text) • Collaborate cross-functionally with research and engineering teams to translate cutting-edge model innovations into production-grade solutions • Prototype generative AI applications showcasing new capabilities of multimodal foundation models in real-world products • Develop benchmarking tools to rigorously evaluate model performance across diverse multimodal tasks
Job Requirements
- Bachelor’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
- Expertise in Python & Pytorch, including practical experience working with the full development pipeline from data processing & data loading to training, inference, and optimization
- Experience working with large-scale text data, or (bonus) interleaved data spanning audio, video, image, and/or text
- Direct hands-on experience in developing or benchmarking at least one of the following topics: LLMs, Vision Language Models, Audio Language Models, generative video models
- First-author publications at leading AI conferences such as CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS etc.
Benefits
- Flexible working arrangements
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Research Engineer Jobs
• Research and improve open-source video and multimodal video generation foundation models • Focus on one or more areas such as pre-training, supervised fine-tuning, post-training, inference, architecture design, or evaluation • Benchmark models against current state-of-the-art, identify bottlenecks, and propose novel improvements • Work with large-scale video datasets and distributed training systems • Collaborate with researchers and engineers on projects with clear research and publication potential
Senior Research Engineer – Multimodal, Video Foundation Model
Tether.toBringing real world currency to the blockchain.
• Pioneer multimodal and video-centric research that moves fast and breaks ground, contributing directly to usable prototypes and scalable systems. • Design and implement novel AI architectures for multimodal language models, integrating text, visual, and audio modalities. • Engineer scalable training and inference pipelines optimized for large-scale multimodal datasets and distributed GPU systems across thousands of GPUs. • Optimize systems and algorithms for efficient data processing, model execution, and pipeline throughput. • Build modular tools for preprocessing, analyzing, and managing multimodal data assets (e.g., images, video, text). • Collaborate cross-functionally with research and engineering teams to translate cutting-edge model innovations into production-grade solutions. • Prototype generative AI applications showcasing new capabilities of multimodal foundation models in real-world products. • Develop benchmarking tools to rigorously evaluate model performance across diverse multimodal tasks.
Electrodynamics Engineer
MercorCincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.
Role Description - Develop high-quality data by creating challenging problems in: - Advanced Quantum Mechanics - Advanced Electrodynamics - Advanced Classical Mechanics - Evaluate and refine AI model training with rigorous physics expertise. - Collaborate with AI research teams to enhance model outputs and innovation. - Work independently and asynchronously to meet task deadlines. - Contribute to a cutting-edge project involving state-of-the-art large language models. Qualifications - PhD in Physics with specialization in: - Advanced Quantum Mechanics - Advanced Electrodynamics - Advanced Classical Mechanics - Graduate degree from US/UK/Canada/Western Europe. - High attention to detail. - Exceptional written and verbal communication skills. - Excellent proficiency in English. Requirements - Contract position. - Compensation: $70–$90/hour. - Location: Remote. - Commitment: 4–6 tasks/week. Company Description Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
• Work with stakeholders to convert business requirements into technical specifications. • Train LLMs from scratch on a large GPU cluster. • Collect and process pre-training and fine-tuning datasets. • Support and improve existing subsystems.

