AI Research Engineer, Model Compression, Quantization at Tether.to

Bringing real world currency to the blockchain.

AI Research Scientist55 days ago

Full Time RemoteTeam 11-50Since 2014H1B No Sponsor

View details: AI Research Engineer – Multi-Modal Reinforcement Learning

• Conduct research on reinforcement learning algorithms for multimodal models, • Design and build reinforcement learning infrastructure that supports scalable training, • Develop and refine reward modeling strategies, • Create and curate multimodal simulation environments and datasets, • Analyze and optimize policy performance across modalities, • Investigate and develop next-generation reinforcement learning paradigms, • Publish research findings in top-tier conferences.

PyTorch

Italy

AI Research Engineer, Model Compression, Quantization

Bringing real world currency to the blockchain.

AI Research Scientist55 days ago

Full Time RemoteTeam 11-50Since 2014H1B No Sponsor

View details: AI Research Engineer, Model Compression, Quantization

• Drive innovation in model compression and efficient deployment for advanced multimodal AI systems, including large language models (LLMs) and vision-language models (VLMs). • Reduce model footprint and computational cost while preserving accuracy, enabling high-performance AI to run efficiently across resource-constrained edge devices. • Apply and advance compression techniques such as quantization, knowledge distillation, and pruning. • Build robust compression pipelines, establish performance and fidelity metrics, and address bottlenecks in production inference.

PyTorch

Italy

AI Research Engineer – Multi-Modal, Vision

Bringing real world currency to the blockchain.

AI Research Scientist55 days ago

Full Time RemoteTeam 11-50Since 2014H1B No Sponsor

View details: AI Research Engineer – Multi-Modal, Vision

• Conduct end-to-end research and engineering on vision-language models, covering training, evaluation, and optimization across the full model development lifecycle. • Design and implement post-training pipelines including supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback. • Develop and maintain high-quality multimodal datasets, including data curation, filtering, and balancing for domain-specific tasks. • Drive model efficiency and deployability, adapting models for resource-constrained environments using compression and optimization techniques. • Design and implement evaluation frameworks and benchmarks to measure model performance, robustness, and real-world task success. • Build and scale training workflows across distributed GPU infrastructure. • Identify and resolve bottlenecks in training pipelines to achieve state-of-the-art model quality on target benchmarks. • Contribute to and leverage open-source ecosystems including models, datasets, and tooling to accelerate development. • Stay current with the latest research in multimodal learning and vision-language systems, translating relevant findings into practical improvements. • Publish research findings in top-tier AI conferences and journals where applicable.

Italy

AI Research Engineer – Agentic Post-training

Bringing real world currency to the blockchain.

AI Research Scientist55 days ago

Full Time RemoteTeam 11-50Since 2014H1B No Sponsor

View details: AI Research Engineer – Agentic Post-training

• Conduct end-to-end research and engineering initiatives to advance post-training of agentic and tool-use models to achieve SOTA results. • Drive broad, cross-cutting model improvements, including factuality, instruction adherence, tool/function use, multi-agent coordination, and reasoning calibration. • Design and enhance large-scale post-training systems, including data pipelines, training workflows, evaluation frameworks, and benchmark infrastructure. • Develop rigorous evaluation suites and diagnostic tools to assess model readiness for deployment. • Strengthen feedback loops from real-world product usage, incorporating both explicit and implicit user signals into post-training. • Collaborate with tooling, product, and training teams to improve the usefulness, reliability, and agentic capabilities of frontier models. • Closely liaise with research, engineering and cross-functional teams to determine which integrations are production-ready for inclusion in major model releases.

Node.js

Worldwide