Brahma logo
Brahma

The only account you'll ever need to secure, transact, and explore onchain like never before.

Machine Learning Researcher – Speech/Audio

AI Research ScientistMachine Learning EngineerFull TimeRemoteSeniorTeam 11-50Since 2022H1B No SponsorCompany SiteLinkedIn

Location

United Kingdom

Posted

4 days ago

Salary

0

Seniority

Senior

Postgraduate DegreeEnglishPythonPyTorch

Job Description

Machine Learning Researcher – Speech/Audio

Brahma

• Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech). • Build and fine-tune models using frameworks like PyTorch and HuggingFace. • Design training pipelines and datasets for scalable voice model training. • Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation. • Work closely with product and creative teams to ensure models meet quality and production constraints. • Stay on top of academic and industrial trends in speech synthesis and related fields.

Job Requirements

  • Strong background in machine learning and deep learning, with focus on speech/audio.
  • Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
  • Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
  • Experience training models at scale and working with large audio datasets.
  • Familiarity with vocoders and transformer-based architectures.
  • Strong problem-solving skills, ability to work autonomously in a remote-first environment.
  • PhD degree in Computer Science/ Machine Learning and publications in top venues (Nice to Have).
  • Contributions to open-source speech research or participation in relevant benchmarks (Nice to Have).
  • Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control (Nice to Have).
  • Experience with voice datasets or proprietary pipelines (Nice to Have).

Related Job Pages

More AI Research Scientist Jobs

Cerence Inc. logo

Senior AI Research Scientist

Cerence Inc.

Cerence is the global industry leader in creating AI-powered user experiences for automotive and transportation.

Full TimeRemoteTeam 1,001-5,000Since 2019H1B No Sponsor

• Finetune LLMs using RL and other SOTA methods • Create conversational workflows involving tool-calls and static knowledge • Work with agent teams to build business logic into conversation • Employ multimodals to enrich the conversational UX • Innovate and build novel methods for faster inference

United States
Tether.to logo

AI Research Engineer, Model Compression, Quantization

Tether.to

Bringing real world currency to the blockchain.

Full TimeRemoteTeam 11-50Since 2014H1B No Sponsor

• Apply low-bit quantization to reduce model size and inference latency for generative AI models (LLMs, VLMs, multimodal) while maintaining accuracy and output quality • Leverage knowledge distillation to transfer capabilities from larger teacher models to smaller student models, enabling efficient multimodal reasoning across text, image, and audio inputs • Implement pruning techniques to remove redundant parameters and attention heads, reducing computational overhead without sacrificing task performance • Analyze trade-offs between model efficiency (size, latency, memory) and accuracy across quantization, distillation, and pruning methods; propose improvements based on empirical findings • Research and apply mixed-precision quantization and other advanced compression strategies (e.g., adaptive pruning schedules, distillation with intermediate feature matching) to optimize the accuracy–performance balance • Stay current with the latest research in model compression, including emerging techniques for multimodal and generative architectures • Document methodologies, experiments, and results clearly to support reproducibility, internal collaboration, and stakeholder communication • Author technical papers and publish findings in top-tier conferences (e.g., NeurIPS, ICML, ICLR, CVPR, ACL, AAAI) to advance the field of model compression for multimodal AI.

Brazil
Full TimeRemoteTeam 11-50Since 2014H1B No Sponsor

• Drive innovation in model compression and efficient deployment for advanced multimodal AI systems, including large language models (LLMs) and vision-language models (VLMs). • Focus on reducing model footprint and computational cost while preserving accuracy. • Apply and advance compression techniques such as quantization, knowledge distillation, and pruning. • Develop, test, and implement novel compression strategies that balance model size, latency, throughput, and accuracy. • Build robust compression pipelines, establishing performance and fidelity metrics, and addressing bottlenecks in production inference.

Switzerland
Tether.to logo

AI Research Engineer – Kernel, Inference Optimization

Tether.to

Bringing real world currency to the blockchain.

Full TimeRemoteTeam 11-50Since 2014H1B No Sponsor

• Drive innovation in model serving and inference architectures for advanced AI systems • Focus on optimizing model deployment and inference strategies to deliver highly responsive, efficient, and scalable performance across real-world applications • Work on a wide spectrum of systems, ranging from resource-efficient models designed for limited hardware environments to complex, multi-modal architectures that integrate data such as text, images, and audio • Adopt a hands-on, research-driven approach to develop, test, and implement novel serving strategies and inference algorithms • Engineer robust inference pipelines, establishing comprehensive performance metrics, and identifying and resolving bottlenecks in production environments • Enable high-throughput, low-latency, low-memory footprint, and scalable AI performance that delivers tangible value in dynamic, real-world scenarios

Switzerland