Research Scientist - Decentralized LLM Pre-Training

Location

United States + 171 moreAll locations: United States | Canada | Brazil | Colombia | Argentina | Chile | Venezuela | Bolivia | Ecuador | French Guiana | Guyana | Paraguay | Peru | Suriname | Uruguay | Mexico | Costa Rica | El Salvador | Guatemala | Honduras | Nicaragua | Panama | Dominican Republic | Puerto Rico | Bahamas | Guadeloupe | Haiti | Jamaica | Martinique | Montserrat | United Kingdom | Germany | France | Estonia | Portugal | Hungary | Poland | Ukraine | Romania | Bulgaria | Czechia | Slovakia | Belarus | Moldova | Sweden | Greece | Belgium | Italy | Ireland | Switzerland | Netherlands | Finland | Malta | Denmark | Lithuania | Croatia | Spain | Austria | Bosnia And Herzegovina | Iceland | Luxembourg | North Macedonia | Montenegro | Norway | Serbia | Slovenia | Albania | Cyprus | Latvia | Monaco | South Africa | Egypt | Algeria | Angola | Benin | Botswana | Burkina Faso | Burundi | Cameroon | Cabo Verde | Central African Republic | Chad | Congo | Côte D'ivoire | Democratic Republic of the Congo | Equatorial Guinea | Eritrea | Ethiopia | Gabon | Gambia | Ghana | Guinea | Guinea-bissau | Kenya | Lesotho | Liberia | Libya | Madagascar | Malawi | Mali | Mauritania | Mauritius | Mayotte | Morocco | Mozambique | Namibia | Niger | Nigeria | Réunion | Rwanda | Senegal | Seychelles | Sierra Leone | Somalia | Sudan | Eswatini | Tanzania | Togo | Tunisia | Uganda | Zambia | Zimbabwe | Georgia | Turkey | Israel | United Arab Emirates | Armenia | Azerbaijan | Bahrain | Iraq | Jordan | Kuwait | Lebanon | Oman | Qatar | Saudi Arabia | Palestine | Yemen | India | Japan | Philippines | Pakistan | Thailand | Singapore | Vietnam | Taiwan | Indonesia | Cambodia | Laos | Malaysia | Myanmar | South Korea | China | Afghanistan | Bangladesh | Bhutan | Kazakhstan | Kyrgyzstan | Maldives | Mongolia | Nepal | Sri Lanka | Tajikistan | Turkmenistan | Uzbekistan | Australia | Papua New Guinea | Kiribati | Palau | French Polynesia | Tuvalu | New Zealand

Posted

87 days ago

Salary

0

Seniority

Mid Level

Job Description

Research Scientist - Decentralized LLM Pre-Training

Rethink recruit

Role Description Templar is looking for a Research Scientist to perform novel research in decentralized LLM pre-training. You will develop and evaluate ideas for scaling large-scale pre-training in bandwidth-constrained, heterogeneous, and error-prone environments — the exact conditions that define real-world decentralized infrastructure. This is a research role with direct production impact. Your work will be ported to Templar's pre-training platform and published at major venues. If you want your research to advance the field and ship to a real system, this is that role. What You'll Do - Perform novel research in decentralized training of LLMs with a focus on pre-training - Develop, implement, and evaluate ideas for scaling large-scale pre-training in bandwidth-constrained, heterogeneous, and error-prone environments - Contribute to porting methodology to Templar's pre-training platform - Publish and present work at major conferences Qualifications - PhD in Machine Learning or equivalent research experience - Publications at top-tier venues such as NeurIPS, ICML, ICLR, CVPR, COLM, or EMNLP - Strong programming skills in PyTorch or JAX, with experience training models across multiple devices Requirements - Experience with distributed training or federated learning - Familiarity with efficient LLM techniques including quantization, optimization, inference, or architecture design - Background in optimization theory

Job Requirements

  • PhD in Machine Learning or equivalent research experience
  • Publications at top-tier venues such as NeurIPS, ICML, ICLR, CVPR, COLM, or EMNLP
  • Strong programming skills in PyTorch or JAX, with experience training models across multiple devices
  • Experience with distributed training or federated learning
  • Familiarity with efficient LLM techniques including quantization, optimization, inference, or architecture design
  • Background in optimization theory

Related Job Pages

More AI Research Scientist Jobs

Toloka Annotators logo

Freelance Annotator (English) - AI Trainer

Toloka Annotators

Be a key player in crafting the high-quality data essential for AI innovation. Perfect for aspiring freelancers

OtherRemoteTeam 51-200

Please submit your resume in English and indicate your level of English. At Toloka, we connect smart, curious people from around the world with freelance online tasks that train and improve artificial intelligence. What we do The Toloka Annotators connects individuals with Generative AI projects from leading tech innovators. Our mission is to unlock the full potential of AI by involving real people from around the world in the development process. About the Role Annotation is what helps AI make sense of the world. As an annotator, you may be invited to take part in online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses - when projects are available. Responsibilities: - Carefully review provided data (text, images, or videos) - Label or classify content based on project guidelines - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material Important note: This is project-based work. Tasks are available only when projects are active. You may be invited to one or more projects depending on your profile and current opportunities. Each project has its own compensation level based on scope and expertise required. On this project, AI trainers earn up to $23 per hour equivalent.

Minnesota
Job Closed
Mercor logo

Physics Expert

Mercor

Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

OtherRemoteH1B No Sponsor

Role Description - Develop high-quality data sets by creating challenging problems in General Relativity, Astrophysics, and Cosmology. - Evaluate and improve AI model performance using rigorous physics expertise. - Collaborate with AI research teams to enhance the quality of training data. - Work independently and asynchronously to meet deadlines and contribute effectively. - Follow complex instructions to ensure the accuracy and depth of AI-generated content. Qualifications - PhD in Physics with specialization in General Relativity, Astrophysics, or Cosmology. - Graduate degree from US/UK/Canada/Western Europe. - High attention to detail. - Exceptional written and verbal communication skills. - Excellent proficiency in English. Requirements - Contract position. - Compensation: $70–$90/hour. - Location: Remote. - Commitment: 4–6 tasks/week. Company Description

United States + 10 moreAll locations: United States | United Kingdom | Canada | Germany | France | Belgium | Switzerland | Netherlands | Austria | Luxembourg | Monaco
$70 - $90 / hour
Job Closed
Mercor logo

Chemistry AI Evaluator

Mercor

Cincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.

OtherRemoteH1B No Sponsor

Role Description - Write and refine prompts to guide model behavior in chemistry contexts. - Evaluate LLM-generated responses to chemistry-related queries for scientific accuracy and quantitative reasoning. - Conduct fact-checking using authoritative public sources and domain knowledge. - Annotate model responses by identifying strengths, areas of improvement, and inaccuracies. - Assess clarity, structure, and appropriateness of explanations for different audiences. - Apply consistent evaluation standards by following clear taxonomies and benchmarks. Qualifications - PhD in Chemistry or a closely related field - Deep expertise in Organic & Biological Chemistry, Inorganic & Materials Chemistry, Physical & Theoretical Chemistry, or Analytical & Instrumental Chemistry - Significant experience using large language models (LLMs) - Excellent writing skills and strong attention to detail - Experience reviewing or editing technical or academic writing - Prior experience with RLHF, model evaluation, or data annotation work (Preferred) - Experience teaching or explaining chemistry concepts to non-expert audiences (Preferred) - Familiarity with evaluation rubrics or structured review frameworks (Preferred) Company Description Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

United States
$73 / hour
Job Closed
Ensemble Health Partners logo

Lead, AI Research Engineer

Ensemble Health Partners

Ensemble Health Partners is a hospital and healthcare company that partners with client hospitals to help them develop processes, train teams, reach their finan

Role Description The Lead AI Research Scientist designs and analyzes machine learning experiments and develops and validates new AI Techniques, and collaborates with cross-functional teams to apply research to real use cases. This role supports model optimization, creates reusable research assets, and contributes to advancing the Company’s AI initiative. - Design, execute, and analyze machine learning experiments, establishing strong baselines and selecting appropriate evaluation metrics. - Stay up to date with the latest AI research; identify, adapt, and validate novel techniques for company-specific use cases. - Define rigorous evaluation protocols, including offline metrics, user studies, and adversarial (red team) testing to ensure statistical soundness. - Specify data and annotation requirements; develop annotation guidelines and oversee quality control processes. - Collaborate closely with domain experts, product managers, and engineering teams to refine problem statements and operational constraints. - Develop reusable research assets such as datasets, modular code components, evaluation suites, and comprehensive documentation. - Work alongside ML Engineers to optimize training and inference pipelines, ensuring seamless integration into production systems. - Contribute to academic publications and represent the company in research communities, as needed. Qualifications - Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a related field is strongly preferred. - Candidates with a master’s degree and exceptional research or industry experience will also be considered. - 5–7 years of experience in AI/ML research roles, ideally in applied or product-focused environments. - Demonstrated success in delivering research-driven solutions that have been deployed in production. - Experience collaborating in cross-functional teams across research, engineering, and product. - Publications in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ACL, CVPR) are a plus. - Strong foundational knowledge in machine learning and deep learning algorithms. - Hands-on experience with PEFT/LoRA, adapters, fine-tuning techniques, and RLHF/RLAIF (e.g., PPO, DPO, GRPO). - Ability to read, implement, and adapt state-of-the-art research papers to real-world use cases. - Proficiency in hypothesis-driven experimentation, ablation studies, and statistically sound evaluations. - Advanced programming skills in Python (preferred), C++, or Java. - Experience with deep learning frameworks such as PyTorch, Hugging Face, NumPy, etc. - Strong mathematical foundations in probability, linear algebra, and calculus. - Domain expertise in one or more areas: natural language processing (NLP), symbolic reasoning, speech processing, etc. - Ability to translate research insights into roadmaps, technical specifications, and product improvements. Requirements - 5 to 7 Years of relevant work experience. Benefits - Comprehensive benefits package designed to support the physical, emotional, and financial health of you and your family, including healthcare, time off, retirement, and well-being programs. - Professional development opportunities, including earning a professional certification relevant to your field and tuition reimbursement. - Quarterly and annual incentive programs for all employees who exceed expectations.

United States
Job Closed