Sully.ai logo
Sully.ai

AI Medical Employees for Healthcare

Applied Research Scientist

Research ScientistResearch ScientistOtherRemoteSeniorTeam 11-50Since 2023H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

99 days ago

Salary

0

Seniority

Senior

Job Description

Applied Research Scientist

Sully.ai

• Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks. • Audit existing evaluation approaches for clinical and agentic tasks. • Define initial benchmarks and build early automated pipelines. • Partner with engineering to land first set of CI gates for accuracy, factuality, and safety. • Deliver a repeatable evaluation framework with automated pipelines in production. • Demonstrate measurable improvements in robustness, hallucination reduction, or safety. • Publish or present internal research findings that directly shape product reliability.

Job Requirements

  • Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks.
  • Strong Python and ML background (PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex).
  • Demonstrated ability to design rigorous experiments and translate findings into production.
  • Track record of published research or deep applied work in LLMs and agent evaluation.
  • Strong communication and technical writing skills to articulate complex findings clearly.

Benefits

  • Speed matters - we operate with urgency, autonomy, and ownership
  • You’ll work on real, first-of-their-kind problems at the edge of AI and medicine
  • Your work helps doctors reclaim their time - and patients get better, faster care

Related Categories

Related Job Pages

More Research Scientist Jobs

The Ohio State University logo

Senior Research Scientist

The Ohio State University

Based in Columbus, Ohio, The Ohio State University (OSU) is one of the largest universities in the nation. OSU annually enrolls more than 60,000 students and has been recognized nu

• The Senior Research Scientist will provide a review of past research on wide-band microwave radiometric measurements of ice sheets and sea ice. • Assist in consolidating and refining Cryorad mission objectives and requirements related to measurements of ice sheet temperature and sea ice thickness and salinity. • Expand on the glaciologic rational for these objectives and help illustrate how mission requirements and objectives flow from the scientific goals. • Develop case studies for both the ice sheet and sea ice case aimed at understanding measurement sensitivities and providing a basis for testing inversion methods.

United States
Job Closed

Led by Michael Antonov, a co-founder of Oculus, and well-funded by Formic Ventures, Deep Origin is poised to reinvent the way scientists work and life science innovations come to life. We see a future largely free of diseases, with a 150-year lifespan being the norm. To get there, we are building an operating system for science, enabling scientists to be more productive and to bring tomorrow's ideas to life quickly and at a reasonable cost. About the role Deep Origin is seeking a Scientist with strong expertise in small-molecule docking and benchmarking, molecular dynamics (MD) simulations, and free energy perturbation (FEP), machine learning, to support a transformative ARPA-H initiative. You'll lead the design of robust simulation workflows and analyze protein-ligand structures across a large target panel to support predictive modeling for therapeutic discovery.

Oregon
Job Closed

Led by Michael Antonov, a co-founder of Oculus, and well-funded by Formic Ventures, Deep Origin is poised to reinvent the way scientists work and life science innovations come to life. We see a future largely free of diseases, with a 150-year lifespan being the norm. To get there, we are building an operating system for science, enabling scientists to be more productive and to bring tomorrow's ideas to life quickly and at a reasonable cost. About the role Deep Origin is seeking a Scientist with strong expertise in small-molecule docking and benchmarking, molecular dynamics (MD) simulations, and free energy perturbation (FEP), machine learning, to support a transformative ARPA-H initiative. You'll lead the design of robust simulation workflows and analyze protein-ligand structures across a large target panel to support predictive modeling for therapeutic discovery.

District of Columbia
Job Closed

Led by Michael Antonov, a co-founder of Oculus, and well-funded by Formic Ventures, Deep Origin is poised to reinvent the way scientists work and life science innovations come to life. We see a future largely free of diseases, with a 150-year lifespan being the norm. To get there, we are building an operating system for science, enabling scientists to be more productive and to bring tomorrow's ideas to life quickly and at a reasonable cost. About the role Deep Origin is seeking a Scientist with strong expertise in small-molecule docking and benchmarking, molecular dynamics (MD) simulations, and free energy perturbation (FEP), machine learning, to support a transformative ARPA-H initiative. You'll lead the design of robust simulation workflows and analyze protein-ligand structures across a large target panel to support predictive modeling for therapeutic discovery.

United States
Job Closed