Job Closed

This listing is no longer active.

Solventum logo
Solventum

Enabling better, smarter, safer healthcare to improve lives.

Data Scientist – Medical Document Analysis

Data ScientistData ScientistOtherRemoteSeniorTeam 10,001+H1B No SponsorCompany SiteLinkedIn

Location

Minnesota

Posted

136 days ago

Salary

$106K - $145.8K / year

Seniority

Senior

Postgraduate Degree5 yrs expExperience acceptedEnglishAWSETLPySparkPythonPyTorchApache Spark

Job Description

Data Scientist – Medical Document Analysis

Solventum

• As a Data Scientist specializing in medical document analysis, you will work at the forefront of healthcare NLP. • You will design, implement, and evaluate advanced natural language understanding (NLU) systems that interpret complex clinical text and support data-driven medical decision making. • This includes developing models grounded in transformers, generative AI, and research-oriented neural architectures. • You will collaborate closely with clinical domain experts, fellow data scientists, ML engineers, and product teams to bring research prototypes into production systems. • You will explore new modeling approaches, optimize training pipelines, and contribute to the long-term direction of Solventum’s deep learning portfolio.

Job Requirements

  • Master's degree or PhD in computer science, mathematics, or related fields or Bachelor’s degree with at least 5 years of IT experience
  • Solid experience in Python especially in deep learning for text analysis and libraries such as PyTorch and Transformers
  • Solid grasp of statistics and exploratory data analysis
  • US citizenship or permanent resident required
  • Experience conducting research-driven NLP/NLU work involving representation learning, attention mechanisms, or hybrid neural architectures.
  • Ability to self-organize across multiple technical and business contexts, communicating complex findings with clarity and confidence.
  • Experience extracting insights from complex clinical datasets and presenting those insights to varied audiences.
  • Familiarity with AWS, GitHub, CI/CD, and scalable ML deployment practices.
  • Hands-on experience with LLMs, prompting, fine-tuning, or agentic AI frameworks.
  • Experience with ETL of large-scale text using tools such as PySpark, Spark NLP, or distributed data frameworks.
  • Exposure to clinical coding systems or medical terminologies (e.g., ICD, CPT, SNOMED) is a plus.

Benefits

  • Medical
  • Dental & Vision
  • Health Savings Accounts
  • Health Care & Dependent Care Flexible Spending Accounts
  • Disability Benefits
  • Life Insurance
  • Voluntary Benefits
  • Paid Absences
  • Retirement Benefits

Related Categories

Related Job Pages

More Data Scientist Jobs

Canva logo

Head of Data – Platform

Canva

Founded in 2012, Canva offers an online graphic design and publishing platform used by millions of people across the globe. As an employer, Canva offers flexibl

Data Scientist136 days ago

• Coaching a team of ~20 data specialists across three teams: Product Platform Data, Infrastructure Data, and Central Data Science • Partnering closely with Platform and Infrastructure Engineering leaders to shape and execute Canva’s Platform and Infrastructure data strategy • Responsible for overseeing the data science roadmap of core data infrastructure such as Canva's Experimentation Platform • Driving technical excellence and championing data-informed decision-making around platform performance, system reliability, and developer productivity • Leading analytics projects that influence how Canva builds scalable systems to serve billions of users • Enabling strategic investments in AI and advanced analytics to improve cost optimization, capacity planning, and system observability • Owning project prioritisation and capacity planning across three highly specialized and interconnected data teams

Australia
Job Closed
Canva logo

Head of Data – Platform

Canva

Founded in 2012, Canva offers an online graphic design and publishing platform used by millions of people across the globe. As an employer, Canva offers flexibl

Data Scientist136 days ago

• Coaching a team of ~20 data specialists across three teams: Product Platform Data, Infrastructure Data, and Central Data Science • Partnering closely with Platform and Infrastructure Engineering leaders to shape and execute Canva’s Platform and Infrastructure data strategy • Responsible for overseeing the data science roadmap of core data infrastructure such as Canva's Experimentation Platform • Driving technical excellence and championing data-informed decision-making around platform performance, system reliability, and developer productivity • Leading analytics projects that influence how Canva builds scalable systems to serve billions of users • Enabling strategic investments in AI and advanced analytics to improve cost optimization, capacity planning, and system observability • Owning project prioritisation and capacity planning across three highly specialized and interconnected data teams

New Zealand
Job Closed
HighLevel logo

Senior Director – Data Science & Analytics

HighLevel

The all-in-one sales & marketing platform that agencies can white-label. CRM, Email, 2-way SMS, Funnel Builder, & more!

Data Scientist137 days ago
OtherRemoteTeam 201-500Since 2018H1B No Sponsor

• Own HighLevel’s end-to-end data science and product analytics strategy, focused on modeling, experimentation, and insight generation, built on the company’s governed data platform. • Build and lead a global team spanning data science, applied ML, decision science, and product analytics, partnering closely with data engineering and platform teams to ensure scalability and reliability. • Collaborate cross-functionally with Product, Growth, Marketing, and Engineering to ensure experiments, models, and insights directly inform product development, GTM decisions, and customer outcomes. • Leverage the modern data stack (Snowflake, dbt, Atlan, Hex, etc.) to enable advanced analytics, causal inference, and machine learning at scale. • Oversee product analytics, defining how user behavior, engagement, and retention are measured, instrumented, and interpreted. • Build and scale experimentation and A/B testing frameworks, ensuring statistical rigor and consistent methodology across 50+ product and marketing teams. • Establish self-serve experimentation tools and centralized KPI definitions to accelerate data-driven product development. • Partner with product leadership to translate analytics insights into roadmap prioritization, UX improvements, and feature impact assessments. • Design, train, and productionize predictive and prescriptive models that optimize retention, churn, pricing, lead scoring, and campaign automation. • Collaborate with platform teams to build and maintain feature stores, model registries, and evaluation pipelines for reproducibility and compliance. • Integrate machine learning and generative AI into the HighLevel platform to enhance personalization, automation, and user productivity. • Define and monitor model performance metrics (e.g., precision, recall, uplift, business ROI) and ensure continuous retraining and quality control. • Partner with GTM, Finance, and Operations to quantify the impact of models, experiments, and analytics on revenue, efficiency, and customer lifetime value. • Deliver predictive dashboards, simulations, and causal analyses that complement BI reporting and drive strategic decisions. • Build forecasting and optimization systems that connect directly to core business metrics like MRR, churn, LTV/CAC, and NPS. • Provide the analytical backbone for IPO-readiness through measurable, model-driven insights and defensible forecasting. • Define success metrics for all data science and analytics initiatives and track performance against strategic goals. • Collaborate with the data platform organization to ensure model governance, lineage, and data quality are enforced within existing pipelines. • Evangelize statistical literacy, experimental rigor, and causal thinking across all functions to raise decision-making maturity company-wide. • Foster a culture of curiosity, reproducibility, and accountability in every analytics and modeling effort.

Texas
Job Closed
OtherRemoteTeam 501-1,000H1B Sponsor

• Design and scale backend services that integrate Generative AI and Retrieval-Augmented Generation (RAG) for production use. • Develop AI agents to automate workflows and improve product intelligence. • Build pipelines that transform structured and unstructured data into actionable insights. • Optimize retrieval systems for speed, scalability, and user relevance. • Collaborate with product and engineering teams to deliver AI features that directly improve customer experience.

United States
$120K - $160K / year