Job Closed
This listing is no longer active.
Enabling better, smarter, safer healthcare to improve lives.
Data Scientist – Medical Document Analysis
Location
Minnesota
Posted
136 days ago
Salary
$106K - $145.8K / year
Seniority
Senior
Job Description
Data Scientist – Medical Document Analysis
Solventum
• As a Data Scientist specializing in medical document analysis, you will work at the forefront of healthcare NLP. • You will design, implement, and evaluate advanced natural language understanding (NLU) systems that interpret complex clinical text and support data-driven medical decision making. • This includes developing models grounded in transformers, generative AI, and research-oriented neural architectures. • You will collaborate closely with clinical domain experts, fellow data scientists, ML engineers, and product teams to bring research prototypes into production systems. • You will explore new modeling approaches, optimize training pipelines, and contribute to the long-term direction of Solventum’s deep learning portfolio.
Job Requirements
- Master's degree or PhD in computer science, mathematics, or related fields or Bachelor’s degree with at least 5 years of IT experience
- Solid experience in Python especially in deep learning for text analysis and libraries such as PyTorch and Transformers
- Solid grasp of statistics and exploratory data analysis
- US citizenship or permanent resident required
- Experience conducting research-driven NLP/NLU work involving representation learning, attention mechanisms, or hybrid neural architectures.
- Ability to self-organize across multiple technical and business contexts, communicating complex findings with clarity and confidence.
- Experience extracting insights from complex clinical datasets and presenting those insights to varied audiences.
- Familiarity with AWS, GitHub, CI/CD, and scalable ML deployment practices.
- Hands-on experience with LLMs, prompting, fine-tuning, or agentic AI frameworks.
- Experience with ETL of large-scale text using tools such as PySpark, Spark NLP, or distributed data frameworks.
- Exposure to clinical coding systems or medical terminologies (e.g., ICD, CPT, SNOMED) is a plus.
Benefits
- Medical
- Dental & Vision
- Health Savings Accounts
- Health Care & Dependent Care Flexible Spending Accounts
- Disability Benefits
- Life Insurance
- Voluntary Benefits
- Paid Absences
- Retirement Benefits
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Head of Data – Platform
CanvaFounded in 2012, Canva offers an online graphic design and publishing platform used by millions of people across the globe. As an employer, Canva offers flexibl
• Coaching a team of ~20 data specialists across three teams: Product Platform Data, Infrastructure Data, and Central Data Science • Partnering closely with Platform and Infrastructure Engineering leaders to shape and execute Canva’s Platform and Infrastructure data strategy • Responsible for overseeing the data science roadmap of core data infrastructure such as Canva's Experimentation Platform • Driving technical excellence and championing data-informed decision-making around platform performance, system reliability, and developer productivity • Leading analytics projects that influence how Canva builds scalable systems to serve billions of users • Enabling strategic investments in AI and advanced analytics to improve cost optimization, capacity planning, and system observability • Owning project prioritisation and capacity planning across three highly specialized and interconnected data teams
Head of Data – Platform
CanvaFounded in 2012, Canva offers an online graphic design and publishing platform used by millions of people across the globe. As an employer, Canva offers flexibl
• Coaching a team of ~20 data specialists across three teams: Product Platform Data, Infrastructure Data, and Central Data Science • Partnering closely with Platform and Infrastructure Engineering leaders to shape and execute Canva’s Platform and Infrastructure data strategy • Responsible for overseeing the data science roadmap of core data infrastructure such as Canva's Experimentation Platform • Driving technical excellence and championing data-informed decision-making around platform performance, system reliability, and developer productivity • Leading analytics projects that influence how Canva builds scalable systems to serve billions of users • Enabling strategic investments in AI and advanced analytics to improve cost optimization, capacity planning, and system observability • Owning project prioritisation and capacity planning across three highly specialized and interconnected data teams
Senior Director – Data Science & Analytics
HighLevelThe all-in-one sales & marketing platform that agencies can white-label. CRM, Email, 2-way SMS, Funnel Builder, & more!
• Own HighLevel’s end-to-end data science and product analytics strategy, focused on modeling, experimentation, and insight generation, built on the company’s governed data platform. • Build and lead a global team spanning data science, applied ML, decision science, and product analytics, partnering closely with data engineering and platform teams to ensure scalability and reliability. • Collaborate cross-functionally with Product, Growth, Marketing, and Engineering to ensure experiments, models, and insights directly inform product development, GTM decisions, and customer outcomes. • Leverage the modern data stack (Snowflake, dbt, Atlan, Hex, etc.) to enable advanced analytics, causal inference, and machine learning at scale. • Oversee product analytics, defining how user behavior, engagement, and retention are measured, instrumented, and interpreted. • Build and scale experimentation and A/B testing frameworks, ensuring statistical rigor and consistent methodology across 50+ product and marketing teams. • Establish self-serve experimentation tools and centralized KPI definitions to accelerate data-driven product development. • Partner with product leadership to translate analytics insights into roadmap prioritization, UX improvements, and feature impact assessments. • Design, train, and productionize predictive and prescriptive models that optimize retention, churn, pricing, lead scoring, and campaign automation. • Collaborate with platform teams to build and maintain feature stores, model registries, and evaluation pipelines for reproducibility and compliance. • Integrate machine learning and generative AI into the HighLevel platform to enhance personalization, automation, and user productivity. • Define and monitor model performance metrics (e.g., precision, recall, uplift, business ROI) and ensure continuous retraining and quality control. • Partner with GTM, Finance, and Operations to quantify the impact of models, experiments, and analytics on revenue, efficiency, and customer lifetime value. • Deliver predictive dashboards, simulations, and causal analyses that complement BI reporting and drive strategic decisions. • Build forecasting and optimization systems that connect directly to core business metrics like MRR, churn, LTV/CAC, and NPS. • Provide the analytical backbone for IPO-readiness through measurable, model-driven insights and defensible forecasting. • Define success metrics for all data science and analytics initiatives and track performance against strategic goals. • Collaborate with the data platform organization to ensure model governance, lineage, and data quality are enforced within existing pipelines. • Evangelize statistical literacy, experimental rigor, and causal thinking across all functions to raise decision-making maturity company-wide. • Foster a culture of curiosity, reproducibility, and accountability in every analytics and modeling effort.
• Design and scale backend services that integrate Generative AI and Retrieval-Augmented Generation (RAG) for production use. • Develop AI agents to automate workflows and improve product intelligence. • Build pipelines that transform structured and unstructured data into actionable insights. • Optimize retrieval systems for speed, scalability, and user relevance. • Collaborate with product and engineering teams to deliver AI features that directly improve customer experience.



