CVS Health logo
CVS Health

Bringing our heart to every moment of your health.

Senior Data Scientist - Clinical Informatics (Analytics Enablement)

Data ScientistData ScientistFull TimeRemoteSeniorTeam 10,001+Since 1963H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

46 days ago

Salary

$101K - $203K / year

Seniority

Senior

Job Description

Senior Data Scientist - Clinical Informatics (Analytics Enablement)

CVS Health

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time. Position Summary Position Summary CVS Health's Analytics & Behavior Change (A&BC) team is an organization working to solve some of the most challenging problems at the intersection of technology and healthcare. A&BC leverages advanced analytics, clinical informatics, and hypothesis-driven approaches to transform data into actionable, customer-centric insights that drive growth, improve health outcomes, and expand access to healthcare across all CVS Health businesses. Our teams build next-generation data and AI products that help power CVS Health to make healthier happen for 100+ million customers. The A&BC organization is looking to grow its Clinical Data Science & AI team. Join us as we embark on an exciting journey to drive a transformational shift in how CVS Health leverages clinical data and analytics to become the leader in consumer healthcare in the U.S. As a Senior Data Scientist - Clinical Informatics (Analytics Enablement), you are tasked with activating CVS Health's clinical data repository to improve outcomes across multiple lines of business and use cases. You will serve as a bridge between clinical data assets and the analysts, data scientists, and business partners who consume them—ensuring data is accessible, well-documented, fit for purpose, and aligned with clinical and regulatory standards. You will: - Serve as a subject matter expert in clinical data, including CCD data, claims, pharmacy, lab results, and clinical documentation, with deep understanding of how to structure and apply this data to solve healthcare problems. - Design and maintain clinical data models, taxonomies, and classification frameworks that enable consistent interpretation and use of clinical data across the organization. - Build the clinical data feature store, establishing standards, documentation, and best practices that accelerate adoption of clinical data for downstream analytics, reporting, and AI/ML use cases. - Develop analytics by building well-documented, validated, and reusable data assets (tables, views, features) that empower analysts and data scientists to work independently with clinical data. - Create and maintain comprehensive data documentation, including data dictionaries, lineage, business logic, known limitations, and appropriate use guidelines for clinical datasets. - Build queries, dashboards, and data visualizations to effectively communicate data quality metrics, data availability, and clinical insights to technical and non-technical stakeholders. - Partner with clinical, operational, and business stakeholders to understand their data needs, translate requirements into data solutions, and ensure clinical data assets meet their analytical objectives. - Maintain data quality frameworks for clinical data, including validation rules, anomaly detection, and monitoring processes to ensure data integrity and reliability. - Translate clinical concepts into analytical frameworks, ensuring that business partners understand the capabilities and limitations of available clinical data. - Collaborate with data engineering teams to inform data pipeline development, ensuring clinical data is ingested, transformed, and stored in ways that support downstream analytics needs. - Contribute to data governance initiatives, including compliance with HIPAA, data privacy regulations, and internal data stewardship policies. - Develop and deliver training, presentations, and consultations to existing and prospective data consumers on clinical data assets, appropriate use, and analytics opportunities. - Stay current with clinical data standards (HL7, FHIR, ICD-10, SNOMED-CT, LOINC, CPT, NDC, RxNorm) and industry best practices in clinical informatics. Required Qualifications Required Qualifications - 4+ years of relevant experience in clinical informatics, healthcare analytics, or clinical data management. - Expertise in clinical data types and structures, including CCD data, lab results, clinical notes, and administrative healthcare data. - Strong knowledge of clinical coding systems and terminologies, such as ICD-10, CPT, HCPCS, SNOMED-CT, LOINC, NDC, and RxNorm. - Experience designing and documenting data models, taxonomies, or classification frameworks for clinical or healthcare data. - Proven ability to enable and support downstream data consumers (analysts, data scientists, business users) through documentation, training, and consultative support. - Proficiency with SQL and experience working with large-scale healthcare datasets. - Experience using cloud-based data platforms, preferably Google Cloud Platform (GCP) tools including BigQuery, for querying, transforming, and managing data. - Strong understanding of data quality principles, including validation, profiling, and monitoring of healthcare data. - Excellent written and verbal communication skills, including the ability to explain complex clinical data concepts to both technical and non-technical audiences. Preferred Qualifications - Proven experience integrating clinical (CCD/OMOP/FHIR) and administrative (claims) data into unified, patient-centric data models, with deep understanding of the strengths, limitations, and complementary nature of each data type. - Experience with patient data normalization & standardization for patient attributes and cross source harmonization. - Hands-on experience reconciling clinical and claims data, including diagnosis alignment, medication reconciliation (prescribed vs. dispensed), and encounter/visit matching. - Experience integrating third-party and enrichment data sources, including SDOH indices (ADI, SVI), consumer/demographic data, mortality data, and provider reference data into patient-level datasets. - Expert knowledge of clinical and administrative coding systems, including ICD-10-CM/PCS, CPT/HCPCS, SNOMED-CT, RxNorm, NDC, LOINC, and NPI. - Experience with classification and grouping systems such as HCC, CCS, DRG, and therapeutic class hierarchies. - Experience designing patient-centric data models, feature stores, and dashboards that aggregate longitudinal data across sources, including demographics, encounters, conditions, medications, labs, utilization, cost, and enrichment attributes - Proven ability to enable downstream data consumers through analytics and well-documented, validated, and reusable data assets, with experience creating data dictionaries, lineage documentation, and self-service analytics layers. - Understanding of healthcare business contexts such as care management, value-based care, quality measurement (HEDIS, Stars), and population health. Education - Bachelor’s degree in health informatics, Public Health, Nursing, Health Information Management, Computer Science, Statistics, or a related quantitative or clinical field—or an equivalent combination of formal education and experience. - Master's degree or higher in Health Informatics, Biomedical Informatics, Clinical Informatics, Public Health, Epidemiology, or a related field is strongly preferred. - Clinical background (RN, PharmD, MD, or similar) with transition into informatics/analytics is highly valued. Anticipated Weekly Hours 40 Time Type Full time Pay Range The typical pay range for this role is: $101,970.00 - $203,940.00 This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong. Great benefits for great people We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. We anticipate the application window for this opening will close on: 06/29/2026 Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time. Position Summary CVS Health's Analytics & Behavior Change (A&BC) team is an organization working to solve some of the most challenging problems at the intersection of technology and healthcare. A&BC leverages advanced analytics, clinical informatics, and hypothesis-driven approaches to transform data into actionable, customer-centric insights that drive growth, improve health outcomes, and expand access to healthcare across all CVS Health businesses. Our teams build next-generation data and AI products that help power CVS Health to make healthier happen for 100+ million customers. The A&BC organization is looking to grow its Clinical Data Science & AI team. Join us as we embark on an exciting journey to drive a transformational shift in how CVS Health leverages clinical data and analytics to become the leader in consumer healthcare in the U.S. As a Lead Data Scientist - Clinical Informatics (Clinical Data Standards), you are tasked with activating CVS Health's clinical data repository to improve outcomes across multiple lines of business and use cases. You will serve as a bridge between clinical data assets and the analysts, data scientists, and business partners who consume them—ensuring data is accessible, well-documented, fit for purpose, and aligned with clinical and regulatory standards. You will: - Serve as a subject matter expert in clinical data, including CCD data, with deep understanding of how to structure and apply this data to solve healthcare problems. - Design and maintain clinical data models, taxonomies, and classification frameworks that enable consistent interpretation and use of clinical data across the organization. - Develop and govern the clinical data feature store, establishing standards, documentation, and best practices that accelerate adoption of clinical data for downstream analytics, reporting, and AI/ML use cases. - Enable self-service analytics by building well-documented, validated, and reusable data assets (tables, views, features) that empower analysts and data scientists to work independently with clinical data. - Create and maintain comprehensive data documentation, including data dictionaries, lineage, business logic, known limitations, and appropriate use guidelines for clinical datasets. - Build queries, dashboards, and data visualizations to effectively communicate data quality metrics, data availability, and clinical insights to technical and non-technical stakeholders. - Partner with clinical, operational, and business stakeholders to understand their data needs, translate requirements into data solutions, and ensure clinical data assets meet their analytical objectives. - Lead and mentor data scientists, data analysts, and data engineers, providing guidance on clinical data interpretation, appropriate use, and best practices for working with healthcare data. - Establish data quality frameworks for clinical data, including validation rules, anomaly detection, and monitoring processes to ensure data integrity and reliability. - Translate clinical concepts into analytical frameworks, ensuring that business partners understand the capabilities and limitations of available clinical data. - Collaborate with data engineering teams to inform data pipeline development, ensuring clinical data is ingested, transformed, and stored in ways that support downstream analytics needs. - Contribute to data governance initiatives, including compliance with HIPAA, data privacy regulations, and internal data stewardship policies. - Develop and deliver training, presentations, and consultations to existing and prospective data consumers on clinical data assets, appropriate use, and analytics opportunities. - Stay current with clinical data standards (HL7, FHIR, ICD-10, SNOMED-CT, LOINC, CPT, NDC, RxNorm) and industry best practices in clinical informatics. Required Qualifications - 7+ years of relevant experience in clinical informatics, healthcare analytics, or clinical data management. - Deep expertise in clinical data types and structures, including CCD data, lab results, clinical notes, and administrative healthcare data. - Strong knowledge of clinical coding systems and terminologies, such as ICD-10, CPT, HCPCS, SNOMED-CT, LOINC, NDC, and RxNorm. - Experience designing and documenting data models, taxonomies, or classification frameworks for clinical or healthcare data. - Proven ability to enable and support downstream data consumers (analysts, data scientists, business users) through documentation, training, and consultative support. - Experience leading cross-functional projects from concept to delivery by coordinating across clinical, technical, and business stakeholders. - Proficiency with SQL and experience working with large-scale healthcare datasets. - Experience using cloud-based data platforms, preferably Google Cloud Platform (GCP) tools including BigQuery, for querying, transforming, and managing data. - Strong understanding of data quality principles, including validation, profiling, and monitoring of healthcare data. - Excellent written and verbal communication skills, including the ability to explain complex clinical data concepts to both technical and non-technical audiences. - Ability to anticipate and resolve roadblocks throughout a project lifecycle, balancing competing priorities across multiple stakeholders. Preferred Qualifications - Healthcare data platform experience with strong understanding of interoperability standards and harmonization at scale (OMOP/CCDA/FHIR) - Familiarity with clinical workflows and HIEs. - Experience using standardized clinical code systems (e.g., ICD-10, SNOMED CT, LOINC, RxNorm, UMLS) and their application within common data models (e.g., OMOP). - Experience in ETL design & implementation from heterogeneous clinical sources into different data standards preferably into the OMOP CDM. - Experience designing and implementing data quality frameworks, preferred to have experience with tools like Achilles, Data Quality Dashboard (DQDB), or equivalent custom frameworks. - Privacy, security, and compliance: HIPAA/HITRUST experience, de-identification/tokenization, PHI handling, and data access controls (column-level, row-level security). Education - Bachelor’s degree in health informatics, Public Health, Nursing, Health Information Management, Computer Science, Statistics, or a related quantitative or clinical field—or an equivalent combination of formal education and experience. - Master's degree or higher in Health Informatics, Biomedical Informatics, Clinical Informatics, Public Health, Epidemiology, or a related field is strongly preferred. - Clinical background (RN, PharmD, MD, or similar) with transition into informatics/analytics is highly valued. Pay Range The typical pay range for this role is: $130,295.00 - $260,590.00 This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong. Great benefits for great people We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. We anticipate the application window for this opening will close on: 06/29/2026 Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

United States
$130K - $260K / year
Data Scientist46 days ago
Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time. Position Summary CVS Health's Analytics & Behavior Change (A&BC) team is an organization working to solve some of the most challenging problems at the intersection of technology and healthcare. A&BC leverages advanced analytics, clinical informatics, and hypothesis-driven approaches to transform data into actionable, customer-centric insights that drive growth, improve health outcomes, and expand access to healthcare across all CVS Health businesses. Our teams build next-generation data and AI products that help power CVS Health to make healthier happen for 100+ million customers. The A&BC organization is looking to grow its Clinical Data Science & AI team. Join us as we embark on an exciting journey to drive a transformational shift in how CVS Health leverages clinical data and analytics to become the leader in consumer healthcare in the U.S. As a Lead Data Scientist - Clinical Informatics (Claims Specialization), you are tasked with activating CVS Health's clinical data repository to improve outcomes across multiple lines of business and use cases. You will serve as a bridge between clinical data assets and the analysts, data scientists, and business partners who consume them—ensuring data is accessible, well-documented, fit for purpose, and aligned with clinical and regulatory standards. You will: - Serve as a subject matter expert in clinical data, including claims, pharmacy, lab results, and clinical documentation, with deep understanding of how to structure and apply this data to solve healthcare problems. - Design and maintain clinical data models, taxonomies, and classification frameworks that enable consistent interpretation and use of clinical data across the organization. - Develop and govern the claims data feature store, establishing standards, documentation, and best practices that accelerate adoption of clinical data for downstream analytics, reporting, and AI/ML use cases. - Enable self-service analytics by building well-documented, validated, and reusable data assets (tables, views, features) that empower analysts and data scientists to work independently with clinical data. - Create and maintain comprehensive data documentation, including data dictionaries, lineage, business logic, known limitations, and appropriate use guidelines for clinical datasets. - Partner with clinical, operational, and business stakeholders to understand their data needs, translate requirements into data solutions, and ensure clinical data assets meet their analytical objectives. - Lead and mentor data scientists, data analysts, and data engineers, providing guidance on clinical data interpretation, appropriate use, and best practices for working with healthcare data. - Establish data quality frameworks for clinical data, including validation rules, anomaly detection, and monitoring processes to ensure data integrity and reliability. - Translate clinical concepts into analytical frameworks, ensuring that business partners understand the capabilities and limitations of available clinical data. - Collaborate with data engineering teams to inform data pipeline development, ensuring clinical data is ingested, transformed, and stored in ways that support downstream analytics needs. - Contribute to data governance initiatives, including compliance with HIPAA, data privacy regulations, and internal data stewardship policies. - Develop and deliver training, presentations, and consultations to existing and prospective data consumers on clinical data assets, appropriate use, and analytics opportunities. - Stay current with clinical data standards (HL7, FHIR, ICD-10, SNOMED-CT, LOINC, CPT, NDC, RxNorm) and industry best practices in clinical informatics. Required Qualifications - 7+ years of relevant experience in clinical informatics, healthcare analytics, or clinical data management. - Deep expertise in clinical data types and structures, including medical claims, pharmacy claims, lab results, clinical notes, and administrative healthcare data. - Knowledge of clinical coding systems and terminologies, such as ICD-10, CPT, HCPCS, SNOMED-CT, LOINC, NDC, and RxNorm. - Experience designing and documenting data models, taxonomies, or classification frameworks for clinical or healthcare data. - Proven ability to enable and support downstream data consumers (analysts, data scientists, business users) through documentation, training, and consultative support. - Experience leading cross-functional projects from concept to delivery by coordinating across clinical, technical, and business stakeholders. - Proficiency with SQL and experience working with large-scale healthcare datasets. - Experience using cloud-based data platforms, preferably Google Cloud Platform (GCP) tools including BigQuery, for querying, transforming, and managing data. - Strong understanding of data quality principles, including validation, profiling, and monitoring of healthcare data. - Excellent written and verbal communication skills, including the ability to explain complex clinical data concepts to both technical and non-technical audiences. - Ability to anticipate and resolve roadblocks throughout a project lifecycle, balancing competing priorities across multiple stakeholders. Preferred Qualifications - Strong experience with medical claims (professional and institutional), pharmacy claims, and eligibility/enrollment data, including understanding of adjudication, adjustments, and claims completeness considerations. - Familiarity with claims-based analytics, including total cost of care, utilization metrics, risk adjustment (HCC), and episode groupers. - Strong understanding of interoperability and large‑scale data harmonization across administrative sources (e.g., medical & pharmacy claims, enrollment/eligibility files, provider files) and across common standards such as X12, NCPDP, FHIR, and OMOP. - Expertise in claims lifecycle and payer workflows, including claim submission, adjudication, pricing, remittance, utilization management, and benefits configuration. - Experience working with standardized administrative code systems (e.g., ICD‑10‑CM, CPT/HCPCS, DRG, NDC). - Hands-on experience with ETL pipelines from payer sources into normalized data standards, preferably OMOP CDM with cost and payer domains. Education - Bachelor’s degree in health informatics, Public Health, Nursing, Health Information Management, Computer Science, Statistics, or a related quantitative or clinical field—or an equivalent combination of formal education and experience. - Master's degree or higher in Health Informatics, Biomedical Informatics, Clinical Informatics, Public Health, Epidemiology, or a related field is strongly preferred. - Clinical background (RN, PharmD, MD, or similar) with transition into informatics/analytics is highly valued. Pay Range The typical pay range for this role is: $130,295.00 - $260,590.00 This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program. Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong. Great benefits for great people We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families. This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility. Additional details about available benefits are provided during the application process and on Benefits Moments. We anticipate the application window for this opening will close on: 06/29/2026 Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.

United States
$130K - $260K / year

Senior Data Scientist II - Agentic AI

Remitly

Remitly is a global digital financial services company providing fast, affordable, and secure remittance services with the aim of making it easier for people to

Data Scientist46 days ago

About the Team: LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX (www.relx.com), a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to improve productivity and transform the overall business and practice of law, deploying ethical and powerful generative AI solutions with a flexible, multi-model approach that prioritizes using the best model from today’s top model creators for each individual legal use case. The company employs over 2,000 technologists, data scientists, and experts to develop, test, and validate solutions in line with RELX Responsible AI Principles (https://stories.relx.com/responsible-ai-principles/index.html). About the Role: Are you looking to develop your Data Scientist career? Would you enjoy training more junior team members? We are seeking a Senior Data Scientist II to help lead the design and validation of Agentic AI-driven product capabilities within the legal domain. This role focuses on defining what to build and why—leveraging machine learning, NLP, and large language models (LLMs) to solve complex legal workflows. You will help drive experimentation, modeling, and evaluation, partnering closely with engineers to translate validated approaches into scalable, customer-facing solutions. Responsibilities: - Develop and implement NLP, LLM, and generative AI approaches (e.g., RAG, prompt strategies). - Define agentic workflows and reasoning strategies for multi-step legal tasks. - Develop retrieval strategies, including hybrid search (semantic + lexical), and evaluation metrics (e.g., relevance, ranking quality). - Analyze large-scale legal datasets to extract insights and improve model performance. - Establish best practices for model evaluation, validation, and benchmarking. - Translate experimental results into clear product recommendations and business impact. - Collaborate with product, legal experts, and engineers to align solutions with user needs. - Mentor team members and provide technical leadership in data science and AI. Requirements: - Education: Master’s degree or above in a quantitative or technical field (Statistics, Computer Science, Mathematics, Data Science, etc.). - Strong experience in machine learning, NLP, and LLM-based modeling. - Strong experience designing and running experiments, including model evaluation and iteration. - Experience with generative AI techniques (e.g., prompt engineering, RAG). - Experience designing and evaluating hybrid search (semantic + lexical) using embeddings and vector databases. - Experience designing agentic workflows and reasoning strategies, with hands-on experience applying agent frameworks (e.g., LangChain, LangGraph, AutoGen) in real-world use cases. - Proficiency in Python and data analysis tools. - Strong foundation in statistics, modeling, and large-scale text processing. #AIFluent U.S. National Base Pay Range: $104,900 - $174,700. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy. We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. USA Job Seekers: EEO Know Your Rights.

United States
$104K - $174K / year
Reed Technology logo

Senior Data Scientist II - Agentic AI

Reed Technology

LexisNexis Legal & Professional® provides legal, regulatory, and business information and analytics that help customers increase their productivity, improve decision-making, achieve better outcomes, and advance the rule of law around the world.

Data Scientist46 days ago
Full TimeRemoteTeam 1,001-5,000

About the Team: LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX (www.relx.com), a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to improve productivity and transform the overall business and practice of law, deploying ethical and powerful generative AI solutions with a flexible, multi-model approach that prioritizes using the best model from today’s top model creators for each individual legal use case. The company employs over 2,000 technologists, data scientists, and experts to develop, test, and validate solutions in line with RELX Responsible AI Principles (https://stories.relx.com/responsible-ai-principles/index.html). About the Role: Are you looking to develop your Data Scientist career? Would you enjoy training more junior team members? We are seeking a Senior Data Scientist II to help lead the design and validation of Agentic AI-driven product capabilities within the legal domain. This role focuses on defining what to build and why—leveraging machine learning, NLP, and large language models (LLMs) to solve complex legal workflows. You will help drive experimentation, modeling, and evaluation, partnering closely with engineers to translate validated approaches into scalable, customer-facing solutions. Responsibilities: - Develop and implement NLP, LLM, and generative AI approaches (e.g., RAG, prompt strategies). - Define agentic workflows and reasoning strategies for multi-step legal tasks. - Develop retrieval strategies, including hybrid search (semantic + lexical), and evaluation metrics (e.g., relevance, ranking quality). - Analyze large-scale legal datasets to extract insights and improve model performance. - Establish best practices for model evaluation, validation, and benchmarking. - Translate experimental results into clear product recommendations and business impact. - Collaborate with product, legal experts, and engineers to align solutions with user needs. - Mentor team members and provide technical leadership in data science and AI. Requirements: - Education: Master’s degree or above in a quantitative or technical field (Statistics, Computer Science, Mathematics, Data Science, etc.). - Strong experience in machine learning, NLP, and LLM-based modeling. - Strong experience designing and running experiments, including model evaluation and iteration. - Experience with generative AI techniques (e.g., prompt engineering, RAG). - Experience designing and evaluating hybrid search (semantic + lexical) using embeddings and vector databases. - Experience designing agentic workflows and reasoning strategies, with hands-on experience applying agent frameworks (e.g., LangChain, LangGraph, AutoGen) in real-world use cases. - Proficiency in Python and data analysis tools. - Strong foundation in statistics, modeling, and large-scale text processing. #AIFluent U.S. National Base Pay Range: $104,900 - $174,700. Geographic differentials may apply in some locations to better reflect local market rates. This job is eligible for an annual incentive bonus. We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy. We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. USA Job Seekers: EEO Know Your Rights.

United States
$104K - $174K / year