Senior Applied Scientist, Document Understanding

Data ScientistData ScientistFull TimeRemoteSeniorTeam 10,001+Since 2008H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

39 days ago

Salary

$127K - $236K / year

Seniority

Senior

Job Description

Senior Applied Scientist, Document Understanding

Thomson Reuters

Senior Applied Scientist, Document Understanding About the Role This role sits within the applied science function. You will design, build, and deploy document understanding systems that directly power Westlaw, PracticalLaw, and CoCounsel. The problems are real, the scale is large, and the expectation is shipped, reliable, measurable impact. You will work across semantic chunking, document enrichment, knowledge graph construction, and synthetic data generation for complex legal, tax, and accounting content. Multiple product teams depend on what this function delivers. About You You hold a PhD or Master's in Computer Science, AI, NLP, or a related field, with 5+ years of post-degree industry experience taking NLP and document understanding systems from development to production at scale. You have hands-on depth across model development, distillation, evaluation, and deployment. You publish, you work independently, lead through influence in an applied research setting, and measure success by what ships and performs in production. What You'll Do - Design and deploy semantic chunking models for lengthy, non-uniformly structured legal documents with adjustable granularity across use cases - Build document enrichment systems using legal and customer-defined taxonomies - Develop LLM-based knowledge graph construction pipelines that extract and link citations, entities, and legal concepts across diverse legal content - Build scalable synthetic data generation systems for model training, multi-hop query simulation, and hallucination-free answer generation - Apply knowledge distillation techniques to compress large models into latency-constrained, production-ready SLMs - Design evaluation frameworks — component-level and end-to-end — using expert annotation and synthetic data - Drive technical decisions on architecture, chunking strategy, classification approach, and knowledge extraction methods - Partner with engineering on delivery, reliability, and scale across multiple product lines - Contribute to published research at venues such as ACL, EMNLP, ICLR, NeurIPS, SIGIR, and KDD, and to intellectual property Required Qualifications - PhD or Master's in Computer Science, AI, NLP, or a related field - 5+ years of post-degree industry experience shipping document understanding, information extraction, or knowledge graph systems into production — not research-only experience - Publications at ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD, or equivalent - Experience leading through influence in an applied research setting - Production Python and experience with PyTorch, Hugging Face Transformers, and DeepSpeed Hands-on production depth required in: - Document layout analysis and semantic chunking beyond fixed-size or paragraph-based methods - Hierarchical, multi-label document classification with domain-specific and customer-defined schemas - Entity recognition and linking, relation extraction, citation parsing, and knowledge graph construction from unstructured text - LLM-based information extraction, few-shot and multi-task learning, and post-training - Knowledge distillation, model compression, and SLM deployment under latency constraints - Synthetic data generation and annotation workflow design - End-to-end evaluation framework design for document understanding Preferred Qualifications - Legal document understanding, legal IE, or legal AI experience - Complex document structures: nested hierarchies, cross-references, non-uniform formatting - Retrieval or QA systems over large document collections - RAG and agentic workflows in enterprise settings - Knowledge graph frameworks for legal or enterprise applications - AzureML or AWS SageMaker #LI-LP2 What’s in it For You? - Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work-life balance. - Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow’s challenges and deliver real-world solutions. Our Grow My Way programming and skills-first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI-enabled future. - Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing. - Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more. We live by our values: Obsess over our Customers, Compete to Win, Challenge (Y)our Thinking, Act Fast / Learn Fast, and Stronger Together. - Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives. - Making a Real-World Impact: We are one of the few companies globally that helps its customers pursue justice, truth, and transparency. Together, with the professionals and institutions we serve, we help uphold the rule of law, turn the wheels of commerce, catch bad actors, report the facts, and provide trusted, unbiased information to people all over the world. In the United States, Thomson Reuters offers a comprehensive benefits package to our employees. Our benefit package includes market competitive health, dental, vision, disability, and life insurance programs, as well as a competitive 401k plan with company match. In addition, Thomson Reuters offers market leading work life benefits with competitive vacation, sick and safe paid time off, paid holidays (including two company mental health days off), parental leave, sabbatical leave. These benefits meet or exceeds the requirements of paid time off in accordance with any applicable state or municipal laws. Finally, Thomson Reuters offers the following additional benefits: optional hospital, accident and sickness insurance paid 100% by the employee; optional life and AD&D insurance paid 100% by the employee; Flexible Spending and Health Savings Accounts; fitness reimbursement; access to Employee Assistance Program; Group Legal Identity Theft Protection benefit paid 100% by employee; access to 529 Plan; commuter benefits; Adoption & Surrogacy Assistance; Tuition Reimbursement; and access to Employee Stock Purchase Plan. Thomson Reuters complies with local laws that require upfront disclosure of the expected pay range for a position. The base compensation range varies across locations. For any eligible US locations, unless otherwise noted, the base compensation range for this role is $127,400 USD - $236,600 USD. Base pay is positioned within the range based on several factors including an individual’s knowledge, skills and experience with consideration given to internal equity. Base pay is one part of a comprehensive Total Reward program which also includes flexible and supportive benefits and other wellbeing programs. This role may also be eligible for an Annual Bonus based on a combination of enterprise and individual performance. This job posting will close 05/13/2026. About Us Thomson Reuters informs the way forward by bringing together the trusted content and technology that people and organizations need to make the right decisions. We serve professionals across legal, tax, accounting, compliance, government, and media. Our products combine highly specialized software and insights to empower professionals with the data, intelligence, and solutions needed to make informed decisions, and to help institutions in their pursuit of justice, truth, and transparency. Reuters, part of Thomson Reuters, is a world leading provider of trusted journalism and news. We are powered by the talents of 26,000 employees across more than 70 countries, where everyone has a chance to contribute and grow professionally in flexible work environments. At a time when objectivity, accuracy, fairness, and transparency are under attack, we consider it our duty to pursue them. Sound exciting? Join us and help shape the industries that move society forward. As a global business, we rely on the unique backgrounds, perspectives, and experiences of all employees to deliver on our business goals. To ensure we can do that, we seek talented, qualified employees in all our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace. We also make reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs in accordance with applicable law. More information on requesting an accommodation here. Learn more on how to protect yourself from fraudulent job postings here. More information about Thomson Reuters can be found on thomsonreuters.com

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemoteTeam 10,001+Since 2008H1B Sponsor

New Position: This position is open due to an existing vacancy to support our evolving business needs. Senior Applied Scientist, Document Understanding About the Role This is an applied science position focused on designing, building, and deploying production-grade document understanding systems that power Westlaw, PracticalLaw, and CoCounsel. You will work across semantic chunking, document enrichment, and knowledge graph construction for complex legal, tax, and accounting content — delivering foundational intelligence that multiple product teams depend on at scale. About You You hold a PhD or Master's in Computer Science, AI, NLP, or a related field, with 5+ years of post-degree industry experience shipping document understanding, information extraction, or knowledge graph systems into production. You have hands-on depth across model development, distillation, evaluation, and deployment. You work independently, lead through influence in an applied research setting, and measure success by what ships and performs in production. What You'll Do - Design and deploy semantic chunking models for lengthy, non-uniformly structured legal documents with adjustable granularity across use cases - Build document enrichment systems that classify documents according to legal and customer-defined taxonomies and extract rich metadata - Develop LLM-based knowledge graph construction pipelines that extract and link citations, entities, and legal concepts across diverse legal content - Build scalable synthetic data generation systems for model training, multi-hop query simulation, and hallucination-free answer generation - Apply knowledge distillation techniques to compress large models into latency-constrained, production-ready SLMs - Design evaluation frameworks — component-level and end-to-end — using expert annotation and synthetic data - Drive independent technical decisions on chunking strategy, classification approach, knowledge extraction methods, and multi-document reasoning architecture - Partner with engineering on delivery, reliability, and scale across multiple product lines - Contribute to published research at venues such as ACL, EMNLP, ICLR, NeurIPS, SIGIR, and KDD, and to intellectual property Required Qualifications - PhD or Master's in Computer Science, AI, NLP, or a related field - 5+ years of post-degree industry experience shipping document understanding, information extraction, or knowledge graph systems into production — not research-only experience - Publications at ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD, or equivalent - Experience leading through influence in an applied research setting - Production Python and experience with PyTorch, Hugging Face Transformers, and DeepSpeed Hands-on production depth required in: - Document layout analysis and semantic chunking beyond fixed-size or paragraph-based methods - Hierarchical, multi-label document classification with domain-specific and customer-defined schemas - Entity recognition and linking, relation extraction, citation parsing, and knowledge graph construction from unstructured text - LLM-based information extraction, few-shot and multi-task learning, and post-training - Knowledge distillation, model compression, and SLM deployment under latency constraints - Synthetic data generation for NLP: query-answer generation with verification and scalable data augmentation - Annotation workflow design and evaluation framework development for document understanding tasks Preferred Qualifications - Legal document understanding, legal information extraction, or legal AI applications - Complex document structures common in legal content: nested hierarchies, cross-references, non-uniform formatting, and embedded elements - Retrieval, QA, or analysis systems over large document collections - Knowledge graph frameworks for legal or enterprise applications - RAG and agentic workflows for enterprise knowledge systems - AzureML or AWS SageMaker #LI-LP2 What’s in it For You? - Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work-life balance. - Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow’s challenges and deliver real-world solutions. Our Grow My Way programming and skills-first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI-enabled future. - Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing. - Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more. We live by our values: Obsess over our Customers, Compete to Win, Challenge (Y)our Thinking, Act Fast / Learn Fast, and Stronger Together. - Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives. - Making a Real-World Impact: We are one of the few companies globally that helps its customers pursue justice, truth, and transparency. Together, with the professionals and institutions we serve, we help uphold the rule of law, turn the wheels of commerce, catch bad actors, report the facts, and provide trusted, unbiased information to people all over the world. Our use of AI within the recruitment process Thomson Reuters utilizes Artificial Intelligence (AI) to support parts of our global recruitment process. Unless you opt-out, our AI system will assess the information provided by you and compare it to the requirements listed for the role, and present the result to our recruitment personnel for further review. The AI system acts as a supporting tool, but there is always a human making the decision if you will be considered for the role. In the United States, Thomson Reuters offers a comprehensive benefits package to our employees. Our benefit package includes market competitive health, dental, vision, disability, and life insurance programs, as well as a competitive 401k plan with company match. In addition, Thomson Reuters offers market leading work life benefits with competitive vacation, sick and safe paid time off, paid holidays (including two company mental health days off), parental leave, sabbatical leave. These benefits meet or exceeds the requirements of paid time off in accordance with any applicable state or municipal laws. Finally, Thomson Reuters offers the following additional benefits: optional hospital, accident and sickness insurance paid 100% by the employee; optional life and AD&D insurance paid 100% by the employee; Flexible Spending and Health Savings Accounts; fitness reimbursement; access to Employee Assistance Program; Group Legal Identity Theft Protection benefit paid 100% by employee; access to 529 Plan; commuter benefits; Adoption & Surrogacy Assistance; Tuition Reimbursement; and access to Employee Stock Purchase Plan. Thomson Reuters complies with local laws that require upfront disclosure of the expected pay range for a position. The base compensation range varies across locations. For any eligible US locations, unless otherwise noted, the base compensation range for this role is $127,400 USD - $236,600 USD. For Ontario, Canada, the base compensation range for this role is $100,000 CAD - $145,000 CAD. Base pay is positioned within the range based on several factors including an individual’s knowledge, skills and experience with consideration given to internal equity. Base pay is one part of a comprehensive Total Reward program which also includes flexible and supportive benefits and other wellbeing programs. This role may also be eligible for an Annual Bonus based on a combination of enterprise and individual performance. This job posting will close 05/13/2026. About Us Thomson Reuters informs the way forward by bringing together the trusted content and technology that people and organizations need to make the right decisions. We serve professionals across legal, tax, accounting, compliance, government, and media. Our products combine highly specialized software and insights to empower professionals with the data, intelligence, and solutions needed to make informed decisions, and to help institutions in their pursuit of justice, truth, and transparency. Reuters, part of Thomson Reuters, is a world leading provider of trusted journalism and news. We are powered by the talents of 26,000 employees across more than 70 countries, where everyone has a chance to contribute and grow professionally in flexible work environments. At a time when objectivity, accuracy, fairness, and transparency are under attack, we consider it our duty to pursue them. Sound exciting? Join us and help shape the industries that move society forward. As a global business, we rely on the unique backgrounds, perspectives, and experiences of all employees to deliver on our business goals. To ensure we can do that, we seek talented, qualified employees in all our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace. We also make reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs in accordance with applicable law. More information on requesting an accommodation here. Learn more on how to protect yourself from fraudulent job postings here. More information about Thomson Reuters can be found on thomsonreuters.com

United States + 1 moreAll locations: United States | Canada
$127K - $236K / year
For People logo

Data Scientist, Healthcare

For People

Data for a better society.

Data Scientist39 days ago
Full TimeRemoteTeam 11-50H1B Sponsor

• Develop and implement data models to find efficiencies and identify trends using statistical techniques. • Analyze large-scale healthcare datasets, specifically focusing on the Transformed Medicaid Statistical Information System (T-MSIS) Analytic Files (TAF), to generate actionable insights. • Collaborate within a dedicated team of analysts and program experts to build interactive self-service dashboards that translate complex metrics into user-centric visualizations. • Support the incremental development of data governance frameworks to ensure the highest standards of data quality, integrity, and security.

United States
$110K - $140K / year
Job Closed
GeoYeti logo

Senior Geospatial Data Scientist

GeoYeti

BCore is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law.

Data Scientist39 days ago

Overview At Bcore, our strength comes from how we deliver impact to the mission. Whether it’s architecting critical IT solutions, producing actionable intelligence, or developing cutting edge technology, we succeed because of the expertise, collaboration, and agility of our teams. Our Insight Solutions division delivers intelligence analysis, advanced data science, and strategic decision support. Bcore accelerates decisive advantage for warfighters and intelligence professionals by fusing human insight, rapid-fire engineering, precision-measured outcomes, and relentless grit into mission-ready solutions. Are you ready to lean into analytic approaches that show customers the power of both technical and methodological innovation? Join our growing team supporting customer missions as a Senior Geospatial Data Scientist in Reston, VA and remote. Responsibilities - Identify corollary datasets to compare against model outputs, especially those with which were unlikely used as part of the training set. - Assist in the evaluation of the effectiveness of ML models. Qualifications Required Qualifications: - Active TS/SCI clearance with CI poly required to start - Expert level understanding of Python, including geospatial libraries like Fiona and Shapely. - Expert level understanding of spatial data storage formats such as ESRI GeoDatabases, shapefiles, GeoPackage, and text-based storage in spatial databases such as PostGIS. - Ideally an understanding of storage formats and methodologies for ML models. What you can expect from us BCore is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law.

United States
US Foods logo

Lead Data Scientist (Remote/Virtual)

US Foods

US Foods is a foodservice distributor, partnering with restaurants and operators to help their businesses succeed.

Data Scientist39 days ago
Full TimeRemoteTeam 10,001+H1B Sponsor

ARE YOU A CURRENT US FOODS EMPLOYEE? PLEASE APPLY DIRECTLY THROUGH OUR INTERNAL WORKDAY CAREER SITE Join Our Community of Food People! The Lead Data Scientist – Commercial will lead a team of data scientists responsible for designing, developing, and deploying advanced analytics and AI solutions that drive commercial growth, seller efficiency improvements and customer engagement. This includes seller effectiveness solutions, eCommerce AI capabilities such as personalization and product recommendations, and advanced marketing and merchandising analytics. This leader owns the Commercial Advanced Analytics portfolio, with accountability for analysis, development, and implementation of AI / ML solutions and delivery of measurable business outcomes across Sales, Marketing, and Digital channels. Responsibilities span the full lifecycle of initiatives, from problem framing and solution design through production deployment and adoption. This position is remote which means the work can be completed from anywhere except Hawaii or United States Territories. ESSENTIAL DUTIES AND RESPONSIBILITIES Delivery and Impact - Partner with senior Sales, Merchandising, Marketing, and Digital stakeholders to identify, prioritize, and frame high-impact business problems suited for advanced analytics and AI. - Oversee delivery of solutions including eCommerce personalization and recommendation systems, seller effectiveness and productivity tools, and advanced marketing and merchandising analytics. - Ensure solutions are production-ready, scalable, and embedded into commercial workflows to drive sustained and measurable revenue, margin, and customer experience impact. Analytical Leadership - Lead, develop, and retain high-performing teams of data scientists, with a strong focus on innovation, execution, and talent development. - Shape and deliver the commercial analytics and AI roadmap aligned to growth priorities, customer strategy, and measurable business outcomes. - Influence decision-making by leading statistical experimentation and driving adoption of data-driven decision making across Sales, Merchandising, Marketing, and Digital leadership teams. Technical Excellence - Provide technical and analytical leadership across applied AI, optimization, and statistical modeling. - Set standards for analytical rigor, model performance, reliability, and commercial business impact. - Collaborate with ML Engineering, Digital, and Platform teams to ensure robust code development, scalable deployment, and stable production operations across the full model lifecycle. SUPERVISION: - Team of five data scientists. RELATIONSHIPS - Internal: Analytics and Data Science teams; Executive Leadership Team; Sales, Marketing, Merchandising, Digital, and Technology leaders. - External: Vendors including cloud infrastructure providers, analytics and AI solution partners, and other strategic partners. WORK ENVIRONMENT (Select one) - Remote: This role is fully remote, and the associate is expected to perform assigned responsibilities from a home-based environment. MINIMUM QUALIFICATIONS - Six years of experience or greater in advanced analytics, data science, or applied machine learning, with progressive leadership responsibility. - Experience deploying applied AI solutions on cloud platforms (e.g., AWS SageMaker and Bedrock), including LLM-based and agentic architectures, with production-grade hosting, monitoring, governance, and end-to-end MLOps. - Experience guiding teams in Python-based data science ecosystems and collaborative development practices, including code quality, testing, and reproducibility. Comfort with modern agentic coding tools (e.g., Claude Code, GitHub Copilot). - Strong command of machine learning, optimization methods, and statistical experimentation at scale, particularly in commercial, customer, or growth-oriented use cases. - Business-oriented analytical thinker with a high bar for rigor, execution, and reliability. - Clear, concise communicator able to influence senior leaders with data-driven insights. EDUCATION - Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related quantitative field required. - PhD in a quantitative field a plus. TRAVEL REQUIREMENT - 10% CERTIFICATIONS/TRAINING - N/A LICENSES - N/A PREFERRED QUALIFICATIONS - Experience leading applied data science or AI teams in commercial, sales, marketing, or eCommerce environments (especially B2B). - Strong consultative, business-facing background with demonstrated success driving adoption of analytics products. - Able to communicate clearly and influence stakeholders through storytelling and public speaking. - Experience in complex, SKU-heavy or distribution-style businesses. This role is also eligible for Benefits for this role include health insurance, pre-tax spending accounts, retirement benefits, paid time off, short-term and long-term disability, employee stock purchase plan, and life insurance. To review available benefits, please click here: https://www.usfoods.com/careers/benefits.html. #LI-EC1 Compensation depends on relevant experience and/or education, specific skills, function, geographic location, and other factors as applicable by law (for example: state or local minimum wage thresholds). The expected base rate for this role is between $100,000 - $160,000 ***EOE – Race/Color/Religion/Sex/Sexual Orientation/Gender Identity/National Origin/Age/Genetic Information/Protected Veteran/Disability Status***

United States
$100K - $160K / year