MeridianLink logo
MeridianLink

Connecting You to Better: MeridianLink is the developer of the industry's first multi-channel loan origination system.

Data Scientist, AI Data Foundations

Data ScientistData ScientistFull TimeRemoteSeniorTeam 501-1,000Since 1998H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

23 days ago

Salary

$114.6K - $195.4K / year

Seniority

Senior

Job Description

Data Scientist, AI Data Foundations

MeridianLink

• Build and maintain vector stores for RAG: Design embedding pipelines, chunking strategies, indexing approaches, and refresh patterns for the vector stores powering retrieval-augmented generation across MeridianLink products. • Own the feature store: Design, build, and operate feature store assets used for model training and online/offline inference, including feature definitions, freshness SLAs, lineage, point-in-time correctness, and reuse across teams. • Design graph data structures: Build graph databases that model relationships between applicants, applications, products, lenders, decisions, and outcomes — and make them queryable for both AI use cases and analytical investigations. • Lead data discovery: Profile our lending, deposit, and behavioral datasets to identify hidden trends, segments, anomalies, and potential model drivers; turn findings into actionable hypotheses for product, risk, and growth teams. • Engineer for AI consumption: Build the curated, AI-ready datasets that downstream model builders, application engineers, and analysts rely on — with appropriate quality, documentation, and governance baked in. • Evaluate retrieval and feature quality: Define and run evaluation frameworks for RAG retrieval quality, feature drift, embedding quality, and graph completeness; iterate based on what the metrics tell you. • Partner with model builders: Work closely with ML engineers and applied scientists to make sure the data structures you build accelerate their work rather than slow it down. • Champion responsible data use: Partner with governance, security, and compliance to ensure that AI-facing data assets respect data classification, customer consent, and regulatory boundaries from day one. • Communicate findings: Translate discovery work into clear narratives — write-ups, notebooks, dashboards, and short presentations — that help non-technical stakeholders act on what the data is showing.

Job Requirements

  • 4–7 years of experience in a data science, ML engineering, or applied data role, with a meaningful portion of that time spent building data assets that other people's models or applications consumed.
  • Hands-on experience designing and operating vector stores for RAG or semantic search, including embedding generation, chunking, indexing, and retrieval evaluation.
  • Experience building or operating a feature store (e.g., Databricks Feature Store, Feast, or a custom internal platform), including offline training and online serving patterns and point-in-time correctness.
  • Experience modeling and building graph data structures using Neo4j, TigerGraph, Azure Cosmos DB Gremlin, or similar graph databases — and writing graph queries to answer real questions.
  • Strong proficiency in Python (pandas, NumPy, scikit-learn, PySpark) and SQL; comfortable working day-to-day in Databricks notebooks and jobs.
  • Practical experience with embedding models and LLM tooling (e.g., Hugging Face transformers, OpenAI / Azure OpenAI APIs, LangChain or similar) in a production or near-production context.
  • Demonstrated data discovery skills: profiling messy real-world datasets, surfacing non-obvious patterns, validating findings statistically, and explaining them clearly.
  • Solid grounding in classical ML concepts — supervised vs. unsupervised learning, train/test discipline, leakage, evaluation metrics — even though you will not own model training day-to-day.
  • Strong written and verbal communication skills; able to write up findings for both technical and business audiences.

Benefits

  • Insurance coverage (medical, dental, vision, life, and disability)
  • Flexible paid time off
  • Paid holidays
  • 401(k) plan with company match
  • Remote work

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemoteTeam 51-200H1B No Sponsor

**Job Description:** - Provide guidance and best practices for data collection, storage, analysis and visualization in facility data management. - Develop and maintain comprehensive data governance policies, ensuring compliance with regulatory and organizational standards. - Verify the implementation of facility management software (e.g., CMMS) to track assets, maintenance schedules, and space utilization. - Ensure data integrity by conducting audits, validating information, and implementing data quality control measures. - Collaborate with IT and facility teams to integrate data systems and improve reporting capabilities. - Analyze facility-related data to identify trends, inefficiencies, and areas for improvement. - Generate reports and dashboards to support decision-making and strategic planning. - Train facility management teams on data entry protocols, reporting tools, and system best practices. - Stay current with industry trends, emerging technologies, and data management advancements. - Translate stakeholder requirements into actionable technical solutions using Power BI dashboards and Power Apps - Proficiency in Extract, Transform, Load (ETL) processes to ensure efficient data integration and transformation. - Implement and oversee data governance strategies to ensure compliance with industry standards and organizational policies. - Establish and maintain data quality assurance protocols to ensure reporting accuracy and reliability. - Define, configure, update, and track Key Performance Indicators (KPIs) related to organizational requirements and strategic goals.

District Of Columbia + 3 moreAll locations: District Of Columbia | Maryland | Texas | Virginia
Humana logo

Senior Data Scientist – Generative AI

Humana

Humana Inc. (NYSE: HUM) is a leading U.S. healthcare company. Through our Humana insurance services and our CenterWell healthcare services, we make it easier for the millions of people we serve to achieve their best health – delivering the care and service they need, when they need it. These efforts are leading to a better quality of life for people with Medicare and Medicaid, families, individuals, military service personnel, and communities at large. Learn more about what we offer at Humana.com and at CenterWell.com.

Data Scientist23 days ago
Full TimeRemoteTeam 10,001+Since 1961H1B Sponsor

• Design, build, and deploy LLM-based applications for conversational intelligence and insight generation. • Develop multi-step AI workflows, including retrieval, orchestration, routing, and tool-using patterns. • Define and implement evaluation approaches for LLM applications, including response quality, reliability, safety, and value. • Work with large-scale structured and unstructured data using modern data platforms (e.g., Spark, Databricks, Snowflake). • Partner with engineering teams to move GenAI solutions from prototype to production. • Write clean, maintainable code and contribute to shared development practices such as version control, documentation, and code review. • Ensure solutions align with enterprise standards for security, privacy, compliance, and responsible AI.

California + 3 moreAll locations: California | Illinois | Montana | South Dakota
$117.6K - $161.7K / year
Job Closed
Full TimeRemoteTeam 501-1,000Since 2005H1B No Sponsor

• Set Long-Term Marketing Strategy: Define the 2–3 year technical roadmap for marketing measurement. Establish the "North Star" metrics and frameworks that determine how Reddit allocates hundreds of millions in marketing budget. • Optimize Marketing ROI: Build and refine Media Mix Models (MMM) and Multi-Touch Attribution (MTA) systems to mathematically quantify the incremental impact of marketing spend. • Bridge B2B & B2C Growth: Design unified frameworks to optimize marketing aimed at both new advertisers (driving revenue) and new users (driving engagement), identifying synergies where brand awareness for one fuels growth for the other. • Advance Causal Inference & Experimentation: Lead the design of complex "always-on" incrementality testing and geo-holdout experiments. Develop methodologies to measure the long-term "halo effect" of brand marketing on organic growth. • Lead Through Influence: Collaborate deeply with Marketing, Growth, and Finance to translate complex data insights into actionable budget shifts. • Be the Technical Guide for the Team: Mentor junior scientists and set the technical bar for code quality and statistical rigor across the Marketing organization.

United States
$217K - $303.9K / year
Microsoft logo

Principal Data Scientist

Microsoft

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to any characteristic protected by applicable local laws, regulations, and ordinances.

Data Scientist23 days ago
Full TimeRemoteTeam 10,001+H1B Sponsor

Role Description Do you enjoy solving problems, looking at problems through a different lens, and working closely with customers to innovate new solutions to complex problems? Do you jump with excitement at the opportunity to identify trends and provide unique business solutions? Do you want to join a team where learning about a new technology or solution is part of our work every day? Then, come join us! The Industry Solutions Engineering (ISE) team is a global engineering organization that works directly with customers looking to leverage the latest technologies to address their toughest challenges. - Work closely with customers’ engineers to jointly develop code for cloud-based solutions. - Collaborate with Microsoft product teams, partners, and open-source communities. - Develop solutions side-by-side with customers through collaborative innovation. - Drive high-impact engineering projects using AI and other cloud-based solutions. - Be part of a cross-functional team of software engineers, data scientists, technical program managers, and designers. As part of our team, you will thrive in working with a variety of technologies, not just Microsoft technology. You will solve exciting business problems, contribute to open source, and collaborate with Microsoft product teams. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Responsibilities - Oversee data acquisition and ensure data is properly formatted and described. - Coach engineers in data cleaning and analysis best practices. - Identify gaps in current data sets and drive ethics and privacy discussions. - Evaluate team’s models and recommend improvements. - Drive best practices for models and develop operational models that run at scale. - Conduct thorough reviews of data analysis and modeling techniques. - Research and maintain a deep knowledge of the industry, including trends and technologies. - Define business, customer, and solution strategy goals. - Partner with other teams to identify and explore new opportunities. - Commit to a customer-oriented focus by acknowledging their needs. Qualifications - Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience. - OR Master's Degree in the same fields AND 7+ years data-science experience. - OR Bachelor's Degree in the same fields AND 10+ years data-science experience. - OR equivalent experience. Requirements - Microsoft is unable to sponsor a work visa for this role due to the nature of the role’s job duties. - Enjoy travel and are comfortable with travel up to 25%. - Customer-facing, project-delivery experience, professional services, and/or consulting experience. Benefits - Competitive package including a wide range of benefits built around personal needs. - Typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. - Different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, is USD $188,000 - $304,200 per year.

United States
$139.9K - $304.2K / year
Job Closed