Job Closed
This listing is no longer active.
Changing the way people find what they love.
Principal Data Scientist – Foundational Models
Location
California
Posted
88 days ago
Salary
$142.1K - $237K / year
Seniority
Lead
Job Description
Principal Data Scientist – Foundational Models
Stitch Fix
• Build the next generation ML systems that transform the way people find what they love. • Focus on building, productionizing and scaling machine learning models, and promoting engineering excellence. • Work with data scientists and lead cross-functional collaboration with Strategy, Product, Merchandising, Marketing, and Engineering partners to enhance capabilities. • Drive step-change improvements in machine learning models through novel research, large-scale experimentation, and production deployment. • Improve how we serve real-time recommendations in production at scale.
Job Requirements
- 5+ years of experience developing or improving machine learning models and taking them from prototype to production.
- 5+ years of advanced python development experience, including familiarity with data manipulation using spark and SQL.
- Solid foundation in machine learning, plus a bonus of evaluating recommender systems.
- Familiarity with modern ML frameworks (e.g. PyTorch, TensorFlow) and model deployment frameworks leveraging container orchestration.
- Hands-on experience with observability tools (e.g., Datadog, Prometheus) and a deep understanding of how to measure and optimize system latency.
- Ability to translate complex technical concepts into clear, actionable insights for cross-functional partners, including Product, Design, and Engineering, to build consensus and drive projects forward.
- Driven by curiosity to explore new technologies and take calculated risks, with a focus on identifying and implementing best practices that elevate the team's engineering standards.
Benefits
- Medical, dental and vision benefits
- Annual bonus
- Restricted stock units
- Comprehensive compensation packages
- Inclusive health and wellness benefits
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Data Scientist (Remote)
ICFFounded in 1969, ICF is a global advisory and technology services company headquartered in Reston, Virginia. It delivers data-driven solutions across energy, en
Description ICF is a rapidly growing, entrepreneurial, multi-faceted consulting company, seeking a Data Scientist to support our work across NIH. We look for someone who loves to learn and is passionate about their work. The ideal candidate will have a strong foundation in machine learning principles, experience with relevant tools and frameworks, and a passion for continuous learning and innovation. The ability to work standard Eastern Time is expected. What you’ll be doing: - Conceptualize and develop custom AI-driven data pipelines which address client problems, maximize efficiency, and maintain high standards of quality. - Design and develop statistical analyses, visualize the output of statistical models, present and interpret the output of predictive models, and perform quality assurance tasks on model code and output. - Develop solutions to a variety of client problems using both generative AI non-generative AI models. - Identify important and interesting questions about large datasets, then translate those questions into concrete analytical tasks. - Build quantitative models with data and communicate the results of those models to stakeholders. - Help conceptualize, as well as perform, data analyses to summarize trends. - Develop and execute database queries that in turn support developing/formatting modeling inputs Basic Qualifications: - Bachelor’s degree in a technical field: Computer Science, Engineering, or related discipline - 3 or more years of experience with modeling and quantitative analysis using standard statistical software. - Highly proficient in Python. - Experience with GenAI and LLM tools and frameworks. - 1 or more years’ experience working with any major public cloud service (experience preferred with Bedrock, Foundry, Vertex AI or similar). - 1 or more years’ experience with NLP, machine learning, deep learning principles and practices. - Strong communication and teamwork abilities. - Ability to obtain and maintain a public trust clearance - U.S. citizenship or lawful permanent residency (Green Card holder) is required due to federal contract requirements Preferred Qualifications: - Experience with incorporating Generative AI into data transformation pipelines. - Demonstrated prompt engineering expertise to design, test, and iterate LLM prompts that improved answer quality and reduced hallucinations. - Understanding of Deep Learning methodology. - Experience with data warehouse design and development as well as data modeling for both relational and dimensional data models. - Demonstrated experience showing strong critical thinking and problem-solving skills paired with a desire to take initiative. - Excellent listening, written, and oral communication skills. - Ability to exercise independent judgement while effectively prioritizing and executing tasks while under pressure. - Team player with the ability to work in a fast-paced environment. Working at ICF ICF is a global advisory and technology services provider, but we’re not your typical consultants. We combine unmatched expertise with cutting-edge technology to help clients solve their most complex challenges, navigate change, and shape the future. We can only solve the world's toughest challenges by building a workplace that allows everyone to thrive. We are an equal opportunity employer. Together, our employees are empowered to share their expertise and collaborate with others to achieve personal and professional goals. For more information, please read our EEO policy. We will consider for employment qualified applicants with arrest and conviction records. Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in all phases of the application and employment process. To request an accommodation, please email Candidateaccommodation@icf.com and we will be happy to assist. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. Read more about workplace discrimination rights or our benefit offerings which are included in the Transparency in (Benefits) Coverage Act. Candidate AI Usage Policy At ICF, we are committed to ensuring a fair interview process for all candidates based on their own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) tools to generate or assist with responses during interviews (whether in-person or virtual) is not permitted. This policy is in place to maintain the integrity and authenticity of the interview process. However, we understand that some candidates may require accommodation that involves the use of AI. If such an accommodation is needed, candidates are instructed to contact us in advance at candidateaccommodation@icf.com. We are dedicated to providing the necessary support to ensure that all candidates have an equal opportunity to succeed. Pay Range - There are multiple factors that are considered in determining final pay for a position, including, but not limited to, relevant work experience, skills, certifications and competencies that align to the specified role, geographic location, education and certifications as well as contract provisions regarding labor categories that are specific to the position. The pay range for this position based on full-time employment is: $89,649.00 - $152,404.00 Nationwide Remote Office (US99)
Head of Data
HappyCoReal-time property operations for property management. Inspect, manage and monitor your assets from anywhere, instantly.
• Define and execute HappyCo’s overall data strategy aligned with company and product goals • Establish the roadmap for analytics, machine learning, and data platform evolution • Collaborate with Product and Engineering leadership to identify high-value opportunities for data-driven insights and AI features • Build and mentor a small data team including data engineers and analysts • Design and implement a canonical entity model that connects data across operational systems • Lead the development of HappyCo’s property graph / knowledge layer, linking entities such as businesses, properties, units, inspections, work orders, and maintenance performance • Develop entity resolution approaches using deterministic rules, fuzzy matching, geospatial matching, and machine learning where appropriate • Oversee the architecture and evolution of HappyCo’s data platform, including the data warehouse, pipelines, and transformation layers • Guide development of scalable data ingestion and transformation pipelines using tools such as BigQuery, Databricks, Dataflow, and Pub/Sub • Establish best practices for data quality, lineage, and governance • Enable internal teams with reliable analytics and insights derived from unified data • Develop benchmark metrics and analytical models that help customers understand performance and opportunities for improvement • Support the creation of self-service analytics capabilities across the company • Prototype and implement predictive models that leverage unified data sets
Sr. Manager, Biopharma Real World Data Analyst
NateraWe are a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health.
Sr. Manager, Real World Data Analyst Natera is the global leader in the development of blood-based diagnostics and has built considerable momentum in the utilization of our Signatera (tumor informed) and Latitude (tumor naïve) assays. 2025 reflected a significant increase in the utilization of Signatera with >700k samples processed across our labs. Our WGS/WES repertoire and the integration of Foresight’s phased variants technique and expertise in Lymphomas have enabled Natera to offer the most experienced, published, and highest ordered MRD test available with global reach. In addition, our best-in-class MRD assays combined with ultra-sensitivity has positioned Natera to lead this area of diagnostics. Our business model has adapted and expanded to offer our customers real world data and insights to support their research objectives. We believe that our unique, multi-time point molecular data, matched with curated clinical data from the EHR and matched H&E digital pathology provides our customers with the most biologically informed dataset for Oncology. These insights are ideally suited to support translational and clinical development teams with the evidence necessary to improve trial design, select the right patients and understand the correlation across treatments, outcomes and molecular signatures across several data modalities (MRD / ctDNA, WGS, WES & WTS, Claims, EMR curated clinical data and digpath). In addition, our multi-modal datasets are uniquely designed to understand response, quantify resistance and build models defining recurrence and the time horizons when high risk patients may relapse/recur. Natera’s Mission for Life Science Companies: Integrate and utilize Natera’s portfolio of products, services and solutions to transform the R&D methodology and improve the probability of success across all of pharma’s primary verticals (Discovery, Development & Commercial) while bringing new products to market faster while compressing costs. Summary Of The Role: The Real-World Data & Analytics (RWDA) group within the Biopharma team at Natera works with major Biopharmaceutical partners to provide best-in-class data, analysis, and methodological guidance for Natera’s real-world data offering. We are seeking a highly motivated and solutions-oriented RWDA / Data Scientist with experience and interest in oncology, molecular analysis and epidemiological study design to join our team. This role requires the individual engage customers to generate new hypotheses and research questions, iterate and interrogate existing hypotheses and research questions, design research projects, conduct and iterate on the analysis as well as derive insights from complex real-world multi-modal, multi-time point molecular and clinical data. The individual will also implement advanced statistical methods, and utilize internal analytical tools, interfaces as well as AI tools to generate, visualize and enhance insights. Responsibilities: ● Team Reporting: data analysts will report directly to the Sr. Manager. This role will be responsible for identifying, hiring, mentoring, training and supervising the team across a number of customers. ● Biopharma Partnerships: design, prepare data and conduct analysis and delivery of RWE analyses for biopharma clients. You will be responsible for translating complex clinical development and translational research questions into actionable research plans and insights utilizing Natera data for trial design, outcomes research and other bespoke projects. ● Multi-modal Real World Data Expertise: generate insights from complex real-world endpoints using extensive coding, demonstrating deep comprehension of Natera clinical and molecular data structures and complexity, while also serving as an expert on the methodological nuances and limitations of real-world data. ● Methodological Standards & Mentorship: Build technical standards by implementing advanced methods in survival analysis, machine learning and predictive modeling. ● AI-Enhanced Applications: Integrate practical adoption of Natera tools including: LLMs and agentic tools into your own daily workflow. Your focus will be on using these technologies to improve the speed and accuracy of code development, documentation, and review. ● Scientific Leadership & Influence: Own the communication of high impact/value results to internal stakeholders and external partners. You will be responsible for the scientific integrity of all deliverables, including manuscripts, conference abstracts, and technical reports where appropriate. ● Cross-Functional Collaboration: Collaborate with internal product, oncology, and clinical abstraction, medical, scientific, business development and consortia teams to continually enhance Natera data quality, products, and analytical best practices. ● Product / Work-Flow Enhancement: Proactively identify gaps in current products and ensure that customer feedback is collected and integrated into the Natera product road map. ● Oncology & RWE Domain Expertise: Maintain deep expertise in oncology clinical guidelines (e.g., NCCN) and emerging RWE methodologies. You will be responsible for translating these external shifts into internal strategy, ensuring that our research designs and data modeling stay ahead of the evolving oncology landscape and reflect the most current standard of care. Qualifications: ● Education: Advanced education in epidemiology, biostatistics, data science, public health, or related fields, to the level of either: o Master’s degree, or PhD and 4+ years of additional work experience ● Technical & Statistical Mastery: o Expert-level proficiency in observational real-world healthcare data, specifically in designing and implementing complex time-to-event methodologies (survival analysis). o Track record of leading RWD analytical studies from initial scoping through to publication or dissemination. o Proficient in using R and SQL, especially statistical tools and packages. o Proficiency applying machine learning, LLM-based coding assistants (e.g., Copilot, Cursor) and agentic frameworks to support data analysis, code review, or scientific documentation workflows. o Adherence to good software engineering practices (version control, modular code, documentation). o Experience with code review. ● Communication & Client Ownership: Experience as a primary technical point of communication for biopharma clients, with a proven ability to collaborate on study design and translate highly technical findings into strategic recommendations for senior-level stakeholders. ● Leadership & Soft Skills: Strong project leadership and the ability to manage multiple high-priority workstreams simultaneously in a fast-paced environment. ● Soft Skills: Strong project leadership with excellent written and verbal communication skills. Ability to thrive in a fast-paced, dynamic environment working with multi-disciplinary scientists on complex problems. ● Solution Oriented: Ability to see through initial research questions to the high impact / valuable insights customers require (need vs. want). Understand the question behind the question as well as identify solutions to generate meaningful insights when the data does not exist. Preferred Skillsets: ● Experience working with Pharma or drug development. ● Experience in clinical trial design in the clinical development space. ● Extensive proficiency with EMR raw and curated data, claims, and registry data. ● Practical experience building, fine-tuning, or configuring LLM-based tools and agentic workflows specifically for scientific discovery. ● High-level familiarity with NCCN guidelines and the ability to interpret real-world outcomes within the context of the current oncology standard of care. ● Significant experience analyzing multi-modal data including biomarker, genomic, or other high-dimensional molecular data alongside clinical datasets. ● Proficiency in managing large-scale data projects within cloud environments such as AWS or Google Cloud Platform (GCP). Travel: As necessary (up to 20%) This is a remote position. The pay range is listed and actual compensation packages are based on a wide array of factors unique to each candidate, including but not limited to skill set, years & depth of experience, certifications and specific office location. This may differ in other locations due to cost of labor considerations. Remote USA $143,100—$178,900 USD OUR OPPORTUNITY Natera™ is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. Our aim is to make personalized genetic testing and diagnostics part of the standard of care to protect health and enable earlier and more targeted interventions that lead to longer, healthier lives. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions, who care deeply for our work and each other. When you join Natera, you’ll work hard and grow quickly. Working alongside the elite of the industry, you’ll be stretched and challenged, and take pride in being part of a company that is changing the landscape of genetic disease management. WHAT WE OFFER Competitive Benefits - Employee benefits include comprehensive medical, dental, vision, life and disability plans for eligible employees and their dependents. Additionally, Natera employees and their immediate families receive free testing in addition to fertility care benefits. Other benefits include pregnancy and baby bonding leave, 401k benefits, commuter benefits and much more. We also offer a generous employee referral program! For more information, visit www.natera.com. Natera is proud to be an Equal Opportunity Employer. We are committed to ensuring a diverse and inclusive workplace environment, and welcome people of different backgrounds, experiences, abilities and perspectives. Inclusive collaboration benefits our employees, our community and our patients, and is critical to our mission of changing the management of disease worldwide. All qualified applicants are encouraged to apply, and will be considered without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, age, veteran status, disability or any other legally protected status. We also consider qualified applicants regardless of criminal histories, consistent with applicable laws. If you are based in California, we encourage you to read this important information for California residents. Link: https://www.natera.com/notice-of-data-collection-california-residents/ Please be advised that Natera will reach out to candidates with a @natera.com email domain ONLY. Email communications from all other domain names are not from Natera or its employees and are fraudulent. Natera does not request interviews via text messages and does not ask for personal information until a candidate has engaged with the company and has spoken to a recruiter and the hiring team. Natera takes cyber crimes seriously, and will collaborate with law enforcement authorities to prosecute any related cyber crimes. For more information: - BBB announcement on job scams - FBI Cyber Crime resource page
Data Scientist
ICFFounded in 1969, ICF is a global advisory and technology services company headquartered in Reston, Virginia. It delivers data-driven solutions across energy, en
• Conceptualize and develop custom AI-driven data pipelines which address client problems • Maximize efficiency and maintain high standards of quality • Design and develop statistical analyses • Visualize the output of statistical models • Present and interpret the output of predictive models • Perform quality assurance tasks on model code and output • Develop solutions to a variety of client problems using both generative AI non-generative AI models • Identify important and interesting questions about large datasets • Translate those questions into concrete analytical tasks • Build quantitative models with data and communicate the results of those models to stakeholders • Help conceptualize and perform data analyses to summarize trends • Develop and execute database queries that in turn support developing/formatting modeling inputs



