The breakthrough AI production platform that allows anyone to create compelling commercials and spec spots in minutes.
Principal Data Scientist, Health Informatics
Location
California + 12 moreAll locations: California | District Of Columbia | Florida | Illinois | New York | North Carolina | Oregon | Maryland | Massachusetts | Minnesota | Texas | Virginia | Washington
Posted
6 days ago
Salary
$128K - $229K / year
Seniority
Lead
Job Description
Principal Data Scientist, Health Informatics
Waymark
• Own clinical data quality across claims, EHR, and ADT: Define standards for how clinical data is structured, normalized, and validated as modeling inputs across payer claims (medical, pharmacy, eligibility), EHR data (Epic, Cerner, Athena), and real-time ADT feeds. Bring deep familiarity with EHR data formats (FHIR, HL7, C-CDA) and how data from systems like Epic, Cerner, and Athena maps to clinical reality. Hold the bar for clinical accuracy and completeness across all three sources. • Build and ship production ML/AI models: Develop, validate, and deploy risk stratification, care gap prediction, treatment effect estimation, and LLM/foundation model applications — with rigor around leakage, calibration, fairness, and clinical face validity. • Apply health economics and outcomes methods: Translate raw clinical and claims data into decision-grade evidence through risk adjustment, utilization measurement, cost attribution, quasi-experimental evaluation, and outcomes measurement aligned with CMS, NCQA, and MCO reporting standards. • Advance machine and AI products: Bring senior modeling judgment to the product roadmap, owning the clinical and methodological soundness of what ships. • Set standards and mentor: Make architectural trade-offs, drive alignment across data science, engineering, product, and clinical stakeholders, and mentor junior data scientists to raise the technical bar of the team.
Job Requirements
- Healthcare Data Expertise: Deep, hands-on fluency with claims, EHR, and ADT data, and strong command of clinical terminologies (ICD-10, SNOMED CT, LOINC, RxNorm, CPT/HCPCS) and value set curation.
- Standards Fluency: Working experience with healthcare data standards and exchange formats — FHIR, HL7v2, and C-CDA.
- Education: Master's degree in Data Science, Biostatistics, Health Informatics, Computer Science, or a related field.
- Python Proficiency: 7-8+ years of hands-on experience in Python, including data science and ML libraries.
- Applied ML/AI Experience: Demonstrated ability to build, validate, and deploy production ML models on healthcare data, with end-to-end ownership from development through deployment and maintenance in a live environment. Experience with ML pipelines, model versioning, and reproducible workflows at scale.
- Project Ownership: Proven ability to manage complex technical projects independently, align multiple stakeholders, and deliver on timelines.
Benefits
- Stock Options: Opportunity to invest in the company’s growth.
- Work-from-Home Stipend: A dedicated stipend for your first year to help set up your home office.
- Medical, Vision, and Dental Coverage: Comprehensive plans to keep you and your family healthy.
- Life Insurance: Basic life insurance to give you peace of mind.
- Paid Time Off: 20 vacation days, accrued over the year, plus 11 paid holidays.
- Parental Leave: 16 weeks of paid leave for birthing parents after six months of employment, and 8 weeks of bonding leave for non-birthing parents.
- Retirement Savings: Access to a 401(k) plan with a company contribution, subject to a vesting schedule.
- Commuter Benefits: Convenient options to support your commute needs.
- Professional Development Stipend: A dedicated stipend supports professional development and growth.
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
• Build and maintain new data models that are accurate and reliable • Work closely with a wide range of stakeholders from Data Science, Commercial Leadership, Sales Operations, Product, Design, and Engineering • Define, prototype, implement and improve ML models • Design experiments to test hypotheses (e.g. which customers should we focus on in our outbound calls) • Build/maintain reports, dashboards, and metrics to support our north star metrics • Teach and learn from their teammates. We value making others successful
Senior Data Scientist
3Play MediaAt 3Play Media, we’re transforming the way organizations approach video accessibility and localization—making it faster, easier, and more reliable to reach every viewer. From captions and subtitles to audio descriptions and dubs, our platform streamlines workflows that once took days into just a few clicks. We support thousands of customers across media & entertainment, education, enterprise, and sports—helping them expand their reach and meet accessibility standards with speed and accuracy. But we’re more than a tech company—we’re a mission-driven team that believes access to content is a right, not a privilege. Headquartered on Boston’s waterfront, our team of passionate problem-solvers, engineers, linguists, and accessibility advocates is driving the future of inclusive and global video. At 3Play, you’ll find a culture that values curiosity, collaboration, and continuous learning. Whether you’re passionate about accessibility, excited by media tech, or simply love working with smart, motivated people, you’ll have the opportunity to make an impact here.
Role Description We are looking to grow our core Applied Science team by adding a “Senior Applied Data Scientist”. This is an individual contributor role on a highly collaborative team. The Applied Science team is a self-sufficient, product development team that applies research and experimentation to fulfill its goals of developing and maintaining business-critical software systems. In short, you’ll be a full-stack applied data scientist and software developer. 3Play Media has diverse business and technical challenges related to media transcription, captioning, audio description, dubbing, subtitling, and translation. This team makes major contributions in many technical domains, including: - Software engineering - Systems engineering - Machine learning - Natural language processing - Generative AI - Signal processing - Applied probability and statistics - Data analysis - Controlled experimentation You’ll be expected to contribute to and learn about several of these domains. We are looking for a curious, communicative, collaborative, humble, self-sufficient, and flexible data scientist. The ideal candidate is energized by the craft of building efficient, reliable software systems alongside sensible, robust machine learning and AI models. You’ll approach engineering and data science as disciplines that are always evolving, and you actively seek out and weigh new approaches rather than settling into the familiar. Above all, you understand that this work is a team sport: you know that strong collaboration, durable relationships, and genuinely diverse perspectives aren't nice-to-haves but the foundation of doing it well. Qualifications - 2+ years full time working experience in signal processing, data science, or machine learning roles, including predictive modeling, data processing, and data analysis responsibilities - Extremely strong communication skills, both verbally and in writing - Strong familiarity with Python coding, utilizing industry standard libraries (e.g., Pandas, NumPy, Scikit-learn, Matplotlib/Seaborn) - Interest in presenting technical and business-oriented information to a variety of audiences, including senior management and customers - BS/ MS / PhD in Data Science, Engineering, Computer Science or a related field Requirements - Strong academic and professional experience with audio signal processing, speech synthesis, and computer vision - Experience writing, deploying, and maintaining Python and SQL code in a production setting - Experience building and maintaining production machine learning systems in a cloud-based platform, such as AWS SageMaker - Experience developing and maintaining custom deep learning models using pytorch and HuggingFace, especially models involving text, audio, and video data - Experience using third-party generative AI systems, including prompt engineering and fine-tuning - Experience in the accessibility and localization industries Benefits - Base salary range: $165k - $200k - Location: Remote — United States - Employment type: Full-time Company Description 3Play Media is a technology company making video accessible to everyone. Our captioning, transcription, audio description, and localization solutions serve over 10,000 customers — universities, media companies, enterprises, and government organisations across the US and Canada. We were founded at MIT Sloan in 2007 and have been building deliberately ever since. The team is small, fast-moving, and low on politics. The work you do here is visible from day one. Headquarters in Boston, MA with offices in Minneapolis, MN and Calgary, AB. 3Play Media is an equal opportunity employer. We are committed to building a diverse and inclusive team and encourage applications from candidates of all backgrounds.
Head of Data Operations
Greenlight PlanetPowering access to brighter lives in Africa, Asia, and beyond
- **Manage the Talent Pipeline:** Oversee the rotation curriculum and technical onboarding systems that transform entry-level graduates into high-impact Analysts and Analytics Engineers. You will empower your Data Operations Leads to execute this curriculum daily. - **Strategic Recruitment:** Lead the selection and onboarding of regional Data Operations Leads and oversee the data-driven recruitment engine used to filter high-volume campus applications and secure top-tier talent. - **Define Technical Standards:** Establish the Sun King Data Standard, ensuring every graduate masters SQL, LookML, and the business logic required for specialized roles. - **Service Accountability:** Set the global strategy for the Data Desk. While your regional Leads manage daily ticket assignments, you are responsible for overall SLA performance, quality of output, and the health of escalation pathways. - **Operationalize Quality:** Partner with the Data Product team to define high-level SOPs. You will guide your Data Operations Leads in testing these procedures to ensure they are realistic for the junior workforce. - **Systemic Automation:** Lead the transition of manual tasks into automated self-service tools by working with the Engineering Hub. Your goal is to provide the Data Product team with high-quality assets that reduce the support burden on the desk. - **Management of Regional Leads:** Provide mentorship and professional development to the Data Operations Leads. You will ensure they are utilizing their squads effectively and maintaining technical rigor across all cohorts. - **Hands-On Mentorship:** While your focus is on the system, you will remain close to the technical work. You will not build executive dashboards daily, but you will oversee their development and get involved in troubleshooting and fixing code when pipelines or logic fail. **Empowerment over Centralization:** Success is measured by the speed at which you graduate talent and how effectively you enable the Data Product team to deploy new tools through the Data Ops infrastructure.
• Own analytical problems end to end from problem framing and metric definition through data acquisition, modeling, validation, and stakeholder rollout • Build predictive and inferential models such as forecasts, propensity and uplift models, and survival or duration analyses, where they meaningfully improve a product or operational decision • Develop statistical methods and reusable analyses for measuring performance, including detection accuracy, intervention impact, and downstream operational outcomes • Partner with Product, Engineering, and Operations to define key metrics, implement new features, and translate business questions into well-scoped analytical work • Audit system outputs and provide structured feedback that improves product accuracy and reliability • Communicate findings through dashboards, written analyses, and stakeholder presentations that lead to concrete decisions • Help define best practices around data quality, experimentation, modeling, and reproducibility, and contribute to the team’s data contracts and SOPs



