Job Closed
This listing is no longer active.
Protege is an AI training platform committed to enabling the ethical sourcing of hard-to-find, multimodal, and real-world AI training data at scale. The company positions itself as
Forward Deployed Data Scientist
Location
United States
Posted
98 days ago
Salary
0
Seniority
Mid Level
Job Description
Forward Deployed Data Scientist
Protege
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As a Forward Deployed Data Scientist (Healthcare Solutions Lead) in the Healthcare vertical, you will guide prospects and customers through the definition and delivery of healthcare datasets. Your job will be to understand what customers are building, identify the data that best fits their needs, and assemble and QA high-quality samples and final deliveries that meet their technical and conceptual specs. Along the way, you’ll ensure timelines and milestones are clearly communicated from the first stages of feasibility to the final data delivery. What You Will Own - Serve as the primary point of contact for customers, building long-term strategic relationships with them via collaboration around data and transparency around its delivery from Protege's network. - Lead end-to-end program management from data specification and preparation through QA and delivery, ensuring cross-functional coordination and on-time execution. - Work with Protege data partners to source cutting edge healthcare data into the Protege ecosystem. - Oversee the QA, packaging, and delivery of complex datasets (EHR, claims, radiology, pathology, unstructured text), ensuring HIPAA compliance in collaboration with privacy partners. Qualifications - Proven customer-facing experience: skilled at managing expectations, leading customer conversations, and delivering technical outcomes with clarity and confidence. - Bring an analyst-first mindset to challenges. You are an expert in using SQL and Python to query data to construct complex patient cohorts, analyze data readiness for model training, validate clinical coverage, and support other customer-specific needs. - Find satisfaction by bringing order to multiple simultaneous projects and masterfully juggle competing (and sometimes changing) priorities. - Deep expertise in various healthcare data modalities ranging from EHR, claims, radiology, pathology, and unstructured text. - Familiarity with privacy-preserving techniques of healthcare data. - Experience in healthcare AI, ML products, or enterprise data platforms. - Prior startup experience. - You treat those around you with kindness. Benefits - Be the connective tissue between Protege’s platform, our data, and our customers. - Build datasets that directly power the next generation of AI models. - Operate at the cutting edge of multimodal data — where human judgment meets machine intelligence.
Job Requirements
- Proven customer-facing experience: skilled at managing expectations, leading customer conversations, and delivering technical outcomes with clarity and confidence.
- Bring an analyst-first mindset to challenges. You are an expert in using SQL and Python to query data to construct complex patient cohorts, analyze data readiness for model training, validate clinical coverage, and support other customer-specific needs.
- Find satisfaction by bringing order to multiple simultaneous projects and masterfully juggle competing (and sometimes changing) priorities.
- Deep expertise in various healthcare data modalities ranging from EHR, claims, radiology, pathology, and unstructured text.
- Familiarity with privacy-preserving techniques of healthcare data.
- Experience in healthcare AI, ML products, or enterprise data platforms.
- Prior startup experience.
- You treat those around you with kindness.
Benefits
- Be the connective tissue between Protege’s platform, our data, and our customers.
- Build datasets that directly power the next generation of AI models.
- Operate at the cutting edge of multimodal data — where human judgment meets machine intelligence.
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
• Support the development of statistical and machine learning models, both supervised and unsupervised. • Prepare, clean, and transform datasets (ETL/ELT processes). • Perform exploratory data analysis (EDA) to identify patterns, trends, and opportunities for improvement. • Create dashboards and automated reports using Power BI, Python, or similar tools. • Participate in the development and maintenance of data pipelines in collaborative environments. • Document processes, code, analyses, and results to ensure traceability and knowledge sharing. • Collaborate with business teams to understand problems and translate needs into analytical solutions. • Contribute analyses and studies that support the company’s strategic decisions.
• Building strong relationships with cross-functional partners across Product, Design, Engineering, and Analytics to drive collaboration and innovation. • Contributing directly to launching new data-driven products to help the Airbnb guest and host community. • Writing software in Python, SQL, or R to model, simulate, and measure the impact of new product features for both guests and hosts. • Analyzing structured or unstructured data to uncover meaningful insights and craft actionable proposals. • Presenting findings and recommendations to leaders and stakeholders in a clear, compelling manner that drives informed data-driven decision making.
• Contribute to the development of a pricing guidance system for hosts. • Drive innovative pricing science through strategic initiatives, including early-stage and ambiguous 0-to-1 projects. • Collaborate with product and cross-functional teams to pioneer pricing strategies and guidance for hosts. • Develop foundational models and experimental approaches that balance supply and demand in the marketplace. • Craft compelling data narratives to surface actionable insights.
Data Scientist III
Grupo BoticárioCriamos oportunidades para a beleza transformar a vida das pessoas, e assim transformar o mundo ao nosso redor.
• Elicit technical requirements and propose efficient data-oriented solutions. • Define and implement data models to meet business requirements. • Develop Data Science solutions involving supervised and unsupervised models, predictive modeling, and exploratory analyses. • Execute all stages of the Data Science solution lifecycle: business understanding, feature/variable preparation, model development, and performance validation. • Assist in building and maintaining data engineering pipelines, ensuring scalability and security. • Design and implement model usage and monitoring practices together with business areas. • Implement systems and routines that ensure data quality and consistency. • Maintain technical documentation for implemented solutions. • Communicate complex insights simply and effectively to non-technical stakeholders. • Interact with professionals across different profiles and seniority levels in a dynamic, changing environment.


