We power commerce through conversation by enabling brands to sell, market, and engage their customers—all via text.
Data Scientist - AI Evaluation
Location
United States
Posted
64 days ago
Salary
$225K - $280K / year
Seniority
Mid Level
Job Description
Data Scientist - AI Evaluation
Wizard
About Wizard Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust. The Role We’re looking for a Data Scientist to own how we measure, understand and improve the accuracy of our AI agent. This role sits at the intersection of data science, machine learning and product and is focused on evaluation, experimentation and insight generation. You won’t be building models but you will make sure they work in real world scenarios. You will build the systems to measure what good looks like and partner closely with ML, AI Engineering and Product to continuously improve the agent’s performance. What You’ll Do - Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations and outcomes) - Design and run experiments to measure improvements and regressions - Build and maintain evaluation datasets, benchmarks and scoring frameworks - Translate ambiguous product questions into clear, measurable hypotheses and analysis - Partner with ML Engineers to validate model changes and guide iteration - Identify failure modes and edge cases and drive improvements through data - Create dashboards and reporting that make agent performance visible, trusted and actionable What Success Looks like - Clear, trusted accuracy metrics are consistently used across product and engineering - A robust automated evaluation framework exists for both offline and live experiments - Model and product changes are consistently measured before and after launch Ideal Background - 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles - Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc) - Strong experience with experimentation (A/B testing, causal inference) - Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems - Ability to translate messy problems into structured analysis and metrics - Strong product mindset, you care about real user outcomes - Clear communication with the ability to influence across engineering and product Compensation & Benefits The expected base salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities. In addition to base salary, Wizard offers: - Equity in the form of stock options - Medical, dental, and vision coverage - 401(k) plan - Flexible PTO and company holidays - Fully remote work within the United States - Periodic company offsites and team gatherings Wizard is committed to fair, transparent, and competitive compensation practices.
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Our mission at Oura is to empower every person to own their inner potential. Our award-winning products help our global community gain a deeper knowledge of their readiness, activity, and sleep quality by using their Oura Ring and its connected app. We've helped millions of people understand and improve their health by providing daily insights and practical steps to inspire healthy lifestyles. Empowering the world starts with living our values and empowering our team. As a quickly growing company focused on helping people live healthier and happier lives, we ensure that our team members have what they need to do their best work — both in and out of the office. We’re seeking a Machine Learning Data Scientist to drive the development of next‑generation Large Physiology Models and other foundation models for wearable biosignals that power Oura’s health sensing features. You’ll contribute end to end by designing and validating deep learning approaches for large‑scale wearable time‑series and running rigorous evaluations on research and clinical datasets. You’ll partner with cross‑functional teams to translate evidence into shipped features and scientific publications. Your work will improve outcomes for Oura members and advance the state-of-the-art of physiological modeling from real‑world wearable data. This is a remote US role with a preference for candidates based on the East coast. We have offices in San Francisco, San Diego and Los Angeles for those who prefer hybrid or office settings. Oura employees in other major cities (like Boston and New York) occasionally gather informally at local co-working locations. What you will do: - Drive foundation models for wearable biosignals, setting standards for reusable representations, tooling, and evaluation adopted across multiple health features. - Design, implement, and validate deep learning approaches for large‑scale physiological time‑series (e.g., PPG, motion, temperature), including self‑supervised and multimodal pretraining; deliver adapters/probes for diverse downstream tasks. - Lead scientific publications from core modeling and validation work that advance understanding of human physiology and support Oura’s mission to improve health. - Build and operate scalable data generation, training, and inference pipelines to improve efficiency and reproducibility. - Lead collaborations with scientists, clinicians, engineers, and product teams to translate research advances into production features that bring value to Oura members. We would love to consider you for this role if you have: - PhD (ideally with industry internship experience), or MSc with 3+ years of industry experience, in machine learning, computer science, statistics, electrical engineering, or a related field; healthcare or health‑tech experience preferred. - Demonstrated success developing deep learning models for large‑scale time‑series or sensor data, ideally physiological or wearable signals, including foundation‑model and self‑supervised/multimodal approaches. - Advanced proficiency in Python and modern ML frameworks (e.g., PyTorch or Jax), with experience building scalable training and evaluation pipelines. - Strong grounding in probability, statistics, and experimental design; proven ability to design rigorous evaluations and interpret results from large research or clinical datasets. - Excellent scientific writing with a record of first‑author, peer‑reviewed publications in top machine learning and digital health venues. - Collaborative, highly autonomous working style; comfortable operating in an interdisciplinary, globally distributed team and communicating effectively with cross-functional partners. - Flexibility for occasional meetings across European time zones and willingness to travel for onsite meetings a few times per year. Benefits At Oura, we care about you and your well-being. Everyone here at Oura has a ring of their own and we are continually looking to improve employee health. What we offer: - Competitive salary and equity packages - Health, dental, vision insurance, and mental health resources - An Oura Ring of your own plus employee discounts for friends & family - 20 days of paid time off plus 13 paid holidays plus 8 days of flexible wellness time off - Paid sick leave and parental leave Oura takes a market-based approach to pay, which may vary depending on your location. US locations are categorized into tiers based on a cost of labor index for that geographic area. While most offers will be closer to the starting range, successful candidates' pay will be determined based on job-related skills, experience, qualifications, work location, internal peer equity, and market conditions. These ranges may be modified in the future. - Region 1 $151,300 - $178,000 - Region 2 $138,550 - $163,000 - Region 3 $127,500 - $150,000 A recruiter can determine your zones/tiers based on your US location. We are not considering candidates residing in the following states: Alaska (AK), Delaware (DE), Iowa (IA), Mississippi (MS), Nebraska (NE), South Dakota (SD), West Virginia (WV), and Wisconsin (WI) Oura is proud to be an equal opportunity workplace. We celebrate diversity and are committed to creating an inclusive environment for all employees. Individuals seeking employment at Oura are considered without regard to age, ancestry, color, gender (including pregnancy, childbirth, or related medical conditions), gender identity or expression, genetic information, marital status, medical condition, mental or physical disability, national origin, protected family care or medical leave status, race, religion (including beliefs and practices or the absence thereof), sexual orientation, military or veteran status, or any other characteristic protected by federal, state, or local laws. We will not tolerate discrimination or harassment based on any of these characteristics. We will work to ensure individuals with disabilities are provided reasonable accommodation to participate in the interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Disclaimer: Beware of fake job offers! We’ve been alerted to scammers posing as ŌURA recruiters, especially for remote roles. Please note: - Our jobs are listed only on the ŌURA Careers page and trusted job boards. - We will never ask for personal information like ID or payment for equipment upfront. - Official offers are sent through Docusign after a verbal offer, not via text or email. Stay cautious and protect your personal details. To all recruitment agencies: Oura does not accept agency resumes. Please do not forward resumes to our jobs alias, Oura employees, or any other organization's location. Oura is not responsible for any fees related to unsolicited resumes.
Data Science Intern (Summer 2026)
Hone HealthHone Health is a digital clinic dedicated to helping men achieve their best health by optimizing their hormones. Past jobs at Hone Health have offered a range of flexible jobs, inc
About Hone Hone is an online medical clinic at the forefront of transforming healthcare and enhancing longevity. We use cutting-edge scientific advancements to empower men and women to take control of their health and unlock their full potential. Our people are the heart of everything we do and drive our success. We approach every project through our brand values: - Fight for the Customer - Execute Ruthlessly - Communicate Candidly, Clearly, and Kindly - Collaborate Selflessly - Practice Calculated Risk-Taking - Maximize Joy and Gratitude Hone has been fully virtual from day one and will continue to be a remote-first employer. Our Ideal Candidate Our ideal candidate is a mission-driven, motivated multi-tasker who is invested in work that is fulfilling and impactful. They embrace change and tackle challenges with enthusiasm. They have an “all-in” disposition towards work, understanding that we are a fast-paced, high-growth organization with evolving priorities. They can excel at both independent tasks and collaborative work, leading with clear and candid communication. They exhibit humble leadership—the ability to drive initiatives forward while remaining excited about continuous learning and development opportunities. They feel strongly about being part of a team that advocates for people to live longer and better lives. The Role We are looking for a Data Scientist Intern to join our Data, Analytics & Machine Learning team for Summer 2026. In this role, you will work on high-impact problems across product, clinical, and marketing domains. You will partner with data scientists, engineers, and business stakeholders to translate ambiguous problems into data-driven solutions and deliver measurable impact. This is a hands-on role where you will contribute across the full data science lifecycle—from data exploration and experimentation to modeling and stakeholder communication. What You’ll Do - Analyze large, complex datasets to uncover trends, patterns, and opportunities for product and clinical improvement - Build and evaluate predictive models (e.g., classification, regression, forecasting) to drive decision-making - Design and analyze experiments (A/B tests) to measure the impact of product and growth initiatives - Partner with cross-functional teams to define problems, scope solutions, and deliver actionable insights - Develop dashboards and visualizations to communicate key metrics and findings to stakeholders - Collaborate with data engineers to ensure reliable data pipelines and scalable model deployment - Present results and recommendations to technical and non-technical audiences Minimum Qualifications - Currently pursuing a Bachelor’s or Master’s degree in Data Science, Computer Science, Statistics, Applied Mathematics, or a related field - Strong foundation in statistics, probability, and machine learning fundamentals - Proficiency in Python and common data science libraries (e.g., Pandas, NumPy, Scikit-learn) - Experience writing SQL queries to analyze and manipulate data - Strong problem-solving skills and attention to detail - Ability to communicate complex ideas clearly and effectively Preferred Qualifications - Experience with experimentation, A/B testing, or causal inference - Familiarity with machine learning frameworks (e.g., PyTorch, TensorFlow) - Experience with data visualization tools (e.g., Plotly, Power BI, Looker) - Exposure to working with large-scale or production datasets - Interest in healthcare, longevity, or personalized medicine Tools & Technologies You may work with: - Python (PySpark, Pandas, NumPy, Scikit-learn, PyTorch) - SQL (Spark SQL or similar) - Data transformation tools (e.g., dbt) - BI & visualization tools (e.g., Power BI) What You’ll Gain - Hands-on experience working on production-grade data science problems - Mentorship from experienced data scientists and exposure to industry best practices - Ownership of projects with real business and clinical impact - Experience working in a fast-paced, remote, cross-functional environment Compensation $20–$25/hour, 40 hours per week Additional Requirements - Must be able to work full-time (40 hours per week) during the internship - This is a remote role based within the United States; candidates must be located within the U.S. - This role is not eligible for current or future visa sponsorship (including CPT, OPT, or H-1B) Benefits* Hone wants our team to be in the best condition of their lives, so we offer a range of benefits including: - A remote-first work environment - Competitive compensation and equity options - Health, dental, and vision insurance coverage - Short-term disability and basic life coverage - Flexible Spending Accounts (FSAs) - Lifestyle Spending Accounts (LSAs) - We follow federal holidays and have uncapped time off - Budget for the technology tools you need (laptop, monitor, and/or special software) - A focus on company-sponsored activities to foster engagement (both virtual and in-person) - Waived membership fees for any Hone team members utilizing Hone products *These benefits are available to full-time, regular employees, and not to independent contractors, hourly or temporary employees, or interns. We are proud to be an equal-opportunity workplace committed to building a team culture that celebrates diversity and inclusion. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions. Please contact us to request accommodation.
• розробка та реалізація візії впровадження ШІ, що трансформує дані у вимірювану бізнес-цінність згідно зі стратегією розвитку ШІ у компанії • проєктування та впровадження автономних AI-агентів, інтегрованих у корпоративну екосистему (Tableau, SQL, Slack, CRM) • повний цикл виведення AI-прототипів у стабільний продакшн, включаючи архітектурний нагляд та контроль якості • побудова автоматизованих циклів оцінки (Eval Loops) для моніторингу точності, галюцинацій та перформансу моделей • оптимізація семантичного шару даних для забезпечення стабільної роботи Text-to-SQL та аналітичних систем • формування «культури лабораторії», де команда швидко тестує гіпотези та постійно еволюціонує разом із технологіями
About the company: Seek, an Eldridge company, is on a mission to connect analytics developers and consumers like never before - driving business value in minutes, not months. Insight Cloud, the analytics app store for your business, was created to make it easy for everyone to build, distribute, and consume data & analytics apps in the thriving cloud analytics economy. Insight Cloud delivers instant access to curated, ready-to-use analytics apps from the leading providers in the market, accelerating your speed to insights by 15X+ while eliminating the pain of building, hosting, and maintaining an entire analytics ecosystem on your own. Built for developers and consumers alike, Insight Cloud is not just another tool to work with data - it is the platform connecting leading analytics developers with innovative consumers - driving business value faster than ever. About the role: Seek’s Data Science team is central to our mission making advanced predictive analytics and machine learning applications accessible to businesses of all sizes. Your work will involve exploring customer and partner data sets, developing predictive models, and applying machine learning techniques to uncover trends, patterns, and insights that can generate significant improvements to industry-wide practices. You will collaborate closely with stakeholders to build and launch new analytics apps, ensuring that the insights generated are accessible through Seek’s app store. In this role you’ll get to: - Analyze large (and sparse) datasets to extract meaningful patterns, trends, and insights using advanced statistical methods and machine learning algorithms. - Construct and maintain data processing workflows, ensuring efficient mapping and transformation of client source data. - Perform data enrichment by sourcing additional datasets and integrating them with client data to enhance analytical depth and accuracy. - Develop complex SQL queries and Python scripts to transform and model data, tailoring configurations and fine-tuning parameters for optimal performance. - Translate business challenges into analytical frameworks, proposing and implementing data-driven solutions that have a tangible impact on our clients’ decision-making processes. - Create and maintain interactive dashboards that provide intuitive visualizations of data insights, enabling users to derive clear, actionable information. - Work closely with analytics engineers, other data scientists and business stakeholders to ensure that ML models are aligned with business goals and technical requirements. - Write clear and comprehensive documentation for ML models, pipelines, and workflows to facilitate collaboration and reproducibility. Qualifications: - 5+ years of experience in data science, with a proven track record of deploying and training machine learning models - Advanced proficiency in Python ML libraries such as Scikit-learn, TensorFlow, and PyTorch for regression, classification, clustering, and deep learning techniques - Strong understanding of gradient boosting frameworks such as XGBoost and LightGBM for building high-performance models - Strong expertise in data and feature engineering including data cleaning, normalization, transformation, and creating/selecting relevant features from raw data - Strong expertise working with large-scale datasets directly within cloud data warehouses such as Snowflake, DataBricks, or Redshift - Advanced SQL skills: joins, subqueries, window functions, common table expressions (CTEs), aggregations, complex string manipulations, stored procedures and functions - Bachelor's degree in data science, statistics, computer science, or a related field. We’re especially interested in you if you have: - Experience with SnowPark ML (Snowflake) for Python: DataFrame API, user-defined functions (UDFs), and ML Pipelines - ML performance optimization experience: efficient computation, parallel processing, hyperparameter turning, model pruning, and quantization - Experience with data visualization tools and techniques - Excellent problem-solving and creative thinking skills with ability to drive complex data analysis and development forward independently - Strong communication skills with the ability to present data and articulate complex technical concepts clearly and concisely Benefits: - Medical Insurance, Dental Insurance, PTO, 16 annual company holidays, 401K, Vision Insurance Tools we use: - SQL, Python ML, Scikit-learn, TensorFlow, PyTorch, Snowflake, DataBricks, Redshift Salary - The annual salary range for this position is between $145,000 - $175,000, depending on experience and qualifications. Location - This is an entirely remote position for candidates based in the US. No Visa Sponsorship Available - We do not offer visa sponsorship at this time. What Sets Us Apart Seek is committed to a culture that values ownership, flexibility, and collaboration. We prioritize work-life balance and encourage our team to explore new ideas, test hypotheses, and innovate continuously. You’ll join a diverse group of professionals dedicated to pushing the boundaries of business analytics development, with ample opportunities to grow alongside talented peers who value your expertise and vision




