Data Scientist – Infrastructure Diagnostics
Location
California
Posted
41 days ago
Salary
$101.3K - $163.9K / year
Seniority
Senior
Job Description
Data Scientist – Infrastructure Diagnostics
Phaidra
• Utilize our existing data ingestion and delivery platforms to "teach" our models to understand the physical world, filling a critical expertise gap in the data center industry. • Multidisciplinary Diagnostic Analysis: Use telemetry tools to analyze sensor data across mechanical (chillers, pumps) and electrical (UPS, switchgear, power feeds) systems to identify "failure signatures" for our LLM-driven monitoring tool. • Refining the Logic Engine: Act as a primary user of our platforms, identifying gaps in our current mechanisms and collaborating with Engineering to influence future features and data quality. • Operational Insight Generation: Translate raw telemetry into the "SME-level" logic and directions used by our LLM tool to guide data center operators in real-time. • SME Development: Cultivate deep domain expertise in all facets of data center infrastructure. You will be expected to master the nuances of both mechanical and electrical dependencies to ensure our product reflects operational reality. • Customer Guidance: Move from shadowing peers to directly supporting customers, using our platform to provide clear, data-backed direction on complex problems. • Model Validation: Oversee pilot projects to test how our AI-driven SME tool interprets real-world stressors, ensuring the output is operationally realistic, accurate, and actionable. • Adaptability: Remain agile and proactive. As a member of a fast-moving team, you will encounter challenges and scopes not explicitly defined here; we expect you to lean in and solve them.
Job Requirements
- Educational Background: Bachelor’s degree in Mechanical Engineering, Electrical Engineering, Control Theory, or a related field that provides a foundation in physical systems and thermodynamics.
- Analytical Grit: A deep, innate interest in using data to diagnose how and why systems fail. You are a "tinkerer" who prefers solving real-world problems over theoretical research.
- Technical Proficiency: Strong Python skills and experience with data manipulation libraries (Pandas/NumPy) to perform custom analysis outside of standard tooling.
- Communication Mastery: Ability to explain complex diagnostic findings clearly and persuasively to both technical peers and non-domain stakeholders.
- Unbiased Problem-Solving: A proven ability to look at a problem without preconceived notions and figure out solutions either independently or via team collaboration.
- Alignment with Values: Demonstrated commitment to Transparency, Collaboration, and Ownership—especially in environments where reliability and learning from failure are paramount.
Benefits
- Fast-paced, team-oriented environment where your work directly shapes the company’s direction.
- We are a 100% remote company.
- Competitive compensation & meaningful equity.
- Outsized responsibilities & professional development.
- Training is foundational; functional, customer immersion, and development training.
- Medical, dental, and vision insurance (exact benefits vary by region).
- Unlimited paid time off, with a required minimum of 20 days per year.
- Paid parental leave (exact benefits vary by region).
- Flexible stipends to support your workspace, well-being, and continued professional development.
- Company MacBook.
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Senior Statistician
RocheA healthier future drives us to innovate. Together, more than 100,000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact. Let’s build a healthier future, together. Roche is an Equal Opportunity Employer.
Role Description Genentech, Inc. seeks a Senior Statistician at its South San Francisco, CA location. Within biotechnology/pharmaceutical organization, develop and implement data-driven drug development strategies spanning all phases of clinical trials for life sciences. Responsibilities include: - Lead statistical design, planning, and analysis of clinical studies and trials through application of statistical methods including group-sequential and combination-test procedures, hierarchical testing strategies for biomarker subgroups, and event-driven analysis planning. - Prepare statistical methodology sections of clinical protocols, Statistical Analysis Plans, and regulatory briefing materials. - Monitor statistical integrity, adequacy, and accuracy in clinical studies settings within assigned therapeutic area. - Prepare Independent Data Monitoring Committee (IDMC) and Integrated Management Committee (IMC) charters, defining statistical decision rules, interim-data-review processes, and safety-monitoring boundaries. - Prepare, integrate, and interpret data used to support internal governance and regulatory decision-making, including comprehensive efficacy, safety, and pharmacokinetic summaries to inform dose-selection and progression decisions at key governance meetings. - Liaise with global health authorities and build collaborative partnerships with cross-affiliate stakeholders to support product development and related programs, processes, systems, and compliance initiatives. May telecommute 100% from any US location. Qualifications - Master’s degree in statistics or closely related field. - 3 years of experience as statistical scientist, biostatistician or a closely related role supporting clinical trials. Requirements - Applying principles of experimental design to plan and analyze clinical studies for diseases, ensuring appropriate control, randomization, and statistical validity, in conformance with global health authority requirements. - Statistical modeling and data modeling to evaluate relationships between treatment, biomarkers, and patient outcomes. - Applying statistical strategies for end-to-end drug development lifecycle. - Analyzing clinical trial data and summarizing efficacy and safety results. - Programming for statistical analysis using R and SAS. - Developing statistical analysis plans and study protocols. - Preparing data summaries and reports for data monitoring. Benefits - The expected annual salary for this position based on the primary location for this position of South San Francisco, California is $151,560 per year. - Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. - A discretionary annual bonus may be available based on individual and Company performance. - This position also qualifies for the benefits detailed at the link provided below. Company Description Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws. If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form.
Data Scientist II – Big Data R&D, Identity Graph, KYC
SocureThe leading provider of digital identity verification and fraud solutions. Salesinfo@socure.com
• Contribute to the design and implementation of machine learning, data mining, statistical, and graph-based algorithms to analyze very large datasets for identity verification and anomaly detection. • Analyze large datasets to help develop and refine entity-resolution and identity-matching algorithms that drive Socure’s KYC and compliance solutions. • Build and maintain components of data-processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3). • Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals. • Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance. • Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing. • Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives. • Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs. • Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well-scoped tasks and follow through to completion.
• Query, curate, and analyze large, complex healthcare datasets • Generate insights that improve cancer care delivery • Apply statistical modeling, machine learning, clinical NLP, and large language model-enabled information extraction • Identify patterns in longitudinal data • Transform clinical notes into structured, research- and operationally useful variables • Build predictive models • Deliver actionable solutions for clinical, research, and decision-support workflows • Collaborate with administrative leaders, clinicians and IT specialists • Design and implement pilots for validation and optimization of models in real time • Produce technical documentation, visualizations and presentations • Lead projects with multiple work streams or complex tasks • Project management for complex tasks • Complete individual or collaborative projects independently • Provide key subject matter expertise and leadership in advancing the use of AI technologies in precision medicine and clinical decision support • Provide mentorship and guidance to more junior staff • Performs other related duties as assigned or requested
• Utilize advanced mathematical and statistical methods to solve business problems under the supervision of senior data scientists • Execute exploratory data analysis to validate and quantify trends identified by partner organizations. • Develop insights, visualizations, and other data artifacts that support broader data science initiatives • Wrangle and analyze large structured and unstructured datasets in a highly repeatable fashion • Demonstrate strong communication skills (both written and verbal) and a positive, solution-oriented attitude




