Data Scientist - Research & Analytics

Location

India

Posted

84 days ago

Salary

0

Seniority

Mid Level

Job Description

Data Scientist - Research & Analytics

Pratham International

Role Description Pratham International, through its wholly-owned subsidiary in India (PULS Pvt. Ltd.), is looking to hire a Data Scientist to join our Research & Analytics team, focused on using data-driven insights to inform programmatic decisions and tackle complex social challenges. Candidates should have strong foundations in statistics, data science, and Python programming and a passion for applying these skills to education and development contexts. This role involves designing and executing rigorous analyses, working across teams to translate needs into tractable data problems, and communicating findings in clear, actionable ways to diverse stakeholders. As a Data Scientist, you will: - Manage project-level data operations to ensure timely and accurate reporting. - Undertake regular analyses to inform program design, monitoring, and decision-making. - Apply statistical and machine learning methods to identify patterns, trends and predictors of educational outcomes. - Design and evaluate predictive models, with attention to fairness, interpretability and program relevance. - Lead data wrangling and cleaning, including SQL queries, API calls, survey datasets. - Ensure data quality, integrity, and documentation for reproducibility. - Build dashboards, reports, and visualizations (Excel, PowerBI, Tableau, or Python-based tools). - Translate technical findings into clear, accessible insights for program, content, and management teams. - Partner with AI/ML team where advanced modeling or deployment is required. Qualifications - Bachelor’s or higher degree in Data Science, Computer Science, Statistics, Research Methodologies or a related field. - Proficiency in Python with strong command of standard data science packages (numpy, pandas, scikit-learn, matplotlib, etc.). - Strong foundations in statistics and probability. - Ability to implement supervised and unsupervised ML methods, and key components of the data pipeline (cleaning, feature selection, cross-validation, parameter tuning). - Working knowledge of SQL. Requirements - Experience working on Data Science projects using machine learning and deep learning. - Orientation towards research. Benefits - Impact at a Global Scale: Your work will directly support the learning journeys of millions of children. - Unique Technical Challenges: Solve "last-mile" problems rarely addressed in the commercial sector. - A Culture of Growth and Evidence: Value rigorous testing and continuous learning. - High-Calibre, Mission-Driven Team: Work alongside a global team of experts from prestigious institutions. - Inclusive and Collaborative Environment: Join a diverse team that values innovative thinking and local expertise. Employment Details - Contract Type: Full-Time Employment - Date of Joining: Immediate; applications reviewed on a rolling basis. - Work from Anywhere in India: Flexible, remote-first work environment with opportunities to travel to field locations. - Competitive Remuneration: Salary benchmarked against industry standards for AI/ML roles, commensurate with experience.

Related Categories

Related Job Pages

More Data Scientist Jobs

Data Scientist84 days ago
OtherRemoteTeam 51-200H1B No Sponsor

• Lead an elite team of data scientists • Define and direct the research frameworks and model architectures for clinical risk identification and value attribution • Architect a multi-year roadmap for research into clinical performance • Act as a primary consultant for a majority of the management team • Translate complex technical study designs into clear narratives for the management team • Grow and empower a team of data scientists • Drive the enhancement of Pearl’s predictive product capabilities

New York + 2 moreAll locations: New York | Massachusetts | Washington
$210K - $250K / year
Job Closed
OtherRemoteTeam 51-200Since 2009H1B Sponsor

At Sage Bionetworks, we believe in the power of open science and interdisciplinary collaboration to create a new vision of human health. We strive for actionable biomedical insights through the responsible sharing and reuse of data, all guided by our deep scientific expertise and diligent governance controls. We're a dynamic, adaptable team, and at Sage, we highly value individuals who can flexibly navigate various roles and functions. Diversity in experiences, backgrounds, and identities is essential for our rich culture of learning and collaboration. Our partnerships, both within Sage and with external communities, are founded on principles of trust, transparency, and a commitment to growth. Together, we are leaving a lasting imprint on the fields of science and medicine. We are hiring Biomedical Data Managers to support open science efforts across two critical initiatives at Sage Bionetworks: the Rare Disease team, empowering community-driven efforts such as the Neurofibromatosis Open Science Initiative and related data initiatives that facilitate collaborative sharing of rare disease data—and the Advanced Data Analytics team, which applies a variety of computational approaches to accelerate re-use of shared biomedical data for various disease domains. As a biomedical data manager, you will coordinate and curate multimodal biomedical datasets alongside scientists and investigators, applying the FAIR (Findable, Accessible, Interoperable, Reusable) principles to enable broad sharing and reuse. The role will involve working with external researchers (biologists and clinician scientists) to guide data ingestion into data hubs, contributing to data model development, determining appropriate annotations for specialized biomedical data types and applying them, and providing input on data governance workflows that support discovery-driven science. You will also engage internally with a variety of roles—including bioinformatics engineers, project managers, governance experts, and community coordinators—to promote team science and open collaboration. We’re looking for motivated professionals who enjoy interacting with researchers across diverse domains, have a passion for open science and data transparency, and appreciate the essential impact that careful curation, annotation, and modeling have on the usability and scientific value of shared biomedical data. Job Summary: As a Professional Biomedical Data Manager, you will leverage your skill set to manage and coordinate data management efforts with some latitude for independent judgment. You will support data collection, maintenance, and sharing efforts, ensuring the accuracy and completeness of scientific data. Key Responsibilities: - Data Standards and Plans: Develop standards, plans, and tools for storing, describing, and sharing heterogeneous data, including clinical, genomic, and imaging datasets. Create data management plans to deliver accurate, timely, and consistent scientific data for the rare disease and advanced data analytics research communities. - Collaboration and Coordination: Collaborate with project managers, scientists, research governance experts, and bioinformatics engineers to coordinate data-sharing efforts. Work with scientific research and operations teams to ensure data and metadata are compliant with data modeling standards - Data Management and Maintenance: Build, review, and maintain scientific databases, data collection tools, and applications. Facilitate data sharing by developing tools, writing documentation, curating data, and maintaining project websites or data portals through which scientific resources are disseminated. Work with research governance experts to implement legal and ethical frameworks for data sharing. - Community and Support: Support a community of researchers by onboarding data contributors to tools and procedures for sharing data, and conducting data management office hours. Develop standard operating protocols and streamline workflows. - Data Analysis and Reporting: Compile, analyze, clean, and validate scientific data. Identify and resolve discrepancies and issues with scientific data. Develop data management reports and monitor study status to ensure timelines are met. We’d Love to Hear from You If You: - Hold a bachelor’s degree in public health, library science, genetics, neuroscience, health informatics, health information management, or other relevant areas. - Have 2+ years of experience in biomedical data management or a related area. - Are proficient in data management tools and techniques. - Have experience with biomedical data models and data management tools. - Have experience working on multidisciplinary teams. - Maintain strong organizational and time-management skills. - Have clear, effective, and inclusive verbal and written communication skills. - Recognize the importance of applying critical thinking and thoughtful practices in your role and business approach, striving to ensure inclusivity, accessibility, and relevance for everyone, crafting solutions that cater to a diverse range of needs. - Experience managing complex multi-omic data types (including transcriptomics, whole genome sequences, proteomics, metabolomics, epigenetics, and clinical/phenotypic data). Preferred skills: - Hold a Master’s or PhD in public health, library science, computational biology, neuroscience, genetics, health informatics & health information management, or other relevant area. - Experience with Jira for project tracking and cross-team communications. - Ability to create, refine and update ontology data models for human and model systems (xenografts, organoids, cell systems etc). - Have familiarity with R or Python and collaborative development and version control systems (e.g., git). - Curiosity towards, and critical assessment of AI tools to support all aspects of a technical role at Sage, from co-pilots for software development, to aids to data analysis and interpretation - Experience with metadata harmonization. - Experience presenting your work for a scientific audience. - Familiarity with AWS or other cloud resources is highly desirable. - Familiarity with ontology databases and management. - Familiarity with multiple disease domains like rare disease, neurodegenerative disease, cancer, immunology. Job Functions and Physical Requirements The following cognitive and physical activities outline the essential functions required for successful job performance. Reasonable accommodations may be provided to enable individuals with disabilities to fulfill these functions. Physical Demands - Extended periods of sitting at a desk and using a computer. - Repeated wrist, hand, and finger movements. - Requires clear vision for tasks like data analysis, transcribing, computer use, and reading. - Occasionally lifting or moving objects weighing up to 10 pounds. Cognitive Demands - Able to work effectively under deadlines. - Strong problem-solving and analytical skills. - Attention to detail and accuracy. - Flexibility to adapt to changing environments, priorities and multitask. Additional Requirements - Travel is required a minimum of two times per year for on-site events in Seattle, WA. - Effective verbal and written communication skills. - Works well in a team and maintains professionalism. - Follows company policies, procedures, and relevant laws and regulations. Note: This list is not exhaustive and may be subject to modification as job duties evolve. Compensation & Total Rewards Sage Bionetworks implements equitable workplace strategies to ensure fair pay. Actual compensation is determined by market data, specific experience, and internal parity. - Job Level: Professional - Annual Salary Range: $93,300 - $121,700 - Note: New hires typically receive an offer between the minimum and the midpoint of the range to allow for future growth within the role. Higher offers may be considered for exceptional experience or specific market conditions. Comprehensive Benefits: We offer a package competitive with both commercial biotech and nonprofit sectors: - Health & Wellness: Comprehensive medical, dental, vision, life, AD&D, and long-term disability. - Future Security: Robust retirement plan and flexible spending accounts (FSA). - Work-Life Harmony: Paid time off and flexible work arrangements. - Learn more at: Benefits – Sage Bionetworks Equal Opportunity & Inclusion Sage Bionetworks is an Equal Opportunity Employer. We prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Accommodation Request: If you require a reasonable accommodation during the application or interview process, please contact hr@sagebase.org. Modern Hiring & Responsible AI We value your talent and your time. To ensure a fair and inclusive experience, we operate with the following principles: - Responsible AI in Recruiting: To help us find the best talent, we may utilize AI tools within our recruiting platforms. We use these technologies responsibly to avoid the pitfalls of bias and discrimination. We believe technology should enhance human judgment, not replace it, and we regularly audit our processes to ensure every candidate is evaluated fairly based on their unique skills. - Work from (Almost) Anywhere: We are a distributed workforce and support remote or hybrid arrangements within the United States. - Virtual-First Interviews: All interviews are conducted virtually to ensure accessibility and flexibility for all candidates. About Sage Bionetworks: Science for the Common Good At Sage Bionetworks, we believe the greatest barriers to medical progress aren't just biological—they are structural. Since 2009, our Seattle-founded nonprofit has been on a mission to tear down the "silos" of traditional science. We don’t just conduct research; we redefine how it’s done through open science, radical collaboration, and ethical data sharing. When you join Sage, you aren't just taking a job; you’re joining a bold collective of scientists, engineers, governance experts, and visionaries. We empower patients to be partners, not just data points, and we provide the platforms that allow the global scientific community to accelerate life-saving discoveries. The Sage Way: Our Core Values We don’t just talk about change; we live it through these five pillars: - Science Driven: We use data and evidence to ensure our work creates a measurable, positive impact on human health. - Accountable: We deliver on our promises through a foundation of trust and radical transparency. - Growth Oriented: We stay curious, seek out the hardest challenges, and constantly aspire to improve ourselves and our field. - Empathetic + Inclusive: We embrace our differences and build environments where every voice is empowered. - Radically Collaborative: We believe the best solutions come from teamwork and building diverse, global communities.

United States
Torc Robotics logo

Staff, Data Scientist - Data Operations & Enrichment

Torc Robotics

Leading autonomous vehicle technology since 2007, Torc develops automated Level 4, Class 8 trucks with Daimler.

Data Scientist84 days ago
OtherRemoteTeam 501-1,000Since 2007H1B Sponsor

About the Company At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business. A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight. Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer. Meet the Team: We are seeking an experienced Staff Data Scientist to join our Data Operations & Enrichment team. Our team is responsible for identifying and coordinating the long-term data needs for the company and ensuring efficient use of quality datasets across the organization. This includes dataset profiling, data retention strategies, integration and partnership opportunities, and conducting data collection campaigns that enable product development. What You’ll Do: - Lead and actively participate in conceptualizing, designing, and deploying cutting-edge machine learning and data science models, algorithms, and predictive analytics to address intricate challenges in autonomous vehicle technology. - Collaborate closely with multidisciplinary teams to identify opportunities for data-driven solutions, utilizing your data science expertise to optimize our self-driving truck systems and refine operational processes. - Analyze extensive datasets to unveil significant trends, patterns, and insights, leveraging sophisticated statistical techniques and data manipulation. - Partner across engineering departments including Data Engineering and ML Ops teams to develop and manage robust data pipelines that ensure data integrity, reliability, and consistency throughout the data lifecycle, supporting the advancement of our self-driving truck capabilities. - Provide mentorship and guidance to junior data scientists and data analysts, fostering skill development and knowledge sharing that uplifts the entire team. - Present complex findings and insights in a clear, compelling manner to technical and non-technical stakeholders, influencing strategic decisions and driving the progress of autonomous vehicle technology. - Remain at the forefront of industry trends, emerging technologies, and best practices in data science and machine learning, proactively applying your knowledge to elevate existing methodologies. What You’ll Need to Succeed: - Bachelor’s, Master's, or Ph.D. degree in Computer Science, Data Science, Statistics, Mathematics, or a related field. - 8+ years of professional experience in data science or related roles, demonstrating hands-on experience in developing and deploying machine learning models within real-world applications. - Proficiency in pertinent programming languages such as Python, along with expertise in utilizing data analysis libraries (e.g., NumPy, pandas, scikit-learn). - Strong grounding in statistics and mathematics, encompassing skills in hypothesis testing, regression analysis, clustering, and time series analysis. - A proven track record of successfully leading and executing intricate data science projects, spanning problem formulation, data preprocessing, model selection, and evaluation. - Expertise in diverse machine learning techniques, including supervised and unsupervised learning, deep learning, natural language processing, and reinforcement learning. - Proficiency in working with big data technologies like Hadoop, Spark, or other distributed computing frameworks. - Demonstrated experience processing large volumes of sensor data and applying time series data analysis techniques. - Deep understanding of SQL querying in data warehouses and experience leveraging business intelligence tools (e.g. Tableau, PowerBI, AWS Quicksight, Ploty, Matplotlib) for data visualization. - Excellent analytical and problem-solving prowess, coupled with the ability to approach challenges with inventiveness and creativity. - Excellent communication skills to effectively convey technical findings and concepts to both technical and non-technical stakeholders. - Adaptability to thrive within a fast-paced, collaborative environment, managing multiple priorities and deadlines with finesse. - Excellent interpersonal, verbal, and written communication skills to build trust and strong working relationships, effectively create and proofread documents and reports, and communicate to a diverse workforce. - Keen attention to detail to identify problems and processes that don’t comply with protocol. - Critical/logical thinking to identify problems and provide solutions to ensure efficiency, safety, and quality. - Excellent business insight and judgment, team orientation and collaborative style. - Keen time management and organizational skills to plan, develop, coordinate resources, prioritize effectively, and maintain competing demands simultaneously with frequent interruptions and in a fast-paced environment. - Ability to ethically handle sensitive and confidential information with impartiality and professionalism. Bonus Points! - Past work experience in the automotive ADAS or Autonomous Vehicle domain. - Past work experience with AWS Lamba and SageMaker services. - The ability to write complex SQL queries and work effectively in any functional programming language for numerical analysis purposes. - Solid adherence to data driven development and design, with an experimental and analytical approach to product improvement. At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities. Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply. Our compensation reflects the cost of labor across several geographic markets. Pay is based on a number of factors and may vary depending on job-related knowledge, skills, and experience. Torc's total compensation package will also include our corporate bonus and stock option plan. Dependent on the position offered, sign-on payments, relocation, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. Job ID: R-102496 Hiring Range for Job Opening US Pay Range $186,200—$223,400 USD

United States
$186K - $223K / year
Job Closed
Affirm logo

Analytics Lead, Full Stack

Affirm

Affirm is a financial services company that is on a mission to provide its customers with “honest financial products that improve lives.” As an employer, Af

Data Scientist84 days ago

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. We are seeking a highly skilled and action-oriented Analytics Lead to join the People Analytics organization and drive the next evolution of our data infrastructure, modeling, and reporting capabilities.The People Analytics team builds and maintains the data foundations that power key talent programs across Affirm, including Talent Acquisition, Total Rewards, Feedback & Development, and core employee data. The team drives the design, modeling, and reporting of high-quality, trusted people data, partnering closely with the business to deliver insights and data products that enable informed decisions. In this role, you will serve as the technical lead for our people data ecosystem—designing scalable architecture, setting engineering standards, and partnering across the business to deliver high-value, trusted data products. The ideal candidate brings strong analytical engineering expertise, a strategic mindset, an innate sense of curiosity, and the ability to lead through influence. You will play a key role in shaping our data roadmap, introducing modern tooling and automation, uplifting engineering best practices, and mentoring other analytics engineers. Join our team and help build the next generation of People Analytics at Affirm. What You'll Do: - Design and deliver relational and non-relational database models, data pipelines, reporting, and visualization solutions while supporting all phases of the analytics development life cycle (ADLC), including requirements gathering, design, development, testing, and deployment. - Develop, maintain, and scale robust ETL/ELT pipelines across HR data sources (e.g., Workday, Greenhouse Recruiting, internal tools such as Arbor), ensuring reliability, performance, and extensibility. - Architect and implement scalable data models optimized for analytical querying and long-term maintainability. - Ensure data quality, integrity, and reliability across all data assets, introducing automation and best practices for monitoring and validation. - Collaborate with People Analytics stakeholders to translate requirements into sound technical solutions and influence longer-term data architecture decisions. - Manage and optimize cloud data warehouse infrastructure (e.g., Snowflake), including performance tuning, cost management, and secure access patterns. - Leverage AI and LLMs to automate data quality checks, enhance metadata management, and extract insights from unstructured HR data. - Stay current on technology best practices and advocate for engineering excellence across the People Analytics team. - Own and manage data governance, security, privacy, and retention standards across all People Analytics systems. What We Look For: - 5+ years and expertise with dbt (Data Build Tool), SQL and Python required; including: - Experience writing clean, computationally efficient code involving ETL processes and data manipulation via dbt, SQL, and Python. - Demonstrated ability to design and build efficient, analytics-ready data models in dbt, transforming raw or unstructured data into well-defined marts. - Comfort with production level IDEs (e.g., Cursor, Visual Studio) and Version Control (e.g., git, specifically GitHub). - Experience using standard Python analysis packages (e.g., Pandas, NumPy). - Experience with the following required: - Snowflake or other cloud data warehouse. - Sigma or other modern BI platform. - Fivetran or similar integration platform for integrating structured, unstructured, or unclear data formats. - Airflow or other orchestration platform. - Experience with the following preferred: - Leveraging LLMs for data transformation or analysis; MLOps. - Data lakes and/or Iceberg table format. - Custom data application development using Python or similar language (i.e. Streamlit dashboards, Slackbots). - Additional Qualifications: - Strong sense of ownership, intellectual curiosity, and the ability to think creatively and critically in a dynamic, fast-paced, and ambiguous environment. - Demonstrated ability to provide technical leadership, influence cross-functional partners, and mentor other team members. - Excellent communication skills and comfort translating technical topics for non-technical audiences. Pay Grade - M Equity Grade - 8 Employees new to Affirm typically come in at the start of the pay range. Affirm focuses on providing a simple and transparent pay structure which is based on a variety of factors, including location, experience and job-related skills. Base pay is part of a total compensation package that may include equity rewards, monthly stipends for health, wellness and tech spending, and benefits (including 100% subsidized medical coverage, dental and vision for you and your dependents.) USA base pay range (CA, WA, NY, NJ, CT) per year: $180,000 - $230,000 USA base pay range (all other U.S. states) per year: $160,000 - $210,000 #LI-Remote Affirm is proud to be a remote-first company! The majority of our roles are remote and you can work almost anywhere within the country of employment. Affirmers in proximal roles have the flexibility to work remotely, but will occasionally be required to work out of their assigned Affirm office. A limited number of roles remain office-based due to the nature of their job responsibilities. We’re extremely proud to offer competitive benefits that are anchored to our core value of people come first. Some key highlights of our benefits package include: - Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents - Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses - Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge - ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount We believe It’s On Us to provide an inclusive interview experience for all, including people with disabilities. We are happy to provide reasonable accommodations to candidates in need of individualized support during the hiring process. [For U.S. positions that could be performed in Los Angeles or San Francisco] Pursuant to the San Francisco Fair Chance Ordinance and Los Angeles Fair Chance Initiative for Hiring Ordinance, Affirm will consider for employment qualified applicants with arrest and conviction records. By clicking "Submit Application," you acknowledge that you have read Affirm's Global Candidate Privacy Notice and hereby freely and unambiguously give informed consent to the collection, processing, use, and storage of your personal information as described therein.

United States
$160K - $230K / year
Job Closed