UP.Labs logo
UP.Labs

Transforming the moving world, one startup at a time

Lead Data Science Engineer

Data EngineerData EngineerFull TimeRemoteLeadTeam 11-50H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

8 days ago

Salary

0

Seniority

Lead

Job Description

Lead Data Science Engineer

UP.Labs

Role Description We’re partnering with one of the world’s leading construction equipment manufacturers to reinvent how the industrial world builds, operates, and makes decisions. This is a rare chance to work shoulder-to-shoulder with a global market leader, tackling high-impact problems with technology and taking bold ideas from 0 to 1. As a Data Science lead on this venture, you’ll have the mandate and autonomy to turn raw concepts into real businesses - scoping, building, and launching software that can transform an industry. You'll act as the principal expert while leading data science for the venture. You will collaborate with your venture team to design and implement data pipelines, guide data science best practices, oversee data monitoring, and build AI/ML solutions. Your duties will include: - Assisting in validating the initial business concepts - Ideating and validating AI/ML use cases - Developing prototypes and proof of concepts - Working closely with the product and engineering team members to drive practical outcomes You’ll have the support you need to drive new innovation in impactful domains ripe for change and growth and be at the forefront of data science and technology advancements in the field. In this role, you will: - Act as the primary owner of Data Science, Analytics, and Data Engineering as a subject matter expert - Build out a team and incubate expertise amongst several venture start-ups - Create rapid proofs of concept, then scale into functional MVPs to turn concepts into tangible reality - Mentor and support early-stage venture teams to achieve bigger outcomes at a greater scale in data density and system complexity - Cross-pollinate learnings, best practices, and insights between multiple ventures to drive continuous improvement of Engineering and Data Science at UP.Labs - Enjoy working in a diverse, dynamic, collaborative, transparent, and inclusive environment where all ideas and opinions are equally valued Qualifications - 8 years experience within Data Science, Data Engineering, and Machine Learning domains and their practical applications - Experience working in a startup environment; member of a founding team ideal - Hands-on experience with machine learning algorithms like random forest, linear and logistic regressions, gradient boosting, classification - Familiarity and preference for working in ambiguous, fast-paced environments such as startups and growth-phase tech companies - Hands-on and end-to-end product build, development, and delivery experience - Experience working with or managing and leading remote, distributed teams including full-time data scientists, engineers and vendors/contractors - Awareness of the latest in Data Science and Data Engineering trends, as well as new use cases within the ML Space - Experience working with major cloud environments (Azure, GCP, AWS) and cloud-native software architectures - Experience with Database and Warehouse solutions, e.g. Data Bricks, Snowflake, Fivetran, DBT and respective public cloud data infrastructure service offerings from AWS, GCP, and Azure - Strong experience in collaborating with Product teams to find effective solutions - Strong communication skills put to use by explaining technical vision and deeply technical concepts to a variety of multidisciplinary team members - An open, curious, and humble mindset that builds on to our open, inclusive, and collaborative environment Additional Desired Competencies - Experience with systems planning in the domains of transportation, aviation, or digital simulation would be valuable - Containerized deployment Tools like Kubernetes, and Docker - Infrastructure as Code Tools like Terraform and OpenTofu - Familiarity with AB testing setup and analysis - Familiar with Reinforcement Learning for practical use cases Location Santa Monica, CA or Remote Travel 6+ weeks per year

Related Categories

Related Job Pages

More Data Engineer Jobs

Nimble Gravity logo

Senior Data Architect

Nimble Gravity

Data Science, Digital Transformation and eCommerce Strategy from experienced eCommerce and AI/ML experts

Data Engineer8 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Collaborate with cross-functional teams to understand business requirements and translate them into effective cloud-based data architecture solutions. • Design data models, data flow diagrams, and schema structures that optimize data storage, retrieval, and processing on cloud platforms. • Design and develop data integration strategies to consolidate data from diverse sources into a unified and coherent format. • Evaluate and select appropriate cloud services and tools for various data-related tasks. • Work closely with data engineers, data scientists, and other stakeholders to understand their needs and provide architectural guidance. • Responsible for overseeing the successful execution of data projects and ensuring the outcomes meet business objectives and technical requirements. • Lead and mentor junior team members, promoting knowledge sharing and continuous learning.

California + 1 moreAll locations: California | New York
Ceresti Health logo

Senior Data Engineer

Ceresti Health

Everyone else treats the patient. We activate the caregiver—because that’s where dementia care really begins.

Data Engineer8 days ago
Full TimeRemoteTeam 11-50Since 2013H1B No Sponsor

• Design and own Ceresti’s end-to-end data architecture: a landing zone with secure cloud object storage for raw partner files and API payloads, validated ingestion pipelines into our transactional Postgres, and a curated analytics layer that decouples reporting and AI workloads from production • Build ingestion pipelines for the data we receive today, including partner data files (CSV/JSON/XML/HL7/X12 as applicable) and REST/SFTP API integrations with schema validation, quarantine of bad records, and full lineage from raw bytes to curated row • Stand up and operate the curated layer (data warehouse / lakehouse-lite) so analytics and ML models can consume data without slowing down the transactional system • Choose, integrate, and operate the smallest set of tools needed, including object storage, an orchestrator (Dagster, Prefect, Airflow, etc.), dbt or similar for transformations, a single validation library (Great Expectations / Pandera / Soda) • Design and enforce data governance for a HIPAA-regulated environment: PHI/PII classification, encryption in transit and at rest, role-based access, audit logging, retention and minimum-necessary policies, and de-identification where appropriate • Partner with backend, ML, product, and clinical stakeholders to define data contracts with our health plan and ACO partners and hold the line on data quality • Build and maintain reliable feature data for ML models, including embeddings (e.g., pgvector) and curated feature tables for risk stratification, engagement, and outcomes work • Instrument the data platform for observability including pipeline SLAs, data freshness, schema drift, quality metrics, and act on what the data tells you • Participate fully in our Agile process: backlog grooming, sprint planning, demos, and retrospectives • Mentor engineers across the team on SQL, schema design, and the craft of building data systems that are boring in the best possible way

United States
Ocean Technologies Group logo

Data Engineering Team Lead

Ocean Technologies Group

Powering teams that deliver for people & planet, with maritime learning, crew and fleet management and GRC solutions

Data Engineer8 days ago
Full TimeRemoteTeam 201-500Since 2020H1B No Sponsor

• Lead a team of data engineers, ensuring alignment on goals, quality and delivery timelines. • Mentor and coach team members to support their technical and professional growth. • Drive engineering excellence by promoting best practices in coding, architecture, testing and observability. • Plan and manage team capacity, sprints and milestones to ensure predictable delivery. • Own the design, evolution and operation of ingestion and transformation pipelines on Apache Airflow and the analytical serving layer on Apache Druid. • Make architectural calls on concurrency, partitioning, memory sizing and cost — including JVM heap and direct-memory tuning on the Druid cluster. • Collaborate closely with DevOps on the Kubernetes / EKS platform that hosts our Druid and Airflow workloads. • Ensure robust data validation, reconciliation and verification so that reporting is trustworthy. • Collaborate with other Team Leaders, Development Managers, Architects and Product Owners to align engineering execution with business objectives. • Contribute to the evolution of development processes, CI/CD pipelines and DevOps practices. • Foster a culture of continuous improvement, innovation and knowledge sharing.

Philippines
Nuvitek logo

Data Engineer

Nuvitek

Speed Up True Modernization

Data Engineer8 days ago
Full TimeRemoteTeam 51-200Since 2012H1B No Sponsor

• Design, develop, and maintain scalable RAG/CAG pipelines for AI-powered applications • Build and optimize document ingestion workflows for structured and unstructured data sources • Manage and maintain vector stores to support semantic search and retrieval capabilities • Develop OCR processing pipelines for historical and modern document collections spanning 1781–2025 • Optimize retrieval performance, relevance tuning, and ranking strategies for LLM-based systems • Build reliable data pipelines that support integrations with large language models and AI services • Collaborate with engineers, UX teams, product owners, and stakeholders to deliver scalable AI solutions • Ensure data quality, integrity, security, and performance across ingestion and retrieval systems • Implement monitoring, logging, and troubleshooting for AI and data processing workflows • Contribute to architecture decisions, technical documentation, and engineering best practices • Participate in agile pod-based development teams and continuous improvement initiatives

United States
$115K - $125K / year
Job Closed