Torc Robotics

Leading autonomous vehicle technology since 2007, Torc develops automated Level 4, Class 8 trucks with Daimler.

Senior Autonomy Data Engineer

Data EngineerData EngineerFull Time Remote SeniorTeam 501-1,000Since 2007H1B SponsorCompany Site LinkedIn

Location

Virginia

Posted

1 day ago

Salary

$160.8K - $193K / year

Seniority

Senior

Bachelor Degree6 yrs expEnglishAmazon Redshift AWS Cloud Python SQL Terraform

Job Description

• Own the design and organization of the program’s data lake, including schema definitions, partitioning strategy and metadata indexing. • Design and maintain end-to-end pipelines that ingest high-bandwidth sensor logs from vehicles into cloud storage with high reliability and tolerant of ad-hoc and intermittent connectivity mechanisms. • Develop data validation and integrity checks that can detect corrupted information, missing sensors, and inconsistent calibration prior to the data being processed by downstream systems. • Implement retention, tiering and lifecycle policies for data to balance storage costs with development value. • Build tooling to query raw logs to produce curated training and evaluation datasets. • Build automation to run cost-effective pseudo-labeling workflows at the scale of data ingest. • Implement data quality and model performance metrics that are used to direct labeling effort toward the highest-value examples. • Deploy and maintain data visualization tooling to support log review, annotation QA, and autonomy debugging workflows. • Build integrations between the visualization tooling and the data lake so engineers can navigate from a dataset entry or model failure directly to the origin log data • Work with autonomy engineers to define and surface custom visualization panels and implement metrics for analyzing unstructured operating environments. • Build dashboards that provide the autonomy engineers visibility into data coverage by terrain type, operating environment and geographic region. • Establish and document data contracts between the data services and model training consumers. • Partner with perception, planning and embedded engineers across the data lifecycle: from shaping the logging schemas and collection triggers to defining the dataset interfaces that supply model training and evaluation. • Define data engineering standards, best practices, and tooling choices for an innovative and fast-paced team. • Contribute to the data roadmap and provide input to technical leadership on investment priorities. • Mentor junior engineers and raise the team’s capabilities in data infrastructure scalability and operational hygiene.

Job Requirements

Bachelor’s degree in Computer Science, Computer Engineering, Software Engineering, Electrical Engineering or a related field with 6+ years of data engineering experience or a Master’s with 4+ years.
Strong proficiency in Python and SQL, with demonstrated ability to build production-quality data pipelines
Deep experience with cloud data infrastructure (AWS preferred: S3, Glue Athena, redshift, or equivalent) and infrastructure-as-code tools (Terraform, Cloud Formation).
Solid understanding of data partitioning strategies and columnar storage formats (Parquet, Orc, etc.)
Experience building and operating data pipelines that process time-series and binary data.
Proven ability to evaluate and integrate open-source tooling when appropriate versus building from scratch.
Strong instincts for delivering data quality through first-class implementations of monitoring, validation and lineage tracking.

Benefits

A competitive compensation package that includes a bonus component and stock options
100% paid medical, dental, and vision premiums for full-time employees
401K plan with a 6% employer match
Flexibility in schedule and generous paid vacation (available immediately after start date)
Company-wide holiday office closures
AD+D and Life Insurance

Related Categories

Data Engineer

Related Job Pages

Data Engineer Jobs in Virginia Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Analista de Engenharia de Dados, Júnior

Experian

We're unlocking the power of data to help create a better tomorrow.

Data Engineer1 day ago

Full Time RemoteTeam 10,001+Since 1996H1B Sponsor

Company Site LinkedIn

• Apoiar o desenvolvimento de soluções que utilizam IA/ML, como APIs inteligentes e automações. • Implementar e manter pipelines de dados para treinamento e consumo de modelos. • Auxiliar na integração de modelos de IA em aplicações e serviços. • Colaborar na construção de componentes reutilizáveis (APIs, serviços, bibliotecas internas). • Apoiar a automação de processos utilizando IA (ex: classificação, recomendação, análise de texto). • Trabalhar em conjunto com times de dados, MLOps e engenharia. • Contribuir com melhorias contínuas em performance, qualidade e confiabilidade das soluções. • Apoiar na monitoração básica de aplicações e resolução de incidentes.

AWS Cloud Docker NoSQL Pandas Python PyTorch Scikit-Learn Spark SQL Tensorflow

View details: Analista de Engenharia de Dados, Júnior

Brazil

Apply

Data Engineer – IT-Abteilung

DAMPSOFT GmbH

Die marktführende Abrechnungs- und Praxismanagement-Software vom Zahnarzt für den Zahnarzt.

Data Engineer1 day ago

Full Time RemoteTeam 201-500Since 1986H1B No Sponsor

Company Site LinkedIn

• Entwicklung und Betrieb von Datenpipelines und ETL-/ELT-Prozessen • Integration und Verarbeitung interner und externer Datenquellen • Nutzung von KI-gestützten Entwicklungstools • Sicherstellung des Betriebs der Datenplattform inklusive Monitoring und Fehleranalyse • Umsetzung technischer Anforderungen und Tickets • Analyse und Umsetzung fachlicher Anforderungen gemeinsam mit den Fachbereichen • Optimierung bestehender Datenprozesse und Datenlogiken • Durchführung von Deployments sowie Pflege technischer Dokumentationen • Mitarbeit bei Tests, Releases sowie der Einhaltung von Daten-, Sicherheits- und Governance-Vorgaben • Analyse und Lösung technischer Probleme im Datenplattformbetrieb

Cloud ETL Python SQL

View details: Data Engineer – IT-Abteilung

Germany

Apply

Senior Software Engineer – Data Platform

Samsara

Pioneer of the Connected Operations Cloud

Data Engineer1 day ago

Full Time RemoteTeam 1,001-5,000Since 2015H1B Sponsor

Company Site LinkedIn

• Design, build, and operate high-scale data ingestion and replication systems from Samsara’s primary production data stores, including RDS, DynamoDB, internal APIs, and event-driven systems, into our data lakehouse. • Build and maintain reliable, scalable, and modern data platform infrastructure capable of handling petabytes of data across Samsara’s analytics, AI, product, and operational use cases. • Improve the reliability, observability, scalability, security, and developer experience of Samsara’s Spark and Databricks-based data processing platform. • Develop internal libraries, APIs, frameworks, and tooling in languages such as Go and Python to help teams across Samsara move, process, discover, and access data safely and efficiently. • Work on foundational data lake and lakehouse technologies, including Delta Lake on S3, data catalogs, metadata services, orchestration systems, and platform automation. • Collaborate closely with infrastructure, product engineering, data science, analytics, security, and data engineering teams to understand platform needs and deliver durable, scalable solutions. • Stay connected to modern data platform technologies and help shape Samsara’s long-term data infrastructure roadmap, including support for AI, privacy, security, global scale, and customer-facing data products. • Champion, role model, and embed Samsara’s cultural principles as we scale globally and across new offices.

Apache AWS Distributed Systems DynamoDB Java Python Scala Spark Go

View details: Senior Software Engineer – Data Platform

United States

$130.9K - $220K / year

Apply