tvScientific

Performance TV Advertising Platform

Staff Data Engineer

Data EngineerData EngineerOther Remote LeadTeam 51-200Since 2020H1B No SponsorCompany Site LinkedIn

Location

California

Posted

96 days ago

Salary

$155.6K - $320.3K / year

Seniority

Lead

Bachelor Degree5 yrs expEnglishAWS Scala Apache Spark

Job Description

• Design and maintain a scalable identity resolution platform • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources • Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets • Build and operate batch and streaming pipelines using modern data stack tools • Create clear documentation, standards, and runbooks for identity and governance systems • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls • Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) • Establish monitoring and alerting for data quality, freshness, and integrity

Job Requirements

Data engineering experience with proven track record building data infrastructure using Spark with Scala
Proven experience building data infrastructure using Spark with Scala for at least 5 years
Experience in delivering significant technical initiatives and building reliable, large scale services
Experience in delivering APIs backed by relationship-heavy datasets
Experience implementing data governance practices, including data quality, metadata management, and access controls
Strong understanding of privacy-by-design principles and handling of sensitive or regulated data
Familiarity with data lakes, cloud warehouses, and storage formats
Strong proficiency in AWS services
Successful design and implementation of scalable and efficient data infrastructure
High attention to detail in implementation of automated data quality checks
Effective collaboration with cross-functional teams
Excellent written and verbal communication skills
Bachelor's degree in Computer Science or a related field.

Benefits

Information regarding the culture at Pinterest and benefits available for this position can be found here.

Related Categories

Data Engineer

Related Job Pages

Data Engineer Jobs in California More Remote Jobs

More Data Engineer Jobs

Senior Data Engineer, DBT, Airflow, Snowflake

Sunscrapers

Top Python Developers

Data Engineer96 days ago

Full Time RemoteTeam 11-50Since 2010H1B No Sponsor

Company Site LinkedIn

• Modeling datasets and schemes for consistency and easy access, • Design and implement data transformations and data marts, • Integrating third-party systems and external data sources into data warehouse, • Building data flows for fetching, aggregation and data modeling using batch pipelines.

Airflow Python SQL

View details: Senior Data Engineer, DBT, Airflow, Snowflake

Poland

Apply

Job Closed

Data Engineer, Azure Databricks

Tech Mahindra

Data Engineer96 days ago

Contract RemoteTeam 10,001+H1B Sponsor

Company Site LinkedIn

• Diseñar e implementar pipelines ETL/ELT escalables utilizando Azure Databricks y PySpark • Desarrollar soluciones de transformación de datos robustas usando Python y SQL • Construir y mantener notebooks en Databricks para procesos de ingestión y transformación • Orquestar flujos de datos mediante Azure Data Factory • Gestionar la gobernanza y control de accesos utilizando Databricks Unity Catalog • Crear y administrar Databricks Jobs para cargas productivas • Trabajar con soluciones de almacenamiento en Azure (Data Lake, Blob Storage, etc.) • Optimizar queries y pipelines para mejorar performance y escalabilidad • Garantizar la calidad, consistencia y confiabilidad de los datos • Colaborar con equipos de analytics y negocio para habilitar soluciones data-driven • Documentar procesos técnicos, arquitectura y despliegues • Utilizar herramientas de AI para optimizar desarrollo, debugging y productividad técnica

Azure ETL PySpark Python SQL Unity

View details: Data Engineer, Azure Databricks

Mexico

Apply

Job Closed

Data Engineer I

TransUnion

TransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace. We do this by having an accurate and comprehensive picture of each person. This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world. Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve. We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.

Data Engineer96 days ago

Full Time RemoteTeam 10,001+Since 1968H1B Sponsor

Company Site LinkedIn

• Be part of an autonomous, cross-functional agile/scrum team • Drive building the next generation suite of products and platform • Designing, coding, building, and deploying highly scalable and robust solutions • Work on core services of big data solutions by streamlining design • Collaborate with the team to build orchestration platform in cloud

GCP Java Python Scala Apache Spark SQL

View details: Data Engineer I

Costa Rica

Apply

Job Closed

Data Engineer

Iluma Alliance

Data Engineer96 days ago

Full Time RemoteTeam 501-1,000H1B No Sponsor

Company Site LinkedIn

• Diseñar y construir pipelines de datos: Diseñar, construir y mantener pipelines para la ingestión, limpieza, transformación y carga (ETL/ELT) de datos desde múltiples fuentes hacia almacenes de datos y plataformas analíticas. • Administración de la infraestructura y almacenamiento de datos: Configurar y administrar bases de datos, sistemas de almacenamiento y herramientas de procesamiento. • Optimización de rendimiento y eficiencia: Identificar y resolver cuellos de botella en pipelines y sistemas de almacenamiento. • Gestión de metadatos y documentación: Desarrollar y mantener mecanismos de catalogación y documentación para conjuntos de datos, pipelines y procesos de transformación. • Seguridad, control y cumplimiento de datos: Implementar medidas de seguridad y cumplimiento para proteger la confidencialidad, integridad y disponibilidad de la información. • Colaboración e integración con equipos y productos: Trabajar de forma articulada con equipos de ciencia de datos, producto, ingeniería de software y operaciones. • Entendimiento del negocio para orientar el uso del dato: Profundizar en el conocimiento del negocio y del sector.

Airflow Amazon Redshift AWS Azure BigQuery ETL GCP MySQL PostgreSQL Python SQL Tableau

View details: Data Engineer

Colombia

Apply

Staff Data Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Engineer, DBT, Airflow, Snowflake

Data Engineer, Azure Databricks

Data Engineer I

Data Engineer