Performance TV Advertising Platform
Staff Data Engineer
Location
California
Posted
96 days ago
Salary
$155.6K - $320.3K / year
Seniority
Lead
Job Description
Staff Data Engineer
tvScientific
• Design and maintain a scalable identity resolution platform • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources • Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets • Build and operate batch and streaming pipelines using modern data stack tools • Create clear documentation, standards, and runbooks for identity and governance systems • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls • Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) • Establish monitoring and alerting for data quality, freshness, and integrity
Job Requirements
- Data engineering experience with proven track record building data infrastructure using Spark with Scala
- Proven experience building data infrastructure using Spark with Scala for at least 5 years
- Experience in delivering significant technical initiatives and building reliable, large scale services
- Experience in delivering APIs backed by relationship-heavy datasets
- Experience implementing data governance practices, including data quality, metadata management, and access controls
- Strong understanding of privacy-by-design principles and handling of sensitive or regulated data
- Familiarity with data lakes, cloud warehouses, and storage formats
- Strong proficiency in AWS services
- Successful design and implementation of scalable and efficient data infrastructure
- High attention to detail in implementation of automated data quality checks
- Effective collaboration with cross-functional teams
- Excellent written and verbal communication skills
- Bachelor's degree in Computer Science or a related field.
Benefits
- Information regarding the culture at Pinterest and benefits available for this position can be found here.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Modeling datasets and schemes for consistency and easy access, • Design and implement data transformations and data marts, • Integrating third-party systems and external data sources into data warehouse, • Building data flows for fetching, aggregation and data modeling using batch pipelines.
• Diseñar e implementar pipelines ETL/ELT escalables utilizando Azure Databricks y PySpark • Desarrollar soluciones de transformación de datos robustas usando Python y SQL • Construir y mantener notebooks en Databricks para procesos de ingestión y transformación • Orquestar flujos de datos mediante Azure Data Factory • Gestionar la gobernanza y control de accesos utilizando Databricks Unity Catalog • Crear y administrar Databricks Jobs para cargas productivas • Trabajar con soluciones de almacenamiento en Azure (Data Lake, Blob Storage, etc.) • Optimizar queries y pipelines para mejorar performance y escalabilidad • Garantizar la calidad, consistencia y confiabilidad de los datos • Colaborar con equipos de analytics y negocio para habilitar soluciones data-driven • Documentar procesos técnicos, arquitectura y despliegues • Utilizar herramientas de AI para optimizar desarrollo, debugging y productividad técnica
Data Engineer I
TransUnionTransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace. We do this by having an accurate and comprehensive picture of each person. This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world. Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve. We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.
• Be part of an autonomous, cross-functional agile/scrum team • Drive building the next generation suite of products and platform • Designing, coding, building, and deploying highly scalable and robust solutions • Work on core services of big data solutions by streamlining design • Collaborate with the team to build orchestration platform in cloud
• Diseñar y construir pipelines de datos: Diseñar, construir y mantener pipelines para la ingestión, limpieza, transformación y carga (ETL/ELT) de datos desde múltiples fuentes hacia almacenes de datos y plataformas analíticas. • Administración de la infraestructura y almacenamiento de datos: Configurar y administrar bases de datos, sistemas de almacenamiento y herramientas de procesamiento. • Optimización de rendimiento y eficiencia: Identificar y resolver cuellos de botella en pipelines y sistemas de almacenamiento. • Gestión de metadatos y documentación: Desarrollar y mantener mecanismos de catalogación y documentación para conjuntos de datos, pipelines y procesos de transformación. • Seguridad, control y cumplimiento de datos: Implementar medidas de seguridad y cumplimiento para proteger la confidencialidad, integridad y disponibilidad de la información. • Colaboración e integración con equipos y productos: Trabajar de forma articulada con equipos de ciencia de datos, producto, ingeniería de software y operaciones. • Entendimiento del negocio para orientar el uso del dato: Profundizar en el conocimiento del negocio y del sector.




