tvScientific logo
tvScientific

Performance TV Advertising Platform

Staff Data Engineer

Data EngineerData EngineerOtherRemoteLeadTeam 51-200Since 2020H1B No SponsorCompany SiteLinkedIn

Location

California

Posted

96 days ago

Salary

$155.6K - $320.3K / year

Seniority

Lead

Bachelor Degree5 yrs expEnglishAWSScalaApache Spark

Job Description

Staff Data Engineer

tvScientific

• Design and maintain a scalable identity resolution platform • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources • Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets • Build and operate batch and streaming pipelines using modern data stack tools • Create clear documentation, standards, and runbooks for identity and governance systems • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls • Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) • Establish monitoring and alerting for data quality, freshness, and integrity

Job Requirements

  • Data engineering experience with proven track record building data infrastructure using Spark with Scala
  • Proven experience building data infrastructure using Spark with Scala for at least 5 years
  • Experience in delivering significant technical initiatives and building reliable, large scale services
  • Experience in delivering APIs backed by relationship-heavy datasets
  • Experience implementing data governance practices, including data quality, metadata management, and access controls
  • Strong understanding of privacy-by-design principles and handling of sensitive or regulated data
  • Familiarity with data lakes, cloud warehouses, and storage formats
  • Strong proficiency in AWS services
  • Successful design and implementation of scalable and efficient data infrastructure
  • High attention to detail in implementation of automated data quality checks
  • Effective collaboration with cross-functional teams
  • Excellent written and verbal communication skills
  • Bachelor's degree in Computer Science or a related field.

Benefits

  • Information regarding the culture at Pinterest and benefits available for this position can be found here.

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 11-50Since 2010H1B No Sponsor

• Modeling datasets and schemes for consistency and easy access, • Design and implement data transformations and data marts, • Integrating third-party systems and external data sources into data warehouse, • Building data flows for fetching, aggregation and data modeling using batch pipelines.

Poland
Job Closed
ContractRemoteTeam 10,001+H1B Sponsor

• Diseñar e implementar pipelines ETL/ELT escalables utilizando Azure Databricks y PySpark • Desarrollar soluciones de transformación de datos robustas usando Python y SQL • Construir y mantener notebooks en Databricks para procesos de ingestión y transformación • Orquestar flujos de datos mediante Azure Data Factory • Gestionar la gobernanza y control de accesos utilizando Databricks Unity Catalog • Crear y administrar Databricks Jobs para cargas productivas • Trabajar con soluciones de almacenamiento en Azure (Data Lake, Blob Storage, etc.) • Optimizar queries y pipelines para mejorar performance y escalabilidad • Garantizar la calidad, consistencia y confiabilidad de los datos • Colaborar con equipos de analytics y negocio para habilitar soluciones data-driven • Documentar procesos técnicos, arquitectura y despliegues • Utilizar herramientas de AI para optimizar desarrollo, debugging y productividad técnica

Mexico
Job Closed
TransUnion logo

Data Engineer I

TransUnion

TransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace. We do this by having an accurate and comprehensive picture of each person. This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world. Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve. We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.

Data Engineer96 days ago
Full TimeRemoteTeam 10,001+Since 1968H1B Sponsor

• Be part of an autonomous, cross-functional agile/scrum team • Drive building the next generation suite of products and platform • Designing, coding, building, and deploying highly scalable and robust solutions • Work on core services of big data solutions by streamlining design • Collaborate with the team to build orchestration platform in cloud

Costa Rica
Job Closed
Full TimeRemoteTeam 501-1,000H1B No Sponsor

• Diseñar y construir pipelines de datos: Diseñar, construir y mantener pipelines para la ingestión, limpieza, transformación y carga (ETL/ELT) de datos desde múltiples fuentes hacia almacenes de datos y plataformas analíticas. • Administración de la infraestructura y almacenamiento de datos: Configurar y administrar bases de datos, sistemas de almacenamiento y herramientas de procesamiento. • Optimización de rendimiento y eficiencia: Identificar y resolver cuellos de botella en pipelines y sistemas de almacenamiento. • Gestión de metadatos y documentación: Desarrollar y mantener mecanismos de catalogación y documentación para conjuntos de datos, pipelines y procesos de transformación. • Seguridad, control y cumplimiento de datos: Implementar medidas de seguridad y cumplimiento para proteger la confidencialidad, integridad y disponibilidad de la información. • Colaboración e integración con equipos y productos: Trabajar de forma articulada con equipos de ciencia de datos, producto, ingeniería de software y operaciones. • Entendimiento del negocio para orientar el uso del dato: Profundizar en el conocimiento del negocio y del sector.

Colombia