Job Closed

This listing is no longer active.

Egen logo
Egen

Engineering new possibilities with platforms, data, and generative AI

Principal Data Architect – Streaming, Data Platforms

Data EngineerData EngineerOtherRemoteLeadTeam 501-1,000Since 2000H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

122 days ago

Salary

$200K - $220K / year

Seniority

Lead

8 yrs expEnglishApache Kafka

Job Description

Principal Data Architect – Streaming, Data Platforms

Egen

• Design and implement **scalable streaming data platforms** to support real-time ingestion, processing, and analytics • Architect and guide the development of **end-to-end data platforms** across batch and streaming workloads • Lead and contribute to **Master Data Management (MDM)** solutions, including: • Golden record design • Data matching, survivorship, and hierarchy management • Integration patterns with downstream consumers • Define and implement **data governance frameworks**, including: • Data ownership and stewardship models • Data quality rules and monitoring • Metadata, lineage, and access controls • Collaborate with application teams to expose data via **APIs and event-driven architectures** • Provide architectural guidance for **cloud-native deployments**, including containerization and orchestration • Establish **data architecture standards, patterns, and best practices** • Partner with DevOps teams to enable CI/CD, infrastructure automation, and platform reliability • Review designs, mentor engineers, and help drive technical decisions across projects

Job Requirements

  • Experience : 8–12+ years in data engineering and architecture roles
  • Strong experience designing **streaming data platforms** (Kafka, MSK, Kinesis, or similar)
  • Proven hands-on experience building or leading **MDM implementations**
  • Solid understanding of **data governance principles** and real-world implementation
  • Experience designing **event-driven and API-first data architectures**
  • Deep knowledge of data modeling (canonical models, domain-driven design concepts)
  • Ability to translate business requirements into scalable technical solutions
  • Good Communication Skills

Benefits

  • Comprehensive Health Insurance
  • Paid Leave (Vacation/PTO)
  • Paid Holidays
  • Sick Leave
  • Parental Leave
  • Bereavement Leave
  • 401 (k) Employer Match
  • Employee Referral Bonuses

Related Categories

Related Job Pages

More Data Engineer Jobs

Zeta Global logo

Senior Data Engineer

Zeta Global

We deliver better experiences for consumers and better results for your brand.

Data Engineer122 days ago
OtherRemoteTeam 1,001-5,000Since 2007H1B Sponsor

• Build data pipelines: Develop robust batch and streaming pipelines (Kafka/Kinesis) to ingest, transform, and enrich large-scale event data (impressions, clicks, conversions, costs, identity signals). • Create data aggregates & marts: Design and maintain curated aggregates and dimensional models for multiple consumers—prediction models, agents, BI dashboards, and measurement workflows. • Data modeling & contracts: Define schemas, data contracts, and versioning strategies to keep downstream systems stable as sources evolve. • Data quality & reliability: Implement validation, anomaly detection, backfills, and reconciliation to ensure completeness, correctness, and timeliness (SLAs/SLOs). • Performance & cost optimization: Optimize compute/storage for scale (partitioning, file sizing, incremental processing, indexing), balancing latency, throughput, and cost. • Orchestration & automation: Build repeatable workflows with scheduling/orchestration (e.g., Airflow, Dagster, Step Functions) and CI/CD for data pipelines. • Observability for data systems: Instrument pipelines with metrics, logs, lineage, and alerting to accelerate detection and root-cause analysis of data issues. • Security & governance: Apply least-privilege access, PII-aware handling, and governance controls aligned with enterprise standards.

United States
$165K - $175K / year
Job Closed
Vulcury logo

Data Engineer – Pipelines, Structured Markup

Vulcury

Vulcury invests in early stage startups and advises companies of all sizes on strategy, growth, and efficiency

Data Engineer122 days ago
Full TimeRemoteTeam 1-10Since 2022H1B No Sponsor

• Design and maintain ingestion pipelines (Python-based ETL/ELT) • Design structured transformation workflows using dbt, SQLMesh, or equivalent • Convert unstructured transcripts and documents into normalized database records • Maintain PostgreSQL architecture (structured tables, JSONB, indexing strategy) • Develop attribute extraction frameworks for technical, commercial, and risk signals • Ensure data quality, consistency, and lineage from raw interaction to structured output • Collaborate with AI/ML engineers to ensure clean model inputs

India
Koala Health logo

Data Engineer

Koala Health

Koala Health works to provide people with all of the medications and health products their pets need. The company packages medication and health products by date and time to help m

Data Engineer122 days ago

• Own and evolve Koala Health’s end-to-end data infrastructure, including ingestion, transformation, modeling, and delivery. • Design and maintain reliable data pipelines from production systems (e.g., application databases, third-party tools, vendors). • Build and manage data models that support analytics, reporting, and operational use cases. • Establish and enforce best practices for data quality, testing, monitoring, and documentation. • Partner with stakeholders across product, operations, finance, and marketing to understand data needs and translate them into scalable solutions. • Improve the reliability, performance, and cost-efficiency of the data stack as the business grows. • Own incident response and debugging for data issues, proactively identifying and resolving root causes. • Create and maintain clear documentation so data assets are understandable and usable across the company. • Evaluate and implement tooling improvements where it meaningfully improves developer velocity or data quality. • Act as a thought partner to leadership on how data can better support decision-making and operational efficiency.

United States
$125K - $150K / year
Data Engineer122 days ago
OtherRemoteTeam 51-200Since 2016H1B No Sponsor

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description A ROIT é uma empresa disruptiva e inovadora. Desenvolvemos e disponibilizamos ao mercado soluções de tecnologia disruptivas, baseadas no uso humano como exceção. Oferecemos uma transformação inteligente para empresas que desejam evoluir e alcançar resultados surpreendentes com Robotização, Inteligência Artificial e Analytics. Venha fazer parte dessa transformação! Responsibilities - Implementar sistemas e rotinas de monitoramento (dados, aplicações, queries etc); - Evoluir modelos de dados, arquitetura e construção de pipeline de dados para atender novos requisitos de engenharia e negócios; - Implementar rotinas de migração, tratamento e armazenamento de dados (ETL); - Desenvolver integrações entre diferentes fontes de dados (RDS, APIs externas e etc) a fim de centralizar em um datalake; - Implementar e conduzir testes de carga; - Monitorar pipeline de dados em execução. Qualifications - Experiência com Python; - Experiência com GCP; - Experiência com integração continua e deploy em cloud; - Experiência com Datalake, Data Warehouse e Data Marts; - Conhecimento em banco de dados SQL e NoSQL; - Conhecimento em recurso das plataformas cloud; - Conhecimento em arquitetura orientada a eventos. Requirements - Será um diferencial ter experiência em: - Vivência em projetos de streaming; - Experiência com Airflow; - Processamento distribuído de dados (Spark ou similares).

United States + 1 moreAll locations: United States | Canada
Job Closed