Building a #BetterWorkingWorld by providing trust through assurance and helping organizations grow, transform & operate.
Senior GCP Data Engineer
Location
India
Posted
3 days ago
Salary
0
Seniority
Senior
Job Description
Senior GCP Data Engineer
EY
• Develop and implement data pipelines (batch and streaming) using Dataflow (Apache Beam) • Design and optimize BigQuery-based data warehouses and data marts • Work on data ingestion, transformation, and loading (ETL/ELT) using GCP services • Collaborate with business teams to understand requirements and convert them into scalable data solutions • Ensure data quality, performance, and cost optimization across GCP workloads • Support data modeling activities (dimensional and normalized models) • Assist in solution design and architecture discussions • Troubleshoot pipeline performance issues and optimize processing • Contribute to CI/CD and deployment automation for data pipelines • Work within agile teams and support sprint-based delivery
Job Requirements
- 3–6 years of experience in GCP Data Engineering
- Hands-on experience in BigQuery
- Dataflow (batch pipelines; exposure to streaming is a plus)
- GCP services: Cloud Storage, Pub/Sub
- Strong understanding of ETL/ELT processes and data pipeline design
- Data warehousing concepts
- Exposure to Cloud Composer (Airflow) or any workflow orchestration tools
- GCP certification (Associate or Professional Data Engineer) is a plus
Benefits
- Health insurance
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Senior Data Engineer
First Help FinancialWe provide auto loans to the underserved and care for our customers and partners with exceptional service.
• Lead data engineering delivery to build reliable, scalable pipeline leveraging Snowflake, Sigma and other applications to support complex data analytics and self-service reporting. • Drive integration with popular tools and services in the broader data warehousing ecosystem. • Optimize architectural performance across the Data stack by refining execution parameters and enforcing best practice strategies to maximize resource efficiency and throughput. • Establish and evolve a data governance operating model across data quality, lineage, metadata management, stewardship, cataloging, and lifecycle management. • Perform advanced code-level analysis and resource profiling to identify and mitigate systemic root causes, ensuring the stability and reliability of high-scale production workloads. • Coordinate with internal stakeholders to ensure effective communication, issue resolution, and on-time delivery of projects. • Participate in weekend and weekday on call rotation.
Data Architect
Cooper ParryThe Rebels of Accountancy ⚡ B Corp Certified | Top 30 Best Companies to Work For | #1 Accountancy Firm to Work For
• Owning and evolving the end-to-end data architecture, aligned to Cooper Parry’s strategy and enterprise standards • Defining architectural patterns, principles, and standards for: Data ingestion, Data transformation, Data modelling, Analytics and reporting • Acting as a trusted advisor to senior stakeholders on data architecture decisions • Designing and oversee solutions using Microsoft Fabric, including: OneLake Lakehouse and Warehouse architectures, Data Pipelines, Spark / Notebooks, Semantic models • Ensuring platform scalability, performance optimisation, cost efficiency, and security • Guiding the transition from legacy platforms where applicable • Providing technical leadership across data engineering practices, including: Batch and streaming data pipelines, ELT/ETL frameworks, CI/CD for data platforms • Architecting robust integrations from internal and third-party systems • Defining data quality, validation, and monitoring standards • Defining and enforcing: Data governance frameworks, Master and reference data strategies, Access control and role-based security • Ensuring compliance with GDPR and organisational security standards • Promoting high data quality and lineage transparency • Working closely with: Data Engineers, Analytics Engineers, BI Developers, Product and Business stakeholders • Mentoring and upskill team members • Reviewing solution designs and providing constructive architectural guidance
Senior Python Developer (data-focused)
íliaSomos especialistas em tecnologia, dados e design, impulsionando a transformação digital de grandes players do mercado há mais de 10 anos, nos setores financeiro, seguros e mobilidade. Com mais de 450 profissionais, estamos presentes no Brasil e Europa, atendendo aos mercados da América Latina, Europa e América do Norte, desenvolvendo produtos digitais de alta qualidade e com foco em resultados de negócios. Certificada pelo 7° ano consecutivo como Great Place to work aqui na ília acreditamos que pessoas mudam o mundo, e investimos nelas. Nossas awesome deliveries são feitas de pessoas para pessoas, afinal awesome people make awesome deliveries!
Role Description Buscamos um desenvolvedor Python com sólida experiência em dados que saiba escrever código de produção — APIs, pipelines, integrações — e que entenda profundamente o dado que trafega por ele. Buscamos quem já viveu os dois lados e se move entre eles com naturalidade. O time trabalha com um fluxo completo de dados: - Extração de bases relacionais, processamento em camadas na AWS (arquitetura medalhão) e disponibilização para microsserviços e BI. - A pessoa precisará atuar com autonomia, propor o melhor caminho técnico e transitar entre diferentes stacks conforme a demanda do projeto. Responsibilities - Ingestão e Extração: Desenvolver scripts (Python puro ou PySpark) para consultar bases SQL Server, tratar os dados e publicá-los em buckets AWS S3. - Processamento (Arquitetura Medalhão): Construir pipelines para mover e refinar dados entre as camadas Bronze e Prata utilizando Python, PySpark e AWS Glue. - Disponibilização de Dados: Garantir a entrega de dados em OpenSearch (Fast Data) e no Amazon Redshift (para suporte ao time de BI). - Integração com Microsserviços: Colaborar com a camada de consumo, onde microsserviços em Node.js consultam o OpenSearch para servir as aplicações finais. - Visão Consultiva: Atuar de forma autônoma, indicando os melhores caminhos técnicos e arquiteturais para o projeto. - Criar e consumir APIs dentro do ecossistema AWS (Lambda, Glue, S3). Qualifications - Python sólido: código estruturado, orientado a objetos e com testes (Pytest ou similar). - Experiência real com pipelines de dados: PySpark, AWS Glue, Mage.ai ou equivalente. - AWS na prática: S3, Lambda, Redshift, Glue. - Vivência com microsserviços e APIs — criação, integração e arquitetura. - SQL e integração com bases relacionais (SQL Server, PostgreSQL ou similar). - CI/CD e versionamento: GitHub Actions, GitLab, Terraform ou equivalente. - Perfil adaptável: confortável em trabalhar com diferentes stacks conforme o projeto exige. Differentials - Node.js — o microsserviço de consulta de dados do projeto está em Node. - Apache Kafka ou experiência com mensageria assíncrona. - OpenSearch ou Elasticsearch. - Atuação em contextos de alta criticidade ou volume de dados. Benefits - Esta contratação será no modelo PJ, atuando no modelo remoto.
• Design, implement, and optimize modern data and analytics platforms using Microsoft Fabric, Azure, and Power BI • Develop scalable cloud and data platforms and build high-performance ETL/ELT processes • Design and evolve modern data models, data pipelines, and self-service analytics solutions • Define technical solution architecture and take responsibility for complex client projects • Coordinate and align interdisciplinary project teams, stakeholders, and clients in agile project environments



