Pwc CEE

PwC is a global network of more than 370,000 professionals in 149 countries that turns challenges into opportunities. We create innovative solutions in audit, consulting, tax and technology, combining knowledge from all over the world. PwC SDC Lviv, opened in 2018, is part of this global space. It is a place where technology is combined with team spirit, and ambitious ideas find their embodiment in real projects for Central and Eastern Europe.

Data Engineer

Location

Ukraine

Posted

27 days ago

Salary

0

Seniority

Mid Level

Job Description

Data Engineer

Pwc CEE

Role Description PwC Poland’s Cloud & Digital team supports global cloud projects across Cloud Strategy, Migrations, Data Analytics, and DevOps. We help clients adopt cloud infrastructure, modernize data platforms, and optimize operations. This hybrid role is based in Warsaw with flexible working options. - Building data platforms (data warehouses, lakes, lakehouses) using Azure Databricks, MS Fabric, Azure Data Factory. - Designing data flows using medallion architecture and supporting data migrations from legacy systems. - Conducting data assessments with Data Modelers/Analysts and addressing data quality issues. - Working with senior stakeholders to gather requirements and support modern data architecture design. - Supporting Data Governance activities, especially around MDM and Data Quality. - Understanding Data Science/Analytics use cases in industries like Consumer Goods, Retail, Manufacturing, or Financial Services. Qualifications - 3 years of experience in Data Engineering or related fields (BI, Data Science). - Hands‑on experience with Databricks, Snowflake, BigQuery, MS Fabric. - Experience building data warehouses/lakehouses with Azure Data Factory / Synapse. - Skills in PySpark/Python and SQL. - Some exposure to CI/CD (Azure DevOps, Jira). - B2+ English and ability to prepare technical documentation. - Experience using modern tech and AI tools in daily work. Requirements - Cloud certifications (Azure/AWS/GCP) and Data Governance certifications. - Training via platforms like Coursera/Udemy/Datacamp. - Knowledge of data modelling (e.g., Kimball), Data Strategy, Data Quality, and MDM. - Experience with Agile (Scrum), ML/GenAI use cases, or Agentic AI for data automation. - Work with senior stakeholders (e.g., CDOs) in Manufacturing, Consumer Goods, or Financial Services. Benefits - Remote or in a comfortable office in Lviv - you choose. - Personal development plan, mentoring, English and Polish language courses. - Official employment from day one, annual review of salary and career prospects. - Events that unite the team and a space where everyone can be themselves.

Related Categories

Related Job Pages

More Data Engineer Jobs

Role Description Siamo Corposostenibile, il principale centro online di Nutrizione Integrativa in Italia, in forte crescita nel settore health & wellness, con un team di oltre 150 professionisti. Stiamo costruendo una piattaforma tecnologica proprietaria con forte componente data-driven: raccolta, normalizzazione e analisi dei dati provenienti da molteplici sistemi (CRM, marketing, prodotto, AI). Cerchiamo un Senior Data Engineer responsabile della progettazione e realizzazione della nostra data platform su GCP. Il focus non è solo analisi: il ruolo prevede la costruzione end-to-end del sistema, dall’acquisizione dei dati alla generazione di dashboard utilizzate in azienda. Avrai ownership diretta su come i dati vengono raccolti, strutturati e resi disponibili al business. Modalità di lavoro: - Full remote (Europa) - Collaborazione asincrona + allineamenti periodici - Richiesta sovrapposizione operativa: GMT-1 / GMT+3 Responsibilities - Progettare e sviluppare un data lake su GCP (Cloud Storage + BigQuery) - Costruire pipeline di ingestione dati da fonti eterogenee: - API (CRM, marketing tools, SaaS esterni) - database interni - eventi applicativi - Sviluppare pipeline ETL/ELT scalabili e monitorabili - Modellare i dati per analytics (star schema, layer semantico) - Garantire qualità, consistenza e affidabilità dei dati - Esporre dati tramite query layer, API o strumenti BI - Sviluppare dashboard per business e operations - Ottimizzare costi e performance su BigQuery - Definire standard e best practice per la gestione dei dati Stack Tecnologico (GCP) - Storage: Google Cloud Storage (data lake) - Data Warehouse: BigQuery - Processing: Python (batch jobs), SQL - Orchestrazione: Cloud Composer (Airflow) o equivalenti - Ingestion/Eventi: Pub/Sub (nice to have) - BI: Looker / Metabase / Superset - Infra: Docker, CI/CD Qualifications - 5+ anni di esperienza in data engineering o backend engineering con forte componente dati - Esperienza nella progettazione di data lake o data warehouse - Ottima conoscenza di Python e SQL - Esperienza con pipeline ETL/ELT e integrazione dati da API esterne - Data modeling per analytics (fact tables, dimensioni, ecc.) - Esperienza con database relazionali (PostgreSQL o simili) Requirements - Esperienza con sistemi distribuiti o scalabili - Versionamento codice (Git) - Esperienza con Docker e pipeline CI/CD - Familiarità con orchestrazione workflow (Airflow o equivalenti) Nice to have - Esperienza con streaming/event-based systems - Esperienza con strumenti BI (Metabase, Superset, Looker, etc.) - Esperienza con sistemi di tracking (event analytics) - Esperienza in ambienti startup / high-growth Soft Skills - Forte autonomia e senso di ownership - Capacità di lavorare su problemi poco definiti - Approccio pragmatico e orientato al risultato - Comunicazione chiara con team tecnici e non tecnici Benefits - Ownership completa della data platform - Impatto diretto su decisioni di business - Ambiente veloce e senza burocrazia - Collaborazione diretta con leadership tecnica - Full remote in Europa - Retribuzione commisurata all’esperienza e alle competenze del candidato. Disponibilità a discutere nel corso del colloquio.

Europe
Job Closed

Role Description Clear Fracture is building AI-driven data integration systems that enable organizations to connect, transform, and reason over complex data using agentic workflows. Our platform operates across cloud and on-prem environments and is designed to support multi-tenant, production-scale use cases. We are looking for a Data Engineer who operates as a software engineer first, with strong experience in data modeling and data systems. You will play a key role in building the core data layer that powers our agentic platform—designing schemas, implementing data services, and enabling reliable, scalable data flows. In addition to building core data infrastructure, you will also develop real use cases on the platform itself, helping shape how users interact with data. This includes designing data interfaces, abstractions, and tooling that make it easier to understand, model, and work with data across the system. This is not a traditional ETL-only role. You will write production code, design systems, and help define how data is represented, accessed, and understood across the platform. Qualifications - Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent practical experience. - 6+ years of professional experience in software engineering and/or data engineering roles. - Due to the nature of the work, U.S. Citizenship and the ability to obtain a Secret Clearance are required. - Strong programming skills in Python (or similar backend language). - Experience designing and implementing data models for production systems, with advanced knowledge of dimensional modeling topics like slowly changing dimensions and entity relationship diagrams. - Proficiency in SQL and experience with relational databases (e.g., PostgreSQL). - Experience building backend services or APIs that interact with data systems. - Experience designing and operating data pipelines (ETL/ELT). - Familiarity with NoSQL databases and different data storage paradigms. - Experience working with large datasets and performance optimization. - Experience with Docker and containerized development workflows. - Familiarity with Kubernetes-based environments. - Strong understanding of software engineering fundamentals (testing, version control, system design). Requirements - Design and implement logical and physical data models for complex, evolving datasets. - Define schemas and access patterns that support multi-tenant usage and application-level workflows. - Balance normalization, performance, and flexibility across different storage systems. - Partner with product and engineering teams to translate requirements into scalable data designs. - Develop real-world data use cases on top of the platform to validate and extend its capabilities. - Design and build data interfaces and abstractions that help users understand and work with data. - Contribute to systems such as data glossaries, semantic layers, and metadata and schema discovery tools. - Help define how users explore, model, and interact with data within the platform. - Translate complex data structures into intuitive, usable representations. - Build backend services and APIs that expose and operate on data models. - Implement data access layers that are reliable, maintainable, and performant. - Contribute to core application architecture where data and services intersect. - Write clean, testable, production-grade code. - Design and implement pipelines for ingesting, transforming, and validating data. - Support both batch and near-real-time processing workflows. - Build systems that handle structured, semi-structured, and unstructured data. - Enable data flows that support AI-driven and agent-based workflows. - Work with embeddings, context retrieval, and data representations used in modern AI systems. - Help design systems that make data accessible and useful for autonomous agents. - Implement validation, monitoring, and testing for data systems. - Ensure correctness, consistency, and observability of data pipelines and services. - Diagnose and resolve data-related issues in production environments. Benefits - Engineering mindset: You approach data systems as software systems, not just pipelines. - Data intuition: You understand how to model real-world complexity into clear, usable structures. - Product thinking: You care about how users interact with and understand data, not just how it is stored. - Systems thinking: You see how data flows through services, APIs, and AI systems. - Ownership: You take responsibility for the reliability and usability of what you build. - Pragmatism: You balance ideal design with real-world constraints. - Collaboration: You work effectively across engineering disciplines.

United States
$120K - $160K / year
Full TimeRemoteTeam 10,001+H1B Sponsor

• Lead a team of data engineers transforming data from disparate systems to enable insights and analytics for business stakeholders. • Create technical roadmaps and recommend strategies for data pipelines and integration. • Leverage cloud-based infrastructure to implement scalable, resilient, and efficient data engineering solutions. • Collaborate with data analysts, data scientists, database administrators, cross-functional teams, and business stakeholders to solve problems. • Influence architectural decisions and design patterns across the data platform. • Provide technical leadership across the software development lifecycle, from design to deployment, including hands-on contribution. • Develop project plans, facilitate prioritization timelines, allocate resources, and take ownership of assigned technical projects in a fast-paced environment. • Perform code reviews and ensure data engineers follow best-practice coding standards. • Define and validate test cases to ensure data quality, reliability, and a high level of confidence. • Continuously improve quality, efficiency, and scalability of data pipelines, reducing gaps and inconsistencies.

United States
$138K - $200.1K / year
Job Closed
Full TimeRemoteTeam 1,001-5,000Since 1966H1B No Sponsor

• Own and deliver impactful data products within WSI’s medallion architecture. • Transform raw and conformed data into governed, high-quality datasets for analytics, AI, and operational use. • Design, build, and optimize data solutions on Microsoft Fabric, including pipelines, Lakehouse/Warehouse structures, PySpark notebooks, and semantic models. • Evolve and implement data architecture patterns (medallion, SCD, CDC, orchestration, CI/CD), adapting them to real-world scale, performance, and business needs. • Ensure data quality, observability, and performance at scale. • Implement validation frameworks, monitoring, SLAs, and cost-optimized storage and compute strategies. • Partner with stakeholders to translate requirements into reusable, scalable data models and curated data products. • Drive consolidation of legacy reporting and BI tools into a unified, governed analytics platform. • Embed security, governance, and best practices, including role-based access, cataloging, and release management. • Act as a technical leader, contributing to standards, mentoring peers, and elevating overall engineering quality. • Leverage modern dev tools (e.g., GitHub Copilot or similar) to accelerate delivery and engineering efficiency.

Wisconsin
$125K - $200K / year