Data Engineer

Location

Poland

Posted

1 day ago

Salary

0

Seniority

Mid Level

Job Description

Data Engineer

InPost

Role Description At InPost, Data & AI is not a support function — it is the engine behind our decisions. We process billions of events daily across nine European markets, and our data platform is what makes that intelligence possible. As a Data Engineer in our Data & AI area, you will be one of the builders: designing the pipelines, streaming systems, and lake architectures that turn raw operational data into reliable, high-quality data products powering ML models, analytics, and business decisions. You will work in cross-functional squads alongside Data Scientists, Analytics Engineers, and Product Managers, shipping real data products — not internal tooling that no one sees. The scale is real, the data is complex, and the impact is immediate. Success looks like: - Data products that are trusted, fresh, and easy to consume - Pipelines that run reliably at scale with no manual intervention - A codebase that your colleagues are proud to contribute to Main Activities: - Data Platform & Lake Engineering: Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of structured and semi-structured data. - Streaming Solutions: Build and operate real-time data streaming pipelines using Apache Kafka and its ecosystem (Kafka Streams, Kafka Connect). - ETL/ELT Design and Maintenance: Architect and maintain ETL and ELT pipelines with a focus on data quality, idempotency, and observability. - Spark and Databricks Development: Develop distributed data processing applications using Apache Spark (PySpark, Scala), running on Databricks. - Database Engineering: Design and manage both SQL and NoSQL databases used in our data products. - Cloud-Native Solutions: Build data solutions on cloud infrastructure (GCP, Azure, or AWS). - CI/CD and Engineering Excellence: Apply software engineering best practices to data pipelines. - Performance Monitoring and Optimisation: Own the operational health of the data infrastructure and ETL processes you build. - API and System Integration: Integrate data from internal and external sources via REST and SOAP APIs. - Knowledge Sharing and Community: Actively contribute to InPost's data engineering community. Qualifications - At least 3 years of experience in a Data Engineering or similar role - Hands-on experience with Apache Spark (Streaming, Spark SQL, MLlib) and Databricks (PySpark, Scala) - Practical experience with Apache Kafka — including Kafka Streams and Kafka Connect - Proficiency in Python; working knowledge of Scala or Java - Experience designing and operating SQL databases (e.g., PostgreSQL, BigQuery, Spark SQL) and NoSQL databases (e.g., MongoDB, Cassandra, or similar) - Experience building and maintaining data lake environments (Delta Lake, Parquet, or equivalent) - Familiarity with cloud platforms (GCP, Azure, or AWS) and their managed data services - Experience integrating data via REST and/or SOAP APIs - Working knowledge of CI/CD tooling (GitLab CI, Jenkins, or equivalent) and software engineering practices (testing, versioning, code review) - Experience building and running Docker containers - Willingness to share knowledge and actively contribute to engineering best practices - Professional working proficiency in both English and Polish Requirements - Experience in an international, multi-market environment - Exposure to ML pipeline engineering or feature store design - Familiarity with data orchestration tools (Apache Airflow, Prefect, or Databricks Workflows) - Experience with Infrastructure as Code (Terraform, Ansible) - Contributions to open-source data engineering projects Benefits - The option to work from the office or 100% remotely - Opportunity to work in a diverse, international and cross-functional environment, along with leading experts - Fulfilling careers with a range of benefits for employees and invests in providing training opportunities for their development - Involvement in technology monitoring and choices - Your impact will be visible instantly and you will be making a difference in our users' lives

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 11-50Since 2016H1B No Sponsor

Role Description We are looking for an experienced Senior Data Engineer to join a fast-growing technology company building data-driven solutions at scale. In this role, you will design and maintain robust data infrastructure, process high-volume real-time data streams, and collaborate with cross-functional teams to support analytics and machine learning initiatives. 📍 Location: Philippines (Remote) 💰 Salary: Negotiable 📈 Seniority: Senior Level (5-10 years of experience) Key Responsibilities - Design, build, and maintain scalable data pipelines for both real-time and historical data processing. - Develop and optimize databases, data warehouses, and ETL workflows handling large-scale time-series data. - Process and manage high-volume data streams (thousands of records per second). - Collaborate with Product, Customer Success, Installation, and Technical Support teams to ensure data quality and business alignment. - Implement monitoring, alerting, and logging systems to ensure platform reliability. - Support Data Science teams by improving data accessibility and facilitating model deployment. - Troubleshoot and optimize data infrastructure performance. Qualifications - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - 5+ years of experience as a Data Engineer. - Strong experience designing and scaling data infrastructure. - Hands-on experience working with time-series data. - Expert-level SQL skills, especially PostgreSQL and PL/pgSQL. - Ability to analyze and optimize database query execution plans. - Experience with ETL and data pipeline tools such as Airflow, Metaflow, or similar platforms. - Experience with TimescaleDB or other time-series databases. - Strong Python programming skills. - Understanding of data warehousing concepts and best practices. Preferred Qualifications - Excellent problem-solving skills. - Self-driven, proactive, and quick learner. - Experience supporting data science and machine learning workflows. Benefits - Fully remote working environment. - Opportunity to work with cutting-edge data technologies. - Collaborative international team. - Competitive compensation package. - Performance-based incentives. - Comprehensive benefits package. - Strong focus on learning, development, and career growth. - Flexible and autonomous working culture. Recruitment Process - HR Interview - Hiring Manager Interview - Take-home Assignment - Culture Fit Interview

Philippines
Coopers Group AG logo

ETL Developer

Coopers Group AG

recruiting in good company

Data Engineer1 day ago
Full TimeRemoteTeam 51-200Since 2010H1B No Sponsor

Role Description Für unseren Kunden aus der Gesundheitsbranche in Bern, suchen wir eine:n erfahrene:n, motivierte:n und aufgeschlossene:n ETL Datenbank Entwickler:in. Zur Verstärkung des Teams wird eine erfahrene Persönlichkeit gesucht, die mit fundiertem Know-how in ETL-Engineering bei der Integration und Weiterentwicklung moderner Datenlandschaften unterstützt. - Konzeption, Implementierung und kontinuierliche Weiterentwicklung von ETL-Ladestrecken - Anbindung und Integration neuer Quellsysteme in ein MS SQL-basiertes Data Warehouse - Entwicklung, Deployment und Betrieb von ETL-Prozessen mit SSIS - Transformation und Harmonisierung von Daten aus unterschiedlichen Quellen - Mitwirkung bei der Weiterentwicklung der bestehenden DWH-Architektur - Unterstützung bei der Sicherstellung von Datenqualität und Performance - Analyse und Behebung von Fehlern in ETL-Prozessen sowie Monitoring der Datenflüsse Qualifications - Mehrjährige Erfahrung im Bereich Data Warehousing und ETL-Entwicklung - Sehr gute SQL-Kenntnisse im Microsoft SQL Server Umfeld - Fundierte Erfahrung mit SSIS (Entwicklung, Deployment und Betrieb) - Praxis in der Integration und Transformation heterogener Datenquellen - Gute Kenntnisse in der Datenmodellierung für DWH- und Reporting-Lösungen - Erfahrung im Monitoring sowie in der Fehleranalyse von ETL-Prozessen - Kenntnisse in BIML von Vorteil - Branchenkenntnisse im Gesundheitswesen oder Erfahrung mit klinischen Daten von Vorteil - Verhandlungssichere Deutschkenntnisse zwingend Company Description Die Coopers Group AG ist eine agile Schweizer Recruiting Agentur, die Spezialisten und Führungskräfte in den Bereichen IT, Life Sciences, Engineering und Finance vermittelt. Mit flexiblen Ansätzen bringen wir Kandidat:innen und Unternehmen zusammen, die nicht nur fachlich, sondern auch menschlich zusammenpassen.

Switzerland
ContractRemoteTeam 501-1,000Since 2003H1B No Sponsor

• Plan and execute business-side data migration activities for Finance data objects in line with global, deployment, and country-specific requirements and timelines. • Perform and coordinate data cleansing activities, ensuring completion within agreed schedules and quality standards. • Complete data collection activities for manual and construction data objects. • Collaborate with IT teams to prepare, validate, and maintain value mappings between source and target systems. • Create and maintain master data lists for assigned Finance data objects where applicable. • Provide business insights and detailed information to technical teams to support data extraction and conversion from legacy systems. • Work closely with IT teams and Business Data Owners to identify and confirm country-specific data objects in scope. • Ensure data readiness and data quality throughout the entire migration lifecycle for assigned Finance data objects. • Verify that data is fit for purpose and aligned with business requirements and stakeholder expectations. • Review and formally approve upload files before and after data loads. • Perform manual (type-in) data loads into target systems where required. • Execute dual maintenance activities to ensure data consistency across systems. • Execute and approve data verification scripts and validation activities. • Act as the functional Single Point of Contact (SPoC) for assigned Finance data objects during migration and Hypercare phases. • Support defect management activities, ensuring timely resolution of data related issues.

Brazil
Full TimeRemoteTeam 201-500H1B No Sponsor

• You will plan, develop and support the data engineering solutions that prepare, integrate and make data accessible for analytical and operational use across ASD • You will work across multiple data sources and formats, build reusable NiFi components, and shape the data flow and integration strategies that the broader analytical capability depends on • Plan and drive the development of data engineering solutions, balancing functional and non functional requirements • Build scalable and reusable NiFi templates and processors for data ingestion and transformation • Optimise performance and ensure high availability of data pipelines • Integrate data from multiple sources and formats into centralised systems • Perform data modelling to support efficient data storage, retrieval and analytics • Provide expert advice on data integration and uplift the capability of others in applying good integration practice • Monitor application of data standards, architectures and security to ensure compliance and scalability • Develop standards and guidelines for data protection, disaster recovery and storage management • Contribute to organisational policies and guidelines for data engineering • Collaborate with cross functional teams to define data flow and integration strategies • Lead the selection and development of data engineering methods, tools and techniques • Develop and promote continuous integration, deployment and monitoring practices

Australia
$175 - $195 / hour