We support enterprises, product houses, and startups with custom software solutions development and IT consulting.
Senior Data Engineer
Location
Poland
Posted
2 days ago
Salary
0
Seniority
Senior
Job Description
Senior Data Engineer
Sigma Software Group
• Design and build scalable, cloud-native data platforms from greenfield to production • Implement near-real-time ingestion pipelines using event-driven patterns • Define and enforce platform standards, including Data Lake / Lakehouse principles, medallion architecture, and data contracts • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability • Introduce best practices for code quality, testing, and CI/CD across data pipelines • Drive adoption of AI tooling and agentic workflows within the data engineering team • Ensure data quality, observability, and reliability across all pipelines and platforms • Develop self-service tooling and microservices to simplify platform usage for other teams
Job Requirements
- 5+ years of professional experience in Data Engineering
- Strong Python and SQL development skills for pipeline development and optimisation
- Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning
- Hands-on experience with Databricks (preferred) or Snowflake
- Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP
- Experience with stream processing technologies (Kafka, Spark Structured Streaming)
- Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing
- Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent)
- Knowledge of Infrastructure as Code (Terraform or equivalent)
- Understanding of production-grade system requirements: reliability, scalability, observability, and performance
- Upper-Intermediate English level WILL BE A PLUS
- Familiarity with RAG pipeline design and LLM integration patterns
- Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar)
- Experience with dbt for data transformation and modelling
- Familiarity with MLflow, Feature Stores, or ML platform integration
Benefits
- Employees can work remotely
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Work with one of our U.S.-based clients. • Develop and implement data pipelines using PySpark. • Collaborate with cross-functional teams including machine learning engineers, software developers, analytics engineers, and product managers. • Translate business requirements into highly available data solutions. • Maintain and improve existing data pipelines and systems.
• Design, develop, and maintain robust ELT/ETL pipelines • Build and optimize data models in cloud-based data warehouses • Implement and manage orchestration frameworks • Develop streaming and real-time data pipelines • Architect and provision cloud data infrastructure on GCP • Ensure all data systems comply with HIPAA, HITECH, and applicable state privacy regulations • Implement data quality frameworks • Mentor junior data engineers
• Collaboration with U.S-based client stakeholders on solution design/definition. Identify solutions that solve business problems, translating requirements into technical specifications and actionable work. • Writing code and implementing the proposed solutions • Creating data pipelines, versioning and change management • Manage the complexity inherent in versioned data pipelines • Develop ETL/ELT processes to help extract and manipulate data from multiple sources. • Design, build and maintain batch or real-time data pipelines in production. • Automate data workflows such as data ingestion, aggregation, and ETL processing. • Logging and instrumentation of pipelines and services. • Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.
Senior Data Engineer – PowerBI, Data Integration
3Pillar GlobalBuilding digital businesses, together.
• Collaboration with U.S-based client stakeholders on solution design/definition. Identify solutions that solve business problems, translating requirements into technical specifications and actionable work. • Writing code and implementing the proposed solutions • Creating data pipelines, versioning and change management • Manage the complexity inherent in versioned data pipelines • Develop ETL/ELT processes to help extract and manipulate data from multiple sources. • Design, build and maintain batch or real-time data pipelines in production. • Automate data workflows such as data ingestion, aggregation, and ETL processing. • Logging and instrumentation of pipelines and services. • Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.



