Sigma Software Group

We support enterprises, product houses, and startups with custom software solutions development and IT consulting.

Senior Data Engineer

Data EngineerData EngineerFull Time Remote SeniorTeam 1,001-5,000Since 2002H1B No SponsorCompany Site LinkedIn

Location

Poland

Posted

2 days ago

Salary

Seniority

Senior

Bachelor Degree5 yrs expEnglishAirflow Apache AWS Azure Cloud ETL Google Cloud Platform Kafka Microservices PySpark Python Spark SQL Terraform Unity Vault

Job Description

• Design and build scalable, cloud-native data platforms from greenfield to production • Implement near-real-time ingestion pipelines using event-driven patterns • Define and enforce platform standards, including Data Lake / Lakehouse principles, medallion architecture, and data contracts • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability • Introduce best practices for code quality, testing, and CI/CD across data pipelines • Drive adoption of AI tooling and agentic workflows within the data engineering team • Ensure data quality, observability, and reliability across all pipelines and platforms • Develop self-service tooling and microservices to simplify platform usage for other teams

Job Requirements

5+ years of professional experience in Data Engineering
Strong Python and SQL development skills for pipeline development and optimisation
Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning
Hands-on experience with Databricks (preferred) or Snowflake
Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP
Experience with stream processing technologies (Kafka, Spark Structured Streaming)
Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing
Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent)
Knowledge of Infrastructure as Code (Terraform or equivalent)
Understanding of production-grade system requirements: reliability, scalability, observability, and performance
Upper-Intermediate English level WILL BE A PLUS
Familiarity with RAG pipeline design and LLM integration patterns
Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar)
Experience with dbt for data transformation and modelling
Familiarity with MLflow, Feature Stores, or ML platform integration

Benefits

Employees can work remotely

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Lead Data Engineer

Bertoni Solutions

Translating technology into your success

Data Engineer2 days ago

Contract RemoteTeam 11-50Since 2016H1B No Sponsor

Company Site LinkedIn

• Work with one of our U.S.-based clients. • Develop and implement data pipelines using PySpark. • Collaborate with cross-functional teams including machine learning engineers, software developers, analytics engineers, and product managers. • Translate business requirements into highly available data solutions. • Maintain and improve existing data pipelines and systems.

Azure Cloud IoT PySpark SQL

View details: Lead Data Engineer

Brazil

$30 / hour

Apply

Senior Data Engineer

IKS Health

Data Engineer2 days ago

Full Time RemoteTeam 10,001+H1B No Sponsor

Company Site LinkedIn

• Design, develop, and maintain robust ELT/ETL pipelines • Build and optimize data models in cloud-based data warehouses • Implement and manage orchestration frameworks • Develop streaming and real-time data pipelines • Architect and provision cloud data infrastructure on GCP • Ensure all data systems comply with HIPAA, HITECH, and applicable state privacy regulations • Implement data quality frameworks • Mentor junior data engineers

Amazon Redshift BigQuery Cloud ETL Google Cloud Platform Python SQL

View details: Senior Data Engineer

United States

$135K - $165K / year

Apply

Senior Data Engineer, PowerBI, Data Integration

3Pillar Global

Building digital businesses, together.

Data Engineer2 days ago

Full Time RemoteTeam 1,001-5,000H1B Sponsor

Company Site LinkedIn

• Collaboration with U.S-based client stakeholders on solution design/definition. Identify solutions that solve business problems, translating requirements into technical specifications and actionable work. • Writing code and implementing the proposed solutions • Creating data pipelines, versioning and change management • Manage the complexity inherent in versioned data pipelines • Develop ETL/ELT processes to help extract and manipulate data from multiple sources. • Design, build and maintain batch or real-time data pipelines in production. • Automate data workflows such as data ingestion, aggregation, and ETL processing. • Logging and instrumentation of pipelines and services. • Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.

Airflow Amazon Redshift AWS Azure BigQuery Cloud EC2 ETL Google Cloud Platform Hadoop Numpy Pandas Python Spark SQL Tableau

View details: Senior Data Engineer, PowerBI, Data Integration

Costa Rica

Apply