Sigma Software Group logo
Sigma Software Group

We support enterprises, product houses, and startups with custom software solutions development and IT consulting.

Senior Data Engineer

Data EngineerData EngineerFull TimeRemoteSeniorTeam 1,001-5,000Since 2002H1B No SponsorCompany SiteLinkedIn

Location

Poland

Posted

2 days ago

Salary

0

Seniority

Senior

Job Description

Senior Data Engineer

Sigma Software Group

• Design and build scalable, cloud-native data platforms from greenfield to production • Implement near-real-time ingestion pipelines using event-driven patterns • Define and enforce platform standards, including Data Lake / Lakehouse principles, medallion architecture, and data contracts • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability • Introduce best practices for code quality, testing, and CI/CD across data pipelines • Drive adoption of AI tooling and agentic workflows within the data engineering team • Ensure data quality, observability, and reliability across all pipelines and platforms • Develop self-service tooling and microservices to simplify platform usage for other teams

Job Requirements

  • 5+ years of professional experience in Data Engineering
  • Strong Python and SQL development skills for pipeline development and optimisation
  • Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning
  • Hands-on experience with Databricks (preferred) or Snowflake
  • Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP
  • Experience with stream processing technologies (Kafka, Spark Structured Streaming)
  • Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing
  • Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent)
  • Knowledge of Infrastructure as Code (Terraform or equivalent)
  • Understanding of production-grade system requirements: reliability, scalability, observability, and performance
  • Upper-Intermediate English level WILL BE A PLUS
  • Familiarity with RAG pipeline design and LLM integration patterns
  • Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar)
  • Experience with dbt for data transformation and modelling
  • Familiarity with MLflow, Feature Stores, or ML platform integration

Benefits

  • Employees can work remotely

Related Categories

Related Job Pages

More Data Engineer Jobs

Bertoni Solutions logo

Lead Data Engineer

Bertoni Solutions

Translating technology into your success

Data Engineer2 days ago
ContractRemoteTeam 11-50Since 2016H1B No Sponsor

• Work with one of our U.S.-based clients. • Develop and implement data pipelines using PySpark. • Collaborate with cross-functional teams including machine learning engineers, software developers, analytics engineers, and product managers. • Translate business requirements into highly available data solutions. • Maintain and improve existing data pipelines and systems.

Brazil
$30 / hour
Full TimeRemoteTeam 10,001+H1B No Sponsor

• Design, develop, and maintain robust ELT/ETL pipelines • Build and optimize data models in cloud-based data warehouses • Implement and manage orchestration frameworks • Develop streaming and real-time data pipelines • Architect and provision cloud data infrastructure on GCP • Ensure all data systems comply with HIPAA, HITECH, and applicable state privacy regulations • Implement data quality frameworks • Mentor junior data engineers

United States
$135K - $165K / year
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

• Collaboration with U.S-based client stakeholders on solution design/definition. Identify solutions that solve business problems, translating requirements into technical specifications and actionable work. • Writing code and implementing the proposed solutions • Creating data pipelines, versioning and change management • Manage the complexity inherent in versioned data pipelines • Develop ETL/ELT processes to help extract and manipulate data from multiple sources. • Design, build and maintain batch or real-time data pipelines in production. • Automate data workflows such as data ingestion, aggregation, and ETL processing. • Logging and instrumentation of pipelines and services. • Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.

Costa Rica
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

• Collaboration with U.S-based client stakeholders on solution design/definition. Identify solutions that solve business problems, translating requirements into technical specifications and actionable work. • Writing code and implementing the proposed solutions • Creating data pipelines, versioning and change management • Manage the complexity inherent in versioned data pipelines • Develop ETL/ELT processes to help extract and manipulate data from multiple sources. • Design, build and maintain batch or real-time data pipelines in production. • Automate data workflows such as data ingestion, aggregation, and ETL processing. • Logging and instrumentation of pipelines and services. • Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures.

Guatemala