CEOX logo
CEOX

Breaking Barriers is Easier With a Strong Circle. More than a network—it’s a highly vetted, mission-driven community.

Data Engineer

Data EngineerData EngineerContractRemoteSeniorTeam 1-10Since 2019H1B No SponsorCompany SiteLinkedIn

Location

United Kingdom

Posted

145 days ago

Salary

0

Seniority

Senior

Bachelor DegreeEnglishAzureCloudETLPySparkPythonSQL

Job Description

Data Engineer

CEOX

• Build and maintain ETL pipelines and code in cloud data platforms. • Support ingestion activity and onboarding of new data sources. • Work with the Databricks Data Intelligence Platform and develop Data Engineering workloads. • Construct raw, refined and curated data layers; catalogue assets appropriately. • Validate solutions against functional and non-functional requirements. • Deliver datasets , transformations and performance-optimised data products. • Improve processes, engineering patterns, and reusable tooling. • Monitor and measure pipeline performance; support incident resolution. • Ensure documentation meets acceptance standards and is approved centrally. • Actively engage in Agile ceremonies and governance forums.

Job Requirements

  • Strong experience with Python, PySpark & SQL for data engineering.
  • Hands-on experience with Azure Databricks.
  • Strong knowledge of Azure Data Factory & Fabric.
  • Experience with Microsoft Purview.
  • Familiarity with Event-driven data ingestion (Event Grid / Pub-Sub).
  • Understanding of SOLID principles , Async programming, Mediator/Factory patterns.
  • Experience delivering unit + integration testing in Databricks.
  • Knowledge of Secure ETL design with Entra IMID/SCIM integration.
  • Understanding of Azure best practice, APIM, and platform governance.
  • Ability to build and serve Power BI models via Databricks data sources.
  • Prior experience working within UK Public Sector environments.

Related Categories

Related Job Pages

More Data Engineer Jobs

Greenbox Capital logo

Lead Data Engineer

Greenbox Capital

Greenbox Capital offers funding solutions for small and mid-sized businesses with the aim of making working capital more accessible to all, even those considered at high risk. Emph

Data Engineer145 days ago

• Design, develop, and maintain scalable data pipelines and ETL processes using Azure Data Factory, Azure Databricks, and other Azure services. • Own the data engineering framework, including pipeline patterns, orchestration standards, and reusable components. • Collaborate with data scientists, Software engineers, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions. • Define, document, and enforce best practices for ADF, Databricks, Spark, and data modeling. • Implement and maintain data storage solutions using Azure SQL Database, Azure Data Lake Storage, and Azure Cosmos DB. • Ensure data quality and integrity by implementing data validation, cleansing, and transformation processes. • Implement data quality checks, validation frameworks, and monitoring for critical data assets. • Design and support governance patterns leveraging Databricks Unity Catalog and Azure-native controls. • Develop and maintain documentation for data engineering processes and solutions.

Florida
$150K - $170K / year
Job Closed
Arbi Arredobagno logo

Data Architect – Microsoft Fabric

Arbi Arredobagno

The perfect design for every bathroom

Data Engineer145 days ago
OtherRemoteTeam 51-200Since 1987H1B No Sponsor

• Architect and deliver complex, multi-year data unification and Customer 360 solutions. • Design and implement Master Data Management (MDM) capabilities: identity resolution, match/merge logic, survivorship rules, and stewardship workflows. • Build and operate scalable, automated data pipelines across cloud and on-prem sources; hands-on experience with Profisee (MDM) and Fivetran (integration). • Develop and optimize solutions on Azure Data Lake, Azure Synapse, Azure Data Factory, Power BI, and Microsoft Fabric (including lakehouse/medallion patterns). • Establish and operationalize data governance frameworks: data cataloging, lineage tracking, stewardship operating models, access controls, and regulatory/compliance policies; experience integrating Microsoft Purview is a plus. • Define and monitor data quality baselines (accuracy, consistency, completeness, timeliness) and drive continuous improvement. • Create and evolve semantic models to support analytics, reporting, and AI scenarios; enforce standardized definitions and calculations. • Lead change management and stakeholder engagement to drive adoption and value realization. • Apply DataOps practices for observability, performance tuning, hypercare, and post–go-live optimization. • Collaborate with executive, business, and IT stakeholders; translate business needs into clear technical designs and roadmaps.

Texas
$118.4K - $185K / year
Job Closed
Full TimeRemoteTeam 51-200H1B Sponsor

• Develop, enhance, and troubleshoot Mainframe batch processes using JCL, Easytrieve, and SAS. • Build and maintain automation and data processing scripts using Python. • Support distributed data processing workloads using Apache Spark and the PySpark API. • Write efficient SQL queries for data extraction, analysis, and transformation. • Work with Google Cloud Platform (GCP) services - primarily Cloud Storage - for data movement and storage management. • Collaborate with data analysts, engineers, and business teams to support data initiatives and enhance data workflows. • Participate in documentation, code reviews, and best practices for data and code quality. • Investigate data issues, perform root-cause analysis, and implement corrective actions.

India
Job Closed
Living Carbon logo

Remote Sensing Data Engineer

Living Carbon

Public benefit company with a mission to fight climate change by enhancing CO2 capture and storage in trees

Data Engineer145 days ago
OtherRemoteTeam 11-50Since 2020H1B No Sponsor

• Conduct remote sensing analytics and modeling • Develop scalable analytical and reporting tools • Manage GIS data collection, storage, and version control across team members • Engage in strategic planning & process improvement • Provide support to collaborate with Land and Forestry teams • Manage and analyze large datasets off-line and in cloud computing and storage platforms • Analyze large and complex geospatial datasets and remote sensing data • Design and implement novel predictive, statistical, and machine learning models related to forestry, land use, carbon sequestration, biodiversity, conservation planning, and climate resilience • Automate statistical and geospatial analysis processes using Python, R, or other programming languages • Create clear and impactful reporting tools to communicate geospatial information and insights • Maintain and update internal geospatial databases, ensuring data quality, consistency, and version control • Integrate data from Land, Forestry, and Carbon teams to support commercial initiatives • Ensure high standards of data accuracy and ethical use in all geospatial analyses and models • Conduct quality control checks on geospatial datasets • Provide technical mapping support to other team members as needed • Work closely with Land, Forestry, and Carbon teams to uncover new operational insights • Identify opportunities to improve geospatial workflows and contribute to the development of best practices • Support research and development efforts in geospatial analytics and remote sensing applications.

United States
$130K - $140K / year
Job Closed