BHFT logo
BHFT

Step into the state-of-the-art algorithmic trading solutions

Market Data Engineer – Domain Trading Expertise Required

Data EngineerData EngineerFull TimeRemoteSeniorTeam 11-50H1B No SponsorCompany SiteLinkedIn

Location

Spain

Posted

6 days ago

Salary

0

Seniority

Senior

Job Description

Market Data Engineer – Domain Trading Expertise Required

BHFT

• Capture & Ingestion: Own the full capture path from wire to lake • Build batch + stream pipelines (Airflow, Spark, dbt) for tick and reference data • Own L2/L3 order-book reconstruction with gap handling • Storage & Modeling: Own the Iceberg-over-S3 lakehouse • Drive storage cost optimisation via compaction, tiering, and snapshot expiry • Tooling & Libraries: Build libraries for schema management, data contracts, validation, and lineage • Reliability & Observability: Embed monitoring, alerting, SLAs/SLOs, and CI/CD across capture and pipeline layers on Kubernetes • Collaboration: Partner with Quant Research, Data Science, Backend, and DevOps to translate requirements into platform capabilities

Job Requirements

  • 5+ years building production-grade data systems
  • Proven expertise architecting and launching data lakes / lakehouses from scratch
  • Hands-on experience with Apache Iceberg (or comparable table formats — Delta / Hudi)
  • Experience with market data and/or network packet capture
  • Experience normalizing market data from multiple vendors
  • Expert-level Python (incl. Polars and/or PySpark); Rust a strong plus
  • Modern orchestration (Airflow) and distributed processing (Apache Spark)
  • Advanced SQL: complex aggregations, window functions, query optimization, partition pruning
  • Solid fundamentals in Linux, containerization (Docker, Kubernetes / EKS), and cloud object storage (AWS S3)
  • DevOps & observability: CI/CD, infrastructure-as-code (Terraform), GitOps (ArgoCD)
  • Strong grasp of structured + unstructured / binary data, and storage optimization
  • English fluency for documentation and collaboration in an international team

Benefits

  • Compensation for health insurance
  • Compensation for sports
  • Compensation for professional development
  • Fully remote work from anywhere in the world, on a flexible schedule

Related Categories

Related Job Pages

More Data Engineer Jobs

Blend360 logo

Data Engineer Manager

Blend360

Optimizing business performance through people, data, tech & analytics

Data Engineer6 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

• Lead, design, and scale data solutions to support Journey Analytics initiatives, focusing on code quality, reusability, and reliable data platforms. • Set the technical direction, overseeing the evolution of data architectures. • Lead a team of data engineers to deliver high-quality, performant datasets for analytics and reporting use cases. • Mentor a team of data engineers, fostering best practices in coding, architecture, and data engineering standards. • Drive the technical strategy for Journey Analytics data platforms, ensuring scalability, maintainability, and performance. • Oversee maintenance, optimization, and automation of code repositories in GitHub. • Guide the refactoring of legacy codebases to improve maintainability, scalability, and reusability. • Drive the design and implementation of modular, reusable data components. • Oversee development and management of automated data pipelines in Databricks. • Ensure data quality, governance, performance, and reliability across all data pipelines and datasets.

Colombia
Blend360 logo

Data Engineer Manager

Blend360

Optimizing business performance through people, data, tech & analytics

Data Engineer6 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

• Lead, design, and scale data solutions to support Journey Analytics initiatives, with a strong focus on code quality, reusability, and reliable data platforms. • Set the technical direction, oversee the evolution of data architectures, and lead a team of data engineers to deliver high-quality, performant datasets for analytics and reporting use cases. • Mentor a team of data engineers, fostering best practices in coding, architecture, and data engineering standards. • Define and drive the technical strategy for Journey Analytics data platforms, ensuring scalability, maintainability, and performance. • Oversee maintenance, optimization, and automation of code repositories in GitHub, ensuring high-quality and consistent development practices. • Guide the refactoring of legacy codebases to improve maintainability, scalability, and reusability across multiple use cases. • Drive the design and implementation of modular, reusable data components to support multiple journeys and reduce duplication. • Oversee development and management of automated data pipelines in Databricks, ensuring reliability and scalability for downstream consumption. • Establish and enforce standards for scalable data modeling to support current and future analytics use cases. • Ensure data quality, governance, performance, and reliability across all data pipelines and datasets. • Partner with analytics, product, and engineering stakeholders to align data solutions with business needs and priorities. • Proactively identify risks, bottlenecks, and improvement opportunities, and drive mitigation strategies at a team and platform level. • Promote continuous improvement of data processes, documentation, and engineering practices.

Argentina
Capco logo

Data Engineer (Databrick + Pyspark)

Capco

Capco, a Wipro company, is a management & technology consultancy dedicated to the financial services & energy industries

Data Engineer6 days ago
Full TimeRemoteTeam 1,001-5,000Since 1998H1B Sponsor

Job Title: Data Engineer (PySpark / Databricks) Experience: 5–9 Years Location: Pune (Hybrid – Capco Office) Job Summary We are looking for a skilled Data Engineer with strong expertise in PySpark, Databricks, and modern data engineering practices. The ideal candidate will have hands-on experience in building scalable data pipelines, working with large datasets, and leveraging cloud-based data platforms. Key Responsibilities Design, develop, and maintain scalable ETL/ELT data pipelines Work extensively with PySpark and Apache Spark for large-scale data processing Build and manage workflows using Apache Airflow Develop and optimize data solutions on Databricks (Jobs, Delta Lake) Work with cloud-based data lakes (S3 or equivalent) Write efficient and complex SQL queries for data transformation and analysis Run and manage Spark workloads on EMR Serverless or other managed Spark platforms Ensure data quality, reliability, and performance optimization of pipelines Must Have Skills Strong hands-on experience with PySpark and Apache Spark internals Experience with Databricks (Jobs, Delta Lake) Proficiency in Apache Airflow for workflow orchestration Solid experience building ETL/ELT pipelines at scale Strong SQL skills and experience with Data Warehouse (DWH) systems Experience running Spark workloads on EMR Serverless or managed Spark platforms Hands-on experience with cloud data lakes (S3 or equivalent) Good to Have Skills Experience with Delta Lake / Apache Iceberg Exposure to streaming frameworks (Spark Structured Streaming, Kafka) Familiarity with CI/CD pipelines for data engineering workflows Knowledge of data governance, cataloging, and lineage tools

India
Reply logo

Data Engineer

Reply

Reply designs and implements innovative solutions in the areas: Digital Services, Technology and Consulting.

Data Engineer6 days ago
Full TimeRemoteTeam 10,001+Since 1996H1B Sponsor

• Projetar, desenvolver e manter pipelines de dados e transformações no Palantir Foundry (PySpark, SQL, Code Workbook). • Construir e gerenciar ontologias, tipos de objetos e relacionamentos dentro da camada de Ontologia do Foundry. • Implementar e suportar fluxos de trabalho e agentes de IA/ML usando Palantir AIP. • Colaborar com partes interessadas do negócio para traduzir requisitos em soluções de dados escaláveis. • Garantir a qualidade dos dados, governança e melhores práticas de segurança em todos os pipelines. • Criar e manter dashboards e aplicações de dados dentro do Foundry Workshop. • Fornecer orientações técnicas e documentação para decisões de arquitetura de dados.

Brazil