Huzzle.com

The human intelligence platform for training and evaluating AI

Data Engineer

Data EngineerData EngineerFull Time Remote SeniorTeam 51-200H1B No SponsorCompany Site LinkedIn

Location

South Africa

Posted

2 days ago

Salary

Seniority

Senior

Bachelor Degree3 yrs expEnglishAirflow Amazon Redshift AWS Azure BigQuery Cloud Distributed Systems ETL Google Cloud Platform Java Python Scala SQL

Job Description

• Design, develop, and maintain scalable ETL and ELT pipelines. • Build and optimize data architectures, databases, and data warehouses. • Integrate data from multiple sources, APIs, and third-party platforms. • Ensure data quality, consistency, reliability, and security. • Monitor and troubleshoot data pipelines and workflows. • Collaborate with analytics, engineering, and business teams to understand data requirements. • Implement data governance and best practices for data management. • Optimize data storage, processing performance, and query efficiency. • Support reporting, business intelligence, and analytics initiatives. • Document data models, workflows, and technical processes.

Job Requirements

3+ years of experience in data engineering, data warehousing, or related roles.
Strong proficiency in SQL and database management.
Experience with Python, Java, Scala, or similar programming languages.
Hands-on experience with ETL/ELT tools and data pipeline development.
Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform.
Experience with modern data warehouses such as Snowflake, Redshift, BigQuery, or Databricks.
Familiarity with orchestration tools such as Airflow or Prefect.
Understanding of data modeling, data governance, and data quality principles.
Experience working with large datasets and distributed systems is preferred.
Strong analytical, problem-solving, and communication skills.
Ability to work independently in a remote, collaborative environment.

Benefits

💰 **Competitive salary:** Based on experience, technical expertise, and location.
🌎 **Fully remote role:** Work from anywhere with flexible working arrangements.
🚀 **Career growth opportunities:** Join innovative companies and work on impactful projects.
📈 **Long-term opportunities:** Build your career with growing global organizations.
🧠 **Continuous learning:** Exposure to modern data technologies, cloud platforms, and large-scale data systems.
🤝 **Collaborative culture:** Work alongside talented engineers, analysts, and business leaders.
⚙️ **Cutting-edge technology:** Gain experience with modern data stacks and cloud-native solutions.

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Senior Data Engineer – Surveillance & Interoperability

inventYOU IT Consulting

Empowering Tech

Data Engineer2 days ago

Full Time RemoteTeam 11-50Since 2017H1B No Sponsor

Company Site LinkedIn

• Design and implement data integration and interoperability solutions across multiple systems and platforms • Design and maintain scalable data architectures and integration frameworks • Develop and maintain ETL/ELT pipelines for data ingestion, transformation, validation, and harmonisation • Design and implement interoperability middleware and API-based integration services • Develop and maintain data exchange mechanisms between heterogeneous systems • Design and implement canonical data models and metadata harmonisation processes • Support the implementation of surveillance, monitoring, and reporting data solutions • Develop and maintain cloud-based and hybrid data integration platforms • Ensure data quality, consistency, reliability, and performance across data ecosystems • Contribute to technical architecture decisions and interoperability standards implementation • Produce technical documentation, architecture artefacts, and implementation guidelines

AWS Azure Cloud Distributed Systems ETL Google Cloud Platform Python SQL Go

View details: Senior Data Engineer – Surveillance & Interoperability

Greece

Apply

Software Developer / Data Engineer

Sigma Software

Our Customer is a Sweden-based AdTech company specializing in advanced self-serve advertising platforms that automate direct transactions between advertisers and major global publishers. Their technology removes traditional friction in ad sales by enabling automation, transparency, and operational efficiency at scale. Platforms are trusted by internationally recognized publishers including TripAdvisor, Bloomberg, The Washington Post, Opera, and Dow Jones, handling millions of transactions worldwide. The project is a strategic architectural transformation toward a Platform-First approach. The company is transitioning from monolithic, client-specific implementations to a standardized, API-driven, multi-tenant ecosystem of reusable microservices. These services power the entire product suite, remaining independent, scalable, and decoupled from frontend or customer-specific customization.

Data Engineer2 days ago

Full Time RemoteTeam 1,001-5,000

Role Description Are you passionate about building scalable, high-performance data platforms? We are looking for a Senior Software Developer / Data Engineer to join our team and work remotely from Europe or Ukraine. In this role, you will contribute to a cutting-edge education technology initiative, designing and implementing a unified data integration platform that will serve as the foundation for an entire ecosystem. Why join us? You will work with modern technologies, collaborate with top engineers, and make an impact on a project that improves data governance, privacy, and compliance across the education sector. Job Description - Design and evolve the platform’s canonical data model - Architect and implement scalable data ingestion and synchronization frameworks - Build reliable, replayable, and highly scalable data processing pipelines - Design entity resolution and identity management capabilities - Define and implement data governance, lineage, and provenance standards - Drive architectural decisions related to scalability, resiliency, security, and operability - Collaborate with product and engineering teams to define platform capabilities and roadmap - Design APIs and delivery mechanisms for downstream consumers - Support customer migrations from legacy systems into the new platform - Mentor engineers and contribute to technical leadership across the team - Ensure platform reliability, performance, and operational excellence Qualifications - At least 5+ years of experience as a Software Developer or Data Engineer - Strong expertise in Java - Deep understanding of data platforms, integration platforms, or distributed systems - Hands-on experience with stream processing technologies (Apache Beam, Spark, Flink, Kafka Streams) - Experience designing and operating event-driven architectures - Strong data modeling and schema design skills - Experience with cloud-native architectures (GCP preferred) - Experience designing multi-tenant SaaS platforms - Strong understanding of data governance, lineage, data quality, and observability concepts - Proven ability to drive architectural decisions and technical initiatives - Strong communication and collaboration skills - Upper-Intermediate+ English level Requirements - Experience with Scala - Experience building Master Data Management (MDM) platforms - Experience with Customer Data Platforms (CDP) - Experience with identity resolution systems - Experience with Apache Beam/Dataflow in production environments - Experience with Cloud Spanner - Experience migrating large-scale legacy systems to cloud-native architectures - Experience leading platform engineering or data platform initiatives - Background in education technology or enterprise integration domains Benefits - Strong architectural thinking and technical leadership skills - Passion for building high-performance, scalable platforms - Comfortable mentoring peers and collaborating across teams - Proactive problem-solver with attention to detail - Committed to quality and operational excellence

Java Distributed Systems Apache Beam Apache Spark Apache Flink Apache Kafka GCP Observability/Monitoring Scala

View details: Software Developer / Data Engineer

Europe

Apply

Senior AI Data Engineer

Accertify, Inc.

Data Engineer2 days ago

Full Time Remote

Role Description We are seeking an experienced Senior AI Engineer to lead the design, development, and deployment of intelligent systems that drive business impact. You will own end-to-end AI solutions, set technical direction, and mentor other engineers, working across machine learning, LLMs, retrieval systems, agentic workflows, and the data layers that feed them. This role requires deep technical expertise and a track record of shipping production AI systems at scale. - AI Architecture: Architect AI systems end-to-end. Set technical direction. Review and mentor others’ designs. - LLM Strategy: Define how we use LLMs, routing, agentic orchestration, prompt patterns, fine-tune vs. prompt trade-offs, and model selection. - Production Ownership: Own deployment patterns across on-prem and cloud (Kubernetes, container infrastructure, secure integration). Drive operational excellence. - Data Architecture: Consult on the data layer, medallion design (bronze/silver/gold), RAG retrieval stores, schema evolution, drift detection, and query performance at scale. - Evaluation Framework: Define how we measure AI quality, use golden datasets, conduct judge-based eval, enable production observability (OpenTelemetry, Phoenix, or similar), and perform behavior testing. - Technical Leadership: Mentor AI Engineers, lead code and design reviews, set engineering standards for the AI team, and drive cross-team initiatives with Data Science and Data Engineering teams. - Security & PII: Set the bar for handling sensitive customer data, secure system design, authentication, PII safety, and compliance-aware patterns. - Collaboration: Partner with product managers, data scientists, engineers, and stakeholders to align AI solutions with business needs. Qualifications - Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience). - 5+ years of production software experience. - Strong proficiency with AI/ML frameworks (OpenAI SDK, Hugging Face transformers, agentic orchestration frameworks). - Strong Python AND strong SQL, both are first-class requirements. Comfortable with query optimization, indexing strategy, and data modeling. - Demonstrated experience designing and operating medallion architectures (bronze/silver/gold) at scale. - Experience handling data at scale, partitioning, batch pipeline design, query performance, and schema evolution. - Production LLM experience, prompt engineering, retrieval (RAG), agentic orchestration, fine-tuning trade-offs. - Experience with vector databases (pgvector, Pinecone, Weaviate, etc.) and production RAG patterns. - Experience with AI observability and evaluation frameworks, OpenTelemetry, Phoenix or equivalent, golden-set eval, and judge-based eval methodology. - Strong understanding of secure system design, authentication, API integrations, and PII-sensitive data handling. - Track record of mentoring engineers and shipping cross-team initiatives. - Strong problem-solving skills and ability to explain technical solutions to non-technical audiences. Preferred Skills - Hands-on experience with Model Context Protocol (MCP), building tools, OAuth flows, or MCP server integrations. - Experience leading data engineering or data platform initiatives end-to-end. - Experience with agentic AI frameworks (e.g., the OpenAI Agents SDK) in production. - Experience deploying production systems across both cloud (AWS, GCP, Azure) and on-prem environments. - Experience in fraud detection, financial services, or other PII-sensitive domains. - Knowledge of MLOps tools and CI/CD for AI. - Open-source contributions, conference talks, or technical writing in the AI/ML space. Benefits - Health & Wellness: Medical, dental, and vision coverage for you and your family. - Time Off: Paid time off, holidays, and personal days to maintain work-life balance. - Financial Growth: Competitive compensation, performance-based rewards, and local retirement or savings plans where applicable, along with financial education resources. - Career Development: Training programs, mentorship opportunities, and growth potential within the company. - Wellness Support: Mental health resources, fitness perks, and wellness programs. - Extras & Perks: Commuter benefits, employee discounts, and company-sponsored events. Additional Details - Potential Salary Range: $140-180k. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. - This role is eligible for remote US-based (work-from-home) locations. Chicago-based candidates would work in a hybrid capacity from the Accertify HQ located in Itasca, IL. - Visa Sponsorship: employment eligibility to work for Accertify in the U.S. is required, as Accertify will not pursue Visa sponsorship for this position.

AI AI/ML LLM Kubernetes Observability/Monitoring OpenTelemetry Phoenix Data Engineering OpenAI API SDK Hugging Face Python SQL Pinecone Weaviate OAuth AWS GCP Azure CI/CD

View details: Senior AI Data Engineer

United States

$140K - $180K / year

Apply

Staff Data Engineer

Jellyfish

Your Platform to Perform

Data Engineer2 days ago

Full Time RemoteTeam 1,001-5,000Since 2017H1B No Sponsor

Company Site LinkedIn

About Jellyfish Jellyfish is rewriting the manual on how high-performing engineering teams actually work within an AI world. We're seeking a Staff Data Engineer who's passionate about building and scaling robust data infrastructure, defining integration patterns, and driving technical strategy across large-scale systems. Ideally, you combine deep hands-on expertise with the ability to influence architecture and direction across teams. This is a high-agency role where you'll help shape not just what gets built, but how we build it — and raise the bar for the engineers around you. What You'll Do - Define the architecture and long-term technical direction, design and implementation of our data pipelines and integrations platform for storage, transformation and export at scale. - Drive integrations with third-party tools and establish the standards that make future integrations faster and more reliable - Own our infrastructure as code strategy using Terraform, and contribute to how we evolve our cloud data infrastructure - Architect workflow orchestration systems for near-real-time and batch data processing, with a focus on reliability and scalability - Lead the design and implementation of data export pipelines that serve diverse customer needs - Spearhead development of internal tooling and agentic workflows that meaningfully accelerate engineering velocity across the org - Serve as a technical anchor — leading design reviews, mentoring engineers, and elevating how the team approaches complex problems - Partner with engineering leadership on roadmap prioritization, cross-team dependencies, and org-wide technical strategy - Communicate clearly about technical decisions, tradeoffs, and project status to both engineers and non-technical stakeholders About You - You have deep expertise building and scaling data pipelines, and a track record of owning complex data infrastructure end-to-end — not just contributing to it - AI - driven developer who dives deep into AI first workflows - Expert Python engineer with strong opinions about how to build reliable, maintainable systems at scale - Designed and led large-scale third-party API integration work, and you know what separates a maintainable integration from a brittle one - Fluent in infrastructure as code (Terraform, CloudFormation, or similar) and think about infrastructure as a product, not an afterthought - Experienced with AI/LLM tool integrations (Claude, Copilot, etc.) and understand the unique infrastructure demands they create - You have production experience with workflow orchestration tools (Prefect, Airflow, Dagster) and can make the hard architectural calls - A natural technical leader — you build consensus, unblock others, and make the engineers around you better - You think in systems: you consider user needs, business impact, and long-term maintainability when designing solutions - Exceptional communicator who can write a design doc, lead a review, and distill a complex tradeoff for any audience - Located in EST or CST time zone Bonus points if: - You've thrived at a rapidly scaling startup and know how to balance speed with technical rigor - You have hands-on experience with Delta Lake and lakehouse architectures - You've built deep integrations with developer tools (Jira, GitHub, GitLab, CI/CD systems) and have strong opinions about how to do it well - You bring strong perspectives on how engineering teams work best — and the tools that help or hurt A list of job experiences and qualification requirements is great, but humility, a performance-driven attitude, and a team-player approach are most important to us. We love to have fun and win in the process. We only hire people who have a passion for building great companies in an environment where a sense of humor is a must. Occasional travel may be required. Applicants must be authorized to work for any employer in the US. We are unable to sponsor or take over sponsorship of an employment visa at this time. Let’s talk about us! This is all about you, but you want to know a little about us. Jellyfish enables leaders to effectively build AI-integrated engineering teams, align engineering decisions with business initiatives and deliver the right software efficiently and on time. AI tools alone won’t transform your org—Jellyfish shows you what’s working, what’s not, and how to build high-performing teams that know how to use AI the right way.

View details: Staff Data Engineer

United States

$200K - $260K / hour

Apply

Data Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Engineer – Surveillance & Interoperability

Software Developer / Data Engineer

Senior AI Data Engineer

Staff Data Engineer