Datafold logo
Datafold

Automated testing for data engineers

Forward Deployed Data Engineer

Data EngineerData EngineerFull TimeRemoteSeniorTeam 11-50H1B No SponsorCompany SiteLinkedIn

Location

Europe

Posted

14 hours ago

Salary

€85K - €135K / year

Seniority

Senior

Bachelor Degree3 yrs expEnglishETL

Job Description

Forward Deployed Data Engineer

Datafold

• Own 1–4 concurrent migration projects end-to-end: scoping, planning, execution, and customer handoff • Be the primary customer contact: run weekly check-ins, manage stakeholder expectations, and escalate risks early before they compound • Configure Datafold's Migration Agent and oversee the migration execution • Partner with Datafold's engineering team to execute migrations • Help refine and scale our product and delivery playbook as the team grows

Job Requirements

  • 3–6 years in data consulting, professional services, or a customer-facing data engineering role
  • Excellent communication skills — equally comfortable in an exec check-in and a technical design session
  • Strong grasp of the modern data stack: dbt, Snowflake, Databricks, orchestration tools, and major patterns (stored procedures, streaming, incremental processing)
  • Extreme ownership mentality — you identify, surface, and fix problems and rally the team to help without being told
  • AI power user — using AI every day and always learning and improving on how to use it more effectively
  • Exposure to legacy data stack and patterns (ETL, stored procedures, etc.) and data platform migration projects is a strong plus

Benefits

  • Competitive salary + equity

Related Categories

Related Job Pages

More Data Engineer Jobs

ContractRemoteTeam 501-1,000Since 2006H1B No Sponsor

• Serve as the strategic architect and technical anchor for Masdar’s global digital ecosystem. • Spearhead the Enterprise Digital function, designing a flexible, best-of-breed composable architecture that seamlessly bridges modern corporate systems with agile, field-level renewable asset operations. • Own and steer the global Data Management & Data Governance strategy, defining the universal data taxonomies that transform cross-border operational insight into a competitive advantage. • Build up high-performing domain architects, to safeguard and scale our Digital Architecture and infrastructure, as well as a robust team of competent data management specialists. • Empower Masdar to continue scaling and expanding securely, rapidly, and intelligently, and powering the world's clean energy future.

United Arab Emirates
H&R Block logo

Senior Data Engineer - Marketing Technologies

H&R Block

With expert guidance, upfront pricing, and more ways to file, it’s #BetterWithBlock.

Data Engineer15 hours ago
Full TimeRemoteTeam 10,001+Since 1955

• Design, develop and test enterprise MarTech platforms for data engineering using SQL, Azure Data engineering skills including Azure Data Factory, Databricks/Fabric technologies • Proficiency in Azure-based cloud technologies to support data needs, along with working in marketing projects (Adobe Experience Platform and/or Salesforce Marketing Cloud platform) • Leverage cutting-edge data technologies, programming languages, and industry-standard coding practices to innovate new features and optimize existing product/marketing functionalities • Design, develop, and maintain high-quality software components • Create and execute unit tests, troubleshoot issues, and resolve defects efficiently • Collaborate with Product, architects and cross-functional teams to align on requirements and implementation strategies • Translate business and functional requirements into clear technical specifications and product deliverables • Participate in technical design discussions and conduct code reviews to ensure quality and consistency • Document system architecture, design approaches, and development processes for future reference • Develop and maintain unit test plans and alpha test plans to support product validation • Stay current with emerging technologies, tools, and methodologies to continuously improve design, development, and deployment practices

Missouri
$101.2K - $161.9K / year

Role Description We are seeking an AI Data Engineer to build and operate the large-scale data systems that power modern AI training and evaluation pipelines. The role combines deep data engineering expertise with a strong understanding of AI workloads, focusing on ingestion, transformation, quality assurance, lineage, and high-throughput delivery of data to training jobs across diverse modalities. The ideal candidate has experience operating petabyte-scale data systems, strong software engineering fundamentals, and a clear understanding of how data infrastructure choices propagate into model quality and training efficiency. Key Responsibilities - Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows. - Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals. - Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale. - Develop dataset versioning, lineage, and provenance tracking systems suitable for reproducible training. - Build high-throughput data loading systems that maximize GPU utilization during training. - Implement labeling workflows, active learning pipelines, and human-in-the-loop data improvement systems. - Design storage architectures balancing cost, throughput, and latency across data tiers. - Build evaluation dataset construction pipelines with strict integrity and contamination controls. - Implement data privacy, redaction, and consent enforcement throughout the pipeline. - Collaborate with ML researchers and engineers to align data systems with model development needs. - Drive observability of data quality, drift, and pipeline health across the AI data estate. - Optimize cost and performance through compression, format selection, and caching strategies. - Document data systems, schemas, and operational procedures for broad internal use. - Stay current with AI data infrastructure research and emerging open-source tools. Qualifications - Bachelor’s or Master’s degree in Computer Science or a related field. - Six or more years of data engineering experience, with significant work supporting ML or AI workloads. - Strong proficiency in Python and at least one JVM or systems language. - Deep experience with modern data processing frameworks such as Spark, Ray, or Beam. - Hands-on experience operating petabyte-scale storage and pipeline systems. - Strong understanding of distributed systems, data modeling, and storage formats. - Experience with dataset versioning, lineage, and reproducibility for ML workflows. - Familiarity with high-throughput data loading for accelerator-based training. - Strong software engineering practices including testing, CI/CD, and code review. - Excellent communication and cross-functional collaboration skills. Preferred Qualifications - Experience with multimodal datasets at large scale. - Familiarity with data quality tooling and dataset evaluation methodology. - Exposure to privacy-preserving data systems and regulated data handling. - Open-source contributions to data infrastructure projects. - Experience supporting frontier model training pipelines. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] . Learn more about Bright Vision Technologies at www.bvteck.com .

United States
$100K - $150K / year
Samsara logo

Senior Data Engineer

Samsara

Pioneer of the Connected Operations Cloud

Data Engineer15 hours ago
Full TimeRemoteTeam 1,001-5,000Since 2015H1B Sponsor

Who we are Samsara (NYSE: IOT) is the pioneer of the Connected Operations™ Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (IoT) data to develop actionable insights and improve their operations. At Samsara, we are helping improve the safety, efficiency and sustainability of the physical operations that power our global economy. Representing more than 40% of global GDP, these industries are the infrastructure of our planet, including agriculture, construction, field services, transportation, and manufacturing — and we are excited to help digitally transform their operations at scale. Working at Samsara means you’ll help define the future of physical operations and be on a team that’s shaping an exciting array of product solutions, including Video-Based Safety, Vehicle Telematics, Apps and Driver Workflows, and Equipment Monitoring. As part of a recently public company, you’ll have the autonomy and support to make an impact as we build for the long term. About the role: Data and Analytics is a critical team within Business Technology. Our mission is to enable integrated data layers for all of Samsara with the insights, tools, infrastructure to make data-driven decisions. We are a growing team that loves all things data — composed of data engineers, architects, analysts, and data scientists. We are looking for a Senior Data Engineer who brings a software engineer's mindset to data infrastructure. This isn't just a pipeline-builder role — we're looking for someone who thinks in systems, builds platforms others can extend, and is excited about pushing the boundaries of what data engineering looks like in an AI-first world. You'll architect Spark-driven workflows at scale, design data platforms as products, and build the next generation of intelligent tooling including MCP servers and AI agents that automate and accelerate data engineering workflows. Our team promotes an agile, collaborative, and supportive environment where diverse thinking, innovative design, and experimentation are welcomed and encouraged. This is a remote position open to candidates residing in Canada. You should apply if: - You want to impact the industries that run our world: Your efforts will result in real-world impact — helping keep the lights on, get food into grocery stores, reduce emissions, and ensure workers return home safely. - You are the architect of your own career: If you put in the work, this role won't be your last at Samsara. We set up our employees for success and have built a culture that encourages rapid career development and mastery in a hyper-growth environment. - You're energized by our opportunity: The vision we have to digitize large sectors of the global economy requires your full focus and best efforts to bring forth creative, ambitious ideas. - You want to build platforms, not just pipelines: You think about data infrastructure as a product, care deeply about developer experience, and want to shape how an engineering team works with data at scale. - You're excited about AI-augmented engineering: You want to be at the frontier of how AI agents and intelligent tooling change the way data engineers work. In this role, you will: Data Platform Engineering - Develop and maintain end-to-end data pipelines and backend ingestion workflows, and participate in the build of Samsara's Data Platform to enable advanced automation and analytics. - Work with data from a variety of sources including ERP(Netsuite), CRM(Salesforce), Product, Order Flow, and Support ticket data. - Manage critical data pipelines to enable growth initiatives and advanced analytics. - Facilitate data integration and transformation for moving data between applications, ensuring interoperability with data layers and the data lake. - Develop and improve data architecture, data quality, monitoring, observability, and data availability. - Write data transformations in SQL/Python to generate data products consumed by Analytics, Marketing Operations, and Sales Operations teams. Spark & Distributed Systems - Design, build, and operate large-scale Spark and PySpark workflows for batch and streaming data processing across Databricks and cloud environments. - Optimize Spark job performance — tuning partitioning, shuffle, caching, and resource allocation for production-grade reliability and efficiency. Platform & Systems Thinking - Define and enforce data engineering standards, patterns, and best practices across the team. - Design systems with long-term maintainability in mind: clear contracts, testable components, and thoughtful failure modes. - Collaborate with platform and infrastructure teams to evolve the underlying architecture of Samsara's enterprise data ecosystem. MCP Servers & AI Agents - Build and maintain MCP (Model Context Protocol) servers that expose Samsara's data assets and engineering workflows to AI models and internal tooling. - Collaborate with platform teams to integrate agentic workflows into the data engineering lifecycle. - Evaluate and adopt emerging AI-native tooling for data engineering, staying ahead of the curve on how LLMs and agents can accelerate data work. Leadership & Collaboration - Champion, role model, and embed Samsara's cultural principles (Focus on Customer Success, Build for the Long Term, Adopt a Growth Mindset, Be Inclusive, Win as a Team) as we scale globally. - Provide mentorship to junior team members and deliver technical guidance, training, and knowledge-sharing across teams. - Engage directly with internal cross-functional stakeholders to understand their data needs and design scalable solutions. - Lead end-to-end projects as the central point of contact for stakeholders. Minimum requirements for the role: - Bachelor's degree in computer science, data engineering, data science, information technology, or an equivalent engineering program. - 8+ years of work experience as a Software Engineer with data focus or as Data Engineer. - 5+ years of experience building and maintaining large-scale, production-grade end-to-end data pipelines, including Data Modeling. - 5+ years of hands-on Spark / PySpark in a production environment, including job optimization and performance tuning. - Core Engineering Fundamentals: Strong programming capabilities in Python and SQL, combined with cloud data warehouse/lakehouse experience (e.g., Snowflake, Google BigQuery, Databricks, or Apache Iceberg). - Exposure to ETL tools such as Fivetran, DBT, or equivalent. - API experience: Python-based API frameworks for data pipeline ingestion. - RDBMS experience: MySQL, AWS RDS/Aurora, PostgreSQL, Oracle, MS SQL Server, or equivalent. - Cloud: AWS, Azure, and/or GCP. An ideal candidate also has experience in: - Designing and governing a centralized semantic layer for reliable AI and analytics - Logging and monitoring experience: Splunk, DataDog, AWS CloudWatch, or equivalent. - AWS Serverless: API Gateway, Lambda, S3, SNS, SQS, SecretsManager. The range of annual base salary for full-time employees for this position is below. Please note that base pay offered may vary depending on factors including your city of residence, job-related knowledge, skills, and experience. This role is also eligible for an initial RSU grant with no vesting cliff, and ongoing refresh opportunities tied to performance, subject to plan terms and conditions. Learn more about our total rewards and benefits below. Annual Base Salary $119,000—$154,000 CAD Total Rewards At Samsara, we build for the people who keep the global economy moving. We want owners, not passengers, which is why our rewards are designed to fuel high-impact builders. Our compensation program delivers above-market total compensation through a combination of base salary, performance-based bonus/variable pay, and equity (for eligible roles) in a high-growth public company. We meaningfully differentiate pay for our top performers, who have the opportunity to earn above-market compensation that can outpace the broader market over time. Beyond compensation, we provide the foundations that enable long-term success: a flexible, employee-led remote model, a professional development stipend, comprehensive health and parental leave plans, and more. If you’re ready to build for the long term and own the outcome, your journey starts here. Flexible Working At Samsara, we embrace a flexible working model that caters to the diverse needs of our teams. Our offices are open for those who prefer to work in-person and we also support remote work where it aligns with our operational requirements. For certain positions, being close to one of our offices or within a specific geographic area is important to facilitate collaboration, access to resources, or alignment with our service regions. In these cases, the job description will clearly indicate any working location requirements. Our goal is to ensure that all members of our team can contribute effectively, whether they are working on-site, in a hybrid model, or fully remotely. All offers of employment are contingent upon an individual’s ability to secure and maintain the legal right to work at the company and in the specified work location, if applicable. Belonging at Samsara At Samsara, we welcome everyone regardless of their background. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, gender, gender identity, sexual orientation, protected veteran status, disability, age, and other characteristics protected by law. We depend on the unique approaches of our team members to help us solve complex problems and want to ensure that Samsara is a place where people from all backgrounds can make an impact. Accommodations Samsara is an inclusive work environment, and we are committed to ensuring equal opportunity in employment for qualified persons with disabilities. Please email accessibleinterviewing@samsara.com or click here if you require any reasonable accommodations throughout the recruiting process. Our Commitment to Authenticity We use Tofu, a fraud detection tool, to validate the authenticity of applications and protect against identity fraud. This ensures we are connecting with real people and allows us to prioritize genuine candidates. Please see Samsara’s Candidate Privacy Notice for more information. Fraudulent Employment Offers Samsara is aware of scams involving fake job interviews and offers. Please know we do not charge fees to applicants at any stage of the hiring process. Official communication about your application will only come from emails ending in @samsara.com, @us-greenhouse-mail.io or @mail3.guide.co. For more information regarding fraudulent employment offers, please visit our blog post here.

Canada