Job Closed

This listing is no longer active.

Able

A product strategy and development company

Staff Data Engineer

Data EngineerData EngineerFull Time Remote LeadTeam 51-200Since 2008H1B SponsorCompany Site LinkedIn

Location

Mexico

Posted

60 days ago

Salary

Seniority

Lead

Bachelor Degree8 yrs expEnglishAmazon Redshift AWS ETL Kafka PySpark Python Spark Terraform Unity

Job Description

• Design, build, and operate a Databricks medallion lakehouse architecture (Bronze/Silver/Gold layers) using Delta Live Tables to support ingestion, transformation, and serving of clinical, behavioral, and operational data across a multi-country digital health platform • Architect and maintain scalable data pipelines on AWS (S3, Glue, Lambda, Kinesis, MSK/Kafka) that ingest data from diverse sources including FHIR-based clinical systems, remote patient monitoring devices, mobile applications, and third-party vendor APIs — ensuring reliability, idempotency, and observability at scale • Implement multi-country data isolation and governance leveraging Databricks Unity Catalog, enforcing data residency requirements across different countries (e.g., the US, EU, and the Kingdom of Saudi Arabia) and integrating policy-as-code consent enforcement (e.g., Open Policy Agent) aligned with regulatory requirements and guidelines (e.g., HIPAA, GDPR) • Partner with platform, compliance, and analytics teams to define and enforce data quality standards, lineage tracking, schema evolution strategies, and tamper-evident audit logging across all tiers of the lakehouse • Support clinical data interoperability by implementing and maintaining FHIR-to-OMOP mapping pipelines, enabling downstream analytics, population health reporting, and AI/ML feature engineering on harmonized datasets • Optimize data platform performance, cost, and reliability through partitioning strategies, compaction, caching, cluster sizing, and monitoring — targeting SLAs appropriate for a patient-facing healthcare platform operating at scale (e.g. 1M+ patients across a dozen markets) • Contribute to certification and compliance readiness (e.g., ISO 27001, SOC 2 Type 2) by maintaining documentation, change control processes, and validation artifacts for all data infrastructure components • Collaborate on real-time and event-driven architectures integrating Kafka-based streaming with the medallion layers and workflow orchestration, supporting adaptive patient journey logic and near-real-time analytics

Job Requirements

Requires 8+ years of data engineering experience, with deep hands-on expertise in Databricks (Delta Lake, Unity Catalog, DLT), AWS data services, Python/Spark, and streaming frameworks — preferably within healthcare, life sciences, or other highly regulated industries
Strong proficiency with AWS data services such as S3, Glue, Lambda, Kinesis, Redshift, Athena, and IAM — with experience architecting end-to-end data pipelines in AWS-native or hybrid environments
Advanced Python and PySpark/Spark development skills for batch and streaming ETL/ELT pipeline development, data transformation, and data quality enforcement
Experience with streaming and event-driven architectures using Kafka (Amazon MSK or Confluent), including integration with lakehouse ingestion layers
Proven ability to implement data governance frameworks including data lineage, schema evolution, access controls, cataloging, and audit logging at enterprise scale
Strong understanding of data modeling for both analytical and operational use cases, including dimensional modeling, slowly changing dimensions, and schema-on-read patterns
Experience with infrastructure-as-code (Terraform, CloudFormation, or CDK) and CI/CD pipelines for data platform deployments
Familiarity with regulatory and compliance requirements in data management, including data residency, encryption at rest and in transit, and role-based access controls aligned with frameworks such as HIPAA, SOC 2, or ISO 27001
Excellent collaboration and communication skills, with the ability to work cross-functionally with platform engineering, analytics, clinical, and compliance teams
Bachelor's degree in Computer Science, Data Science, Engineering, or a related field (or equivalent practical experience)

Benefits

18 days of PTO per year, observance of local holidays, and an annual break between Christmas and New Years
A monthly wellness stipend and snack boxes delivered to your home

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Data Engineer

Orderly Wellness Corporation

Building next-gen health & wellness brands—tech-driven, marketing-led, high impact, and quick to market.

Data Engineer60 days ago

Full Time RemoteTeam 51-200Since 2023H1B No Sponsor

Company Site LinkedIn

• Own and extend ELT pipelines connecting five business systems into BigQuery via Heroku Connect, Fivetran, and custom ingestion • Design and maintain dbt models following star schema patterns (staging → intermediate → marts), including incremental strategies and testing • Solve cross-system identity resolution across three CRMs with different identifier schemes • Build and maintain data quality frameworks: automated testing, schema drift detection, and pipeline observability • Define and implement reverse ETL workflows to push warehouse-derived segments back into Salesforce and Ontraport for marketing and operations teams • Build and maintain Looker dashboards to surface key metrics; provide ad hoc analysis for strategic decisions • Collaborate with engineering to ensure reliable data replication from production systems • Evaluate orchestration tooling and scale the platform as the team grows

AWS Azure BigQuery Cloud ETL Google Cloud Platform Heroku Python SQL

View details: Data Engineer

United States

$145K - $185K / year

Apply

Job Closed

Senior Data Engineer

Interos

Data Engineer60 days ago

Full Time RemoteTeam 51-200

Role: Senior Data Engineer About interos.ai interos.ai is defining the category of supply chain risk intelligence, building the world’s most trusted and transparent supply chains. Our pioneering platform automates the discovery and continuous monitoring of supply chain risk across every tier — enabling faster, data‑driven threat mitigation and resilience at global scale. As the first and only automated supplier intelligence platform, interos.ai gives organizations real-time visibility into extended supply chains to protect against regulatory risk, unethical labor, cyber threats, and systemic vulnerabilities — all through a single pane of glass. With customers spanning commercial, Federal, A&D, and public sectors, including Global Fortune 500 companies and members of the Five Eyes nations, interos.ai offers a rare opportunity to help shape an early, fast‑growing market that’s transforming global security and economic resilience. At interos.ai, you’ll join a team that’s not afraid to challenge conventions, explore new ideas, and iterate quickly in pursuit of the best solutions. We operate in a space that’s evolving fast, and so are we. Our work is meaningful, the problems we tackle are complex, and the pace requires focus, curiosity, and adaptability. We know interos.ai isn’t the right fit for everyone — and that’s okay. We’re looking for people who are humble in how they collaborate, hungry to succeed, and smart in how they work to solve problems (EQ). People who thrive here are energized by broad exposure and a dynamic scope of work, comfortable navigating ambiguity, open to feedback, proactive in taking ownership, and motivated by making a real impact. If you’re seeking a predictable routine or a highly structured environment, you may find our pace challenging. However, if you’re someone who leans into change, enjoys solving hard problems with great teammates, and wants to contribute to technology that truly matters, you’ll fit right in. We’re excited to meet individuals who bring expertise in their craft, a willingness to experiment, and a mindset grounded in continuous learning. About The Role: As a Senior Data Engineer, you will play a central role in ensuring interos.ai’s data infrastructure and data pipelines are robust, efficient, and scalable. Your primary responsibility is delivering features, code, and data that is maintainable, performant, and aligned with engineering standards. What You'll Do - Build and share knowledge of the data flow throughout the Resilience platform. - Optimize the storage, processing, and movement of data, such as tuning data models or refactoring data software. Develop, document, and maintain new data platform functionality. - Contribute to software and data architecture design and reviews. - Provide support for other engineers or internal customers in the matters of data or software. - Implement and enforce best practices for code quality, testing, and documentation. - Improve or develop frameworks, packages, and/or documentation to support engineering standards. - Conduct code reviews to ensure adherence to coding standards and promote knowledge sharing within the team. What You Bring - 5+ years of experience in Software Development. - 3+ years of full-time professional Python experience including production experience with data pipelines. - 3+ years of experience with SQL via relational or columnar databases - 2+ years of experience working with Snowflake. - 2+ years of experience developing in AWS. - 2+ years of experience in data streaming or event-driven systems with Kafka or another stream processing system. - Bachelor’s degree in Computer Science or equivalent experience. Bonus Points For - Experience with job or pipeline orchestration, using tools like Prefect, Databricks, Airflow, Argo, etc. - Passionate about writing code that is scalable, maintainable, reusable, and well-tested. - Interested in building reliable, fault-tolerant, production-grade systems. - Feel comfortable debugging large or complex applications and, if necessary, guiding their refactor. - Excellent communication skills with the ability to convey technical concepts to both technical and non-technical stakeholders. - Have experience developing data software in an agile environment. - Enjoy optimizing complex processes and leaving something better off than where you found it. - A seasoned engineer who enjoys sharing your experience with the team. What We Offer Comprehensive health, dental & vision insurance 401(k) with employer match Flexible Time Off (FTO) + 10 paid holidays Opportunity to help shape an early, fast-growing market A flexible, remote-first work environment that empowers you to perform at your best — wherever you are An incredible culture built on trust, accountability, and collaboration across time zones and teams Our Core Values At interos.ai, our culture is shaped by three core values that reflect who we are and how we operate as a high‑performing, connected team. Technical Skill We hold a performance‑driven standard across everything we build and deliver. We value depth of expertise, strong judgment, and the ability to turn complex challenges into clear, effective solutions. Excellence is both our expectation and our baseline. Will We show up hungry, humble, and smart. We take ownership, seek growth, and elevate one another through curiosity, accountability, and collaboration. We lean into hard problems with resilience and a commitment to continuous improvement. Thrill We stay connected and bring energy to the work. We believe meaningful relationships, a sense of fun, and celebrating wins together fuel great outcomes. We create space for engagement, connection, and moments that remind us why we love what we do. These values guide how we show up for our partners, our colleagues, and our customers every day. Additional Information Compensation Base Salary Range: $120,000 – $160,000 USD (depending on experience and location) Variable Compensation: Eligible for annual performance bonus or commission program Equity: Stock options included as part of total compensation package We believe in rewarding great work with competitive, transparent compensation. Final offer will be based on skills, experience, certifications, and geographic location. Equal Opportunity Employer interos.ai is proud to be an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees — regardless of race, religion, gender identity, sexual orientation, age, disability, or veteran status. Accessibility & Accommodations We are committed to providing reasonable accommodations for candidates throughout the hiring process. If you need assistance, please contact us at hr@interos.ai.

AI Observability/Monitoring Python SQL Snowflake AWS Apache Kafka Databricks Airflow Argo

View details: Senior Data Engineer

United States

$120K - $160K / year

Apply

Senior Data Engineer

Garner Health

A better way to get your employees to high-quality doctors.

Data Engineer60 days ago

Full Time RemoteTeam 51-200Since 2019H1B No Sponsor

Company Site LinkedIn

Garner’s mission is to transform the healthcare economy, delivering high-quality and affordable care for all. We are fundamentally reimagining how healthcare works in the U.S. by partnering with employers to redesign healthcare benefits using clear incentives and powerful, data-driven insights. Our approach guides employees to higher-quality, lower-cost care, creating a system that works better for everyone. Patients achieve better health outcomes, employers spend healthcare dollars more effectively, and physicians are rewarded for delivering exceptional care rather than performing more procedures. Garner is one of the fastest-growing healthcare technology companies in the country. Our products are trusted by the most sophisticated employers and providers in the industry, and we are building a team of talented, mission-driven individuals who are motivated to make a meaningful impact on healthcare at scale. About the role:We are seeking an exceptional Senior Data Engineer with a high sense of ownership who is excited about our mission, eager to learn and teach others, and can deliver across the tech stack. This role will report to our VP, Engineering. The ideal candidate should have experience with a modern data stack and implementations of data pipelines using Python and SQL in AWS. Where you will work:This role is open to remote candidates across the U.S. For candidates based in New York City, the position follows a hybrid schedule with in-office work required Tuesday, Wednesday, and Thursday each week. What you will do: - Build, optimize, and maintain data pipelines that power our business - Define and build out abstracted reusable data sets to be used for Business Intelligence, Marketing, and Data Science Research - Design, build, and evangelize a federated data validation frameworks to be used to monitor potential data inconsistencies - Protect our users’ privacy and security through best practices The ideal candidate has: - 4+ years of software/data engineering experience (distributed data processing, data warehousing, data governance, Big Data, data variance, data privacy, and data quality) - Expertise in SQL, job experience, and demonstrated strength with Python - Expertise in building scalable data pipelines, query optimization, data modeling, and defining reusable datasets - Experience working with orchestration tools (especially Airflow), databases (especially PostgreSQL), datalakes (especially Iceberg), data warehouses (especially Snowflake) - Expertise with SQL tuning, Spark development, medallion and event-driven architectures. - Familiarity with healthcare or insurance - Familiarity with data security and HIPAA compliance - A desire to be a part of a high-performing, mission-driven team that operates with intense urgency, a strong sense of individual accountability, and a commitment to authentic feedback Technologies we use: - Postgres/SQL, Snowflake, Python, AWS, Terraform, Argo, Airflow, Spark, DuckDB, Iceberg, Airbyte, DBT, ElasticSearch (aka OpenSearch), OpenTelemetry, Looker This is a unique opportunity to join a fast-growing company in a transformative role, helping shape the future of healthcare. Compensation Transparency:The target salary range for this position is $167,000.00 - $200,000.00. Individual compensation for this role will depend on various factors, including qualifications, skills, and applicable laws. In addition to base compensation, this role is eligible to participate in our equity incentive and competitive benefits plans, including but not limited to: flexible PTO, Medical/Dental/Vision plan options, 401(k) with company match, flexible spending accounts, Teladoc Health and more. Fraud and Security Notice: Please be aware of recent job scam attempts. Our recruiters use getgarner.com and garnerhealth.com email domains exclusively. If you have been contacted by someone claiming to be a Garner recruiter or a hiring manager from a different domain about a potential job, please report it to law enforcement here and to candidateprotection@garnerhealth.com. Equal Employment Opportunity:Garner Health is proud to be an Equal Employment Opportunity employer and values diversity in the workplace. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. Garner Health is committed to providing accommodations for qualified individuals with disabilities in our recruiting process. If you need assistance or an accommodation due to a disability, you may contact us at talent@garnerhealth.com.

Airbyte Airflow Argo AWS Dbt Duckdb Elasticsearch Iceberg Postgres/Sql Python Snowflake Spark Terraform

View details: Senior Data Engineer

United States

$167K - $200K / year

Apply

Job Closed

Data Engineer - Data Warehouse Architect

ICF

Founded in 1969, ICF is a global advisory and technology services company headquartered in Reston, Virginia. It delivers data-driven solutions across energy, en

Data Engineer60 days ago

Full Time Remote

Company Site

Description This position focuses on developing, implementing, and maintaining architecture solutions across a large enterprise data warehouse to support effective and efficient data management and enterprise-wide business intelligence analytics. Responsibilities: - Implement and optimize data pipeline architectures for sourcing, ingestion, transformation, and extraction processes, ensuring data integrity and compliance with organizational standards. - Develop and maintain scalable database schemas, data models, and data warehouse structures; perform data mapping, schema evolution, and integration between source systems, staging areas, and data marts. - Automate data extraction workflows and create comprehensive technical documentation for ETL/ELT procedures; collaborate with cross-functional teams to translate business requirements into technical specifications. - Establish and enforce data governance standards, including data quality metrics, validation rules, and best practices for data warehouse design and architecture. - Develop, test, and deploy ETL/ELT scripts using SQL, Python, Spark, or other relevant languages; optimize code for performance and scalability. - Tune data warehouse systems for query performance and batch processing efficiency; apply indexing, partitioning, and caching strategies. - Perform advanced data analysis, validation, and profiling using SQL and scripting languages; develop data models, dashboards, and reports in collaboration with stakeholders. - Conduct testing and validation of ETL workflows to ensure data loads meet SLAs and quality standards; document testing protocols and remediation steps. - Troubleshoot production issues, perform root cause analysis, and implement corrective actions; validate data accuracy and consistency across systems. Basic Qualifications: - Minimum of 3 years of experience in data analysis. Additional Qualifications: - Strong analytical and problem-solving skills with attention to detail. - Proficiency in SQL and ability to develop complex queries (e.g., multi-join), tune performance, and troubleshoot. - Experience with Unix/Linux shell scripting for ETL automation. - Familiarity with database tools and platforms (e.g., Teradata, Oracle, Non-Relational). - Excellent verbal and written communication skills; ability to collaborate across all levels. - Ability to prioritize and multi-task in a fast-paced environment. - Knowledge of Java/J2EE, REST APIs, Web Services, and event-driven microservices. - Experience with Kafka streaming, schema registry, OAuth authentication. - Familiarity with Spring Framework, GCP services, Git, CI/CD pipelines, containerization, and data ingestion/data modeling. Preferred Qualifications: - Experience with Databricks concepts and terminology (e.g., workspace, catalog). - Proficiency in Python and Spark. - Background in architecting real-time data ingestion solutions using microservices and Kafka. Working at ICF ICF is a global advisory and technology services provider, but we’re not your typical consultants. We combine unmatched expertise with cutting-edge technology to help clients solve their most complex challenges, navigate change, and shape the future. We can only solve the world's toughest challenges by building a workplace that allows everyone to thrive. We are an equal opportunity employer. Together, our employees are empowered to share their expertise and collaborate with others to achieve personal and professional goals. For more information, please read our EEO policy. We will consider for employment qualified applicants with arrest and conviction records. Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in all phases of the application and employment process. To request an accommodation, please email Candidateaccommodation@icf.com and we will be happy to assist. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.  Read more about workplace discrimination rights or our benefit offerings which are included in the Transparency in (Benefits) Coverage Act. Candidate AI Usage Policy At ICF, we are committed to ensuring a fair interview process for all candidates based on their own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) tools to generate or assist with responses during interviews (whether in-person or virtual) is not permitted. This policy is in place to maintain the integrity and authenticity of the interview process.  However, we understand that some candidates may require accommodation that involves the use of AI. If such an accommodation is needed, candidates are instructed to contact us in advance at candidateaccommodation@icf.com. We are dedicated to providing the necessary support to ensure that all candidates have an equal opportunity to succeed.   Pay Range - There are multiple factors that are considered in determining final pay for a position, including, but not limited to, relevant work experience, skills, certifications and competencies that align to the specified role, geographic location, education and certifications as well as contract provisions regarding labor categories that are specific to the position. The pay range for this position based on full-time employment is: $74,090.00 - $125,954.00 Nationwide Remote Office (US99)

ETL SQL Python Apache Spark Performance Optimization Unix Linux Shell Oracle Database Java J2EE REST API Microservices Apache Kafka OAuth Spring GCP Git CI/CD Docker/Containers Databricks AI

View details: Data Engineer - Data Warehouse Architect

United States

$74.1K - $125K / year

Apply

Staff Data Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer

Senior Data Engineer

Senior Data Engineer

Data Engineer - Data Warehouse Architect