InterWell Health is a kidney care management company that is on a mission to reinvent healthcare and better help its patients live healthy and happy lives. As a
Staff Data Engineer
Location
United States
Posted
89 days ago
Salary
0
Seniority
Lead
Job Description
Staff Data Engineer
InterWell Health
• Design and evolve a scalable, secure, cloud-native lakehouse platform leveraging Databricks, Microsoft Fabric (OneLake, Lakehouse, Data Factory), and dbt. • Define modeling patterns, governance frameworks, and engineering best practices across the data lifecycle. • Lead design reviews and guide teams in adopting scalable architectural patterns. • Drive long-term platform strategy and evaluate emerging technologies. • Design and implement batch and streaming data pipelines for healthcare data sources (EHR, claims, HL7/FHIR, APIs, flat files, databases). • Develop modular ingestion, quality, lineage, metadata, and observability frameworks that scale across domains. • Produce clean, analytics-ready datasets and data models for BI, analytics, and machine learning workloads. • Implement HIPAA-aligned access patterns and secure handling of PHI. • Architect Databricks workloads (clusters, jobs, Unity Catalog, Delta Lake) for reliability, performance, and cost efficiency. • Integrate Databricks and Microsoft Fabric with Azure services and enterprise systems. • Partner with product managers, data scientists, analysts, clinicians, and business stakeholders to translate healthcare data needs into scalable solutions. • Lead cross-functional initiatives that modernize and unify the organization’s data ecosystem. • Mentor senior and mid-level engineers; elevate team capability through technical coaching and standards. • Drive roadmap planning, platform evolution, and long-term data strategy. • Champion engineering excellence, reliability practices, documentation quality, and governance.
Job Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 7+ years of experience in data engineering.
- 2+ years operating in a senior or staff level engineering role.
- Deep hands-on proficiency with Databricks, Spark, Delta Lake, dbt, and Python.
- Proven ability to design and operate largescale cloud data platforms (Azure preferred).
- Hands-on experience with Data Engineering, Data Factory, Lakehouse, OneLake.
- Advanced data platform architecture and Lakehouse design expertise.
- Strong command of distributed data processing and cloud native engineering.
- Experience working in HIPAA regulated environments and handling PHI.
- Healthcare data fluency, including regulated data handling and compliance.
- Technical leadership, mentorship, and influence across teams.
- Strong communication skills with both technical and clinical stakeholders.
- Experience with platform reliability, CI/CD for data pipelines, and infrastructure as code.
Benefits
- Health insurance
- Flexible working hours
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Own the technical strategy and execution of migrating large-scale data workloads from GCP to AWS, ensuring continuity, data integrity, and minimal disruption. • Design migration playbooks and serve as the go-to expert for decisions across compute, storage, and orchestration layers during the transition. • Architect and implement scalable batch and streaming data pipelines using Apache Spark, Delta Lake, and the medallion architecture. • Establish standards for pipeline design, data quality, and observability that the broader engineering organization can build on. • Take accountability for the reliability, performance, and cost-efficiency of production ETL jobs running on AWS (EMR, Glue) against terabyte-scale datasets. • Proactively identify and address bottlenecks, technical debt, and opportunities to improve throughput and resilience.
• Design, build, and own scalable data platforms on Google Cloud • Play an architectural role, driving end-to-end data solutions • Define best practices and mentor team members • Work closely with stakeholders, analysts, and data scientists to deliver reliable, high-performance data pipelines and analytics platforms
• Build and maintain data infrastructure that enables the collection, storage, and retrieval of data; • Create new data flows by integrating our data sources and ensuring they are reliable and efficient; • Develop ETL pipelines, data warehousing, and data modeling to support business needs; • Ensure data quality monitoring, reliability, and lineage by developing processes and tools to identify and correct data quality issues; • Collaborate with other members of the Data & Analytics Team to optimize the data infrastructure and improve data governance; • Provide documentation and training to end-users on data sources, pipelines, and data quality procedures; • Stay current with the latest technologies and techniques related to data engineering, and identify opportunities to improve data infrastructure and analysis.
Senior Data Engineer
DreamixBespoke software development company that provides custom end-to-end product development following the highest standards
• Design, develop, and maintain scalable data pipelines for processing and analyzing large volumes of data • Develop and optimize data workflows using Databricks, leveraging Spark for large-scale data processing • Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and ensure data integrity and quality • Utilize your expertise in Python for scripting and coding tasks related to data processing and analysis • Understand and implement business rules in Python for data transformation • Implement ETL processes to integrate data from various sources into data warehouse or data lake solutions • Optimize big data storage and processing • Troubleshoot and resolve data-related issues, ensuring the reliability and performance of our data infrastructure • Follow emerging trends and technologies in the data engineering space and make recommendations for continuous improvement • Optimize and tune data workflows for maximum efficiency and scalability. • Implement data security best practices to protect sensitive information and ensure compliance with data protection regulations. • Develop and maintain API integrations to facilitate seamless data exchange between systems and applications




