Job Closed
This listing is no longer active.
Banner Health is a nonprofit healthcare system based in Phoenix, Arizona. As one of the largest employers in the country, Banner Health utilizes the expertise a
Lead Data Engineer
Location
Arizona + 4 moreAll locations: Arizona | California | Michigan | Tennessee | Texas
Posted
72 days ago
Salary
$53 - $89 / hour
Seniority
Senior
Job Description
Lead Data Engineer
Banner Health
• Design, build, and scale Banner Health’s next-generation data platform on the cloud • Lead the engineering of reliable, secure, and scalable data products and pipelines that power analytics, reporting, AI, and operational insights across clinical and business domains • Work closely with architects, analysts, data scientists, product teams, and business leaders to modernize how data is ingested, governed, transformed, and consumed across the organization • Design, build, and optimize scalable batch and streaming data pipelines for enterprise analytics and operational use cases • Contribute to the evolution of the enterprise data platform to support advanced analytics, self-service consumption, and AI/ML use cases • Lead the development of curated, high-quality data products across lakehouse, warehouse, and domain-oriented data architectures • Build and enhance cloud-native data solutions using modern platforms such as Databricks, Spark, Delta Lake, and AWS services • Establish and enforce engineering standards for code quality, testing, CI/CD, observability, lineage, and documentation • Drive best practices for data quality, schema evolution, performance tuning, reliability, and cost optimization • Partner with architecture, governance, security, and analytics teams to implement trusted and compliant data solutions • Support ingestion and transformation of complex healthcare and enterprise data sources, including structured, semi-structured, and high-volume event data • Mentor engineers, provide technical leadership, and contribute to solution design, estimation, and delivery planning • Translate business and operational requirements into scalable technical designs and production-ready data pipelines
Job Requirements
- Strong knowledge of data engineering and analytics
- Bachelor's degree in Data Science, Computer Science, Information Technology or a related field
- 6+ years of experience in data engineering, big data, analytics engineering, or data platform development
- 3+ years in a senior or lead engineering role driving architecture, standards, and delivery across large-scale data environments
- Hands-on experience with Databricks, Apache Spark, Delta Lake, and cloud-based data platforms on AWS, Azure, or GCP
- Deep expertise in SQL and strong programming skills in Python and/or Java
- Experience building and operating large-scale distributed data systems, including data lakes, lakehouses, warehouses, or mesh-oriented platforms
- Strong understanding of data modeling, partitioning, storage design, metadata management, and performance optimization
- Experience implementing data quality, lineage, observability, and operational monitoring in production environments
- Familiarity with orchestration, DevOps, and CI/CD practices for data platforms
- Strong communication skills and the ability to work effectively with technical and non-technical stakeholders
- Proven ability to balance multiple priorities in a fast-paced environment while maintaining high engineering standards.
Benefits
- Health and financial security options
- A variety of benefit plans to help you and your family
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Growth Data Engineer
Embedding VCEmbedding VC is an early-stage venture fund backing Generative AI startups.
• Build and maintain marketing attribution systems (UTM tracking, user journey, conversion mapping) • Implement and manage 1st-party data passback to ad platforms (e.g., conversions, revenue, LTV signals) • Configure and maintain tracking infrastructure: - GTM (client + server-side) - Google Ads (Enhanced Conversions) - Meta Pixel + Conversions API - LinkedIn, TikTok, and other ad platforms • Set up and maintain conversion events and schemas across platforms • Ensure consistent event definitions and naming across product, CRM, and ad platforms • Build and manage audience sync pipelines • Integrate CRM and warehouse data into ad platforms for optimization (offline conversions, lead stages, revenue) • Operationalize new marketing experiments (UTMs, tracking parameters, campaign setup) • Debug and resolve tracking and attribution discrepancies across systems (e.g., Ads vs CRM vs warehouse) • Build and validate marketing and pipeline dashboards, ensuring data accuracy and consistency • Implement privacy-safe tracking practices (Consent Mode, CMP integrations) • Partner closely with growth, product, and data teams to ensure measurement supports decision-making
Founding Data Engineer – Core Data Platform
Embedding VCEmbedding VC is an early-stage venture fund backing Generative AI startups.
• Own the entire data foundation of a fast-scaling AI company — from raw data to executive metrics. • Build from 0 → 1 — define the architecture that powers product, finance, and company-wide decision making. • High visibility and impact — your work directly informs leadership, product direction, and company strategy. • Help define how data supports AI systems, agents, and long-term intelligence. • Design and build core data pipelines (e.g., product events, payments, internal systems → BigQuery) • Define and maintain the data warehouse architecture, including schema design, data modeling, and table structure • Establish and own the single source of truth (SOT) for product and business metrics • Build and maintain core data models (user, subscription, revenue, engagement, etc.) • Ensure data consistency across systems (product analytics, billing, internal tools) • Lead data reconciliation efforts (e.g., Stripe vs internal systems vs reporting) • Implement data quality checks, validation, and monitoring systems • Build reliable reporting layers used by leadership and finance (not ad hoc dashboards) • Establish data standards and contracts (event naming, schema governance, tracking consistency) • Partner with engineering to improve instrumentation and data correctness at source • Support downstream teams (analytics, DS) by providing clean, well-documented datasets • Continuously improve data reliability, performance, and cost efficiency
Senior Data Engineer
HumanaLouisville, Kentucky-based Humana is a leading healthcare company that offers a variety of health, wellness, and insurance products and services designed to off
• developing and optimizing scalable data pipelines to support critical analytics and business intelligence • designing, building, and maintaining data models and architectures that enable efficient storage, retrieval, and processing of large-scale datasets • working with key stakeholders to define the business problem, determine requirements, and ensure quality deliverables • managing and prioritizing data engineering roadmaps in partnership with business leaders, translating strategic objectives into actionable work items and clear delivery plans • implementing best‑practice data governance, data quality, and data security standards, ensuring that pipelines and datasets are reliable, well‑documented, and compliant with organizational requirements • facilitating regular stakeholder communication, providing updates on progress, risks, and dependencies while ensuring expectations are aligned across technical and non‑technical audiences • conducting code reviews and enforcing data engineering best practices
Senior Data Engineer, GKT Team
SemrushYour competitors' favorite marketing platform used by 10,000,000 marketers
• Lead the Design and development of scalable data pipelines and infrastructure • Optimize the performance of data systems and troubleshoot complex data issues • Collaborate with data scientists to ensure data is structured and available for advanced analytics • Ensure data security, privacy, and compliance with organizational policies • Manage the integration of new data sources and ensure data quality • Continuously evaluate and adopt new technologies to improve data engineering practices • Contribute to architecture decisions



