Lead Data Engineer
Location
Worldwide
Posted
57 days ago
Salary
0
Seniority
Lead
Job Description
Lead Data Engineer
Ilant Health
Role Description At Ilant Health, data is the cornerstone of our mission. It drives our clinical precision, shapes our business strategy, and provides the measurable ROI necessary to expand access for employers and health plans. We are looking for a Lead Data Engineer to architect the "source of truth" that powers our value-based care models for obesity and cardiometabolic health. In this role, you will not just build pipelines; you will be the architect of our data platform. You will own the ingestion of complex healthcare datasets (claims, eligibility, clinical labs), the design of our "Single Patient View," and the creation of next-generation internal tools that allow non-technical stakeholders to query our data using natural language. Key Responsibilities - Data Architecture and Strategy (The “Blueprint”) - Design the "Single Patient View": Architect a unified data model that stitches together fragmented data sources (e.g., linking a pharmacy claim for Wegovy, a clinical lab result for HbA1c, and user engagement metrics from the Ilant app into a cohesive longitudinal record). - Scalability Planning: Design a cloud-native infrastructure (likely Snowflake/AWS) capable of handling 100x Member growth without requiring a total refactor. - Buy vs. Build Decisions: Evaluate and select the right tooling for ingestion (e.g., Fivetran vs. custom Python) and orchestration (e.g., Airflow vs. Dagster) to maintain low engineering overhead while maximizing output. - Conversational Intelligence Layer (GenAI/LLM): Architect and implement a "Text-to-Data" interface (leveraging LLMs/RAG) that allows business decision-makers to interact with our data via prompts (e.g., similar to Gemini/ChatGPT). - Pipeline Engineering (The “Plumbing”) - Data Consumption Layer: Ensure the reliability and low-latency availability of the data assets (dbt models, feature stores) consumed by the Data Science and Analytics teams, guaranteeing they always have fresh, trustworthy data for modeling and reporting. - External Data Integration (Primary Mandate): Own the end-to-end reliability of mission-critical external files. You are responsible for the system that ingests, validates, and standardizes these files from payers/employers. - Claims Ingestion Engine: Build robust, fault-tolerant pipelines to handle the notoriously messy formats of payer data (EDI 837/835, raw CSVs, JSON) and standardize them into a clean, queryable schema. - dbt Model Ownership: Oversee the transformation layer (using dbt), creating a "Gold" layer of data that is business-ready for analysts, product features, and the conversational AI layer. - Data Quality and Trust (The “Guardrails”) - Pipeline Reliability & Operational Uptime: You own the "uptime" of our data platform. Ensure all scheduled ingestion and transformation jobs run successfully and on time. You are the first line of defense when a pipeline fails, leading the root cause analysis (RCA) and resolution to minimize downtime. - Automated Testing & Observability: Implement "Data Observability" tools (e.g., Great Expectations, Monte Carlo, or custom equivalents) to catch issues before they hit the dashboard (e.g., Configure alerts to trigger if an eligibility file arrives with 50% fewer records than the previous month). - Governance & Compliance: Act as the technical custodian of HIPAA compliance. Ensure all PII/PHI is encrypted, masked, and accessed only via strict Role-Based Access Controls (RBAC). - Master Data Management (MDM): Implement identity resolution logic to handle conflicts across sources (e.g., ensuring "Jane Doe" in a Cigna claims file is correctly matched to "Jane Doe" in the Ilant app database). - Leadership and Collaboration - Partner with Product: Work directly with the CPO and Product Managers to assess the technical feasibility of new features (e.g., "Can we accurately calculate 'time to goal weight' given the current data latency?"). - Partner with Data Science: Collaborate to productionize predictive models (e.g., patient risk stratification, weight loss trajectory). You will build the MLOps infrastructure that takes a model from a Jupyter notebook to a scalable, real-time inference API within our product. Qualifications - Experience: 7+ years in Data Engineering, with at least 3+ years in a Lead or Architectural role. - Strategic Maturity: Demonstrated ability to make high-stakes "Buy vs. Build" decisions and architect systems for 10x scale, prioritizing long-term stability and maintainability over short-term patches. - Healthcare Native: Deep familiarity with healthcare data standards (HL7, FHIR, ICD-10, CPT, NDC) and the specific challenges of claims/eligibility ingestion. - GenAI/LLM Interest: Practical experience or strong interest in building semantic layers for LLM applications (RAG, Vector DBs, or prompt engineering for analytics). Requirements - Languages: Python (Advanced), SQL (Expert). - Cloud: AWS. - Warehousing: Snowflake, BigQuery, or Databricks. - Transformation: dbt (Data Build Tool). - Orchestration: Airflow, Dagster, or Prefect. Benefits - Fully remote environment – work from anywhere while maintaining meaningful collaboration with a distributed team. - Comprehensive health benefits – medical, dental, and vision coverage to support you and your family. - Paid time off – 2 weeks of PTO to rest, recharge, and take the time you need. - Flexible floating holiday – one additional day each year to celebrate what matters most to you. - Paid sick leave – 5 sick days so you can prioritize your health when needed. - 11 paid company holidays throughout the year. - 401(k) retirement plan to help you invest in your future. - Healthcare and Dependent Care FSA options for additional tax-advantaged savings.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Description About the Role At NinjaOne, we're looking for a skilled Senior Data Engineer to join our team and help drive the future of our data infrastructure. You'll play a critical role in building, maintaining, and scaling our systems to ensure smooth data flow, accuracy, and security across the organization. This is an exciting opportunity to work on innovative projects, collaborate with cross-functional teams, and help shape how we leverage data to fuel growth, optimize products, and drive business decisions. Location - We are flexible on remote working from home, if you are located in the USA and reside in one of the following states - CA , CO , CT , FL , GA , *IL , KS , MA , MD, ME , NJ , NC , NY , OR , TN , TX , VA , and WA . We have physical offices in Austin, TX and Tampa, FL, if you prefer a hybrid option. We hire the best software engineers, but experience in our stack can't hurt: NinjaOne is built on Java , Kotlin , C++ , Golang and Postgres ; supporting millions of user endpoints and running as a scalable cloud service in AWS . Knowing large-scale datastore bottlenecks, asynchronous application design and client-server architecture will help you. What You'll be Doing - Data Pipeline Development: Design and implement scalable data pipelines that move and transform large volumes of data from multiple sources to central data warehouses , transforming data to enable business reporting and advanced analytics . - Database Management: Manage and optimize the performance of relational databases, ensuring data availability, reliability, and consistency. - Automation & Optimization: Automate and optimize data workflows to reduce manual processes and improve efficiency in data collection, storage, and processing. - Monitoring & Maintenance: Ensure the integrity and security of data across systems, monitor performance, and troubleshoot any issues that arise within the data pipeline. - Data Visualization: Build d ashboards and reports in Tableau and Databricks to expose key data points and trends to business stakeholders . - Collaboration: Work closely with data scientists, analysts, and other teams to gather requirements, understand data needs, and provide solutions that support data-driven decision-making. - Other duties as needed . About You - Bachelor's degree in Computer Science , Computer Engineering, Information Technology or equivalent work experience preferred. - 10 + years of experience in software development, with a strong focus on data engineering and data science. - E xperience in building data pipelines and managing large-scale data systems using technologies like SQL and Python. - Expertise in Python. - Experience in cloud platforms like AWS, GCP, or Azure, and experience with tools like Airflow , Kafka or dbt for orchestrating data workflows. - Mastery with both relational databases inc luding MySQL, PostgreSQL and NoSQL databases like MongoDB, Cassandra. - E xperience with data warehousing concepts and tools suc h as Redshift, BigQuery , Snowflake. - Solid understanding of Microservices Architecture and DevOps principles. - Experience that will make you a standout candidate: o Previous experience working with large-scale data pipelines and machine learning models. o Understanding of Generative AI and Deep Learning frameworks. About Us NinjaOne automates the hardest parts of IT to deliver visibility, security, and control over all endpoints for more than 30,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support. NinjaOne is #1 on G2 in endpoint management, patch management, remote monitoring and management, and mobile device management. What You'll Love We are a collaborative, kind, and curious community. We honor your flexibility needs with full-time work that is hybrid remote. We have you covered with our comprehensive benefits package, which includes medical, dental, and vision insurance. We help you prepare for your financial future with our 401(k) plan. We prioritize your work-life balance with our unlimited PTO. We reward your work with opportunity for growth and advancement. Additional Information This position is NOT eligible for Visa sponsorship. Due to federal government security requirements associated with our FedRAMP-authorized environment, candidates must be U.S. citizens or lawful permanent residents. *Due to operational policies, NinjaOne is unable to hire for this role within the city limits of Chicago. We will consider all qualified candidates who reside outside of the city proper or are willing to self- relocate. Starting pay for the successful applicant depends on a variety of job-related factors, including but not limited to location, market demands, experience, job-related knowledge, and skills. The benefits available for this position include medical, dental, vision, 401(k) plan, life insurance coverage and PTO. For roles based in California, Colorado, Maryland, New Jersey, or Washington the base salary hiring range for this position is $11 0 ,000 to $200 ,000 per year. For roles based in New York, the base salary hiring range for this position is $11 0,000 to $20 0,000 per year. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, genetic information, marital status, veteran status, or any other status protected by applicable law. We are committed to providing an inclusive and diverse work environment. #LI- KS 2 #LI-Remote #BI-Remote #BI-Hybrid
Senior Software Engineer, Data Platform Team
SysdigConfidently secure containers, Kubernetes and cloud services with #SecureDevOps.
• Own the design and development of features and components for the data platform, focusing on high-throughput data ingestion, transformation, and storage. You will report to the Director, Engineering. • Architect and implement robust, distributed, and scalable data processing pipelines in Go to ensure data quality and reliability. • Contribute to the technical strategy and roadmap for the data platform, anticipating future data needs for product features and internal analytics. • Mentor junior and mid-level engineers on the team, and conduct thorough code reviews to ensure quality and best practices. • Participate in an on-call rotation to address urgent operational issues impacting data services.
At PointClickCare our mission is simple: to help providers deliver exceptional care. And that starts with our people. As a leading health tech company that’s founder-led and privately held, we empower our employees to push boundaries, innovate, and shape the future of healthcare. With the largest long-term and post-acute care dataset and a Marketplace of 400+ integrated partners, our platform serves over 30,000 provider organizations, making a real difference in millions of lives. We also reinvest a significant percentage of our revenue back into research and development, ensuring our employees have the resources to innovate and make a lasting impact. Recognized by Forbes as a top private cloud company and honored as one of Canada’s Most Admired Corporate Cultures, we offer flexibility, growth opportunities, and meaningful work. At PointClickCare, we empower our people to be the architects of a smarter healthcare future; one that is human-first and accelerated by AI to create meaningful and lasting change. Employees harness AI as a catalyst for creativity, productivity, and thoughtful decision-making. By integrating AI tools into our daily workflows, collaboration is enhanced, outcomes are improved, and every team member has the proficiency to maximize their impact. It all starts with our hiring practices where we uncover AI expertise that complements our mission, and we continue to invest in training and development to nurture innovation throughout the employee journey. Join us in redefining healthcare — so it doesn’t just survive, it thrives. To learn more about PointClickCare, check out Life at PointClickCare and connect with us on Glassdoor and LinkedIn. **Travel to Office expectations** For Remote Roles: If this role is remote, there will be in-office events that will require travel to and from the Mississauga and/or Salt Lake City office. These will include, but not limited to, onboarding, team events, semi-annual and annual team meetings. For Hybrid Roles: If this role is Hybrid, there will be an expectation to reside within commutable distance to the office/location specified in the job listing. This will include, but not limited to, weekly/bi-weekly/monthly events in the office with your specific team. This is a requirement for this role. In this role, you contribute to the PointClickCare’s data platform vision and strategy, define and own roadmaps, and drive execution and delivery of products to ensure overall success across the entire product portfolio. You represent the needs of customers, healthcare providers, industry partners and other related network members internally and externally. To succeed in this role, you draw upon your extensive knowledge and experience of health data and its uses, data pipelines, governance to deliver clinical and operational data at scale for analytics and AI. You have a track record of building sophisticated solutions – from discovery to implementation to delivery. You bring a collaborative approach to this cross functional leadership role to ensure successful outcomes for our customers, while optimizing time to value. The Principal PM will partner closely with architecture, engineering, data science, clinical, compliance, and infrastructure teams to build a secure, interoperable foundation that supports analytics, AI/ML, care delivery, and new product innovation across the enterprise. This role demands deep expertise in health data domains and a strong ability to rapidly turn complex technical challenges into actionable platform strategy. Key Responsibilities - Define and drive the longterm strategy for a unified health data platform supporting clinical, operational, and product use cases. - Collaborate to define architecture for ingesting and processing billions of patient data points across diverse systems (EHRs, claims, labs, devices, thirdparty APIs). - Lead platform initiatives that improve data completeness, lineage, governance, and quality. - Partner with product teams to enable them to: define data models and canonical representations to support downstream applications, including AI/ML pipelines. - Partner with engineering to evaluate and adopt technologies for rapid ingestion, streaming, ETL/ELT frameworks, data lakehouse systems, and distributed compute. - Influence infrastructure strategy to improve resiliency, observability, and cost efficiency across the data stack. - Collaborate with clinicians, data scientists, and compliance leaders to ensure platform capabilities align with realworld healthcare needs and regulatory expectations. - Facilitate crossteam decisionmaking, often balancing accuracy, technical feasibility, security, and cost. Qualifications - 12-15+ years of product management experience, with at least 5 years focused on health data platforms or clinical data systems. - 3+ years of experience working with data pipelines for AI use, including enrichment pipelines, data science use, including use of MCP servers with agents. - Management of Healthcare Data and its use across the range of PointClickCare’s customer segments and users. Work experience with Health Plans, EHRs, and Long Term Post-Acute providers is preferred. - Deep understanding of healthcare data formats, ontologies, and regulatory contexts. - Strong technical fluency across data pipelines, APIs, cloud architectures, and distributed systems. - Experience working with data governance, data catalogs, and stewardship - Experience working with clinical teams and navigating healthcare workflows. - Technical degree preferred (Computer Science, Health Informatics, Information Systems, or related fields). Key Attributes - Systems thinker with the ability to connect clinical data structure, platform architecture, and AI/analytics needs. - Exceptional communicator adept at simplifying complex technical and regulatory concepts for diverse audiences. - Highly autonomous leader comfortable driving multiteam initiatives without direct authority. - Obsessively focused on data quality, reliability, integrity, and endtoend lineage. - Passionate about improving care outcomes, patient experience, and clinician efficiency through highquality health data infrastructure. $154,000 - $172,000 a year At PointClickCare, base salary is one of the many components that make up our total rewards package. The CAD base salary range for this position is $154,000 - $172,000 + bonus or commission + equity + benefits. Our salary ranges are determined by job and level. The range displayed on each job posting reflects the target for new hire salaries for the position across all CAD locations. Within the range, individual compensation is determined by job-related skills and knowledge, relevant experience including professional and lived experience, and/or work location. Your recruiter can share more information about our total rewards package during the hiring process. Why PointClickCare · Market leader: #1 EHR platform for long-term and post-acute care in North America · Massive data advantage: Decades of real-world insights from millions of resident journeys · Mission-driven: Our technology helps caregivers deliver better outcomes for seniors · Growth stage: Senior Living is a strategic priority with significant investment and executive sponsorship · Culture: Collaborative, customer-obsessed, and committed to continuous improvement PointClickCare Benefits & Perks: Benefits starting from Day 1! Retirement Plan Matching Flexible Paid Time Off Wellness Support Programs and Resources Parental & Caregiver Leaves Fertility & Adoption Support Continuous Development Support Program Employee Assistance Program Allyship and Inclusion Communities Employee Recognition … and more! It is the policy of PointClickCare to ensure equal employment opportunity without discrimination or harassment on the basis of race, religion, national origin, status, age, sex, sexual orientation, gender identity or expression, marital or domestic/civil partnership status, disability, veteran status, genetic information, or any other basis protected by law. PointClickCare welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates taking part in all aspects of the selection process. Please contact recruitment@pointclickcare.com should you require any accommodations. As part of our commitment to a streamlined and equitable hiring experience, PointClickCare uses AI tools to assist with candidate screening and assessment. When you apply for a position, your information is processed and stored with Lever, in accordance with Lever’s Privacy Policy. We use this information to evaluate your candidacy for the posted position. We also store this information, and may use it in relation to future positions to which you apply, or which we believe may be relevant to you given your background. When we have no ongoing legitimate business need to process your information, we will either delete or anonymize it. If you have any questions about how PointClickCare uses or processes your information, or if you would like to ask to access, correct, or delete your information, please contact PointClickCare’s human resources team: recruitment@pointclickcare.com PointClickCare is committed to Information Security. By applying to this position, if hired, you commit to following our information security policies and procedures and making every effort to secure confidential and/or sensitive information. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
• Define and drive the longterm strategy for a unified health data platform supporting clinical, operational, and product use cases. • Collaborate to define architecture for ingesting and processing billions of patient data points across diverse systems (EHRs, claims, labs, devices, thirdparty APIs). • Lead platform initiatives that improve data completeness, lineage, governance, and quality. • Partner with product teams to enable them to: define data models and canonical representations to support downstream applications, including AI/ML pipelines. • Partner with engineering to evaluate and adopt technologies for rapid ingestion, streaming, ETL/ELT frameworks, data lakehouse systems, and distributed compute. • Influence infrastructure strategy to improve resiliency, observability, and cost efficiency across the data stack. • Collaborate with clinicians, data scientists, and compliance leaders to ensure platform capabilities align with realworld healthcare needs and regulatory expectations. • Facilitate crossteam decisionmaking, often balancing accuracy, technical feasibility, security, and cost.


