Arbital Health logo
Arbital Health

We are a neutral third-party adjudication utility that is accelerating the $1 trillion shift to value-based care

Senior Product Manager, Data Platform

Data EngineerData EngineerFull TimeRemoteSeniorTeam 1-10Since 2023H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

3 days ago

Salary

$170K - $200K / year

Seniority

Senior

Bachelor Degree4 yrs expEnglishAirflowAWSPythonSQL

Job Description

Senior Product Manager, Data Platform

Arbital Health

• Own the product roadmap for Arbital’s data pipeline platform, including ingestion, transformation, calculation, validation, audit trail, and AI consumption layers • Define and prioritize pipeline capabilities based on client needs, implementation learnings, engineering constraints, and long-term platform scalability goals • Translate complex healthcare data requirements from claims processing to VBC contract logic into structured, buildable product specs • Partner with leadership to align pipeline investments with Arbital’s broader product and go-to-market strategy • Write detailed PRDs, user stories, and technical specifications for platform features, configurations, and automation tooling • Work directly with engineering to scope, sequence, and ship pipeline improvements — balancing speed, quality, and flexibility • Define acceptance criteria and lead QA processes for new pipeline & platform capabilities, ensuring outputs meet accuracy and performance standards • Drive platform delivery end-to-end, owning prioritization, cross-team dependencies, and release coordination • Develop deep fluency in Arbital’s data models, pipeline architecture, and healthcare data standards (claims, eligibility, attribution, CMS/ACO files), and actuarial concepts (IBNR, rebates, contract terms) • Work hands-on with data — running SQL queries, reviewing pipeline outputs, and validating logic — to inform product decisions and support debugging • Define standards for data quality, deduplication, business rule configuration, lineage, and pipeline observability across all client environments • Evaluate and recommend tooling improvements across the platform stack (e.g., Airflow, Databricks, AWS) in partnership with engineering • Serve as the primary product owner for data capabilities across implementation, engineering, actuarial, and data science teams • Partner closely with the Implementation team to surface recurring client configuration needs and turn them into scalable platform features • Collaborate with actuarial and data science teams to ensure pipeline logic correctly supports attribution, aggregation, and actuarial use cases • Communicate roadmap priorities, tradeoffs, and delivery status clearly to both technical teams and non-technical stakeholders

Job Requirements

  • 4–7 years of experience in product management, with at least 2 years owning data platform, data infrastructure, data pipelines, or platform/API products
  • Strong technical foundation — comfortable reading data schemas, writing SQL, and engaging meaningfully with engineering on architecture decisions
  • Experience working with healthcare data (claims, eligibility, value-based care) strongly preferred
  • Proven ability to translate ambiguous, complex requirements into clear, actionable product specifications
  • Excellent cross-functional collaboration skills — experience working across engineering, data science, and client-facing teams
  • Strong written and verbal communication skills, with an ability to tailor messaging to both technical and business audiences
  • High attention to detail and a strong bias toward quality in data products
  • Comfortable operating with autonomy in a fast-moving, early-stage environment
  • Hands-on experience with Airflow, Databricks, Python, dbt, or AWS data services (Nice to Have)
  • Background in value-based care, actuarial modeling, or population health analytics (Nice to Have)
  • Experience building configurable, multi-tenant data pipelines at scale (Nice to Have)
  • Experience with data lineage, audit trail, or data governance products (Nice to Have)
  • Prior work at a health tech startup or data-driven healthcare company (Nice to Have)
  • Familiarity with BI tooling such as Sigma or Looker (Nice to Have)

Benefits

  • Generous equity grants of ISO stock options
  • We offer an exceptional benefits package with high employer-paid contributions for health, dental, and vision insurance
  • 4% 401(k) match
  • Flexible PTO, a weeklong winter shutdown, and 10 holidays each year
  • Occasional travel required - Quarterly team offsites

Related Categories

Related Job Pages

More Data Engineer Jobs

UR Ventures logo

Senior Data Warehouse Developer

UR Ventures

Ensuring University of Rochester innovations enrich our community, improve our society, and make the world ever better

Data Engineer3 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Architecting, designing, developing, implementing and supporting various components for the enterprise data warehouse and associated data sources. • Works independently and provides support for reporting projects, as well as process and architectural recommendations. • Mentoring staff, demonstrating best practices, planning for business continuity and sustainability and working on projects that support Business Intelligence and Data Warehousing functions as assigned. • Works with the user community and reporting team to understand business data and data transformation requirements. • Converts those requirements to functional/technical specifications for database and integration design. • Manages requests and/or components of larger projects with appropriate documentation and status reports. • Architects data integration and transformation processes across on premise and cloud data sources. • Provides leadership and is a subject matter expert depending on area of focus. • Designs and develops data integration and transformation processes across on-premises and cloud data sources using enterprise tools such as Informatica, Denodo, SQL Server, SSIS, Oracle Data Integrator, and Boomi AtomSphere, selected based on environment and requirements. • Develops data models and database designs that leverage their knowledge of data warehouse modeling best practices. • Collaborates with other team members using scrum processes or other project management disciplines depending on area of focus. • Follows best practices for change and code management. • Provides supporting technical documentation that is written, organized and maintained. • Participates in ongoing stabilization, support and maintenance. • Provides on call support as needed for business continuity and operations. • Attends webinars, reads case studies and white papers, and participates in opportunities to further refine skills and stay apprised on functional area topics.

New York
$86.5K - $129.7K / year
IntegriChain logo

Senior Data Engineer

IntegriChain

Data-Driven Commercialization

Data Engineer3 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

• Help define and mature data integration, data consolidation, MDM integration, and data platform design patterns across Integrichain. • Design, build, optimize, and operate Snowflake data models, pipelines, stored procedures, and high-volume data processing patterns. • Partner with MDM and Product teams to support HCO Master data ingestion, outbound extracts, cross-reference data, golden record consumption, survivorship outputs, and downstream publishing patterns. • Work with Product, Engineering, MDM, Data Science, DevOps, Security, and business stakeholders to align data solutions to enterprise priorities. • Use dbt or similar ELT tooling to develop reliable, maintainable, testable, and observable data pipelines. • Drive Snowflake performance tuning, warehouse sizing, workload management, cost tracking, and cost optimization practices. • Partner with Data Science leadership to rationalize and consolidate the enterprise data landscape across products, platforms, and acquired capabilities. • Define reusable data integration patterns for batch, micro-batch, near-real-time, and application-to-application data exchange. • Collaborate with cross-functional teams to understand business data needs, source-system realities, and enterprise application integration requirements. • Design scalable patterns for ingesting, transforming, mastering, and publishing data across operational and analytical use cases. • Help establish standards for data contracts, schema evolution, data quality, lineage, and data ownership. • Design and build data pipelines that load source data into Reltio MDM and extract mastered outputs from Reltio for downstream Snowflake, analytics, AI, and operational use cases. • Partner with MDM configuration and Product Management teams to translate HCO mastering requirements into data pipeline, mapping, validation, reconciliation, and publishing patterns. • Work with Reltio APIs, exports, crosswalks/XREFs, event-based integration patterns, and bulk load/extract mechanisms as needed to support inbound and outbound data flows. • Engineer integration patterns for HCO Master data, including party/entity, address, identifier, hierarchy, relationship, match/merge, survivorship, and golden record outputs. • Support source ingestion and reference data integration involving datasets such as HIN, DEA, NPI, NCPDP, 340B/PHS, channel outlet data, customer/account data, and other life sciences master/reference sources. • Develop validation and reconciliation processes to compare source data, Reltio mastered data, Snowflake curated data, and downstream consumption layers. • Help operationalize MDM outputs for business-facing data products, semantic models, reporting tables, APIs, and AI-ready datasets. • Design Snowflake database, schema, table, view, and semantic-layer patterns that support performance, governance, and maintainability. • Optimize Snowflake workloads using clustering, micro-partition awareness, warehouse sizing, query profiling, caching behavior, and workload isolation. • Implement Snowflake cost tracking and optimization practices, including warehouse utilization monitoring, inefficient query identification, and cost allocation by workload, team, or use case. • Build scalable SQL and Snowflake stored procedure logic for large-volume data processing and analytical workloads. • Apply secure Snowflake design patterns including RBAC, masking, access isolation, auditing, and environment separation. • Design, build, and maintain reliable ELT pipelines using dbt or comparable modern data transformation tooling. • Develop Python-based automation for API integration, file processing, metadata management, validation, orchestration support, and operational tooling. • Develop modular, tested, and reusable transformation models for raw, curated, mastered, and business-ready data layers. • Implement automated data quality checks, source freshness checks, reconciliation, logging, and exception-handling patterns. • Build orchestration-ready pipelines that support dependency management, restartability, incremental loads, and operational monitoring. • Collaborate with DevOps/SRE teams on CI/CD, deployment automation, environment promotion, and operational runbooks for data pipelines. • Spearhead logical and physical data modeling efforts for enterprise analytical, operational, MDM, and AI-ready datasets. • Design models that balance normalization, dimensional modeling, medallion/lakehouse concepts, and application-specific consumption needs. • Create denormalized reporting and semantic-model-ready structures that simplify business consumption and reduce ambiguity for AI/LLM use cases. • Process and optimize large data volumes in Snowflake using efficient SQL, PL/SQL-style procedural logic, Snowflake Scripting, and performance-aware design. • Create reusable patterns for historical tracking, snapshots, audit columns, data versioning, and lifecycle management. • Ensure data models support downstream BI, AI/ML, semantic models, data apps, MDM Explorer/Entity 360 use cases, and enterprise reporting.

Pennsylvania
Full TimeRemoteTeam 1,001-5,000Since 1999H1B No Sponsor

• Co-define and evolve the data platform roadmap in collaboration with architecture and engineering teams • Translate technical and business requirements into epics, user stories, and technical backlog items • Work closely with Data Engineers and Architects on: data models and architectures (batch/streaming) data pipelines and ingestion frameworks storage (e.g. data lake / data warehouse) and processing layers • Support design and implementation of platform components for: machine learning workflows (MLOps, feature stores, model lifecycle) data observability, lineage, and quality monitoring • Ensure datasets are reliable, well-structured, and ready for analytics and ML use cases • Participate in technical discussions and architecture decisions (not just coordination) • Drive delivery in an agile setup (planning, backlog refinement, prioritisation) • Communicate technical concepts in a clear way to non-technical stakeholders

Poland
The Hanover Insurance Group logo

Associate Data Engineer

The Hanover Insurance Group

For more than 170 years, The Hanover has been committed to delivering on our promises and being there when it matters the most. We live our values every day, demonstrating we CARE through our values, Sustainability initiatives and inclusive corporate culture.

Data Engineer3 days ago
Full TimeRemoteTeam 5,001-10,000

Role Description This role will primarily become proficient with all internal & external data produced and consumed by Hanover’s Core Commercial and Specialty businesses. This role will drive complex projects independently as well as under the mentorship of experienced engineers. The Associate Data Engineer will collaborate with business partners and cross-functional teams on data and reporting requests. The assignments will become broader in nature, usually requiring originality and ingenuity. This role will be accountable for successful outcomes related to designing, developing, implementing, optimizing, and maintaining data integrations, pipelines and solutions. IN THIS ROLE, YOU WILL: - Complete data integration and build scalable ETL pipelines - Learn data engineering best practices for data storage, integration, transformation, and access within cloud and on-premises platforms - Gain a strong understanding of Medallion Architecture and Data Lake best practices - Ensure data security and compliance within industry standards - Develop and maintain documentation for data integration and ETL solutions - Build solutions within Azure Data Services (Azure Synapse Analytics, Azure SQL Database, Azure Data Lake Storage) - Create solutions using pySpark/SparkSQL, and Delta Tables - Collaborate with data scientists, analysts, and stakeholders - Work within agile development methodologies - Occasional Power BI report creation for data delivery - Utilize best practices for CI/CD Qualifications - Bachelor’s degree or higher in Data Science, Mathematics, Statistics, Computer Science, Information Systems, Business Information Technology, or other related fields - 0-2 years professional experience in data engineering, data architecture, and/or data analytics - Beginner proficiency in SQL and Python - Strong problem-solving, communication, and collaboration skills - Critical thinking and attention to detail - Must be eligible to work in the United States without requiring sponsorship now or in the future (i.e. US Citizen or green card holder) Requirements - Some experience working with Azure, AWS, GCP, or other cloud - CI/CD experience with ADO or Github - Azure Synapse Analytics and/or Microsoft Fabric experience - PowerBI experience - Insurance industry experience Benefits - Medical, dental, vision, life, and disability insurance - 401K with a company match - Tuition reimbursement - PTO - Company paid holidays - Flexible work arrangements - Cultural Awareness Day in support of IDE - On-site medical/wellness center (Worcester only)

United States
$58K - $75K / year