Job Closed

This listing is no longer active.

Arine optimizes medication to ensure each patient is on the safest, most effective therapy for their unique health needs

Staff Data Engineer

Data EngineerData EngineerOther Remote LeadTeam 11-50H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

155 days ago

Salary

$170K - $185K / year

Seniority

Lead

Python AWS ETL dbt Docker Kubernetes SQL Git

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As a key technical leader and team architect working in a fast-paced environment, you will drive the design, development, and optimization of scalable data ingestion pipelines within the Arine platform. Leveraging expert-level proficiency in Python and AWS, you will architect solutions that handle diverse file types and large-scale healthcare datasets. You will have a direct impact on building reusable, configurable tools set for handling data needs for the entire company. What You'll be Doing: - Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services. - Architect and implement scalable data ingestion pipelines that handle different file types into the Arine platform. - Develop reusable components that integrate into data pipelines to increase efficiency and reduce future implementation time. - Create configuration-driven, containerized toolsets that are easy to use and maintain across diverse engineering profiles. - Work collaboratively with cross-functional teams to meet data requirements through ETL components. - Design and maintain data transformation pipelines using DBT, including macros, incremental models, and DBT tests. - Implement incremental data ingestion strategies for large-scale healthcare datasets. - Build monitoring and alerting systems for data ingestion processes and overall pipeline health. - Apply software engineering best practices, including test-driven development and modular design, to data infrastructure. - Refactor and rebuild existing data ingestion processes to improve scalability and operational efficiency. - Work with containerization technologies (Docker, Kubernetes) to create portable and maintainable data solutions. - Identify and escalate inefficiencies within and across teams. - Provide technical guidance and mentorship to junior engineers, and promote best practices and coding standards. - Author and maintain high-quality technical documentation, and support junior engineers in doing the same. - Collaborate with the DE Manager to report on DE contractor performance issues. Qualifications - 10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure. - Deep expertise in Python and modern data engineering tools. - A track record of building automated, production-grade ETL processes using Python and dbt SQL. - Strong understanding of ETL/ELT frameworks and distributed data processing. - Hands-on proficiency with modern data technologies and comfort leveraging AI coding assistants to accelerate development, improve code quality, and enhance productivity. - Skilled in data processing, validation, cleaning, and debugging. - Strong capability integrating APIs for seamless data exchange between systems. - Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP. - Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations. - Comfort working with large-scale datasets (10GB+). - Strong capability implementing incremental processing and change data capture (CDC) methodologies. - Extensive background designing scalable data architectures in AWS environments. - Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design. - Hands-on familiarity with containerization (Docker, Kubernetes) and building configuration-driven, maintainable systems. - Proven ability to build tools and systems that diverse engineering profiles can operate through configuration rather than code changes. - A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence. - Familiarity with healthcare data and regulatory environments (HIPAA) as a plus. - Strong collaboration skills, with comfort partnering across technical and non-technical stakeholders. - Excellent written and verbal communication, with the ability to explain technical infrastructure concepts to diverse audiences. Requirements - Ability to pass a background check. - Must live in and be eligible to work in the United States. Benefits - Dynamic role with the opportunity to contribute to the company's growth and shape its future. - Unparalleled learning and growth prospects, collaborating closely with experienced Clinicians, Engineers, Software Architects, and Digital Health Entrepreneurs. - Salary range for this position is: $170,000-185,000/year. Remote Work Requirements - An established private work area that ensures information privacy. - A stable high-speed internet connection for remote work. - This role is remote, but you will be required to come to on-site meetings multiple times per year.

Job Requirements

10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure.
Deep expertise in Python and modern data engineering tools.
A track record of building automated, production-grade ETL processes using Python and dbt SQL.
Strong understanding of ETL/ELT frameworks and distributed data processing.
Hands-on proficiency with modern data technologies and comfort leveraging AI coding assistants to accelerate development, improve code quality, and enhance productivity.
Skilled in data processing, validation, cleaning, and debugging.
Strong capability integrating APIs for seamless data exchange between systems.
Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP.
Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations.
Comfort working with large-scale datasets (10GB+).
Strong capability implementing incremental processing and change data capture (CDC) methodologies.
Extensive background designing scalable data architectures in AWS environments.
Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design.
Hands-on familiarity with containerization (Docker, Kubernetes) and building configuration-driven, maintainable systems.
Proven ability to build tools and systems that diverse engineering profiles can operate through configuration rather than code changes.
A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence.
Familiarity with healthcare data and regulatory environments (HIPAA) as a plus.
Strong collaboration skills, with comfort partnering across technical and non-technical stakeholders.
Excellent written and verbal communication, with the ability to explain technical infrastructure concepts to diverse audiences.
Ability to pass a background check.
Must live in and be eligible to work in the United States.

Benefits

Dynamic role with the opportunity to contribute to the company's growth and shape its future.
Unparalleled learning and growth prospects, collaborating closely with experienced Clinicians, Engineers, Software Architects, and Digital Health Entrepreneurs.
Salary range for this position is: $170,000-185,000/year.
Remote Work Requirements
An established private work area that ensures information privacy.
A stable high-speed internet connection for remote work.
This role is remote, but you will be required to come to on-site meetings multiple times per year.

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Data Manager

Health Services Advisory Group

HSAG is an EEO Employer of Veterans protected under Section 4212. If you have special needs and require assistance completing our employment application process, please feel free to contact us. EOE M/F/Veteran/Disability. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Data Engineer155 days ago

Other Remote

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description HSAG is seeking a Data Manager I to join the Data Science & Advanced Analytics (DSAA) division, focusing on managing claims and encounter data within a Microsoft SQL Server environment. This role emphasizes data integrity, compliance, and quality improvement in healthcare analytics. The Data Manager I position is a key contributor to healthcare data management projects, with a focus on claims and encounter data. Responsibilities include: - Supporting extraction, transformation, and loading (ETL) processes - Data validation and reporting in a Microsoft SQL Server environment - Developing and maintaining data dictionaries - Conducting audits - Collaborating with internal teams to ensure accurate data collection and reporting - Training team members on data management best practices - Ensuring compliance with health data regulations The Data Manager I will work with a wide array of healthcare data types, including but not limited to: - Survey - Case review - Medical and prescription drug claims and encounters - Eligibility - Demographic - Clinical - Electronic health record - Registry - Vital statistics - Operational Qualifications - Bachelor’s degree in computer science, computer information systems, or a quantitative discipline - At least three (3) years of experience with ETL processes in a Microsoft SQL Server environment - At least three (3) years of data warehousing experience - Excellent SQL programming skills - Experience gathering and refining requirements, interviewing business users to understand and document data requirements - Strong attention to detail, documenting business and technical requirements based on user interviews - Database testing experience - Experience with Microsoft SQL Server Requirements - Experience working with healthcare data - Proficient in Microsoft Word, Excel, and Access - Excellent verbal and written communication skills - Ability to handle several projects simultaneously and work with multiple teams Benefits - Formal internal training on an assortment of healthcare-related topics during the first year

SQL Microsoft SQL Server ETL

View details: Data Manager

United States + 171 more

$75K - $90K / year

Apply

Job Closed

Senior Data Engineer

TRACTIAN

Artificial Intelligence Quarterbacking Your Maintenance

Data Engineer155 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Develop and maintain scalable data pipelines and ETL processes. • Design, implement, and optimize existing data extraction and loading processes with adequate data engineering design patterns. • Lead data engineering reliability and observability, increasing analytics team awareness of the data flow processes before it becomes an issue. • Collaborate with backend and analytics engineers in a holistic data engineering process, loading data accordingly with the technical requirements. • Ensure data quality and consistency across various sources by implementing data validation and cleansing techniques. • Work with cloud-based data warehouses and analytics platforms to manage and store large datasets. • Monitor and troubleshoot data pipelines to ensure reliable and timely delivery of data. • Document data processes, workflows, and best practices to enhance team knowledge and efficiency. • Create dashboards as data products as internal

Airflow Amazon Redshift AWS ETL Grafana Apache Kafka PostgreSQL Python Rust SQL

View details: Senior Data Engineer

Brazil

Apply

Principal Software Data Engineer

PointClickCare

Founded in 2000 and based in Mississauga, Ontario, Canada, PointClickCare offers comprehensive services to assist long-term healthcare providers. One of the fir

Data Engineer155 days ago

Other Remote

Company Site

• Lead and guide the design and implementation of scalable distributed systems based on Java microservices • Engineer and optimize data pipelines using solutions like Apache Hudi, Apache Trino, Azure ADLS • Collaborate cross-functionally with product, analytics, and AI teams to ensure data is a strategic asset • Advance ongoing modernization efforts, deepening adoption of event-driven architectures and cloud-native technologies • Drive adoption of best practices in data governance, observability, and performance tuning for data workloads • Embed data quality in processing pipelines by defining schema contracts, implementing transformation tests and data assertions, enforcing backward-compatible schema evolution, and automating checks for freshness, completeness, and accuracy across batch and streaming paths before production deployment • Establish robust observability for data pipelines by implementing metrics, logging, and distributed tracing for streaming jobs, defining SLAs and SLOs for latency and throughput, and integrating alerting and dashboards to enable proactive monitoring and rapid incident response • Foster a culture of quality through peer reviews, providing constructive feedback and seeking input on your own work

Apache HTTP Server AWS Azure Distributed Systems GCP HDFS Java Microservices Apache Spark

View details: Principal Software Data Engineer

United States

$183K - $203K / year

Apply

Principal Data Platform Engineer

Deputy

Make a difference to the lives of shift workers around the world

Data Engineer155 days ago

Other RemoteTeam 201-500Since 2008H1B Sponsor

Company Site LinkedIn

Role Description At Deputy, our mission is to simplify shift work and create thriving workplaces. Data is at the heart of how we achieve this, powering everything from product innovation to exceptional customer experiences. We are looking for a Principal Data Engineer to be the technical cornerstone of our data team, leading the charge in building a scalable and reliable data platform that will support our next phase of growth. This is a foundational, strategic role. You will have the autonomy to shape our data architecture, champion engineering best practices, and set the technical vision for how we use data across the company. This role is a perfect blend of hands-on technical leadership and strategic influence, where you will not only build powerful data systems but also mentor a talented team and partner with leaders across the business. Responsibilities - Architect and Evolve Our Core Data Platform - Own the technical vision and roadmap for our data platform. - Design, implement, and refine a robust data lakehouse architecture (e.g., Medallion) using Databricks and Delta Lake. - Develop and maintain resilient, reusable patterns for ingesting data from diverse sources. - Lead the implementation of core data modelling principles (e.g., Kimball dimensional modelling). - Use tools like Unity Catalog to establish a comprehensive data governance framework. - Develop strategies for monitoring, optimising, and forecasting Databricks and cloud expenditure. - Champion Engineering Excellence and Best Practice - Implement and advocate for automated CI/CD pipelines (e.g., using GitHub Actions). - Champion a Git-first culture for all data transformation code. - Implement comprehensive, automated data quality testing at every stage of our pipelines. - Establish thorough monitoring, logging, and alerting for all data pipelines. - Be a Strategic Partner Across the Business - Collaborate with leaders in Product, Engineering, Sales, finance, and Customer Success. - Advise analysts, data scientists, and stakeholders on leveraging the data platform. - Serve as the go-to expert on our data architecture. - Lead, Mentor, and Elevate Our Data Team - Actively mentor data analysts and engineers through pair programming and code reviews. - Lead initiatives like a 'data guild' for knowledge sharing. - Partner with data leadership to define career progression pathways. Qualifications - Mastery of data architecture principles and data modelling frameworks. - Strong software engineering mindset with experience in CI/CD for data. - Exceptional communication and stakeholder management skills. - Genuine passion for leadership and mentorship. Requirements - Tech Stack: Dbt, Databricks, Unity Catalog, Terraform, AWS (Redshift, Dynamo db, API gateway, Cloud Watch, Lambda, Streaming with Kinesis/Firehose, Glue, Bedrock). - Languages required: advanced SQL, Python. Benefits - Flexible remote-first work policy with a work-from-home stipend. - Employee Share Ownership Plan (ESOP). - Paid parental leave. - Group Salary Continuance Insurance. - Employee Assistance Program. - Additional leave days including study assistance, celebration days, and volunteering. - Global working groups focused on collaboration, belonging, and connection. - Annual Hackathons. - Novated leasing for electric vehicles, internet reimbursement, and more.

Databricks Unity dbt Terraform AWS Amazon Redshift DynamoDB Amazon CloudWatch Amazon Lambda Amazon Kinesis Amazon Bedrock SQL Python GitHub Actions Git

View details: Principal Data Platform Engineer

United States

Apply

Job Closed

Staff Data Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Manager

Senior Data Engineer

Principal Software Data Engineer

Principal Data Platform Engineer