Job Closed
This listing is no longer active.
Stop disease through technology.
Data Engineer
Location
United States
Posted
127 days ago
Salary
0
Seniority
Senior
Job Description
Data Engineer
InductiveHealth Informatics
• Lead the continued transition of legacy SAS-based ETL processes to SQL Server, completing remaining migrations and validating results through parallel processing and data reconciliation. • Translate undocumented or minimally documented legacy ETL logic into maintainable, fault tolerant SQL Server and SSIS workflows. • Improve and standardize incremental data processing patterns, reducing reliance on full data refreshes and destructive reload processes. • Own the reliability and performance of ETL pipelines by identifying and resolving bottlenecks, particularly in high-volume and performance-sensitive workflows. • Investigate and correct data flow issues that prevent records from consistently reaching downstream systems across environments. • Support production data operations by partnering with product, engineering, and support teams to triage and resolve data-related issues and support tickets. • Participate in regular operational check-ins and serve as a primary escalation point for ETL and data pipeline concerns. • Document ETL logic, dependencies, and operational processes to reduce institutional knowledge risk and improve long-term maintainability. • Introduce improved logging, monitoring, automation, and repeatability across data integration workflows. • Collaborate with engineering peers and domain experts to establish clearer ownership and standards for ETL and data pipeline practices.
Job Requirements
- Strong hands-on experience with SQL Server development, including advanced T-SQL, query optimization, and performance tuning in production environments.
- Demonstrated experience designing, maintaining, and modernizing data pipelines and ETL processes, particularly in environments transitioning from legacy architectures to more scalable, maintainable data platforms.
- Ability to analyze, interpret, and translate legacy data transformation logic (including SAS-based workflows) into modern, SQL-based implementations, with an emphasis on clarity, performance, and long-term maintainability.
- Experience with SQL Server–based data integration tooling or comparable modern data orchestration frameworks, including support for incremental processing, dependency management, and multi-step pipelines.
- Familiarity with modern data engineering concepts such as idempotent pipelines, incremental ingestion patterns, schema evolution, and environment-aware deployments.
- Comfort working within complex, partially undocumented systems and progressively improving them through refactoring, documentation, and automation.
- Experience supporting and operating production data pipelines, including diagnosing failures, resolving data quality issues, and partnering cross-functionally to restore and improve system reliability.
- Experience managing data workflows across multiple environments (development, scale, production) with attention to consistency, validation, and release coordination.
- Strong problem-solving skills with a systems-level mindset, particularly when identifying root causes of performance, scalability, or data integrity issues.
- Ability to work collaboratively with engineering, product, and support teams while maintaining clear ownership of data platform outcomes.
- Clear written and verbal communication skills, especially when documenting technical systems and explaining data flows to both technical and non-technical stakeholders.
Benefits
- Virtual first, remote organization and culture
- Flexible Paid Time Off (PTO)
- 401(k) retirement plan with corporate matching
- Medical, prescription, vision, and dental coverage (multiple plans based on your needs)
- Short Term and Long Term Disability (for employee)
- Life Insurance (for employee)
- New Team Member support for home office setup
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Azure Databricks Data Engineer
OZA leading consulting company whose Intelligent Automation expertise accelerates the way you do business.
• Design and implement end-to-end data solutions on the Azure platform, including data ingestion, data processing, data storage, and data visualization. • Develop and maintain data pipelines using Azure Data Factory, Azure Databricks, Azure Data Lake Storage, and other relevant tools and technologies. • Collaborate with data architects and data scientists to understand data requirements and design scalable and optimized data models and schemas. • Implement data integration solutions to extract, transform, and load (ETL) data from various sources into Azure data platforms. • Ensure the reliability, availability, and performance of data solutions by monitoring and optimizing data pipelines and storage systems. • Troubleshoot and resolve data-related issues, including data quality, performance, and security concerns. • Collaborate with cross-functional teams to gather business requirements and translate them into technical solutions. • Stay updated with the latest trends and advancements in Azure data technologies and provide recommendations for adopting new tools and techniques. • Perform data profiling, data validation, and data cleansing activities to ensure data accuracy and consistency. • Document technical specifications, data flows, and processes for reference and knowledge sharing.
• Design and implement scalable, modern data architectures (Lakehouse, Delta Lake, Medallion) • Collaborate with business and technology teams to align solutions with strategic objectives • Lead data modernization and decentralization initiatives with a Data Mesh approach • Ensure governance, security and compliance (LGPD, IAM, RBAC) • Participate actively in strategic data planning, aligning solutions with corporate goals • Support the definition of data roadmaps and contribute to long-term architectural decisions • Conduct regular alignments with technical and business stakeholders to ensure adherence to organizational needs
• Analyse and maintain RDF/TTL data models and vocabularies; • Develop, optimise, and maintain SPARQL queries; • Support data ingestion, transformation, and validation workflows; • Ensure consistency and correctness of semantic data across the platform; • Collaborate with backend engineers to integrate semantic logic into application flows; • Assist in documenting semantic models, assumptions, and constraints; • Participate in troubleshooting data quality and reasoning issues.
• Analyse and maintain RDF/TTL data models and vocabularies; • Develop, optimise, and maintain SPARQL queries; • Support data ingestion, transformation, and validation workflows; • Ensure consistency and correctness of semantic data across the platform; • Collaborate with backend engineers to integrate semantic logic into application flows; • Assist in documenting semantic models, assumptions, and constraints; • Participate in troubleshooting data quality and reasoning issues.



