Igniting what's possible
Data Engineer
Location
Alabama + 34 moreAll locations: Alabama | Arizona | California | Colorado | Connecticut | Florida | Hawaii | Idaho | Illinois | Iowa | Kansas | Kentucky | Nevada | New Hampshire | New Jersey | New Mexico | New York | North Carolina | Ohio | Oklahoma | Oregon | Maryland | Massachusetts | Michigan | Minnesota | Missouri | Pennsylvania | Rhode Island | South Carolina | Tennessee | Texas | Vermont | Virginia | Washington | Wisconsin
Posted
17 days ago
Salary
$140K - $175K / year
Seniority
Senior
Job Description
Data Engineer
Fuze Health
• Deploy new data pipelines • Design & build data observability platforms and metrics • Build metadata driven pipeline solutions • Monitor and optimize data transformations • Build third party data integrations
Job Requirements
- BA/BS or Master's degree in a quantitative field such as Statistics, Economics, Engineering, Mathematics, or Data Science
- At least 4 years of work experience in software engineering
- Experience with one or more of the major cloud vendors: AWS, GCP, Azure
- Experience with infrastructure as code tools and methodologies
- Experienced in SQL-based and python data manipulation with large datasets
- Thrive in a dynamic fast-paced entrepreneurial environment. You're unafraid to dive into an unfamiliar problem but humble enough to make mistakes and iterate
Benefits
- dental, vision, and multiple group medical plans to choose from
- a 401(k) retirement savings plan
- group life insurance
- accidental death and dismemberment (AD&D) insurance
- flexible spending account (FSA) and health savings account (HSA)
- commuter benefits
- employer-paid short-term (STD) and long-term disability (LTD) insurance
- additional supplemental insurance plans (spouse life insurance, legal insurance, an employee assistance program, home health testing kits, and a fertility medication discount program)
- flexible vacation time
- accrued paid sick time
- 10 paid holidays
- 2 floating holidays for full time non-exempt employees
- eight weeks of paid parental leave for eligible employees
- additional paid weeks for the birthing parent
- 4 weeks paid caregiver leave
- a Lifestyle Spending Account allowance each month
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Perform migration activities of Alteryx workflows to AWS; • Develop and refactor data pipelines using AWS Glue; • Adapt processes and business rules to a cloud-native architecture; • Ensure the quality, performance, and scalability of pipelines; • Provide technical support to the team with specialized Alteryx expertise; • Participate in the analysis, documentation, and optimization of ETL processes.
• Develop and maintain batch and streaming data pipelines in Databricks using PySpark and Spark SQL; • Implement pipelines following the established Medallion architecture (Bronze, Silver, and Gold); • Assist in migrating legacy pipelines and jobs from tools such as IBM DataStage, Azure Data Factory, and Azure Synapse Analytics to Databricks Workflows; • Help migrate routines and notebooks from Databricks on Azure to AWS; • Develop and version notebooks and code using Databricks Repos and Git; • Implement basic tests and data quality routines in pipelines; • Support dimensional modeling of the Data Warehouse (Star Schema) with the architecture team; • Collaborate with the DevOps team to automate deployments and CI/CD; • Participate in implementing data governance with Unity Catalog; • Monitor and provide support for production data pipelines.
• Build the new corporate Lakehouse using Databricks on AWS; • Develop batch and streaming data pipelines with PySpark and Spark SQL; • Create and maintain the Medallion architecture (Bronze, Silver, Gold) in Delta Lake; • Participate in designing the new Data Warehouse data model; • Migrate data, pipelines, and routines from the current Databricks on Azure environment to AWS; • Migrate jobs and integrations from IBM DataStage, Azure Data Factory, and Azure Synapse Analytics; • Migrate Databricks workspaces between Azure and AWS environments; • Configure and operate data governance using Unity Catalog and Odin; • Implement monitoring, testing, and data quality; • Work alongside DevOps on deployment automation and CI/CD; • Version notebooks, pipelines, and code using Git and Databricks Repos.
• Design, build, and maintain scalable data pipelines that ingest, transform, and validate large volumes of data across multiple sources and channels. • Improve the scalability, reliability, and performance of our data pipelines to support rapidly growing workloads and new data streams. • Contribute to the design and implementation of our Data Lake architecture, enabling reliable data storage and reuse across teams. • Manage and optimize data ingestion workflows, including data collected from web scrapers, third-party vendors, and internal systems. • Monitor pipeline health, investigate incidents, and implement improvements to increase system reliability and observability. • Support the onboarding and integration of new AI channels and data sources into the platform. • Collaborate with teams across the organization to ensure data generated by different systems can be reused effectively for analytics and business intelligence. • Identify and resolve performance bottlenecks in distributed systems, including rate limiting, concurrency, and throughput constraints. • Advise engineering and product teams on data architecture, data quality, and best practices for managing scalable data workflows. • Continuously evaluate and improve our data platform to support the company’s rapid growth and evolving product needs.


