Job Closed
This listing is no longer active.
Senior Data Engineer
Location
Illinois
Posted
30 days ago
Salary
$130K - $180K / year
Seniority
Senior
Job Description
Senior Data Engineer
Intelligent Medical Objects (IMO)
• Build and operate production-grade data platforms that support IMO’s terminology-driven products, analytics, and machine learning use cases • Design, develop, and maintain data pipelines for batch and incremental processing using modern lakehouse and cloud-native patterns • Work extensively with cloud data platforms (AWS + Databricks) to ingest, transform, and serve structured and semi-structured data at scale • Model data intentionally—developing well-documented, analytics- and product-ready data models that balance usability, performance, and correctness • Apply strong software engineering practices to data work, including version control, testing, CI/CD, and infrastructure-as-code • Collaborate directly with product, analytics, and AI teams to translate requirements into scalable technical solutions • Improve reliability, performance, and cost-efficiency of data systems through monitoring, observability, and continuous optimization • Design for data quality and trust, implementing automated checks, validation frameworks, and lineage-aware workflows • Contribute to platform evolution, helping shape standards around orchestration, data modeling, environments, and deployment • Operate in an Agile environment, taking ownership of deliverables and proactively identifying risks and opportunities • Mentor and support other engineers, leading by example in code quality, problem decomposition, and technical decision-making • Continuously learn and apply industry best practices in data engineering, analytics engineering, and AI data foundations
Job Requirements
- Bachelor’s degree in a relevant technical field and 5+ years of professional experience, or 7+ years of equivalent hands-on experience
- Demonstrated experience building and supporting end-to-end data platforms in a production environment
- Strong programming experience in Python and SQL, with an engineering mindset toward maintainability and testing
- Deep experience with cloud-based data platforms, especially:
- AWS (e.g., S3, EC2, RDS, IAM)
- Databricks / Spark-based processing
- Strong SQL skills, including complex transformations and performance-aware query design
- Hands-on experience with data orchestration frameworks (e.g., Airflow or equivalent)
- Experience designing and optimizing data models for analytics, reporting, and downstream applications
- Familiarity with CI/CD practices and infrastructure-as-code (e.g., Git, Terraform)
- Comfort working with large, complex, and evolving datasets, including managing schema change and metadata
- Strong analytical, debugging, and root-cause analysis skills
- Clear written and verbal communication skills, including documenting designs and tradeoffs
- A proactive, ownership-oriented mindset and the ability to work effectively across teams.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer (Remote) Department:IT Location:Canton, MA The Phia Group is a service- oriented organization assisting employee health plans nationwide. We provide our clients with innovative cost-cutting solutions and constantly expanding service offerings. We continue to enjoy growth thanks to our most valuable resource – our talented and committed team. At The Phia Group, whose mission is to provide high quality yet affordable healthcare to American employees and their families, you can look forward to not only unparalleled benefits for yourself but also being immersed in a company that was named one of USA Today’s Top Workplaces for 2026. Meanwhile, from a regional perspective, both The Boston Globe and Louisville Business First also recognized our unwavering commitment to upholding an internal culture of inclusivity, enjoyment, and empathy for our valued employees by listing The Phia Group in their respective lists for the Top Places to Work in 2026. The Data Engineer is responsible for supporting the development, maintenance, and optimization of data pipelines and analytics-ready datasets. You will be collaborating across multiple teams and stakeholders to solve complex problems and support data-driven initiatives. Essential Duties and responsibilities include the following; other duties may be assigned: - Build, maintain, and optimize data pipelines utilizing Azure Data Factory, ensuring data is ingested, transformed, and delivered to Snowflake reliably for analytics - Implement monitoring, alerts, and testing of data pipeline performance, data quality metrics, and lineage to ensure trustworthy data delivery - Troubleshoot data issues and perform root cause analysis to proactively resolve operational issues - Document data structures, processes, architectural decisions, and best practices for knowledge sharing - Develop, maintain, and optimize Snowflake objects (schemas, tables, views) and SQL transformations to produce curated, analytics-ready datasets - Collaborate with analysts, stakeholders, and product owners to translate business needs into data requirements and stable technical implementations - Enable data for AI/ML use cases by preparing feature-rich datasets, supporting feature engineering, and ensuring data consistency for model training and inference - Support deployment and operationalization of machine learning models by integrating pipelines with ML workflows (e.g., batch/real-time scoring) - Continually improve ongoing reporting and analytics, automating or simplifying self-service or manual processes - Implement version control practices for all data engineering code and documentation Experience and Qualifications - Bachelor's degree in Computer Science, Computer Engineering, Information Technology, or a related field; or equivalent experience - 5+ years of experience in data engineering or business intelligence roles working with ETL, data modeling, data architecture, and developing pipelines and applications for analytics (e.g., BI, reporting, machine learning, deep learning) - Solid programming skills in advanced SQL, Python, or other programming languages for data processing and automation Experience supporting or working with AI/ML workflows, including: - Data preparation and feature engineering for machine learning models - Integration of data pipelines with ML frameworks (e.g., scikit-learn, TensorFlow, PyTorch, or similar) - Understanding of model lifecycle concepts (training, validation, deployment, monitoring) - Expertise working with Snowflake for data warehousing, including experience with schema design, performance tuning, and optimization - Proficiency with Git, Azure DevOps, and collaborative development best practices - Experience designing, developing, and deploying end-to-end pipelines using Azure Data Factory Working Conditions / Physical Demands Sitting at workstation for prolong periods of time. Extensive computer work. Workstation may be exposed to overhead fluorescent lighting and air conditioning. Fast paced work environment. Operates office equipment including personal computer, copiers, and fax machines. This job description is not intended to be and should not be construed as an all-inclusive list of all the responsibilities, skills or working conditions associated with the position. While it is intended to accurately reflect the position activities and requirements, the company reserves the right to modify, add or remove duties and assign other duties as necessary. External and internal applicants, as well as position incumbents who become disabled as defined under the Americans with Disabilities Act, must be able to perform the essential job functions (as listed here) either unaided or with the assistance of a reasonable accommodation to be determined by management on a case by case basis.
Full Stack Data Engineer
Anbre Interiors | Top Interior Designers in Chennai & Luxury InteriorsInteriors That Inspire – Chennai's Best Home Interior Design Experts.
• Design, build, and maintain data pipelines using GCP tools (BigQuery, Cloud Functions, Cloud Composer, Cloud Scheduler, Apache Beam, Airflow). • Clean, transform, and organize data from multiple sources. • Automate ETL/ELT workflows for reliability and scalability. • Support ingestion from APIs, spreadsheets, and internal systems. • Write Python and Bash scripts to process and automate data tasks. • Develop lightweight backend services and utilities to streamline internal processes. • Build and update dashboards in Looker Studio and D3.js. • Deliver clean, intuitive KPI reports for operations and leadership. • Support visualization needs across the Wellness Division. • Contribute to simple predictive modeling and forecasting tasks. • Prepare structured datasets for future machine learning initiatives.
Data Engineer – Mid-Senior Level
SOFTGAMESWe are an instant gaming company. We develop casual, truly social games that can be played instantly across all devices.
• Design, build, and maintain scalable data pipelines using Databricks and Apache Airflow • Support data analysts by ensuring they have clean, structured, and well-documented data • Manage and optimize Tableau dashboards, ensuring seamless data exports and reporting • Develop ETL workflows and data integrations using Python • Monitor data quality, optimize performance and cost efficiency • Take responsibility for the smooth operation and reliability of the data platform • Work with cloud-based data solutions (AWS)
Data Management Internship
American ForestsWe lead the movement to reforest America, from cities to large, forested landscapes.
Role Description American Forests is seeking a part-time, paid undergraduate or graduate student intern for June–July 2026 to support data management, metadata development, and GIS-related tasks associated with the launch of the Forest Innovation Platform. - Organizing datasets from the Climate Risk Viewer. - Integrating datasets into the Forest Innovation Platform Data Hub. - Developing clear, standardized metadata for platform data layers. - General data processing and GIS support as needed. Qualifications - Current enrollment at Colorado State University. - Experience with GIS (e.g., ArcGIS, QGIS). - Strong data management and organizational skills. - Clear and effective technical writing ability. Requirements - This position reports to the Director of Climate Science. - No direct reports. Benefits - Compensation: $20.00 per hour. - This position is part-time and non-exempt. - Not eligible for benefits. Working Conditions - This is a remote position with a preference for candidates based in Fort Collins, Colorado. - Up to 10% travel throughout the U.S. and/or to American Forests’ headquarters in Washington D.C. may be required. - Ability to remain in a stationary position (sitting or standing) for extended periods. - Ability to operate a computer and other office equipment. - Reasonable accommodation may be provided to support individuals across the full range of abilities and experiences.



