A software development and consulting company, Emergent Software is headquartered in Minneapolis, Minnesota, and serves clients from a broad spectrum of industr
Senior Data Engineer
Location
United States
Posted
88 days ago
Salary
$115K - $150K / year
Seniority
Senior
Job Description
Senior Data Engineer
Emergent Software
• Architect end-to-end data platforms on Azure, ensuring scalability, resilience, and future-proof design • Lead design reviews and architectural decisions for large or complex client data projects • Contribute to defining data engineering best practices, coding standards, and reusable frameworks across the team • Mentor and coach junior engineers on advanced Azure services, performance tuning, and cloud-native patterns • Collaborate with Data Architects to define data models, schemas, and integration patterns • Drive cloud cost optimization strategies, ensuring clients use the right services at the right scale • Evaluate and recommend new Azure tools and technologies for inclusion in Emergent Software’s offerings • Lead client advisory on data platform modernization, migration strategies (e.g., on-prem to Azure), and data engineering maturity • Contribute to thought leadership, such as blog posts, webinars, or speaking engagements on Azure data topics • Design, develop, and maintain data pipelines using Azure Data Factory, Synapse Pipelines, or Azure Batch • Build and optimize data storage layers (Azure OneLake, Azure SQL, Synapse Analytics) for structured, semi-structured, and unstructured data • Implement data transformation logic (ETL/ELT) using Spark, Python, SQL, or Azure Data Factory • Ensure data quality, validation, and lineage using tools like Azure Purview and integrate with governance frameworks • Collaborate with BI teams to prepare curated data models and tables for reporting in Power BI or downstream systems • Manage data security, privacy, and compliance (RBAC, encryption, sensitive data masking) across Azure services • Monitor and optimize pipeline performance and cost efficiency using Azure Monitor and cost management tools • Develop and maintain CI/CD pipelines for data projects using Azure DevOps • Document data engineering solutions and adhere to Emergent Software’s delivery standards and Agile practices
Job Requirements
- 5-10 years experience working in data-specific roles
- Advanced understanding of data modeling design patterns such as 3NF and star-schema
- Data warehouse design experience using medallion architecture
- Experience with Microsoft Fabric, Synapse or Snowflake
- ETL tools such as Azure Data Factory
- Experience using a scripting language for data transformation
- Performance tuning databases and long running queries
- Stored procedure development
- Experience working with unstructured/structured large-scale data.
- Advanced SQL syntax
- Advanced Python scripting
- Experience with real-time data processing
- Holding Microsoft certification(s) in the Data & AI solutions field are a plus
- Microsoft Certified: Fabric Analytics Engineer Associate (DP-600)
- Microsoft Certified: Fabric Data Engineer Associate (DP-700)
- Microsoft Certified: Power BI Data Analyst Associate (PL-300)
Benefits
- Medical Insurance: up to 84% of your monthly medical premium (HSA options available)
- HSA Contribution: up to $200/month
- Dental & Vision Insurance: up to 50% of your monthly dental and vision premium costs
- 401(k) plan: company match up to 4% of salary
- Profit sharing bonus: up to 15% of salary paid quarterly
- Extra compensation: get paid extra for work over 40 hours/week
- Employee referral & customer referral bonuses
- Flex Spending Account (FSA) for Dependent Care & Healthcare Costs
- Short Term Disability: $500/week for 12 weeks
- Long Term Disability: up to $6,000/month
- Group term life and AD&D insurance: $50k
- PTO, standard holidays, 2 floating holidays
- Paid parental leave
- Staff development program: 100 hours/year plus training costs
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Staff Data Engineer
InterWell HealthInterWell Health is a kidney care management company that is on a mission to reinvent healthcare and better help its patients live healthy and happy lives. As a
• Design and evolve a scalable, secure, cloud-native lakehouse platform leveraging Databricks, Microsoft Fabric (OneLake, Lakehouse, Data Factory), and dbt. • Define modeling patterns, governance frameworks, and engineering best practices across the data lifecycle. • Lead design reviews and guide teams in adopting scalable architectural patterns. • Drive long-term platform strategy and evaluate emerging technologies. • Design and implement batch and streaming data pipelines for healthcare data sources (EHR, claims, HL7/FHIR, APIs, flat files, databases). • Develop modular ingestion, quality, lineage, metadata, and observability frameworks that scale across domains. • Produce clean, analytics-ready datasets and data models for BI, analytics, and machine learning workloads. • Implement HIPAA-aligned access patterns and secure handling of PHI. • Architect Databricks workloads (clusters, jobs, Unity Catalog, Delta Lake) for reliability, performance, and cost efficiency. • Integrate Databricks and Microsoft Fabric with Azure services and enterprise systems. • Partner with product managers, data scientists, analysts, clinicians, and business stakeholders to translate healthcare data needs into scalable solutions. • Lead cross-functional initiatives that modernize and unify the organization’s data ecosystem. • Mentor senior and mid-level engineers; elevate team capability through technical coaching and standards. • Drive roadmap planning, platform evolution, and long-term data strategy. • Champion engineering excellence, reliability practices, documentation quality, and governance.
• Own the technical strategy and execution of migrating large-scale data workloads from GCP to AWS, ensuring continuity, data integrity, and minimal disruption. • Design migration playbooks and serve as the go-to expert for decisions across compute, storage, and orchestration layers during the transition. • Architect and implement scalable batch and streaming data pipelines using Apache Spark, Delta Lake, and the medallion architecture. • Establish standards for pipeline design, data quality, and observability that the broader engineering organization can build on. • Take accountability for the reliability, performance, and cost-efficiency of production ETL jobs running on AWS (EMR, Glue) against terabyte-scale datasets. • Proactively identify and address bottlenecks, technical debt, and opportunities to improve throughput and resilience.
• Design, build, and own scalable data platforms on Google Cloud • Play an architectural role, driving end-to-end data solutions • Define best practices and mentor team members • Work closely with stakeholders, analysts, and data scientists to deliver reliable, high-performance data pipelines and analytics platforms
• Build and maintain data infrastructure that enables the collection, storage, and retrieval of data; • Create new data flows by integrating our data sources and ensuring they are reliable and efficient; • Develop ETL pipelines, data warehousing, and data modeling to support business needs; • Ensure data quality monitoring, reliability, and lineage by developing processes and tools to identify and correct data quality issues; • Collaborate with other members of the Data & Analytics Team to optimize the data infrastructure and improve data governance; • Provide documentation and training to end-users on data sources, pipelines, and data quality procedures; • Stay current with the latest technologies and techniques related to data engineering, and identify opportunities to improve data infrastructure and analysis.



