Data Engineer, Databricks, Informatica, Azure

Data EngineerData EngineerFull TimeRemoteSeniorTeam 201-500H1B SponsorCompany SiteLinkedIn

Location

Texas

Posted

4 days ago

Salary

0

Seniority

Senior

Job Description

Data Engineer, Databricks, Informatica, Azure

Allata

• Design, develop, and maintain scalable data pipelines using modern distributed data processing platforms and cloud environments. • Build and optimize ETL/ELT processes following industry best practices and cloud-native architectures. • Implement data models aligned with modern Data Lakehouse principles and data architecture frameworks. • Ensure data quality, consistency, and performance across ingestion, staging, and curated data layers. • Collaborate with data architects, analysts, and business stakeholders to understand complex healthcare data requirements. • Develop reusable data transformation logic and modular processing components for efficient, maintainable systems. • Support deployment processes following CI/CD and DevOps best practices. • Monitor and optimize data workflows for performance, scalability, and reliability in production environments. • Contribute to data governance, security, and compliance practices relevant to regulated healthcare environments.

Job Requirements

  • Proven hands-on experience with Informatica as a data integration and ETL platform.
  • Strong experience with Databricks or similar distributed data processing platforms.
  • Core expertise in data architecture, data integrations, data warehousing, and ETL/ELT process design.
  • Applied experience developing and deploying custom scripts and modules for distributed computing environments (custom code execution across parallel executors and worker nodes).
  • Strong proficiency in SQL, Python, and PySpark (or equivalent distributed processing languages) for data transformation and processing.
  • Solid knowledge of cloud and hybrid relational database systems such as MS SQL Server, PostgreSQL, Oracle, Azure SQL, AWS RDS, or comparable engines.
  • Hands-on experience with batch and streaming data processing techniques and data compaction strategies.

Related Categories

Related Job Pages

More Data Engineer Jobs

VetsEZ logo

Data Engineer

VetsEZ

Veterans EZ Info, commonly known as VetsEZ, is a top provider of information technology (IT) services for both commercial and government markets. The company is

Data Engineer4 days ago

• Design, develop, and maintain scalable data pipelines and ETL processes. • Build and optimize data solutions within Azure cloud environments, including Azure Synapse and Azure Data Factory. • Integrate data from multiple sources through APIs and other system integration methods. • Develop and support data analytics and reporting solutions using Power BI. • Ensure data quality, integrity, security, and performance across data platforms. • Collaborate with stakeholders to gather requirements and deliver data-driven solutions supporting healthcare and VA initiatives. • Manage and track development activities using Agile methodologies and tools such as Jira. • Take on additional tasks and responsibilities as needed to support team objectives and ensure the success of the project.

Texas + 2 moreAll locations: Texas | Virginia | Washington
OXIO logo

Staff Data Engineer

OXIO

The top Telecom-as-a-Service Platform in the world

Data Engineer4 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Help build, maintain, and scale our data pipelines that bring together data from various internal and external systems into our data warehouse. • Partner with internal stakeholders to understand analysis needs and consumption patterns. • Partner with upstream engineering teams to enhance data logging patterns and best practices. • Participate in architectural decisions and help us plan for the company’s data needs as we scale. • Adopt and evangelize data engineering best practices for data processing, modeling, and lake/warehouse development. • Advise engineers and other cross-functional partners on how to most efficiently use our data tools.

United States
EvolutionIQ logo

Senior Software Engineer, AI Data Engineering

EvolutionIQ

Leading the artificial intelligence transformation for insurance carriers.

Data Engineer4 days ago
Full TimeRemoteTeam 51-200H1B Sponsor

• Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage. • Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment. • Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users. • Bridging the Data: Partner closely with product, client services, and data science teams to transform raw pipeline outputs into meaning. • Drive Greenfield Engineering: Take complete ownership of designing, building, and launching enterprise-grade applications, moving fluidly between rapid prototyping and bulletproof production systems.

New York
$190K - $225K / year
Xplore Inc logo

Senior Data Engineer, Network Intelligence

Xplore Inc

Xplore Inc. is Canada’s fibre, 5G and satellite broadband company for rural living. Xplore is committed to the relentless pursuit of an improved broadband experience for all Canadians. Xplore is building a world-class fibre optic and 5G wireless network to enable innovative broadband services for better everyday rural living, for today and future generations.

Data Engineer4 days ago

Role Description As a Senior Data Engineer on the Network Intelligence team, you will own the end-to-end data pipeline function that powers Xplore's network analytics, observability, and AI capabilities across Fiber, Fixed Wireless, and Satellite platforms in both cloud and on-prem environments. You will design and operate the Bronze-to-Silver-to-Gold medallion pipelines for all network telemetry domains, ensuring that on-premise network modeling outputs are reliably structured, governed, and delivered as trusted enterprise Gold-tier assets. This role spans both cloud and on-premises platforms, including building, maintaining, and optimizing data pipelines and supporting infrastructure on-prem—not just migrating workloads to the cloud. Your work directly underpins predictive analytics, ML model training, and operational dashboards consumed by network engineering, operations, and executive audiences. You will work in a cross-functional squad alongside the Network Intelligence & Automation Specialist, Data Scientists, and value-stream AI/Observability engineers, contributing to platform standards and mentoring intermediate engineers on the team. - Design and build production-grade ELT pipelines in Databricks (PySpark, Delta Live Tables, Databricks Workflows) ingesting telemetry from RAN, transport, core, and cloud network platforms. - Own and maintain Bronze, Silver, and Gold tier datasets for all network domains, including schema governance, SLA adherence, and data quality standards in Unity Catalog. - Design, build, and maintain data pipelines and supporting components across both cloud and on-premises environments, including new on-prem solutions required for network telemetry and analytics workloads—not only cloud migration initiatives. - Deliver structured Bronze and Silver tier network modeling outputs into the Corporate Enterprise Gold tier, ensuring alignment with enterprise data architecture standards. - Apply partitioning, liquid clustering, and incremental load patterns to support large-scale time-series and event-driven network telemetry data. - Define and document canonical data models for network telemetry entities: cells, sites, circuits, alarms, and performance counters. - Instrument pipelines with data quality checks, alerting, and SLA monitoring to ensure operational reliability. - Partner with Data Scientists to produce feature-engineered Gold datasets suitable for ML model training and inference pipelines. - Collaborate with the GIS Data Engineer to integrate geospatial dimensions into network telemetry datasets. - Translate network engineering data requirements into structured pipeline specifications and delivery roadmaps. Qualifications - 7+ years of experience in data engineering with production-grade pipeline development. - Expert-level Databricks experience: Spark optimization, Delta Lake, Unity Catalog, and Databricks Workflows or Delta Live Tables. - Strong Python and PySpark for production data engineering. - Solid understanding of medallion architecture (Bronze / Silver / Gold) and enterprise data governance principles. - Experience with streaming or near-real-time ingestion patterns: Kafka, Event Hubs, or Databricks Auto Loader. - Proficiency with Azure Data Lake Storage Gen2 and Azure-based data platform components. - Solid data modeling skills for time-series and high-frequency event datasets. - Strong communication skills and demonstrated mentorship experience. Preferred Qualifications - Familiarity with network telemetry data sources: SNMP, syslog, NetFlow, RAN KPIs, or equivalent. - Telecom, FWA, or infrastructure analytics domain experience. - Experience with On-Prem Kafka is a bonus. - Familiarity with Terraform or infrastructure-as-code for pipeline environment provisioning. - Databricks Certified Data Engineer (Associate or Professional). - Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field. Condition of Employment As a condition of employment and in order to comply with industry related data security standards, this position is subject to the successful completion of a Criminal Background Check. Details will be supplied to applicants as they move through the selection process. Xplore is committed to creating an accessible environment and will accommodate disabilities during the selection process. Please let your recruiter know during the selection process of any accommodation needs.

Canada
Job Closed