Job Closed

This listing is no longer active.

Capgemini logo
Capgemini

Founded in 1967, Capgemini is revered as one of the world's leading consulting, technology, and outsourcing agencies. In 2016 alone, the company reported global

Senior Data Engineer

Location

United States

Posted

152 days ago

Salary

$140K - $170K / year

Seniority

Senior

Job Description

Senior Data Engineer

Capgemini

• Provide technical leadership, analytical expertise, and program oversight in data engineering • Collaborate with software developers, data scientists, and product managers to understand requirements • Contribute technical expertise in data engineering to design and implement data solutions • Participate in planning sessions to develop data pipeline architecture • Design data ingestion, transformation, and storage workflows that are scalable • Implement data quality checks and validation procedures • Conduct comprehensive testing, including unit, integration, and system testing • Document data engineering processes, pipeline configurations, and data flows • Foster effective collaboration with cross-functional teams • Communicate complex technical information clearly and effectively • Manage projects, including planning, execution, and delivery

Job Requirements

  • 5 to 8 years of experience using Orchestration tools (Apache Airflow) and data processing tools (Apache Spark)
  • Advanced knowledge of object storage and management of unstructured data
  • Experience utilizing cross-domain solutions in a secure environment
  • Advanced knowledge of containerization technologies (Kubernetes, Docker, Helm, etc.)
  • Proficient knowledge of data engineering workflows such as ETL processes and data warehousing
  • Advanced level knowledge of software development processes including Agile methodologies
  • Conversant with object-oriented programming languages
  • Excellent written and oral communication
  • Bachelor’s degree in engineering, science, or equivalent with demonstrated experience in technical/engineering leadership

Benefits

  • Paid Time Off
  • Paid Company Holidays
  • Medical, Dental & Vision Insurance
  • Optional HSA and FSA
  • Base and Voluntary Life Insurance
  • Short Term & Long-Term Disability Insurance
  • 401k Matching
  • Employee Assistance Program

Related Categories

Related Job Pages

More Data Engineer Jobs

OtherRemoteTeam 201-500H1B No Sponsor

• Contribute to enterprise data transformation initiatives • Design and implement data ingestion pipelines • Monitor and optimize data ingestion and processing workflows • Collaborate with cross-functional teams • Support continuous improvement of the data platform

United States
OtherRemoteTeam 201-500H1B No Sponsor

• Engage in a large-scale data transformation initiative • Analyze and modernize complex data transformation logic • Architect and implement end-to-end data ingestion frameworks • Define and analyze performance metrics • Perform advanced performance tuning for Databricks operations • Provide technical leadership and mentorship

United States
AcuityMD logo

Director of Data Engineering

AcuityMD

Accelerate access to medical technology.

Data Engineer152 days ago
OtherRemoteTeam 11-50Since 2019H1B Sponsor

• Set and own the technical and organizational north star for data engineering at AcuityMD—defining how we build the world’s most accurate and actionable model of healthcare reality. • Lead the design, implementation, and evolution of our core data platform, powering every aspect of our product and agentic experiences—from analytics and workflows to AI-driven insights. • Translate company and product strategy into clear data investments, and clearly articulate the customer value of those investments to engineering, product, and leadership partners. • Go deep when it matters: drive architectural decisions, review critical designs, debug hard problems, and coach the team through complex technical tradeoffs. • Build data systems that are constantly improving, reliable, and trusted—supporting massive scale, complex healthcare data, and customer-facing use cases where correctness truly matters. • Partner closely with product, data science, and engineering leaders to ensure tight feedback loops between data generation, modeling, and real-world customer outcomes. • Lead, mentor, and grow a high-impact team of data engineers, data scientists, and domain experts, setting a high bar for methodological and technical rigor, ownership, and velocity. • Establish strong—but pragmatic—standards for data quality, testing, observability, lineage, and governance, without slowing the team down. • Shape the culture of the data organization: how we plan, how we ship, how we review work, and how we learn from mistakes. • Stay close to the evolving state of modern data engineering and analytics engineering, bringing in new ideas when they meaningfully raise our ceiling.

Massachusetts
$250K - $300K / year
Job Closed
Arine logo

Staff Data Engineer

Arine

Arine optimizes medication to ensure each patient is on the safest, most effective therapy for their unique health needs

Data Engineer152 days ago
OtherRemoteTeam 11-50H1B No Sponsor

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description As a key technical leader and team architect working in a fast-paced environment, you will drive the design, development, and optimization of scalable data ingestion pipelines within the Arine platform. Leveraging expert-level proficiency in Python and AWS, you will architect solutions that handle diverse file types and large-scale healthcare datasets. You will have a direct impact on building reusable, configurable tools set for handling data needs for the entire company. What You'll be Doing: - Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services. - Architect and implement scalable data ingestion pipelines that handle different file types into the Arine platform. - Develop reusable components that integrate into data pipelines to increase efficiency and reduce future implementation time. - Create configuration-driven, containerized toolsets that are easy to use and maintain across diverse engineering profiles. - Work collaboratively with cross-functional teams to meet data requirements through ETL components. - Design and maintain data transformation pipelines using DBT, including macros, incremental models, and DBT tests. - Implement incremental data ingestion strategies for large-scale healthcare datasets. - Build monitoring and alerting systems for data ingestion processes and overall pipeline health. - Apply software engineering best practices, including test-driven development and modular design, to data infrastructure. - Refactor and rebuild existing data ingestion processes to improve scalability and operational efficiency. - Work with containerization technologies (Docker, Kubernetes) to create portable and maintainable data solutions. - Identify and escalate inefficiencies within and across teams. - Provide technical guidance and mentorship to junior engineers, and promote best practices and coding standards. - Author and maintain high-quality technical documentation, and support junior engineers in doing the same. - Collaborate with the DE Manager to report on DE contractor performance issues. Qualifications - 10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure. - Deep expertise in Python and modern data engineering tools. - A track record of building automated, production-grade ETL processes using Python and dbt SQL. - Strong understanding of ETL/ELT frameworks and distributed data processing. - Hands-on proficiency with modern data technologies and comfort leveraging AI coding assistants to accelerate development, improve code quality, and enhance productivity. - Skilled in data processing, validation, cleaning, and debugging. - Strong capability integrating APIs for seamless data exchange between systems. - Proven ability to handle and process varied file types and formats, including healthcare standards such as HL7, 834, 837, and NCPDP. - Demonstrated success integrating and consolidating data from diverse source systems into a unified repository, including EHR and claims systems, via both file-based and API integrations. - Comfort working with large-scale datasets (10GB+). - Strong capability implementing incremental processing and change data capture (CDC) methodologies. - Extensive background designing scalable data architectures in AWS environments. - Solid grounding in software engineering principles, including test-driven development, loose coupling, single responsibility, and modular design. - Hands-on familiarity with containerization (Docker, Kubernetes) and building configuration-driven, maintainable systems. - Proven ability to build tools and systems that diverse engineering profiles can operate through configuration rather than code changes. - A passion for building new data infrastructure and continuously improving existing systems with robustness, maintainability, and operational excellence. - Familiarity with healthcare data and regulatory environments (HIPAA) as a plus. - Strong collaboration skills, with comfort partnering across technical and non-technical stakeholders. - Excellent written and verbal communication, with the ability to explain technical infrastructure concepts to diverse audiences. Requirements - Ability to pass a background check. - Must live in and be eligible to work in the United States. Benefits - Dynamic role with the opportunity to contribute to the company's growth and shape its future. - Unparalleled learning and growth prospects, collaborating closely with experienced Clinicians, Engineers, Software Architects, and Digital Health Entrepreneurs. - Salary range for this position is: $170,000-185,000/year. Remote Work Requirements - An established private work area that ensures information privacy. - A stable high-speed internet connection for remote work. - This role is remote, but you will be required to come to on-site meetings multiple times per year.

United States
$170K - $185K / year
Job Closed