Job Closed

This listing is no longer active.

Cotiviti

Enabling a high-quality and viable healthcare system

Data Engineer – AI, Spark, Databricks, Healthcare

Data EngineerData EngineerFull Time Remote SeniorTeam 5,001-10,000H1B SponsorCompany Site LinkedIn

Location

United States

Posted

81 days ago

Salary

$105K - $125K / year

Seniority

Senior

Bachelor Degree3 yrs expEnglishAirflow AWS Azure ETL GCP Hadoop Apache Kafka Oracle Database Ray RDBMS Apache Spark SQL

Job Description

• Create, maintain and execute intermediate to advanced Spark scripts for data management and data validation, and data integration. • Create, maintain and execute basic to intermediate SQL scripts for data management and data validation. • Optimize the queries to improve the efficiency of daily tasks. • Perform data analysis and identify any issues. • Work with other groups such as Engineering team, DBA, Cloud ops, etc. to troubleshoot and resolve any environmental or network issues that impact your work. • Extend your support to after – hours or weekends as needed. • Create and maintain data pipelines as needed. • Validates the tasks results to ensure that all the requirements are met. • Adhere to all the industry level and organization level compliance rules and regulations to maintain data integrity. • Complete individual productivity tracking. • Complete task assignments using department ticketing system within assigned deadline. • Achieve organizational and individual goals as identified in performance reviews and goal setting exercises. • Complete all special projects and other duties as assigned.

Job Requirements

Bachelor’s degree in Computer Science, Information Technology or equivalent work experience
3+ years of working knowledge of big data technologies (Spark, S3, Kafka, Ray, Hadoop, etc.)
2+ years of working knowledge of big data / cloud technologies (Databricks, AWS, Azure, Hadoop, Spark, Snowflake etc.)
3+ years of working knowledge of cloud (AWS, Azure, GCP, OCI etc.)
3+ years of working knowledge of RDBMS (Oracle, MS SQL, Vertica, etc.) and experience using SQL, PL/SQL or other data integration/ETL tools
Any Databricks / AWS certifications is a big plus
Familiarity with data pipeline orchestration tools (e.g., Airflow, Databricks Workflows)
3+ years of data analysis
Preferably in the Healthcare industry of enrollment, medical claims and/or pharmacy claims
Proficient in Microsoft Office Suite applications PowerPoint, Word, Excel and Outlook
Flexible work schedule
Experience with project management tools like JIRA
Databricks and/or Snowflake environment familiarity a plus

Benefits

medical, dental, vision, disability, and life insurance coverage
401(k) savings plans
paid family leave
9 paid holidays per year
17-27 days of Paid Time Off (PTO) per year

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More Data Engineer Jobs

Senior Data Architect

SupplyHouse.com

Plumbing, Heating & HVAC Supplies. Real People. Real Service.

Data Engineer81 days ago

Full Time RemoteTeam 501-1,000Since 2004H1B Sponsor

Company Site LinkedIn

• Design, build, and maintain scalable ELT/ETL pipelines into BigQuery for core domains (orders, customers, products, inventory, fulfillment, procurement, marketing) • Build orchestration patterns with strong operational rigor (retries, idempotency, backfills, SLAs/SLOs, incident response) to enable reliable delivery • Implement data quality and testing frameworks (freshness checks, anomaly detection, unit/integration tests) and standardize monitoring/observability • Standardize CI/CD and infrastructure-as-code patterns for data systems and guide teams on best practices for ingestion, transformation, and orchestration • Lead modernization initiatives such as legacy warehouse to cloud lakehouse migrations and platform upgrades • Create and own curated, analytics-ready datasets and models (dimensional/conformed, semantic-ready layers) that make reporting fast, consistent, and self-serve • Establish and enforce standards for conceptual, logical, and physical data modeling • Oversee domain-driven modeling and data product design through reusable patterns and reference implementations • Establish data architecture standards including modeling conventions, schema evolution/versioning, incremental loading strategies, and warehouse performance patterns • Define and evolve the enterprise data architecture vision aligned to business strategy, including canonical data models, domain boundaries, and integration patterns • Lead architecture decisions for data warehousing, lakehouse, streaming, MDM, and operational data platforms • Evaluate and select strategic data technologies and vendors • Partner with Security and Legal to design privacy and compliance architectures (GDPR/CCPA/SOC2-aligned approaches) and establish enterprise governance frameworks • Build governance fundamentals including documentation, lineage/metadata, access controls, and PII handling • Act as the escalation point for data architecture decisions and mentor senior engineers/architects through reviews, templates, and best practices • Partner with application engineering and analytics stakeholders to define data contracts and ensure reliable upstream/downstream integrations • Translate business capabilities into scalable data platform solutions • Optimize cost and performance across BigQuery workloads (query optimization, partition pruning, clustering strategies, and workload management where applicable) • Drive adoption of emerging capabilities (real-time analytics, AI enablement, semantic layers, data mesh) and develop multi-year data roadmap and maturity models • Operate at enterprise scope across multiple business domains • Influence strategy beyond the immediate team and set technical direction that others follow.

AWS Azure BigQuery ETL GCP HashiCorp Vault

View details: Senior Data Architect

Arizona + 13 more

$145K - $180K / year

Apply

Job Closed

Platform Data Engineer

TELUS Digital

Data Engineer81 days ago

Full Time RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

• Be responsible for expanding and optimizing our data and data-pipeline architecture, as well as improving data flow and collection for cross-functional teams • Create, build, and maintain optimal data pipeline architectures • Assemble large, complex datasets that meet business requirements • Identify, design, and implement internal process improvements: automate manual processes, optimize data delivery, and redesign infrastructure for greater scalability • Build the infrastructure required for optimal extraction, transformation, and loading (ETL) of data from a wide variety of data sources using SQL and AWS/Azure big data technologies • Work with data and analytics experts to enhance the functionality and reliability of our data systems

Apache HTTP Server AWS Azure Hadoop Apache Kafka NoSQL Python Scala Apache Spark SQL

View details: Platform Data Engineer

Brazil

Apply

Job Closed

Principal Data Engineer

Autodesk

How the world gets designed and made. #MakeAnything

Data Engineer81 days ago

Full Time RemoteTeam 10,001+Since 1982H1B No Sponsor

Company Site LinkedIn

Job Requisition ID # 26WD96683 Position Overview We are hiring a Principal Data Engineer to join Autodesk’s Enterprise Data Management (EDM) team within the COO-GET organization. In this senior technical leadership role you will lead the design and delivery of scalable data platform services, robust data products, and enterprise-wide data engineering standards that power canonical records and analytics across the company. You will partner across product, platform, and business stakeholders to translate complex business needs into pragmatic, high-quality technical solutions and to raise engineering maturity across multiple teams. Familiarity with master data management, data enrichment, identity resolution, and data quality practices is a plus. Responsibilities - Act as a technical leader and force multiplier across multiple data engineering teams: coach, mentor, and raise the bar on code quality, testing, and delivery practices - Define and own architecture and standards for data platform services, including data modeling, ETL/ELT patterns, metadata, and data governance - Lead design and delivery of robust, performant data pipelines and data products using modern cloud data platforms (e.g., Snowflake), orchestration, and transformation tooling (dbt or similar) - Drive DataOps maturity: testing frameworks, automated quality gates, CI/CD, observability, incident response, and runbooks for production data workloads - Solve complex, enterprise-scale data engineering challenges (record reconciliation, identity/organizational hierarchies, lineage, scale/performance) - Lead root-cause analysis and durable remediation for complex production incidents; enable teams to prevent recurrence - Collaborate closely with product managers, data stewards, analytics, ML, and business stakeholders to prioritize high-impact initiatives and translate requirements into scalable engineering solutions - Provide hands-on implementation as needed (POC, spike, or production work), and lead technical decision-making across teams - Design and operationalize real-time/near-real-time data paths and evented architectures as appropriate to use cases. Minimum Qualifications - BS/MS in Computer Science or a related technical field, or equivalent practical experience - 10+ years of hands-on software and data engineering experience in enterprise environments - Deep, demonstrable expertise with SQL, hive, dbt, Python & Snowflake, and modern data engineering tooling (Snowflake and dbt or comparable technologies) - Strong experience designing data models (conceptual, logical, physical) and optimizing query performance at scale - Practical experience with data lake table formats and query engines (for example, Apache Hive, Apache Iceberg) - Extensive AWS experience and comfortable designing solutions using cloud services and IaC (Terraform/CloudFormation) - Track record of leading architecture and solution design for complex, cross-team systems - Demonstrated ownership of data quality frameworks, observability, and production reliability for data workloads - Excellent communication skills and experience influencing stakeholders across GEOs and functions - Proven experience mentoring engineers and elevating team delivery practices. Preferred Qualifications - Prior experience in ingesting, aggregating, event streams, web & product analytics data - Practical experience with streaming/event-driven architectures and stream processing frameworks (for example, Apache Flink) - Familiarity with search/indexing systems and integrating with data APIs - Experience operationalizing ML/AI or GenAI capabilities on enterprise data platforms Learn More About Autodesk Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world. When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us! Benefits From health and financial benefits to time away and everyday wellness, we give Autodeskers the best, so they can do their best work. Learn more about our benefits in the U.S. by visiting https://benefits.autodesk.com/ Salary transparency Salary is one part of Autodesk’s competitive compensation package. For U.S.-based roles, we expect a starting base salary between $128,000 and $229,900. Offers are based on the candidate’s experience and geographic location, and may exceed this range. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package. Equal Employment Opportunity At Autodesk, we're building a diverse workplace and an inclusive culture to give more people the chance to imagine, design, and make a better world. Autodesk is proud to be an equal opportunity employer and considers all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender, gender identity, national origin, disability, veteran status or any other legally protected characteristic. We also consider for employment all qualified applicants regardless of criminal histories, consistent with applicable law. Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging Are you an existing contractor or consultant with Autodesk? Please search for open jobs and apply internally (not on this external site).

SQL Apache Hive dbt Python Snowflake Apache Iceberg AWS Terraform

View details: Principal Data Engineer

United States

Apply

Job Closed

Data Engineer II

Cleveland Clinic

Your source for health news, tips and information from one of the nation’s top hospitals.

Data Engineer81 days ago

Full Time RemoteTeam 10,001+H1B Sponsor

Company Site LinkedIn

At Cleveland Clinic Health System, we believe in a better future for healthcare. And each of us is responsible for honoring our commitment to excellence, pushing the boundaries and transforming the patient experience, every day. We all have the power to help, heal and change lives — beginning with our own. That’s the power of the Cleveland Clinic Health System team, and The Power of Every One. Job Title Data Engineer II Location Cleveland Facility Remote Location Department Emergency Medicine-Integrated Hospital Care Institute Job Code T99347 Shift Days Schedule 9:00am-5:00pm/8:30am-4:30pm Job Summary Job Details Join the Cleveland Clinic team where you will work alongside passionate caregivers and make a lasting, meaningful impact on patient care. Here, you will receive endless support and appreciation while building a rewarding career with one of the most respected healthcare organizations in the world. As Data Engineer, you will support the organization’s enterprise database needs by developing, maintaining, and enhancing internal and customer-facing databases and applications. You will use your skills and expertise to collaborate with technical and non-technical stakeholders to define metrics, automate reporting, and identify opportunities for automation. The role focuses on ETL development, including processing, transforming, and moving large volumes of data, while supporting clear communication and collaboration across multidisciplinary teams. Everyone at Cleveland Clinic is a caregiver. As part of our IT and cybersecurity team, you’ll do more than explore and expand your discipline and skills, you’ll impact patients everywhere by delivering world-class care. Help our teams stay connected, up to date and equipped with the groundbreaking tools and technologies that are changing patients’ lives. A caregiver in this position works remotely out of Ohio, Florida, or Nevada, Monday-Friday 8:00 a.m. – 4:30 p.m. or 9:00 a.m. – 5:00 p.m. A caregiver who excels in this role will: - Assist and support the overall database need of Cleveland Clinic. - Support the creation of various analytics and .NET applications through databases, such as MS SQL and Airflow. - Work in a team of database developers, software developers and business analysts to deliver database analytics and web-based applications. - Develop, maintain and enhance databases and applications for internal purposes and customer reporting. - Administer existing databases and the analysis, design and creation of new databases. - Perform data modeling, database optimization, understanding and implementation of schemas and the ability to interpret and write complex code. - Monitor systems for optimum performance and capacity constraints. - Design, implement and support ETL processes. - Establish database standards, documentation and best practices. - Handle multiple projects while supporting existing production databases and processes. - Perform support, development and administrative activities as required. Minimum qualifications for the ideal future caregiver include: - Bachelor's degree in computer science, engineering or related field - Three years database development experience Additional experience may offset degree requirement. - Solid understanding of database and .NET computing environment concepts - Excellent knowledge of current technologies and project management skills - Diverse technology background required along with strong verbal and written communication skills Preferred qualifications for the ideal future caregiver include: - Master's Degree - Programming experience with an emphasis on coding that scales well and is optimized for use in high-volume environment - Experience building ETL processes/pipelines and automating scheduling of these processes - Experience in SQL, Airflow, Python, Snowflake, DBT, and GIT preferred Physical Requirements: - Ability to communicate and exchange accurate information. - Ability to perform work in a stationary position for extended periods. - Ability to work with physical records or operate a computer or other office equipment. - In some locations, ability to travel throughout the hospital system. - In some locations ability to move up to 25 lbs. Personal Protective Equipment: - Follows standard precautions using personal protective equipment as required. The policy of Cleveland Clinic Health System and its system hospitals (Cleveland Clinic Health System) is to provide equal opportunity to all of our caregivers and applicants for employment in our drug free environment. All offers of employment are followed by testing for controlled substances. Cleveland Clinic Health System administers an influenza prevention program. You will be required to comply with this program, which will include obtaining an influenza vaccination on an annual basis or obtaining an approved exemption. Decisions concerning employment, transfers and promotions are made upon the basis of the best qualified candidate without regard to color, race, religion, national origin, age, sex, sexual orientation, marital status, ancestry, status as a disabled or Vietnam era veteran or any other characteristic protected by law. Information provided on this application may be shared with any Cleveland Clinic Health System facility. If applying for a Florida position, please see the following website for more information on the background screening requirements required by the Agency of Health Care Administration: https://info.flclearinghouse.com/ Please review the Equal Employment Opportunity poster. Cleveland Clinic is pleased to be an equal employment opportunity employer.

Microsoft SQL Server Airflow SQL Python Snowflake dbt .NET ETL

View details: Data Engineer II

United States

Apply

Job Closed

Data Engineer – AI, Spark, Databricks, Healthcare

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Architect

Platform Data Engineer

Principal Data Engineer

Data Engineer II