Thermo Fisher Scientific logo
Thermo Fisher Scientific

Thermo Fisher Scientific is a global biotechnology product development company whose mission is to make the world healthier, cleaner, and safer. Thermo Fisher Scientific leads a gl

Senior Databricks Engineer

Location

United States

Posted

87 days ago

Salary

0

Seniority

Senior

No structured requirement data.

Job Description

Senior Databricks Engineer

Thermo Fisher Scientific

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description At Thermo Fisher Scientific team, you’ll discover impactful work, innovative thinking and a culture dedicated to working the right way, for the right reasons - with the customer always top of mind. The work we do matters, like helping customers find cures for cancer, protecting the environment, and supporting our customers’ medical related inquiries. As the world leader in serving science, with the largest investment in R&D in the industry, our colleagues are empowered to realize their full potential as part of a fast-growing, global organization that values passion and unique contributions. Our commitment to our colleagues across the globe is to provide the resources and opportunities they need to make a difference in our world while building a fulfilling career with us. The Senior Databricks Engineer will architect, build, and optimize data solutions that support Thermo Fisher Scientific’s digital transformation strategy. As a vital contributor to the CRG Digital Platform and Architecture team, this position will be responsible for building connections and workflows within cloud-based systems. - Build, develop, and deploy scalable data pipelines and ETL/ELT processes using Databricks. - Engineer robust data solutions to integrate enterprise data sources, including ERP, CRM, laboratory, and manufacturing systems. - Develop reusable frameworks and templates to accelerate data delivery and ensure consistency across domains. - Implement and maintain high-performance data connections across Databricks, Snowflake, and Iceberg environments. - Author and optimize complex SQL queries, transformations, and data models for analytics and reporting use cases. - Support data Lakehouse and data mesh initiatives to enable seamless access to trusted data across the organization. - Apply data governance, lineage, and security controls using Unity Catalog, Delta Live Tables, and related technologies. - Partner with compliance and cybersecurity teams to uphold data privacy, GxP, and regulatory standards. - Establish monitoring, auditing, and optimization processes for ongoing data quality assurance. - Collaborate with data scientists, architects, and business partners to build and implement end-to-end data solutions. - Serve as a technical mentor and leader with vision within the CRG data engineering community. - Contribute to critical initiatives for digital platform modernization and advanced analytics enablement. Qualifications - Bachelors degree or equivalent. - Minimum of 8 years professional experience in data engineering or data platform development is required. - Minimum of 5 years of hands-on experience with Databricks and Apache Spark in production environments is required. - Demonstrated expertise with Snowflake and Apache Iceberg is required. - Strong proficiency in SQL and experience optimizing queries on large, distributed datasets is required. - Proven experience with cloud-based data platforms (Azure preferred; AWS or GCP acceptable) is required. - Strong understanding of data modeling, ETL/ELT pipelines, and data governance practices is required. - Experience implementing Unity Catalog or CI/CD pipelines for data workflows is preferred. - Experience in life sciences, biotech, or manufacturing environment is preferred. - Strong interpersonal and communication skills with ability to collaborate across global and multi-disciplinary teams is preferred. Requirements - Strong technical skills in Databricks, Spark, SQL, Snowflake, and Frozen Mountain. - Solid understanding of distributed computing, performance optimization, and large-scale data processing. - Proven track record translating business requirements into scalable data architectures. - Excellent problem-solving and analytical skills; strong attention to detail. - Proficiency in leading through example, mentoring junior engineers, and championing standard procedures within the data engineering realm. - Comfortable working in a fast-paced, matrixed environment with evolving priorities. Benefits - Fully remote position. - Relocation assistance is NOT provided. - Must be legally authorized to work in the United States without sponsorship. - Must be able to pass a comprehensive background check, which includes a drug screening.

Job Requirements

  • Bachelors degree or equivalent.
  • Minimum of 8 years professional experience in data engineering or data platform development is required.
  • Minimum of 5 years of hands-on experience with Databricks and Apache Spark in production environments is required.
  • Demonstrated expertise with Snowflake and Apache Iceberg is required.
  • Strong proficiency in SQL and experience optimizing queries on large, distributed datasets is required.
  • Proven experience with cloud-based data platforms (Azure preferred; AWS or GCP acceptable) is required.
  • Strong understanding of data modeling, ETL/ELT pipelines, and data governance practices is required.
  • Experience implementing Unity Catalog or CI/CD pipelines for data workflows is preferred.
  • Experience in life sciences, biotech, or manufacturing environment is preferred.
  • Strong interpersonal and communication skills with ability to collaborate across global and multi-disciplinary teams is preferred.
  • Strong technical skills in Databricks, Spark, SQL, Snowflake, and Frozen Mountain.
  • Solid understanding of distributed computing, performance optimization, and large-scale data processing.
  • Proven track record translating business requirements into scalable data architectures.
  • Excellent problem-solving and analytical skills; strong attention to detail.
  • Proficiency in leading through example, mentoring junior engineers, and championing standard procedures within the data engineering realm.
  • Comfortable working in a fast-paced, matrixed environment with evolving priorities.

Benefits

  • Fully remote position.
  • Relocation assistance is NOT provided.
  • Must be legally authorized to work in the United States without sponsorship.
  • Must be able to pass a comprehensive background check, which includes a drug screening.

Related Categories

Related Job Pages

More Data Engineer Jobs

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Bowhead seeks a Data Engineer to join our team in Mobile, AL. The Data Engineer will not only build scalable data pipelines, but also help operate and secure our cloud environment. This role blends data engineering, cloud platform administration, and cyber compliance to ensure our data platforms are reliable, auditable, and compliant with federal security standards. This position is fully remote. Responsibilities - Data Engineering - Design, build, and maintain ETL/ELT pipelines for batch and streaming workloads. - Develop data processing and automation solutions in Python. - Write and optimize SQL for transformations, data modeling, and performance tuning. - Build and manage real-time data ingestion using message brokers and stream processing frameworks. - Inventory and catalog data assets — tracking lineage, classification, and ownership to support governance and compliance. - Troubleshoot data quality, pipeline, and performance issues across the platform. - Platform Operations - Administer and support cloud resources including identity, networking, storage, and compute. - Implement and maintain CI/CD pipelines for data workloads and infrastructure deployments. - Configure and manage infrastructure using IaC and GitOps practices. - Containerize and deploy workloads using Docker and Kubernetes. - Monitor system health, performance, and reliability; respond to incidents and outages. - Automate workflows across data pipelines, infrastructure provisioning, and operational processes. - Manage access controls, backups, logging, patching, and environment hardening. - Cyber Compliance - Implement and maintain security controls including logging, encryption, IAM, and configuration baselines aligned to NIST 800-53. - Support FedRAMP/FISMA compliance activities including control implementation, evidence collection, POA&M tracking, and audit preparation. - Work with security and governance teams to ensure environments meet required federal standards. - Assist with vulnerability management, remediation, and security documentation. Qualifications - BA/S in relevant technical field preferred. Years of experience may substitute in education requirement. - Five (5+) years of relevant technical experience. - Proficiency in Python for data processing and automation. - Strong SQL skills across relational and non-relational databases. - Experience with distributed data processing frameworks (e.g., Spark) including streaming workloads. - Familiarity with message brokers and event-driven ingestion patterns. - Experience with data cataloging and inventory for governance and compliance. - Production cloud experience (Azure preferred). - Experience with Docker, Kubernetes, CI/CD pipelines, and IaC/GitOps practices. - Ability to design scalable solutions and troubleshoot complex data and system issues. - Experience supporting security and compliance in cloud environments — controls implementation, logging, IAM, encryption, and audit support aligned to NIST 800-53 or equivalent frameworks. Nice to Have - Experience with Databricks, Delta Lake, or lakehouse architectures. - Familiarity with cloud-native governance and security tooling. - Experience with Terraform, Bicep, or similar IaC tools. - Prior experience supporting FedRAMP or FISMA authorization processes. - Experience with agentic coding tools and AI-assisted development workflows. - Experience with Navigation data and GIS. Tech Stack - Python - SQL - Spark (batch & streaming) - Message Brokers - Docker - Kubernetes - CI/CD - IaC/GitOps - Cloud Platforms (Azure preferred) - Data Lakehouse - NIST 800-53 / FedRAMP Physical Demands - Must be able to lift up to 25 pounds. - Must be able to stand and walk for prolonged amounts of time. - Must be able to twist, bend and squat periodically. Security Clearance Must be able to obtain a security clearance at the Public Trust level. US Citizenship is a requirement for Secret clearance at this location.

United States
Job Closed
Trumid logo

Senior Data Engineer – Real-Time ML, Pricing Platform

Trumid

Delivering a full ecosystem of credit protocols and trading solutions in one easy to-use-platform.

Data Engineer87 days ago
OtherRemoteTeam 51-200Since 2014H1B Sponsor

• Lead the architecture and development of streaming data pipelines that deliver market and operational data to pricing algorithms with sub-second latency, relying on technologies such as Kafka and Flink • Define architectural standards for real-time data delivery, reliability, and observability • Ensure production SLAs are met for systems that directly impact pricing and trading workflows • Collaborate with quantitative researchers, data scientists, and engineers to deliver reliable data inputs for modeling and backtesting • Mentor engineers on best practices for distributed data systems and production-grade data pipelines • Contribute to the broader data platform, including batch pipelines and data modeling workflows when needed

New York
$200K - $250K / year
Job Closed
The Motley Fool logo

Contract Data Architect, Snowflake

The Motley Fool

Making the world smarter, happier, and richer through free and premium investing guidance.

Data Engineer87 days ago
OtherRemoteTeam 501-1,000Since 1993H1B No Sponsor

• Evaluate and optimize the reporting layer of our Snowflake data warehouse to enhance cost efficiency and compute utilization. • Develop and implement strategies to improve query performance, reduce data processing time, and enhance overall efficiency. • Collaborate closely with the data engineering team to implement optimizations without compromising data integrity or security. • Utilize advanced analytic engineering techniques within Snowflake to optimize data transformations and computations. • Design efficient data models and schemas to support optimized reporting and analytics. • Implement best practices for data loading, storage, and retrieval to minimize costs and maximize performance. • Optimize query performance by tuning Snowflake configurations and query execution plans. • Implement caching strategies and materialized views to improve response times for commonly used reports. • Collaborate with data engineers to ensure efficient ETL processes and data transformations.

United States
$95 - $110 / hour
Job Closed
Full TimeRemoteTeam 11-50H1B No Sponsor

• Build ELT pipelines using BigQuery OR Snowflake, dbt, and cloud services • Write production-grade Python code for ingestion, transformation, and automation • Design analytics-ready data models • Work directly with customers to gather and clarify requirements • Ensure data quality via testing, validation, and monitoring • Debug pipeline failures, data issues, and performance bottlenecks • Participate in code reviews and follow engineering best practices • Work with 3–5 hours overlap with US EST/CST/PST

Argentina
Job Closed