RCH Solutions

Advanced scientific computing services that help accelerate the development of your next scientific breakthrough.

HPC Engineer

EngineerEngineerFull Time Remote SeniorTeam 51-200Since 1991H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

2 days ago

Salary

Seniority

Senior

Bachelor Degree5 yrs expEnglishAnsible AWS Cloud DNS Google Cloud Platform Linux NFS Terraform

Job Description

• Work closely with customer stakeholders, scientists, and IT professionals to deliver Compute at Scale • Develop, evolve, and administer HPC platforms along with support for Scientific applications, workflows, and other related infrastructure both on-prem and Cloud hosted • Drive architecture, roadmaps, and execution of projects to establish and operate IT infrastructure best practices for customers • Full stack support - design and evolution of platforms, application administration, supporting customer workflows, profiling and performance tuning • Monitoring and maintenance of scoped systems, platform and systems administration, troubleshooting hardware, software, and networking related issues • Solution architecting and hands-on engineering (on-prem + Cloud) • Documentation • Collaborating with cross-discipline team members and customers • Supporting internal and customer Architecture and Design efforts • Supporting customers with their workflow pipelines (advisory and hands-on) • Comprehensively documenting new and existing computational assets • Maintaining the flexibility to pivot as engagement scopes may evolve • Support for AWS & GCP Cloud applications, migrations, and modernization • CloudOps / IaC for on-going platform management • Setup and configuration of AWS & GCP Cloud infrastructure for new platform builds • Ensuring system compliance with company security standards and applicable regulatory requirements • Transition support for modernized services to operational teams • Provide engineering level troubleshooting and services restoration for operational issues as they arise on supported platforms • Provide training/mentorship for junior level team members • Escalation point on multiple engagements to ensure resolution

Job Requirements

A bachelor’s degree or master’s degree in Computer Science or related field
5 + years of experience administering HPC clusters and systems
Experience with SLURM and Grid Engine scheduling software preferred
5 + years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support
5+ years professional experience developing or administering compute solutions for Scientific / Research IT domains, Life Sciences being preferred
Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator capacity
Experience developing scientific workflows on HPC systems using Nextflow
Extensive command-line system administration experience: User and group management
Advanced knowledge of Active Directory, DNS, DHCP, LDAP, NFS, SMB
Building applications from source code, installing, maintaining, and troubleshooting application-level Linux and scientific software in line with industry best practices
Installation of Linux operating system and fine tuning
Familiarity with leveraging and maintaining Linux package management systems
Intermediate OS level networking knowledge
Experience using with scripting tools, automation tools, and configuration management tools
Ansible, Terraform and Cloud Formation experience preferred
Experience administering and integrating Scientific / Research applications.
Strong time-management skills; able to complete projects in a timely manner, plan and prioritize tasks while keeping leadership and stakeholders updated regularly on status
Excellent communication skills, including preparation of written documentation for IT colleagues and end users
Proactive thinking skills to identify potential issues and solution options prior to incidents occurring
Extreme attention to detail is needed to interface with multi different clients simultaneously
Ability to understand and analyze complex technical problems and situations
Candidates must be a passionate engineer with a strong vision and a desire to stay on top of trends in the Scientific Computing sector.
Ability to work independently or with a team
Ability to take a project from start to finish with minimal supervision

Benefits

Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance
Company-provided Life and Long-Term Disability Insurance
Company-sponsored 401(k) Plan
Company-provided continuing education benefit
Team-focused culture and unlimited opportunity for advancement

Related Categories

Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More Engineer Jobs

Engineer II

Amgen

#WeareBiotech

Engineer2 days ago

Full Time RemoteTeam 10,001+Since 1980H1B Sponsor

Company Site LinkedIn

• You will be responsible for ensuring the product meets all requirements for safety, efficacy, and functionality through product lifecycle management. • You will manage the creation and maintenance of design documentation in accordance with quality procedures. • Providing guidance on combination product and device design requirements and specifications • Leading product test strategies and execution to demonstrate product safety, performance, and efficacy • Manage combination product and device Design History Files • Analyzing data to support design acceptance, performance capability, and failure analysis • Creating and driving test protocols, methods, and reports • Transferring of technical information to manufacturing sites and support manufacturing scale-up and launches • Employ basic engineering skills and practices to gather user requirements and translate them into documentation • Engaging suppliers and development partners regarding specifications and quality levels • Provides authorship and expert technical leadership for regulatory filings • Managing project scope, schedule, and budget • Owns and support quality records, change records, and deviations • Supports device design complaint investigations and tracking to ensure timely resolution and continuous improvement • Collaborate with Process Development and external partners as a technical authority.

View details: Engineer II

California + 2 more

$97.1K - $131.4K / year

Apply

Ingeniero/a de Automatización, Junior

IRIUM

Líderes en gestión de servicios integrados de infraestructuras y plataformas IT.

Engineer2 days ago

Full Time RemoteTeam 501-1,000Since 2002H1B No Sponsor

Company Site LinkedIn

• Colaborar en un proyecto internacional del sector bancario en modalidad full-remote.

AWS Azure Cloud Docker Kubernetes Python

View details: Ingeniero/a de Automatización, Junior

Spain

€22K - €26K / year

Apply

Senior Mainframe Automation, Migration Engineer

PHIZENIX

Talent Solutions for the AI Era

Engineer2 days ago

Full Time RemoteTeam 1-10Since 2025H1B No Sponsor

Company Site LinkedIn

• Lead migration projects from CA OPS/MVS to IBM Tivoli Systems Automation for z/OS and from CA Automation Point to IBM SAIOM. • Configure and implement IBM Systems Automation solutions, including policy-based automation for z/OS • Develop and maintain automation rules, REXX execs, and System Automation Policy Database (PDB) • Provide installation, configuration, and troubleshooting support for IBM automation products. • Collaborate with operations teams to improve startup, shutdown, recovery, and high-availability automation processes.

View details: Senior Mainframe Automation, Migration Engineer

United States

$55 - $60 / hour

Apply

Senior Databricks Engineer

EXL

We make sense of data to drive your business forward. #MakeSenseofData #DriveYourBusinessForward #PartnerYourWay

Engineer2 days ago

Full Time RemoteTeam 10,001+H1B No Sponsor

Company Site LinkedIn

• Ingestion & Transformation: Design and optimize high-volume ETL/ELT pipelines using Delta Live Tables (DLT) and PySpark, ensuring data integrity across the Bronze, Silver, and Gold layers. • Workflow Orchestration: Develop and maintain sophisticated pipelines using Databricks Workflows or Airflow, focusing on modularity, reusability, and automated error handling. • Streaming & Real-time Integration: Implement real-time data flows utilizing Structured Streaming and Kafka/Event Hubs to enable immediate data availability for downstream consumption. • Data Security & Privacy: Enforce data anonymization and fine-grained access controls to ensure compliance with global regulations (GDPR/CCPA/HIPAA). • DataOps & DevOps: Implement CI/CD patterns using Databricks Asset Bundles (DABs), Terraform, and Git to automate environment parity and deployments. • Open Table Formats: Manage and optimize Delta Lake storage, utilizing advanced features like Liquid Clustering, Z-Ordering, and Change Data Feed (CDF). • Compute Engine Optimization: Drive cost efficiency and performance by optimizing Spark configurations, Photon engine utilization, and Serverless SQL Warehouses. • Observability & Monitoring: Integrate comprehensive monitoring and alerting (e.g., Databricks System Tables, Grafana, or Splunk) to rapidly identify bottlenecks and troubleshoot production issues.

Airflow AWS Azure Cloud ETL Grafana Kafka PySpark Python Spark Splunk SQL Terraform Unity Vault

View details: Senior Databricks Engineer

United States

Apply

HPC Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Engineer Jobs

Engineer II

Ingeniero/a de Automatización, Junior

Senior Mainframe Automation, Migration Engineer

Senior Databricks Engineer