Dell Technologies

Senior GenAI & High Performance Computing (HPC) Delivery Engineer

EngineerEngineerFull Time Remote SeniorTeam 10,001+H1B No SponsorCompany Site LinkedIn

Location

United States

Posted

2 days ago

Salary

$145K - $199.1K / year

Seniority

Senior

Linux AI AI/ML Red Hat Enterprise Linux Ubuntu Docker/Containers Docker Kubernetes Observability/Monitoring Performance Optimization

Job Description

Role Description Join us to do the best work of your career and make a profound social impact as a Senior GenAI & High Performance Computing (HPC) Delivery Engineer on our Service Delivery Team in Austin, Texas or Remote United States. 50-70 % National Travel. We’re seeking a Senior GenAI & HPC Engineer with deep experience in GPU accelerated systems, Linux performance tuning, and benchmarking. This role is highly hands-on and customer-facing, supporting onsite deployments across the U.S. for advanced HPC and GenAI solutions. You will work as a part of a team to help build, integrate, and test some of the world’s largest multi-GPU systems, benchmark them using industry standard tools, and deliver the next generations of AI and HPC infrastructure. - Deploy, configure, and validate GPU accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus) - Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, OSU Microbenchmarks, and related tools - Produce as-built documentation, performance reports, and share best practices amongst the team. - Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads - Work directly with customers onsite (travel both regionally and across the U.S.) Qualifications - 7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields - Deep hands-on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager - Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks - Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience - Experience with GenAI/HPC networking (InfiniBand and/or RoCE) - Experience working in Linux based parallel computing environments at scale - Experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm) - Ability to travel up to 70% of the time across the U.S. as needed for projects - Strong customer facing and communication skills Requirements - Bachelor’s degree - NVIDIA certifications (NCA, NCE, DGX) - Experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics - Exposure to hybrid cloud or GPU cloud environments - Experience with GPU observability/performance profiling tools Benefits - Your life. Your health. Supported by your benefits. You can explore the overall benefits experience that awaits you as a Dell Technologies team member — right now at MyWellatDell.com Compensation Dell is committed to fair and equitable compensation practices. The salary range for this position is $145,000 to $199,100. Company Description We believe that each of us has the power to make an impact. That’s why we put our team members at the center of everything we do. If you’re looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, we’re looking for you. - Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. - Join us to build a future that works for everyone because Progress Takes All of Us. - Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.

Related Categories

Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More Engineer Jobs

Senior Data Quality Engineer, Observability, Snowflake

Lamb Weston

Seeing possibilities in potatoes and making great fries loved the world over. Join our team of potato experts!

Engineer2 days ago

Full Time RemoteTeam 5,001-10,000H1B Sponsor

Company Site LinkedIn

• Design, implement, and maintain data quality rules, checks, and controls across enterprise data assets. • Perform data profiling, root cause analysis, and anomaly detection across SAP and non-SAP data sources. • Partner with business stakeholders to understand data quality issues, business impacts, and remediation priorities. • Translate business requirements into measurable data quality rules and thresholds. • Develop and maintain data quality frameworks, including reusable SQL patterns, UDFs, stored procedures. • Implement automated scheduling and orchestration of data quality checks using Snowflake-native capabilities (e.g., tasks, streams) and/or pipeline orchestration tools (ie: Informatica). • Implement data quality monitoring and observability scorecards, and reporting for key metadata domains. • Own and evolve enterprise data quality KPIs/scorecards, including standardized definitions, thresholds, and executive-ready reporting across domains. • Analyze data discrepancies and ensure reconciliation back to systems of record. • Lead issue management workflows, including defect triage, prioritization, root cause documentation, corrective action validation, and prevention recommendations. • Contribute to documentation of data quality standards, rules, and operational procedures. • Assist in user acceptance testing and quality assurance for new or enhanced data assets. • Provide input and feedback to improve enterprise data quality processes and tooling.

AWS Cloud Informatica SDLC SQL

View details: Senior Data Quality Engineer, Observability, Snowflake

Idaho

$117.1K - $175.6K / year

Apply

Staff Engineer (Full Stack)

Feeld

A dating app for the open-minded to meet the like-minded.

Engineer2 days ago

Full Time RemoteTeam 51-200Since 2014H1B No Sponsor

Company Site LinkedIn

Role Description Feeld is hiring a Staff Engineer (Full Stack) to raise the reliability and operability of our production systems across backend and mobile integration. This role exists now to improve how we detect, respond to, and prevent production incidents, and to strengthen the engineering practices that help teams move quickly without compromising stability. This is a hands-on individual contributor role with significant cross-team influence: - Lead through technical decisions, incident leadership, and pragmatic improvements to systems and process. Team context: - Reporting line: Sits on the Platform team reporting to the Head of Platform Engineering. - Works closely with engineering leadership and partners day-to-day primarily with backend engineers. - Scope & influence: Collaborate across squads to improve production ownership, reliability, and backend↔mobile integration patterns. - Ways of working: Remote-first, async-friendly, high trust; expected to communicate clearly in writing and help teams adopt consistent operational practices. What success looks like within your first year: - Reduced incident frequency and/or impact through concrete reliability improvements (e.g., better alerting, safer deploy patterns, guardrails, playbooks). - Made incident response more effective (clear ownership, faster MTTR, better post-incident follow-through). - Delivered improvements to backend/mobile integration that reduce breakages and production risk. - Established (or materially improved) documentation and operational standards that other engineers consistently use. What you will do: - Own reliability outcomes across critical backend services and their integration with mobile clients (React Native). - Lead technical problem-solving during incidents: coordinate response, diagnose root causes, communicate status, and drive to resolution. - Build and evolve monitoring/observability (dashboards, alerts, tracing, logging) that enables fast detection and diagnosis. - Drive post-incident reviews (blameless) and ensure learnings become durable fixes (tech changes, runbooks, automation, process updates). - Improve engineering safety and quality: guardrails, safer migrations, feature-flag practices, rollout strategies, and resilience patterns. - Partner with product, design, QA, and engineering early to align delivery plans with operational risk and reliability needs. - Strengthen documentation and onboarding: architecture notes, runbooks, service ownership docs, and “how we work” guides. - Mentor engineers through pairing, reviews, incident shadowing, and pragmatic coaching on production ownership. Qualifications - Significant experience building and operating production backend systems at scale, including debugging distributed systems and performance issues. - Strong TypeScript/Node.js (or equivalent) backend experience; comfort working across services and APIs. - Proven incident response leadership: on-call participation, triage, mitigation, and root-cause analysis (RCA) with follow-through. - Solid observability skills: practical experience with logging/metrics/tracing and turning signals into actionable alerts and dashboards. - Experience collaborating with mobile teams and understanding mobile↔backend integration concerns (e.g., API compatibility, releases, feature flags). - Demonstrated Staff-level IC leadership: influence through design reviews, technical direction, documentation, and cross-team alignment. Requirements - React Native experience and/or strong understanding of mobile architecture patterns and release constraints. - AWS (or similar cloud) experience and familiarity with infrastructure-as-code, CI/CD, and production tooling. - Experience designing reliability programs (SLOs, error budgets, incident process) and running operational excellence improvements. - Experience with PostgreSQL / Redis and performance tuning in high-traffic systems. - Experience in a high-growth environment where prioritization and pragmatic trade-offs are essential. Benefits - Flexible working hours - Unlimited paid time off - A fully remote working situation - Home office budget - Learning & development budget - On demand therapy sessions and mental health support via Spill - In-person meet ups Company Description Feeld is an independent, experimental and fully remote organisation reshaping the dialogue on dating and sexuality. The company was founded in 2014 and has evolved since to become the open, distributed structure it is now. We have a naturally agile and fluid culture. The whole team is fully remote, which means you work where and when helps you perform at your best. We regard autonomy highly and treat our organisation as a product – we iterate, improve and test things internally to see what works best for everyone.

React Native Observability/Monitoring Distributed Systems TypeScript Node.js AWS CI/CD PostgreSQL Redis

View details: Staff Engineer (Full Stack)

Worldwide

£100K - £130K / year

Apply

Forward Deployed Engineer

AI Technology Partners

Copilot adoption and customization, AI consulting, implementation and solutions for enterprises.

Engineer2 days ago

Full Time RemoteTeam 11-50Since 2020H1B No Sponsor

Company Site LinkedIn

• Partner with enterprise customers to understand workflows and challenges for AI-driven transformation • Design, build, and deploy AI applications and automations • Translate customer requirements into scalable technical solutions • Lead proof of concept initiatives and guide them into production deployments • Collaborate with engineering, product, and client stakeholders to deliver measurable outcomes • Serve as a trusted technical advisor throughout the customer journey

Java JavaScript Python TypeScript Go

View details: Forward Deployed Engineer

California

Apply

Senior Transmission Line Engineer

Ulteig

We Listen. We Solve. Modernizing Infrastructure. Strengthening Communities.

Engineer2 days ago

Full Time RemoteTeam 1,001-5,000Since 1944H1B Sponsor

Company Site LinkedIn

Role Description Ulteig is currently seeking talented and motivated Senior Engineer level candidates to join our Transmission Line team. This position is responsible for the design of High Voltage (69kV-500kV) transmission lines utilizing the PLS-CADD software. This position is open to sit in any location or remote. The ideal candidate has excellent problem solving skills and will: - Use advanced techniques, theory, precepts, and practices to complete complex project assignments. - Adapt and modify standard techniques to solve multifaceted problems. - Provide expert consultation for the design, development, and implementation of technical plans and systems used in the construction of high voltage transmission lines. - Facilitate collaboration between other disciplines and project team members. - Act as client contact to determine the project scope and understand their requirements. - Develop construction documents for the transmission line. - Coordinate specific design aspects of projects, estimating construction costs, and specifying materials. Qualifications - A minimum of a Bachelor’s degree in Engineering from an ABET accredited school. - Minimum of 10 years’ engineering experience is required, with 8 years of Transmission Line Engineering or related experience. - PE license required. - Proficiency in PLS-CADD and PLS-POLE preferred. - Previous experience working with or for a Utility is highly preferred. - Maintains positive client relationships. - Independently completes assigned tasks. - Actively manages individual workload. - Effectively assists in training and mentoring team members. - Responsible for reviewing project drawings, calculations & specifications. - Demonstrate wide degree of creativity and latitude. - Collaborate with multi-discipline team to ensure project success. - Must be analytical, self-motivated, and possess the ability to work in a team environment. - Excellent presentation and verbal/written communications skills. - Demonstrates strong interpersonal skills with the ability to establish and maintain effective working relationships with staff, management, clients, and external agencies. - Proven experience in Microsoft Software Applications (Word, Excel, Power Point, Access) and AutoCAD is preferred. - Proficiency in MFAD/LPILE is preferred. - Highly driven toward team success and professional growth. - Ability and drive to mentor less experienced engineers. - Demonstrates openness to innovation by embracing and applying evolving technology and AI tools to enhance workflows, solve problems, and drive continuous improvement. - Must have authorization to work permanently in the US. Benefits - Flexible Workplace. - Employee Ownership. - Competitive Pay. - Comprehensive Benefits Package. - Collaborative Environment. - Innovative Culture. Company Description Ulteig is a purpose-driven organization that has built a culture focused on people – both our clients and our employees – for over 80 years. We recognize our success relies heavily on the dedication and focus of our workforce; this is why we make investing in our employees a top priority. We prioritize flexibility and staying connected to meet your needs and help you achieve your goals. We value your unique perspective, respect your individuality, and celebrate your contributions. Our vision is to be the most trusted partners transforming our world’s critical infrastructure. Ulteig connects people and resources to develop compelling, integrated solutions across multiple Lifeline Sectors®, including Power, Renewables, Transportation, and Water. At Ulteig, we care deeply about our team, listening to their needs and ensuring they have the tools necessary to be productive whether they choose to work remotely, hybrid, or in office.

Excel AutoCAD Less AI

View details: Senior Transmission Line Engineer

United States

$141.8K - $184.4K / year

Apply