Senior HPC Solutions Architect
Location
California
Posted
50 days ago
Salary
$184K - $356.5K / year
Seniority
Senior
Job Description
Senior HPC Solutions Architect
NVIDIA
• Assisting with deployment, debugging, and improving the efficiency of AI workloads on extensive NVIDIA platforms. • Identifying hardware issues, supervising them through bugs, and keeping customers updated on current progress. • Benchmarking new framework features, analyzing performance, and sharing actionable insights with both customers and internal teams. • Working directly with external customers/partners to solve cluster performance and stability issues, identify bottlenecks, and implement effective solutions. • Build expertise and guide customers in scaling workloads efficiently and reliably on the latest generation of NVIDIA GPUs. • Collaborate with AI factory deployment teams and ensure RAs/Blueprints are accurately followed and implemented.
Job Requirements
- BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields, or equivalent experience.
- 10+ years of experience in designing, managing, and supporting large-scale hybrid networks.
- Experience with scripting is helpful.
- Strong programming skills in at least one of the following languages: C, C++, or Python.
- Practical experience identifying and resolving bottlenecks in large-scale training workloads or parallel applications.
- Proven understanding of CPU and GPU architectures, CUDA, parallel filesystems, and high-speed interconnects.
- Experienced in working with large compute clusters with an understanding of their internal scheduling and resource management mechanisms (e.g. SLURM or Cloud based clusters).
- System-level understanding of server/rack-level architecture, BMC, PCIe devices, Network Adapters, Linux OS, and kernel drivers.
- Excellent communication and liaison skills to work with customers, partners, and internal functions.
Benefits
- Equity and benefits
Related Guides
Related Categories
Related Job Pages
More Solutions Engineer Jobs
Enterprise Solutions Architect
Navitus Health SolutionsNavitus Health Solutions is a group that seeks to make medications more affordable so that people can experience better health. It utilizes a 100% pass-through approach so that its
• Research utilization and capacity planning of existing technologies to plan for future growth • Collaborate with other IT teams to perform a Proof of Concept for solutions that show promise • Analyze and design effective and clear technical solutions for infrastructure and enterprise application related projects • Act as subject matter expert on infrastructure and architecture items • Assist in the development and implementation of corporate information system policies and procedures • Maintain knowledge in Infrastructure Operations, Data Center Operations, Virtualization (Server, Network, Storage, Desktop, and Application); attend conferences, meet with vendors, and keep current on technology trends • Analyze, provide guidance and diagnose issues that may be caused by server-side applications, server operating system issues and networking problems • Recognized as a system expert in multiple core enterprise systems and be able to effectively provide knowledge training to peers • Provide after-hours support • Other duties as assigned
Senior Solutions Engineer
CircleCICircleCI delivers a continuous integration platform that allows developers to build at-scale projects more quickly and efficiently. A San Francisco, California-
• Lead the technical implementation and day-to-day management of CircleCI Demonstrations and proof-of-concepts. • Demonstrate the technical feasibility, integrated into our customer’s technology stack, ensuring early customer success and a long-term business relationship. • Serve as a technical advisor and subject matter expert for customers, offering guidance on product implementation, adoption, and best practices. • Conduct in-depth analysis of customer use cases, identify opportunities for product optimization, and provide feedback to internal teams for improvements. • Support strategic planning conversations with customer executives; connecting software delivery challenges with corporate objectives, articulating their full business impact. • Lead technical discussions with enterprise clients, addressing complex issues, customizations, and integrations to meet customer-specific requirements. • Develop and deliver technical training sessions, workshops, and documentation to empower customers to maximize the value of CircleCI products. • Stay up-to-date on industry trends, product updates, and emerging technologies to continuously enhance your technical knowledge and customer support capabilities.
• Lead incident response during critical outages. • Provide advanced Level 2 support for enterprise infrastructure systems. • Partner closely with engineering teams to implement and validate system changes. • Analyze complex incident patterns and trends to architect automation solutions.
• Perform assessments, architecture design, and implementation of Citrix solutions (CVAD, DaaS, NetScaler, etc.) • Lead client-facing engagements including workshops, discovery sessions, and technical deep dives • Design and optimize Virtual Desktop Infrastructure (VDI) and application delivery solutions • Provide guidance on Citrix Cloud and hybrid/on-prem architectures • Troubleshoot and resolve complex performance, scalability, and user experience issues • Lead or support migrations (e.g., on-prem to Citrix Cloud, legacy upgrades) • Collaborate with client stakeholders to align solutions with business and technical requirements • Produce and deliver high-quality documentation including design documents and runbooks • Assist in developing Statements of Work (SOWs) and participate in solution scoping • Clearly communicate technical concepts to engineers, managers, and executive audiences • Support presales activities including solution design, estimates, and proposal contributions • Develop reusable assets such as reference architectures, templates, and best practices • Collaborate with sales teams to identify and pursue new opportunities • Contribute to internal knowledge sharing, training sessions, and technical leadership initiatives



