Manager, Next-Generation AI Cluster Architecture

AI EngineerMachine Learning EngineerFull TimeRemoteLeadTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

Worldwide

Posted

37 days ago

Salary

$224K - $356.5K / year

Seniority

Lead

No structured requirement data.

Job Description

Manager, Next-Generation AI Cluster Architecture

NVIDIA

Title: Manager, Next-Generation AI Cluster Architecture Location: US Job Description: NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. We are looking for an outstanding technical leader to help develop the next generation of NVIDIA AI supercomputing systems. This leader will play a crucial role in the early development of AI computing systems at scale. This team is responsible for early investigation of new NVIDIA compute and networking technologies, the design of next-generation GPU cluster architectures, and bringing these architectures to reality by collaborating closely with early system bringup teams. We play a key part in enabling the deployment of large scale datacenter systems with early customers, and develop reference architectures that are used throughout the industry to inform deployment of NVIDIA datacenter products. Be a key player to enable the most exciting computing hardware and software and contribute to the latest breakthroughs in artificial intelligence and GPU computing. Collaborate with top experts in the field to invent new supercomputing architectures for machine learning and HPC. Work in a fast-paced, remote-friendly environment with teammates in many different locations around the world. What you'll be doing: - Lead a team developing next generation system architectures for future HPC and AI clusters using the latest NVIDIA technologies - Build full-stack systems crafted for high-performance machine learning applications, from the data center and physical architecture, through the network topology and system software stack - Author reference architectures which influence future supercomputing systems for AI and HPC both inside and outside NVIDIA - Collaborate with teams throughout the company on the cluster architecture, at-scale bringup, and integration of new technologies and products What we need to see: - BS (Masters or PhD preferred) in Applied Science or Engineering (or equivalent experience) - 8+ overall years experience of experience in the high-performance computing or machine learning fields, including 3+ years of technical leadership experience - Proven ability to lead high-performing engineering teams, especially across distributed groups with diverse expertise - Proficiency in software development and system automation with languages such as Go, Python, or Ansible - Creative problem-solver with excellent teamwork and collaboration skills - Ability to work as part of a large, diverse team in a remote-friendly environment Ways to stand out from the crowd: - Experience leading teams building HPC compute and storage systems in a research environment at large scale - Well-developed knowledge of deep learning applications, including multi-GPU and multi-node training and inference workloads - Expertise with high-performance datacenter networking such as InfiniBand and RoCE - Expertise with open-source monitoring technologies such as Prometheus and Grafana - Have a proven track record of growing and managing a team that encourages idea sharing, empowers team members, and provides opportunities for professional growth Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/ Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD. You will also be eligible for equity and benefits. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Related Job Pages

More AI Engineer Jobs

rrreefs logo

AI Developer

rrreefs

rethinking, rebuilding, regenerating coral reefs

AI Engineer37 days ago
ContractRemoteTeam 1-10Since 2020H1B No Sponsor

• Scope, design, and build AI-powered workflows and internal tools that improve business efficiency • Work closely with stakeholders to identify high-impact opportunities for AI integration • Translate AI opportunities into functional solutions using LLMs, automation platforms, and APIs • Stay up to date with the latest developments in AI tools, models, and best practices • Introduce new ideas that can create measurable business impact

Pakistan

Role Description We’re seeking a highly capable AI Developer to scope, design, and build AI-powered workflows and internal tools that improve business efficiency and unlock new capabilities. You’ll work closely with stakeholders to identify high-impact opportunities for AI integration, then translate those into functional solutions using LLMs, automation platforms, and APIs. This is a hands-on role requiring both strategic thinking and execution - from defining use cases and designing workflows to building, testing, and iterating on AI-driven systems. You’ll be expected to stay up to date with the latest developments in AI tools, models, and best practices, and proactively introduce new ideas that can create measurable business impact. Qualifications - Excellent spoken and written English - Proven experience building AI-powered workflows, automations, or internal tools - Experience building AI agents or multi-step workflows - Familiarity with vector databases, embeddings, or retrieval-augmented generation (RAG) - Experience working in fast-paced startup or scale-up environments - Basic coding skills (Python, JavaScript, or similar) - Strong experience working with LLMs (e.g. OpenAI, Claude, or similar) and prompt engineering - Experience with automation platforms such as N8N, Zapier, Make.com, or similar - Ability to scope AI use cases and translate business problems into technical solutions - Experience integrating APIs and working with multiple systems and data sources - Strong understanding of AI capabilities, limitations, and practical applications in business - Experience rapidly testing, iterating, and deploying AI solutions - Ability to stay up to date with emerging AI tools, trends, and technologies - Strong problem-solving skills and systems thinking - Ability to communicate technical concepts clearly to non-technical stakeholders - High attention to detail and focus on building reliable, scalable solutions - Fast and reliable internet connection; your own laptop or desktop suitable for the role; a quiet working environment Requirements - Competitive contract rate based on experience - Opportunity to work on cutting-edge AI implementations with real business impact - High level of ownership and autonomy in shaping AI systems - Exposure to international clients and high-growth companies - Access to future opportunities - Contract-based role with competitive compensation based on experience - Flexible, fully remote working setup - Opportunity for ongoing work based on performance and impact

Worldwide
Zoom Video Communications logo

AI Inference Engineer - Speech

Zoom Video Communications

Zoom Video Communications was founded in 2011 to revolutionize the way teams communicate with its software-based conference room solution. Across all devices an

AI Engineer37 days ago

Develop state-of-the-art speech services, optimize ASR inference systems for production, and propose new model structures to enhance model accuracy and inference speed while ensuring scalability and high performance across deployment environments.

Washington + 1 moreAll locations: Washington | California
General Motors logo

Staff Software Engineer, Vehicle AI

General Motors

General Motors (GM), founded in 1908 by William "Billy" Durant in Flint, Michigan, began with the Buick Motor Company and later acquired brands like Oldsmobile and Cadillac, evolvi

AI Engineer37 days ago

Description Vacancy Status: This posting is not for an existing vacancy within the organization and is open to new applications. AI Disclosure: As part of the application process, Artificial Intelligence will be used in the hiring process for this role Remote : This role is categorized as remote. This means the successful candidate may be based anywhere in Canada and is not expected to report to a GM worksite unless directed by their manager. About the Role: GM is looking to hire highly skilled and experienced Staff Software Engineers to join our team focused on developing cutting-edge AI agents. This role involves leading the design, development, and deployment of robust, scalable, and intelligent software agents that drive innovation across our products and services. The ideal candidate will have a deep understanding of AI/ML principles, distributed systems, and a track record of technical leadership. What You'll Do: - Lead the architecture and implementation of next-generation AI agents, from conceptualization to production deployment. - Drive technical direction and strategy for the AI agent platform, ensuring scalability, reliability, and performance. - Mentor and guide junior and senior engineers, fostering a culture of technical excellence and best practices. - Collaborate with Product Managers and other engineering teams to define requirements and deliver impactful solutions. - Conduct complex code reviews, system design reviews, and provide constructive feedback. - Identify and address technical debt, performance bottlenecks, and architectural challenges within the agent infrastructure. - Stay current with the latest advancements in AI, machine learning, and software engineering to continually improve our technology stack. Your Skills & Abilities (Required Qualifications): - Bachelor's degree in Computer Science, related technical field, or equivalent practical experience. - 8+ years of professional software development experience, with a focus on large-scale distributed systems or AI/ML infrastructure. - Expert proficiency in one or more programming languages such as Python, C++, Java, or Kotlin. - Extensive experience designing, building, and deploying production-grade AI/ML models or intelligent agents. - Demonstrated technical leadership in complex projects, including mentoring and driving cross-functional initiatives. What Will Give You a Competitive Edge (Preferred Qualifications) : - Master's or Ph.D. in Computer Science or a related quantitative field. - Deep expertise in specific AI agent technologies (e.g., Reinforcement Learning, Multi-Agent Systems, Large Language Models (LLMs)). - Experience with cloud platforms (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes). - Proficient with Android development with a proven ability to design and deploy high performance applications. - Proven ability to communicate complex technical concepts effectively to both technical and non-technical audiences. - A strong portfolio of contributions to open-source projects or relevant publications. Compensation: The salary range for this role is $147,000 to $196,600. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position. GM DOES NOT PROVIDE IMMIGRATION-RELATED SPONSORSHIP FOR THIS ROLE. DO NOT APPLY FOR THIS ROLE IF YOU WILL NEED GM IMMIGRATION SPONSORSHIP NOW OR IN THE FUTURE Benefits: The goal of the General Motors of Canada total rewards program is to support the health and well-being of you and your family. Our comprehensive compensation plan currently includes the following benefits, in addition to many others: - Paid time off including vacation days, holidays, and supplemental benefits for pregnancy, parental and adoption leave. - Healthcare, dental and vision benefits including health care spending account and wellness incentive. - Life insurance plans to cover you and your family. - Company and matching contributions to a Defined Contribution Pension plan to help you save for retirement. - GM Vehicle Purchase Plan for you, your family, and friends. About GM Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all. Why Join Us We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee to feel they belong to one General Motors team. Total Rewards | Benefits Overview From day one, we're looking out for your well-being-at work and at home-so you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources. Non-Discrimination and Equal Employment Opportunities (U.S.) General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. We strongly believe that providing an inclusive workplace creates an environment in which our employees can thrive and develop better products for our customers. All employment decisions are made on a non-discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws. We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required, where applicable, to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more, visit How we Hire. Accommodations General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email us [email protected] or call us at 1-800-865-7580. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

United States
$147K - $196.6K / year
Job Closed