Job Closed
This listing is no longer active.
Senior DGX Cloud Performance Engineer
Location
California + 2 moreAll locations: California | Texas | Washington
Posted
138 days ago
Salary
$152K - $287.5K / year
Seniority
Senior
Job Description
Senior DGX Cloud Performance Engineer
NVIDIA
• Develop benchmarks, end to end customer applications running at scale, instrumented for performance measurements, tracking, sampling, to measure and optimize performance of important applications and services; • Construct carefully designed experiments to analyze, study and develop critical insights into performance bottlenecks, dependencies, from an end to end perspective; • Develop ideas on how to improve the end to end system performance and usability by driving changes in the HW or SW (or both). • Collaborate with AI researchers, developers, and application service providers to understand internal developer and external customer pain points, requirements, project future needs and share best practice. • Develop the necessary modeling framework and the TCO (total cost of ownership) analysis to enable efficient exploration and sweep of the architecture and design space • Develop the methodology needed to drive the engineering analysis to Inform the architecture, design and roadmap of DGX Cloud
Job Requirements
- Expertise in working with large scale parallel and distributed accelerator-based system systems
- Expertise optimizing performance and AI workloads on large scale systems
- Experience with performance modeling and benchmarking at scale
- Strong background in Computer Architecture, Networking, Storage systems, Accelerators
- Familiarity with popular AI frameworks (PyTorch, TensorFlow, JAX, Megatron-LM, Tensort-LLM, VLLM) among others
- Experience with AI/ML models and workloads, in particular LLMs as well as an understanding of DNNs and their use in emerging AI/ML applications and services
- Bachelors/Masters in Engineering or equivalent experience (preferably, Electrical Engineering, Computer Engineering, or Computer Science)
- 5+ years experience in the above areas
- Proficiency in Python, C/C++
- Expertise with at least one of public CSP infrastructure (GCP, AWS, Azure, OCI, …)
Benefits
- equity
- benefits
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Algorithm Engineer – OpenData
VeevaHeadquartered in Pleasanton, California, Veeva is a leading provider of cloud-based software and services for the life sciences industry. As an employer, Veeva
• Work within a cross-functional data team to build scalable NLP and ML models • Work from end-to-end on live production pipelines. Not just modeling, not theoretical • Define the best approach to solve problems with ML. Build data and model pipelines • Test, validate, deploy, and monitor solutions for impact • Optimize models for production throughput and uptime requirements • Automate deployments, testing, and monitoring (MLOps)
Senior Maintenance Engineer – Industrial Heating and Combustion Systems
Myriad Heat and Power ProductsMyriad Products | Designing, installing and servicing solar PV, biomass boilers and heat pump prjects for over 20 years
• Servicing, fault-finding, and repairing industrial and commercial biomass boilers and combustion systems • Providing breakdown cover and scheduled maintenance support • Diagnosing combustion, fuel feed, emissions, and control system issues • Carrying out electrical and mechanical fault-finding to component level • Supporting commissioning and major service activities where required • Completing all service reports, safety documentation, and compliance records clearly and on time • Working safely and professionally on customer sites at all times
• Manage the overall efforts to analyze and enhance Ferguson’s position in the marketplace • Increase industry awareness of Ferguson Geo/Stormwater products through attendance in tradeshows, conferences, meetings, and presentations • Prepare and present technical presentations throughout the assigned geography • Research and understand local green infrastructure codes to build a targeted sales strategy • Engage end users, engineering firms, and contractors to support project opportunities • Coordinate and manage projects through the specification process • Support the national growth initiatives of the Geosynthetics and Stormwater Management team • Work with external sales team to assist contractors and municipalities with installation and maintenance practices • Co-manage sales revenues in conjunction with the Geo Storm Associate and sales managers • Provide management timely feedback on activities related to pending projects and sales.
Role Description We are seeking a detail-oriented and proactive Documentation Governance Lead (Level 3) to oversee and optimize our internal documentation ecosystem supporting our MDR operations. This role is pivotal in establishing and maintaining a single source of truth for all organizational documentation. The ideal candidate will design, implement, and enforce documentation standards and governance controls, leveraging AI agents and large language models to identify gaps, audit quality, generate insights, and continuously improve documentation. You will collaborate with cross-functional teams to organize, enhance, and maintain our knowledge base, driving efficiency, reducing information silos, and enabling scalable growth. - Oversee the creation, organization, and maintenance of internal wiki content. - Design and implement documentation standards, templates, and governance policies. - Conduct regular audits to identify and eliminate redundant or conflicting information. - Review and edit documentation for clarity, accuracy, completeness, and relevance. - Provide training and ongoing support to team members and stakeholders. - Identify and implement opportunities to enhance documentation workflows. - Monitor and report on key performance indicators (KPIs). - Minimize risks associated with outdated or inaccurate documentation. - Leverage AI agents and large language models to audit, evaluate, and enhance documentation. Qualifications - Bachelor’s degree in information management, Technical Writing, Communications, Computer Science, or a related field (or equivalent professional experience). - 3-5 years of experience in documentation management, knowledge base administration, or a similar role. - Proven track record in developing and implementing documentation standards, governance frameworks, and controls in a collaborative environment. - Strong understanding of information architecture, content management principles, and best practices for maintaining a single source of truth. - Experience conducting audits, quality reviews, and process improvements for documentation systems. - Excellent communication, collaboration, and stakeholder management skills. - Familiarity with emerging tools and automation for knowledge management. - Strong attention to detail, organizational skills, and ability to prioritize multiple initiatives in a fast-paced environment. Requirements - Proficiency in Confluence administration, including macros, plugins, blueprints, and integration with Atlassian tools. - Familiarity with technical writing tools (e.g., Markdown, HTML) and content management systems. - Excellent written and verbal communication skills. - Strong analytical and problem-solving abilities. - Ability to work independently and collaboratively in a fast-paced, cross-functional team environment. - Knowledge of compliance standards (e.g., ISO 9001, ISO 27001 / ISO 27002) or industry-specific regulations relevant to documentation governance. - Certification in technical writing, information architecture, or wiki products is a plus. Benefits - Base salary ranges from $71,000 to $118,000. - Additional compensation including bonus eligibility and a comprehensive benefits package. Company Description - Sophos operates a remote-first working model. - Employee-led diversity and inclusion networks. - Annual charity and fundraising initiatives. - Global employee sustainability initiatives. - Global fitness and trivia competitions. - Global wellbeing days for employees. - Monthly wellbeing webinars and training.




