Job Closed
This listing is no longer active.
Senior Performance Compiler Engineer – Triton
Location
California + 2 moreAll locations: California | Texas | Washington
Posted
101 days ago
Salary
$184K - $287.5K / year
Seniority
Senior
Job Description
Senior Performance Compiler Engineer – Triton
NVIDIA
• Investigating the latest and future NVIDIA GPU hardware architecture and programming models • Working on the frontier of AI by understanding advanced algorithms (like attention sinks and MoEs) and numerics (like block-scaled floating point) to identify new opportunities for optimization • Designing and implementing compiler technology using MLIR to optimize high-level kernel descriptions (written in Triton's Python DSL), with a focus on generating efficient, low-level GPU code • Engaging in a dynamic, iterative process of optimization—sometimes starting with the kernel, sometimes with the compiler—to find the most efficient path to peak performance • Collaborating with teams across NVIDIA, including hardware architects and the CUDA compiler team, to influence future products and ensure we are always operating at maximum efficiency
Job Requirements
- Bachelor, Masters or Ph.D. degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or a related field
- 8+ years of relevant industry experience in software development
- Demonstrated strong C++ programming and software design skills, with an emphasis on performance analysis and debugging
- Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP
- Solid understanding of computer architecture and hands-on experience with assembly-level programming.
Benefits
- Competitive salaries
- Generous benefits package
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
• Design, develop, and implement computer vision systems, including navigation systems for aerial applications. • Write clean, efficient, and well-documented code in C++ and Python. • Collaborate with cross-functional teams to design and test integrate robotic and vision systems. • Collaborate with external stakeholders, including customers to understand requirements, conduct tests and understand changes and improvements needed to be made. • Optimize system performance and ensure robust, reliable operation in diverse environments. • Debug and troubleshoot system issues to identify and implement effective solutions.
• Design, develop, and implement autonomy software for aerial autonomy, navigation, and real-time mapping. • Work with internal product teams to integrate new features and improve existing functionality. • Partner directly with customers and stakeholders, including in field testing and operational environments, to understand operational needs and deliver tailored solutions. • Improve and troubleshoot performance of deployed systems, providing rapid iteration in response to end-user feedback and on-site challenges. • Coordinate and execute field tests, demonstrations, and pilot programs in dynamic environments. • Serve as the technical liaison between product development and end-users, translating practical feedback into actionable roadmaps for the engineering org. • Champion best practices in code quality, security, and maintainability across the development lifecycle.
Senior Performance Engineer
ClickUpThe world's most productive AI Workspace for projects, tasks, chat, docs, and more. All software and humans - converged.
• Lead end-to-end performance analysis across frontend, backend, and infrastructure layers. • Profile and optimize application code, APIs, and database queries for maximum efficiency. • Investigate and resolve performance bottlenecks using observability and monitoring tools. • Collaborate with development, DevOps, and product teams to design scalable, high-performance solutions. • Build and maintain performance dashboards, metrics, and automated monitoring systems. • Conduct root cause analysis for performance incidents and drive long-term improvements. • Develop and implement strategies for continuous performance optimization and capacity planning. • Stay current with industry trends, tools, and best practices in performance engineering.
• Support a large-scale government agency initiative to modernize a complex, interdependent system of systems migrating from legacy on-premises platforms to AWS Cloud Native architectures. • Conduct structured discovery sessions to understand legacy on-prem systems, workflows, data dependencies, and integration points. • Document current-state (As-Is) and target-state (To-Be) capabilities supporting AWS Cloud Native migration. • Support development of phased migration plans (e.g., refactor vs. lift-and-shift decisions). • Identify cross-system dependencies that must be sequenced to prevent downstream rework or hybrid thrashing. • Collaborate with architects to ensure requirements align to target cloud-native design principles. • Develop modernization EPICs aligned to cloud-native re-architecture, data ingestion modernization, integration redesign, automation of manual workflow gates, DevSecOps enablement. • Decompose EPICs into features and refined user stories with clear acceptance criteria, testable outcomes, and defined dependencies. • Establish initial product backlog in collaboration with Product Owners. • Ensure traceability from high-level modernization objectives to sprint-ready user stories. • Collaborate with Requirements Engineers in other workstreams to coordinate interdependent user stories, align sequencing of integration capabilities, prevent duplication of effort, and manage shared components. • Maintain visibility of system-of-systems impacts when new requirements are introduced. • Support integrated release planning across workstreams. • Participate in Agile Scrum ceremonies (Sprint Planning, Backlog Refinement, Daily Standups, Reviews & Retrospectives). • Work closely with Product Owners to prioritize backlog items, groom stories for sprint readiness, and ensure stories meet INVEST principles. • Define and document Definition of Done (DoD) criteria including code complete, automated tests implemented, documentation updated, security checks passed. • Support incremental delivery aligned to modernization milestones. • Develop requirements artifacts supporting architecture review board submissions, interface control documentation, security authorization processes. • Maintain end-to-end Requirements Traceability Matrix (RTM), ensuring requirements are testable and align with validation frameworks. • Define requirements for high-volume data processing systems, event-driven and serverless architectures, API-based integration patterns, and database modernization (e.g., Postgres/Aurora migration).



