Job Closed

This listing is no longer active.

Senior Performance Compiler Engineer – Triton

EngineerEngineerOtherRemoteSeniorTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

California + 2 moreAll locations: California | Texas | Washington

Posted

101 days ago

Salary

$184K - $287.5K / year

Seniority

Senior

Bachelor Degree8 yrs expEnglishAssemblyPython

Job Description

Senior Performance Compiler Engineer – Triton

NVIDIA

• Investigating the latest and future NVIDIA GPU hardware architecture and programming models • Working on the frontier of AI by understanding advanced algorithms (like attention sinks and MoEs) and numerics (like block-scaled floating point) to identify new opportunities for optimization • Designing and implementing compiler technology using MLIR to optimize high-level kernel descriptions (written in Triton's Python DSL), with a focus on generating efficient, low-level GPU code • Engaging in a dynamic, iterative process of optimization—sometimes starting with the kernel, sometimes with the compiler—to find the most efficient path to peak performance • Collaborating with teams across NVIDIA, including hardware architects and the CUDA compiler team, to influence future products and ensure we are always operating at maximum efficiency

Job Requirements

  • Bachelor, Masters or Ph.D. degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or a related field
  • 8+ years of relevant industry experience in software development
  • Demonstrated strong C++ programming and software design skills, with an emphasis on performance analysis and debugging
  • Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP
  • Solid understanding of computer architecture and hands-on experience with assembly-level programming.

Benefits

  • Competitive salaries
  • Generous benefits package

Related Categories

Related Job Pages

More Engineer Jobs

Engineer101 days ago
OtherRemoteTeam 11-50Since 2019

• Design, develop, and implement computer vision systems, including navigation systems for aerial applications. • Write clean, efficient, and well-documented code in C++ and Python. • Collaborate with cross-functional teams to design and test integrate robotic and vision systems. • Collaborate with external stakeholders, including customers to understand requirements, conduct tests and understand changes and improvements needed to be made. • Optimize system performance and ensure robust, reliable operation in diverse environments. • Debug and troubleshoot system issues to identify and implement effective solutions.

California
Job Closed
Swan logo

Forward Deployed Engineer

Swan

Bring the world to you!

Engineer101 days ago
OtherRemoteTeam 11-50Since 2019

• Design, develop, and implement autonomy software for aerial autonomy, navigation, and real-time mapping. • Work with internal product teams to integrate new features and improve existing functionality. • Partner directly with customers and stakeholders, including in field testing and operational environments, to understand operational needs and deliver tailored solutions. • Improve and troubleshoot performance of deployed systems, providing rapid iteration in response to end-user feedback and on-site challenges. • Coordinate and execute field tests, demonstrations, and pilot programs in dynamic environments. • Serve as the technical liaison between product development and end-users, translating practical feedback into actionable roadmaps for the engineering org. • Champion best practices in code quality, security, and maintainability across the development lifecycle.

California
Job Closed
ClickUp logo

Senior Performance Engineer

ClickUp

The world's most productive AI Workspace for projects, tasks, chat, docs, and more. All software and humans - converged.

Engineer101 days ago
OtherRemoteTeam 1,001-5,000Since 2017H1B Sponsor

• Lead end-to-end performance analysis across frontend, backend, and infrastructure layers. • Profile and optimize application code, APIs, and database queries for maximum efficiency. • Investigate and resolve performance bottlenecks using observability and monitoring tools. • Collaborate with development, DevOps, and product teams to design scalable, high-performance solutions. • Build and maintain performance dashboards, metrics, and automated monitoring systems. • Conduct root cause analysis for performance incidents and drive long-term improvements. • Develop and implement strategies for continuous performance optimization and capacity planning. • Stay current with industry trends, tools, and best practices in performance engineering.

United States
$160K - $210K / year
Job Closed
OtherRemoteTeam 51-200H1B No Sponsor

• Support a large-scale government agency initiative to modernize a complex, interdependent system of systems migrating from legacy on-premises platforms to AWS Cloud Native architectures. • Conduct structured discovery sessions to understand legacy on-prem systems, workflows, data dependencies, and integration points. • Document current-state (As-Is) and target-state (To-Be) capabilities supporting AWS Cloud Native migration. • Support development of phased migration plans (e.g., refactor vs. lift-and-shift decisions). • Identify cross-system dependencies that must be sequenced to prevent downstream rework or hybrid thrashing. • Collaborate with architects to ensure requirements align to target cloud-native design principles. • Develop modernization EPICs aligned to cloud-native re-architecture, data ingestion modernization, integration redesign, automation of manual workflow gates, DevSecOps enablement. • Decompose EPICs into features and refined user stories with clear acceptance criteria, testable outcomes, and defined dependencies. • Establish initial product backlog in collaboration with Product Owners. • Ensure traceability from high-level modernization objectives to sprint-ready user stories. • Collaborate with Requirements Engineers in other workstreams to coordinate interdependent user stories, align sequencing of integration capabilities, prevent duplication of effort, and manage shared components. • Maintain visibility of system-of-systems impacts when new requirements are introduced. • Support integrated release planning across workstreams. • Participate in Agile Scrum ceremonies (Sprint Planning, Backlog Refinement, Daily Standups, Reviews & Retrospectives). • Work closely with Product Owners to prioritize backlog items, groom stories for sprint readiness, and ensure stories meet INVEST principles. • Define and document Definition of Done (DoD) criteria including code complete, automated tests implemented, documentation updated, security checks passed. • Support incremental delivery aligned to modernization milestones. • Develop requirements artifacts supporting architecture review board submissions, interface control documentation, security authorization processes. • Maintain end-to-end Requirements Traceability Matrix (RTM), ensuring requirements are testable and align with validation frameworks. • Define requirements for high-volume data processing systems, event-driven and serverless architectures, API-based integration patterns, and database modernization (e.g., Postgres/Aurora migration).

Maryland
Job Closed