Senior Systems Software Engineer – Deep Learning Solutions

Full-stack EngineerSoftware EngineerFull TimeRemoteSeniorTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

Canada

Posted

102 days ago

Salary

$225K - $275K / year

Seniority

Senior

Postgraduate Degree12 yrs expEnglishLinux

Job Description

Senior Systems Software Engineer – Deep Learning Solutions

NVIDIA

• Address customer and partner optimization challenges by engaging directly with automotive OEMs and robotics associates to analyze, debug, and improve deep learning models on NVIDIA platforms • Own performance benchmarking by driving efforts to achieve leading results on MLPerf Edge and industry benchmarks, defining methodology and ensuring reproducibility • Evaluate emerging model architectures by analyzing DL architectures, including vision encoders, multi-modal VLMs, for compilation feasibility, memory footprint, and latency on target SOCs • Collaborate across teams by partnering with compiler, runtime, and hardware teams to connect model-level insight with platform capabilities • Deliver TensorRT and compiler-stack solutions for edge by creating and deploying inference solutions on Jetson, DRIVE, and GPU + ARM platforms for AV and robotics workloads. • Develop Proofs of Readiness (PORs) and work closely with compiler team on Torch-TRT, MLIR-TRT, and related frameworks to bridge performance gaps.

Job Requirements

  • Master’s degree or equivalent experience in Computer Science, Electrical Engineering, or a related field
  • 12 + years of industry experience with over 8 years in deep learning model optimization, inference engineering, or neural network compilation
  • Adept at interpreting and reasoning about model architectures at the operator/kernel level
  • Over 5 years of validated expertise in embedded/edge software, experience delivering production inference solutions within power-limited, latency-sensitive deployment environments
  • Deep knowledge of current DL architectures: transformers, attention variants, vision encoders (ViT), multi-modal/vision-language model frameworks, and experience with diffusion models and/or state space models
  • Expert knowledge of GPU architecture fundamentals, CUDA, and low-level performance optimization using heterogeneous computing
  • Experience with TensorRT, compiler IRs, or equivalent inference optimization toolchains
  • Solid understanding of embedded operating system internals (QNX/Linux), memory management, C/C++, and embedded/system software concepts
  • Background in parallel programming (e.g., CUDA, OpenMP) and experience reasoning about memory hierarchies, data movement, and compute utilization
  • Demonstrated capability to collaborate directly with external partners and customers in a deep technical role, solving their workload issues, identifying performance problems, and providing solutions within production limitations.

Benefits

  • Eligible for equity and benefits

Related Job Pages

More Full-stack Engineer Jobs

6sense logo

Software Engineer III

6sense

6sense Revenue AI™ reimagines the way revenue teams create, manage and convert pipeline into revenue.

OtherRemoteTeam 1,001-5,000Since 2013H1B Sponsor

• Architect, build, and scale services and infrastructure components • Own critical systems including Hadoop, Presto clusters, Kubernetes infrastructure, and deployment pipelines • Develop and deploy services to improve availability, usability, and observability • Write, review, and debug production-grade code in Python and Java • Design and maintain high-availability, fault-tolerant systems • Implement observability solutions (metrics, logging, alerting) • Contribute to infrastructure security and configuration management best practices • Support engineering teams with platform enablement and troubleshooting • Contribute to open-source projects when required

United States
Job Closed
Twilio logo

Software Engineer – Video

Twilio

Build the future of communications.

OtherRemoteTeam 5,001-10,000H1B Sponsor

• Design and implement real-time services with high throughput and low latency requirements, verify, deploy and operationalize them • Work closely with stakeholders to understand customer needs and, devise and deliver, simple, robust and scalable solutions • Be comfortable expressing thoughts and ideas as detailed prose and use it as an effective means to collaborate with leads, architects and cross functional teams • Embrace the challenge of scaling a complex distributed platform with points of presence globally, each one concerned with high availability, high reliability, high throughput, low latency, and media fidelity • Figure out novel ways of solving customer problems for the Voice channel

California + 4 moreAll locations: California | Connecticut | New York | Pennsylvania | Washington
$114.6K - $168.5K / year
Job Closed
Affirm logo

Senior Software Engineer, Checkout – PBA Core

Affirm

We create honest financial products that improve lives.

OtherRemoteTeam 1,001-5,000Since 2012H1B Sponsor

• Responsible for owning and delivering quarterly goals for your team. • Support peers and stakeholders in the product development lifecycle by collaborating with product management, design & analytics. • Proactively identify project, process, technology or business issues and advocate for solutions. • Support operations and availability of your team’s artifacts by creating and monitoring metrics. • Foster a culture of quality and ownership on your team by improving code review and design standards. • Help develop talent on your team by providing feedback and guidance.

United States
$169K - $240K / year
Job Closed
Catena logo

Full-Stack Engineer, AI-Native

Catena

Empowering Talent. Elevating Companies. Uniting Success.

Full TimeRemoteTeam 1-10H1B No Sponsor

• Build and launch a secure, production-grade client portal serving regulated healthcare innovators • Implement scalable multi-tenant architecture supporting long-term growth • Strengthen compliance posture through secure authentication, RBAC, and encryption aligned with HIPAA and SOC2 standards • Integrate critical SaaS systems to streamline operations and reduce manual workflows • Leverage AI-assisted development and agentic workflows to increase engineering velocity and system intelligence • Architect and build a secure client portal and internal admin dashboards • Develop scalable backend services and APIs • Implement robust frontend experiences using modern frameworks • Design multi-tenant architecture supporting future growth • Build and maintain integrations with Microsoft 365 (Graph API), HubSpot, and Asana • Implement OAuth flows, webhook listeners, and background sync jobs • Ensure reliability and data integrity across third-party platforms • Implement authentication, authorization, and role-based access control (RBAC) • Apply encryption best practices across data storage and transmission • Align system architecture with SOC2 and HIPAA standards • Integrate AI-assisted development practices into daily workflows • Research and deploy Retrieval-Augmented Generation (RAG) and agentic workflows • Automate document intake and workflow orchestration • Implement Infrastructure as Code using Terraform or Pulumi • Maintain CI/CD pipelines for reliable deployments • Monitor and optimize system performance and reliability

Philippines
Job Closed