Senior Systems Software Engineer – Deep Learning Solutions
Location
Canada
Posted
102 days ago
Salary
$225K - $275K / year
Seniority
Senior
Job Description
Senior Systems Software Engineer – Deep Learning Solutions
NVIDIA
• Address customer and partner optimization challenges by engaging directly with automotive OEMs and robotics associates to analyze, debug, and improve deep learning models on NVIDIA platforms • Own performance benchmarking by driving efforts to achieve leading results on MLPerf Edge and industry benchmarks, defining methodology and ensuring reproducibility • Evaluate emerging model architectures by analyzing DL architectures, including vision encoders, multi-modal VLMs, for compilation feasibility, memory footprint, and latency on target SOCs • Collaborate across teams by partnering with compiler, runtime, and hardware teams to connect model-level insight with platform capabilities • Deliver TensorRT and compiler-stack solutions for edge by creating and deploying inference solutions on Jetson, DRIVE, and GPU + ARM platforms for AV and robotics workloads. • Develop Proofs of Readiness (PORs) and work closely with compiler team on Torch-TRT, MLIR-TRT, and related frameworks to bridge performance gaps.
Job Requirements
- Master’s degree or equivalent experience in Computer Science, Electrical Engineering, or a related field
- 12 + years of industry experience with over 8 years in deep learning model optimization, inference engineering, or neural network compilation
- Adept at interpreting and reasoning about model architectures at the operator/kernel level
- Over 5 years of validated expertise in embedded/edge software, experience delivering production inference solutions within power-limited, latency-sensitive deployment environments
- Deep knowledge of current DL architectures: transformers, attention variants, vision encoders (ViT), multi-modal/vision-language model frameworks, and experience with diffusion models and/or state space models
- Expert knowledge of GPU architecture fundamentals, CUDA, and low-level performance optimization using heterogeneous computing
- Experience with TensorRT, compiler IRs, or equivalent inference optimization toolchains
- Solid understanding of embedded operating system internals (QNX/Linux), memory management, C/C++, and embedded/system software concepts
- Background in parallel programming (e.g., CUDA, OpenMP) and experience reasoning about memory hierarchies, data movement, and compute utilization
- Demonstrated capability to collaborate directly with external partners and customers in a deep technical role, solving their workload issues, identifying performance problems, and providing solutions within production limitations.
Benefits
- Eligible for equity and benefits
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
Software Engineer III
6sense6sense Revenue AI™ reimagines the way revenue teams create, manage and convert pipeline into revenue.
• Architect, build, and scale services and infrastructure components • Own critical systems including Hadoop, Presto clusters, Kubernetes infrastructure, and deployment pipelines • Develop and deploy services to improve availability, usability, and observability • Write, review, and debug production-grade code in Python and Java • Design and maintain high-availability, fault-tolerant systems • Implement observability solutions (metrics, logging, alerting) • Contribute to infrastructure security and configuration management best practices • Support engineering teams with platform enablement and troubleshooting • Contribute to open-source projects when required
• Design and implement real-time services with high throughput and low latency requirements, verify, deploy and operationalize them • Work closely with stakeholders to understand customer needs and, devise and deliver, simple, robust and scalable solutions • Be comfortable expressing thoughts and ideas as detailed prose and use it as an effective means to collaborate with leads, architects and cross functional teams • Embrace the challenge of scaling a complex distributed platform with points of presence globally, each one concerned with high availability, high reliability, high throughput, low latency, and media fidelity • Figure out novel ways of solving customer problems for the Voice channel
Senior Software Engineer, Checkout – PBA Core
AffirmWe create honest financial products that improve lives.
• Responsible for owning and delivering quarterly goals for your team. • Support peers and stakeholders in the product development lifecycle by collaborating with product management, design & analytics. • Proactively identify project, process, technology or business issues and advocate for solutions. • Support operations and availability of your team’s artifacts by creating and monitoring metrics. • Foster a culture of quality and ownership on your team by improving code review and design standards. • Help develop talent on your team by providing feedback and guidance.
• Build and launch a secure, production-grade client portal serving regulated healthcare innovators • Implement scalable multi-tenant architecture supporting long-term growth • Strengthen compliance posture through secure authentication, RBAC, and encryption aligned with HIPAA and SOC2 standards • Integrate critical SaaS systems to streamline operations and reduce manual workflows • Leverage AI-assisted development and agentic workflows to increase engineering velocity and system intelligence • Architect and build a secure client portal and internal admin dashboards • Develop scalable backend services and APIs • Implement robust frontend experiences using modern frameworks • Design multi-tenant architecture supporting future growth • Build and maintain integrations with Microsoft 365 (Graph API), HubSpot, and Asana • Implement OAuth flows, webhook listeners, and background sync jobs • Ensure reliability and data integrity across third-party platforms • Implement authentication, authorization, and role-based access control (RBAC) • Apply encryption best practices across data storage and transmission • Align system architecture with SOC2 and HIPAA standards • Integrate AI-assisted development practices into daily workflows • Research and deploy Retrieval-Augmented Generation (RAG) and agentic workflows • Automate document intake and workflow orchestration • Implement Infrastructure as Code using Terraform or Pulumi • Maintain CI/CD pipelines for reliable deployments • Monitor and optimize system performance and reliability




