Senior HPC and AI Network Software Architect
Location
Poland
Posted
65 days ago
Salary
zł221.3K - zł507K / year
Seniority
Senior
Job Description
Senior HPC and AI Network Software Architect
NVIDIA
• Build and evolve the architecture of scalable software systems for distributed AI training and inference. • Develop and evaluate next-generation communication and runtime capabilities in libraries such as NCCL, UCX, and UCC. • Partner with AI framework teams (e.g., TensorFlow, PyTorch, JAX) to build integrations and explore new approaches. • Collaborate on hardware and system-level features across GPUs, DPUs, and interconnects. • Drive innovation across runtime systems, communication libraries, and AI-specific protocol layers.
Job Requirements
- Ph.D., or equivalent industry experience, in computer science, computer engineering, or a closely related field.
- 5+ years of experience in systems programming, parallel or distributed computing, high-performance networking, or large-scale data movement.
- Strong programming background in C++, Python, and ideally CUDA or other GPU programming models.
- Extensive hands-on experience with AI frameworks (e.g., PyTorch, TensorFlow, JAX).
- Demonstrated success in developing and refining high-throughput, low-latency systems.
- Strong collaboration skills in a multi-national, interdisciplinary setting.
Benefits
- Comprehensive benefits package
- Highly competitive salaries
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
• Construir e dar suporte às funcionalidades da solução ERP Iniflex. • Identificar e solucionar problemas simples no código e na lógica. • Colaborar com a equipe para entender os requisitos do projeto e discutir abordagens técnicas. • Realizar testes de integração e funcionais para garantir a qualidade do código.
Senior Full-Stack Developer
Uribe SchwarzkopfWe are a family architectural development firm that has, over five decades, helped shape the development of Quito.
• Ensure best practices in software development • Optimize application for maximum speed and scalability • Collaborate with stakeholders across the organization to discuss the needs, design, and functionality of a website or interface • Implement testing, and write unit tests, e2e tests using test framework using libraries such as react-testing library or jest • Discuss improvements and processes with the team lead • Collaborate with team to translate design mockups into interactive web applications • Develop and maintain back-end as well front-end components of our web applications
• Contribute to and lead components of cloud-native, scalable, multi-tenant architecture. • Design and develop robust, scalable backend services and APIs using Java, Kotlin, TypeScript, or C#. • Partner with product, design, and engineering leaders to shape technical solutions to business problems. • Solve complex problems in distributed systems, data infrastructure, observability, performance, and reliability. • Contribute to the design and implementation of AI/LLM-powered features and services. • Guide and support other engineers, lead technical design reviews, contribute to team best practices, and champion engineering excellence. • Take features from concept to production with thoughtful design, testing, instrumentation, and supportability. • Evaluate new technologies and tools, prototype solutions, and share insights with the team. • Drive initiatives end-to-end, from problem framing and design through implementation, rollout, and long-term support. • AI-First Development: Apply AI-first development practices to improve engineering velocity and quality.
Software Engineer, AI Trainer
Anyone AIWe invest in people from Latam to bridge the talent gap in AI.
• Contributors will design and implement multi-file coding tasks across bug fixing, feature development, refactoring, and testing • Write clear natural-language specifications and reference implementations • Develop and extend unit and integration test suites • Review peer-generated tasks for correctness, clarity, and realism • Identify edge cases, ambiguities, and potential failure modes • Ensure alignment between specifications, code, and expected outputs




