Principal Release Infrastructure Architect

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteLeadTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

California

Posted

11 days ago

Salary

$272K - $431.3K / year

Seniority

Lead

Postgraduate Degree15 yrs expEnglishAngularLinuxPostgreSQLPythonReactTypeScript

Job Description

Principal Release Infrastructure Architect

NVIDIA

• managing the architecture of a full-stack release management platform, advancing it to accommodate multi-tenant, multi-environment systems across multiple hardware platforms • crafting hierarchical domain models and state machines that manage complex lifecycles and multi-axis promotion flows • architecting robust ingestion and reconciliation pipelines, ensuring data fidelity and compliance across various representations • defining and integrating a comprehensive validation, promotion, and gating model with our automated sanity stages and customer-release pipelines • establishing a strict separation between authoring and production environments, ensuring data integrity and sanitization • setting standards for API build, authentication, RBAC, audit logging, and observability across the platform • leading frontend architecture for a sophisticated authoring and review experience, handling complex tabular editing and bulk operations • driving platform onboarding workflows, defining consistent naming schemes, validation rules, and notification topologies for new hardware platforms • mentoring engineers, conducting architecture reviews, and raising the technical bar across multiple engineering domains • partnering with product, TPM, release managers, and other collaborators to align on roadmaps, capacity, and operational ownership • defining and managing deployment, rollout, and incident-response models, including database migration strategies and clear on-call runbooks and continuously evaluating and incorporating emerging tools, frameworks, and patterns to improve the platform's capabilities.

Job Requirements

  • BS, MS, or PhD or equivalent experience in Computer Science or a related field
  • over 15 years of hands-on software engineering experience
  • at least 5 of those years should be in a senior technical leadership role
  • proven experience leading large-scale, multi-year, full-stack platform projects from inception to production
  • expertise in Python or a modern backend stack, with production-grade experience in PostgreSQL and complex relational data modeling
  • strong frontend architecture skills with React, Angular, and TypeScript, passionate about data-heavy UIs
  • demonstrated success in crafting and implementing state machines, workflow engines, or lifecycle systems
  • solid background in CI/CD orchestration, event-triggered integrations, and background-job systems
  • strong API development field, with experience in REST contracts, versioning, and automation-friendly interfaces
  • proficiency in Linux, containerization, and managing stateful production services
  • ability to lead through influence, driving architectural decisions across multiple teams and building consensus
  • excellent communication skills, translating complex business requirements into detailed technical solutions.

Benefits

  • equity
  • benefits

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Deutsche Telekom IT Solutions logo

Trainee - Oracle Cloud Infrastructure

Deutsche Telekom IT Solutions

As Hungary’s most attractive employer in 2025 (according to Randstad’s representative survey), Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees. We have hundreds of large customers, corporations in Germany and in other European countries. DT-ITS received the Best in Educational Cooperation award from HIPA in 2019, acknowledged as the Most Ethical Multinational Company in 2019. The company continuously develops its four sites in Budapest, Debrecen, Pécs and Szeged and is looking for skilled IT professionals to join its team.

InternshipRemoteTeam 5,001-10,000

Role Description - Manage and monitor OCI resources (compute, storage, networking) - Provision and maintain cloud infrastructure components - Monitor performance and troubleshoot issues - Implement security policies and access controls - Manage backups, recovery Qualifications - Good English knowledge - Foundational understanding of Oracle Cloud Infrastructure (OCI) - Hands-on experience with Linux-based environment - Nice to have: OCI official certification - Nice to have: Infrastructure as Code tools (e.g Terraform) Requirements - You will be working in the European Union to meet our customers' data security and privacy requirements. - Please be informed that our remote working possibility is only available within Hungary due to European taxation regulation. Company Description As Hungary’s most attractive employer in 2025 (according to Randstad’s representative survey), Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees. We have hundreds of large customers, corporations in Germany and in other European countries. DT-ITS received the Best in Educational Cooperation award from HIPA in 2019, acknowledged as the Most Ethical Multinational Company in 2019. The company continuously develops its four sites in Budapest, Debrecen, Pécs and Szeged and is looking for skilled IT professionals to join its team.

Hungary
Job Closed
Conduit logo

Infrastructure Engineer

Conduit

An infrastructure platform for crypto computing environments.

Full TimeRemoteTeam 1-10H1B Sponsor

Role Description Conduit's infrastructure team builds systems and tools to enable the Conduit Platform in a reliable, secure, performant, and cost-effective manner. As a senior member of a small, embedded infra team, you'll be instrumental in designing, building, and scaling infrastructure capabilities — and in enabling other engineering teams to ship faster and more reliably. This is a high-ownership, high-autonomy role where you'll have significant influence over how Conduit's infrastructure evolves. Responsibilities - Embed with engineering teams to drive infrastructure design decisions and help bring new products and services from prototype to production. - Define and build internal tooling and processes to enable teams to deploy, migrate, and operate services more reliably and efficiently. - Own large-scope infrastructure projects end-to-end including service migrations, ingress architecture, and internal orchestration and control plane design. - Lead design and implementation of solutions to improve availability, reliability, and security of Conduit services. - Drive sustainable incident response, on-call practices, and retrospectives with a bias toward automation and systemic fixes over manual toil. - Participate in on-call rotation. Qualifications - 5–7+ years of infrastructure or SRE experience, including hands-on experience scaling production systems. - Genuine curiosity about web3 and interest in understanding the products and systems you're supporting. - Strong foundation in system design, distributed systems, and security best practices. - Experience running large-scale Kubernetes infrastructure (100–1,000+ nodes) on a public cloud (AWS, GCP). - Experience designing and supporting highly available and scalable distributed systems. - Strong debugging skills across the stack — networking, performance, memory, and service behavior. - Strong cross-functional collaboration and communication skills. Requirements - Own problems end-to-end and are comfortable navigating ambiguity with limited context. - Are always looking to improve systems rather than simply maintain them; you automate the toil and push toward better reliability, cost, and developer experience. - Can embed with a product team, understand their context quickly, and help them move faster without sacrificing stability. - Like problem solving in a dynamic, collaborative environment with modern cloud tooling. Company Description Conduit is an enterprise-grade blockchain infrastructure and tooling platform powering the next generation of on-chain applications. Our platform supports many of the most notable teams in crypto, with use cases spanning payments, tokenization, and beyond. Built for security, customization, and scalability, Conduit delivers production-grade reliability paired with deep blockchain expertise—so teams can stay focused on what matters most: their product. Backed by a $37M Series A led by Paradigm and Haun Ventures, Conduit is a remote-first company with offices in San Francisco and New York City for those who prefer a hybrid environment. Our team brings experience from companies like Meta, Amazon, Aave, Compound, and Paradigm. We’re building the infrastructure layer that will power on-chain finance for the next decade.

United States
General Dynamics logo

Infrastructure / Network SME

General Dynamics

General Dynamics is a global aerospace and defense company offering products designed to provide safety and security to people around the world. In the past, Ge

Role Description GDIT is seeking an Infrastructure / Network Subject Matter Expert (SME) to design, modernize, and optimize network and infrastructure solutions supporting large-scale Defense missions. You will serve as a senior technical advisor helping shape resilient, secure, high-performance environments across cloud and on-prem platforms. How You’ll Make an Impact - Design, architect, and optimize enterprise network and infrastructure solutions. - Lead modernization efforts such as cloud migration, network segmentation, and Zero Trust implementation. - Provide troubleshooting expertise for complex networking, routing, and performance challenges. - Work with security teams to ensure compliance with DoD cybersecurity frameworks and accreditation requirements. - Develop architecture diagrams, technical plans, and configuration standards. - Advise leaders and program teams on infrastructure strategy, capacity planning, and lifecycle management. Qualifications - Technical Training, Certification(s) or Degree and 10+ years of experience. - 10+ years deep experience with enterprise networking (routing, switching, firewalls, load balancing). - Strong understanding of cloud networking (AWS, Azure, hybrid architectures). - Hands-on experience with Cisco, Juniper, Palo Alto, F5, or similar technologies. - Familiarity with DoD or federal security frameworks (RMF, STIGs). - Strong troubleshooting and diagnostic skills across network layers. - Ability to communicate complex infrastructure concepts clearly to both technical and non-technical audiences. - US Citizenship Required. - Candidate must possess active SECRET clearance and ability to attain TOP SECRET / SCI. Requirements - CCNP/CCIE, CASP, Network+, or related certifications (Preferred). - Experience supporting large modernization or cloud transformation initiatives (Preferred). - Experience with automation tools (Ansible, Terraform) (Preferred). Benefits - Variety of medical plan options, some with Health Savings Accounts. - Dental plan options and a vision plan. - 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. - Full flex work weeks where possible. - Paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement, and jury duty leave. - 15 days of paid leave per calendar year for new employees, plus 10 paid holidays per year. - Paid Family Leave program providing up to 160 hours of paid leave in a rolling 12 month period for eligible employees. - Short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness, and business travel and accident insurance.

United States
$164.4K - $215.1K / year
Job Closed
CNX logo

Azure Infrastructure Architect

CNX

We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.

Full TimeRemoteTeam 10,001

Role Description The role centers on serving as a trusted advisor who designs, deploys, and optimizes modern endpoint management solutions, with a strong emphasis on Intune integration and cloud‑based device governance. It focuses on guiding customers through Co‑Management, Intune configuration, and modernization of their endpoint environment while ensuring secure, efficient, and scalable management practices. Responsibilities - Design, Deploy, Review and Assess the health of the infrastructure and clients - Upgrade, update and maintain the SCCM hierarchy - Troubleshoot issues with infrastructure and clients - Tune and optimize for performance - Assist with reporting - Assist in setting up SCCM Co-Management with Intune - Assist in planning, reviewing and troubleshooting Software Updates feature - Assist in planning, reviewing and troubleshooting Task Sequences - Provide training in all areas of SCCM to ensure customer goals are met Qualifications - 3 years working as a depth expert and technology owner or consultant for SCCM - Ability to present to multiple levels of customer leadership - Ability to act as a consultant and architect for multiple customers - Broad knowledge across multiple scenarios: - Windows Desktop/Server Management using SCCM - SQL Server, Active Directory, Certificates, IIS, DNS, Security - Package/Application creation - Desktop OS Upgrades using different approaches - Intune and Co-Management experience - SSRS Reporting and/or PowerBI experience - PowerShell scripting - Infrastructure Design - Deployment - Upgrades - Maintenance, Health, Tuning - Troubleshooting Location ESP Work-at-Home Time Type Full time

Spain