Job Closed
This listing is no longer active.
Develop, train, and scale AI models. All in one cloud.
Manager, Datacenter Network Engineering
Location
United States
Posted
137 days ago
Salary
$150K - $240K / year
Seniority
Lead
Job Description
Manager, Datacenter Network Engineering
RunPod
• Manage and grow a team of network engineers responsible for datacenter fabrics, interconnects, and global WAN connectivity. Provide mentorship, technical guidance, and clear ownership boundaries. • Define and evolve network designs for GPU-heavy clusters, including spine-leaf topologies, ECMP routing, and high-bandwidth east-west traffic patterns. • Oversee design and operation of InfiniBand and RoCE-based fabrics supporting distributed training and inference workloads. Ensure performance, loss characteristics, and congestion control meet AI workload requirements. • Guide implementation and operations of encapsulation technologies such as VXLAN, EVPN, Geneve, or similar, enabling scalable multi-tenant isolation and flexible network provisioning. • Lead strategy and execution for global WAN connectivity, including private backbone links, IX connectivity, and hybrid connectivity with cloud providers and partners. • Establish operational best practices for monitoring, capacity planning, change management, incident response, and post-mortems across the network stack. • Partner closely with Infrastructure, SRE, Hardware, and Product Engineering teams to ensure network capabilities align with platform and customer requirements. • Work with hardware vendors, colocation providers, and transit partners on network design, procurement, deployment timelines, and escalations. • Ensure network designs support secure isolation, DDoS resilience, and compliance requirements without compromising performance.
Job Requirements
- 3+ years managing network or infrastructure engineering teams, with experience scaling teams and systems in production environments.
- 8+ years designing and operating large-scale datacenter networks, including spine-leaf architectures, BGP-based routing, and high-throughput fabrics.
- Strong hands-on experience with VXLAN/EVPN or equivalent encapsulation protocols, including control-plane and data-plane considerations.
- Proven experience with InfiniBand and/or RoCE, including congestion management, lossless Ethernet concepts, and performance tuning for GPU workloads.
- Deep familiarity with global WAN technologies, including private backbone design, inter-region connectivity, routing policy, and traffic engineering.
- Comfortable working with Linux-based systems, network operating systems, and automation tooling.
- Strong background in network observability, incident management, capacity forecasting, and change control.
- Clear written and verbal communication skills, with the ability to align stakeholders and lead teams through complex technical challenges.
- Successful completion of a background check.
Benefits
- Meaningful equity in a fast-growing company- everyone on the team receives stock options — your impact drives our growth, and you share in the upside.
- Generous medical, dental & vision plans — we cover 100% for all employees and partial for dependents.
- Flexible PTO- take the time you need to recharge
- Most roles are remote work first with an inclusive, collaborative teams utilizing slack as the main form of internal communication
- Join a passionate team on the cutting edge of AI infrastructure — where culture, learning, and ownership are at the heart of how we scale.
Related Guides
Related Categories
Related Job Pages
More Network Engineer Jobs
• Network Engineers are the behind‑the‑scenes superheroes who partner early and often with developers, testers, and program managers to design, operate, and elevate the large‑scale, mission‑critical services that power IS, non‑IS, and clinical caregivers every day. • They translate business, technical, and caregiver needs into resilient, compliant, high‑performing service architectures that deliver availability, quality, cost, and experience. • They craft and champion standards, engineer service delivery systems end‑to‑end, and lead the charge in diagnosing and permanently solving disruptions with calm, creative precision. • From driving innovation through telemetry, automation, lifecycle planning, and cost modeling, to steering disaster recovery drills and ensuring 24x7 uptime, they keep the enterprise humming. • Network Engineers are master problem‑solvers, data‑driven decision makers, financial stewards, and thought leaders—mentoring teammates, advancing IS best practices, and pushing the organization toward smarter, faster, more reliable solutions. • With an eye for continuous improvement and a passion for results, they don’t just keep networks running—they make them extraordinary.
Network Engineer
The AME GroupManaged IT Services | Cybersecurity | Business Resilience| Backup and Recovery | Compliance Assist | SOC 2 Type 2
• Implement, and support IT solutions across various platforms, ensuring optimal performance and scalability. • Provide daily hands on support for network infrastructure and connectivity issues. • Respond to and resolve end user network incidents and service requests in real time. • Collaborate with our team and our clients to design and implement solutions. • Work closely with team and clients to provide technical guidance and ensure successful project implementation.
• Administer, configure, and troubleshoot enterprise **routing and switching** environments • Support **Cisco** and **Cisco Meraki** network infrastructure (switching, wireless, security appliances) • Configure and manage **Palo Alto firewalls**, including policies, NAT, VPNs, and security rules • Own network issues end-to-end, from investigation through resolution • Respond to **major network incidents** on an as-needed basis • Support production and business-critical systems with high uptime requirements • Document network configurations, changes, and troubleshooting steps • Collaborate with internal teams to maintain stable, secure network operations
Member of Technical Staff, Network Engineer
Anchorage DigitalTrusted institutional partner in crypto and first federally chartered crypto bank
• As a Member of Technical Staff at the Infra Engineering team, you'll get the opportunity to work on various platform areas such as cloud infrastructure (e.g. GCP, gke, terraform) • help define the long-term strategy and execute it with a team of talented engineers • Internal developers will appreciate a well functioning build pipeline that allows them to get their code to production smoothly • The role has cross-team exposure which allows for interesting work and professional growth.




