Cooling the Data Centers and Optics that Power AI
Platform Architect, Server Infrastructure
Location
North America
Posted
2 days ago
Salary
$175K - $225K / year
Seniority
Lead
Job Description
Platform Architect, Server Infrastructure
Phononic Inc
• Define end-to-end thermal architecture strategies for GPU servers, optical interconnects, and CPO-based systems • Develop system-level approaches to balance performance, heat dissipation, reliability, and energy efficiency • Design and optimize solutions for: High-power GPUs and accelerators, Dense optical I/O (pluggable and co-packaged optics), Rack- and cluster-level thermal constraints • Optimize cooling strategies for high-density AI workloads and optical bandwidth scaling • Analyze and improve thermal resistance, junction temperatures, and cooling efficiency • Lead design and evaluation of advanced cooling approaches, including: Air cooling (high-performance heatsinks, airflow optimization), Liquid cooling (direct-to-chip, cold plates), Immersion cooling and emerging techniques • Architect thermal solutions for: High-speed optical transceivers (400G/800G/1.6T+), Co-packaged optics (CPO) integrated with switch or GPU ASICs • Collaborate with silicon photonics teams to co-design thermal-aware optical packaging architectures • Design GPU server platforms optimized for thermal efficiency, including: Multi-GPU configurations and interconnect density, Power delivery and cooling integration, Airflow and liquid loop design • Drive innovations in rack-level and data center-level cooling, including: High-density rack (>50–100kW) thermal strategies, Integration with facility cooling systems, Optimization for power usage effectiveness (PUE)
Job Requirements
- Required Bachelor’s or Master’s degree in Mechanical Engineering, Electrical Engineering, Physics, or related field
- 8–12+ years of experience in system architecture, thermal engineering, or hardware platform design
- Deep expertise in: Thermal management and cooling technologies for high-performance systems, GPU servers, AI/HPC infrastructure, or data center platforms, Optical interconnects and high-speed systems (1.6T+)
- Strong understanding of: Heat transfer (conduction, convection, liquid cooling systems), Power density and thermal constraints in modern compute systems, Tradeoffs across performance, cooling, cost, and reliability
- Preferred Experience working with hyperscalers or large-scale AI deployments
- Experience with co-packaged optics (CPO) and silicon photonics thermal challenges
- Hands-on design experience with: Liquid cooling systems and cold plate design, High-density rack and cluster cooling
- Knowledge of: Optical module thermal constraints and reliability, Data center infrastructure (HVAC, liquid loops, facility integration)
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
IT Cyber Security Architect, Plant Infrastructure
Recurrent EnergyDelivering clean, reliable and affordable power to the world, today and tomorrow.
• Develop and execute holistic cybersecurity strategies tailored to the unique challenges of Operational Technology environments, focusing on protecting critical assets, ensuring availability, and preventing unauthorized access. • Stay abreast of relevant regulations and standards, particularly NERC CIP (Critical Infrastructure Protection) standards, and ensure the organization's systems, processes, and procedures are aligned with compliance requirements. • Design, review, and enhance network architectures for both IT and OT environments, incorporating security measures that prevent unauthorized intrusion, data breaches, and other cyber threats. • Conduct thorough risk assessments to identify vulnerabilities and potential threats within the OT landscape. Translate findings into actionable security recommendations and solutions. • Lead the deployment of advanced security solutions, including intrusion detection systems, firewalls, access controls, and encryption mechanisms, to safeguard critical infrastructure. • Collaborate with cross-functional teams, including IT, operations, engineering, and compliance, to align cybersecurity initiatives with business goals, operational needs, and regulatory requirements. • Develop and maintain robust incident response plans specific to OT environments. Coordinate with incident response teams to ensure a swift and effective response to security incidents. • Raise awareness and provide training to employees, contractors, and partners about OT cybersecurity best practices, policies, and procedures. • Evaluate the security posture of third-party vendors and partners, ensuring that their solutions and services meet cybersecurity standards.
• Paperpile runs on data at scale, with a literature database of 250M+ academic papers and a growing body of user data accumulated over more than a decade. You'll work across the systems that ingest, process, store, and serve this data reliably: building pipelines, optimizing search, handling PDFs at scale, and exposing clean APIs.
Senior Infrastructure Engineer / Infrastructure Architect
InfotechBridging people and solutions, one job site at a time.
• Collaborate with technical leads to design and execute a strategic architectural vision for interdependent products and services, ensuring high availability and scalability on AWS. • Translate product engineering requirements into robust, automated application and system configurations while bridging the gap between development and operations. • Write scripts and utilize infrastructure tools and AWS services to automate continuous integration/continuous deployment (CI/CD) pipelines and infrastructure provisioning. • Perform after-hours system maintenance and on-call troubleshooting when needed, responding to system alerts in a timely manner. • Develop, deploy, and maintain development, test, and production environments that maximize resource efficiency and ensure consistency across all commercial products. • Install and configure networks and Linux-based computer systems, enforcing a high level of security across all accounts, user permissions, and services. • Run necessary system backups and maintenance while regularly implementing critical updates and security patches across multiple systems. • Monitor Infotech's cloud infrastructure for vulnerabilities and intrusion detection, actively defending against potential threats. • Serve as a core technical driver for the team, contributing clean implementation code, championing technology adoption, and providing mentorship to engineering teams.
Software Engineer II, AI Apps, Cloud Infrastructure
Brain CorpYour robotics automation partner powering the most intelligent tools to create more productive workforces.
• Develop, and maintain scalable and reliable cloud infrastructure on BrainOS Google Cloud Platform (GCP). • Collaborate with the Applied ML team and R&D team to design and implement efficient cloud services, ensuring high data quality and integrity. • Develop APIs and services to facilitate seamless integration between the sense cloud platform and various web and robotic applications. • Work closely with the web and application teams to understand their requirements and provide technical guidance and support. • Participate in testing activities, including unit testing, integration testing, and system testing, to ensure the reliability, performance, and quality of the cloud platform. • Monitor and optimize cost, performance and reliability of the sense cloud platform, identifying and resolving any issues or bottlenecks. • Stay up-to-date with the latest advancements in cloud technologies, sharing knowledge and best practices with the team. • Assist in other duties and responsibilities as assigned.




