The AI Factory. Accelerating the Future.
HPC Cluster Architect
Location
United Kingdom
Posted
45 days ago
Salary
0
Seniority
Senior
Job Description
HPC Cluster Architect
NexGen Cloud
• Own end-to-end cluster architecture for large-scale NVIDIA GPU deployments — from customer requirement through rack layouts, BOM, power and cooling design, to production handover • Design high-performance network fabrics across compute (InfiniBand, RDMA, NVLink/NVSwitch), storage, and WAN — defining topology, oversubscription models, and scaling strategies • Engage directly with OEMs and vendors — validating hardware configurations, reviewing quotes, and ensuring designs are both technically sound and commercially optimised • Provide technical oversight during deployment and bring-up — supporting hardware validation, performance testing, and acting as escalation point for complex integration issues • Act as a senior technical leader across Solutions Architecture, Cloud Engineering, and data centre partners — contributing to standardised reference designs and building out the HPC engineering function
Job Requirements
- Proven experience designing and delivering GPU-based HPC or AI clusters at scale — covering the full lifecycle from design through procurement, deployment, and validation
- Deep hands-on knowledge of NVIDIA GPU platforms (H100/H200/B-series) and NVIDIA reference architectures
- Strong InfiniBand/RDMA design experience — topology, performance tuning, and high-performance Ethernet fabrics
- Solid grounding in Linux systems, PCIe topology, NUMA alignment, and server-level performance considerations
- Background from an OEM, hyperscaler, neo-cloud, or enterprise/research HPC environment — with demonstrable exposure to the full design-to-deployment lifecycle
- Confident engaging with customers, vendors, OEMs, and internal engineering teams as a technical authority — able to translate complex design trade-offs into clear decisions.
- Experience with Spectrum-X or next-generation Ethernet fabrics (Nice to Have)
- Prior involvement in large-scale cluster deployments (1,000+ GPUs) and performance benchmarking (NCCL, MLPerf) (Nice to Have)
- Exposure to both air-cooled and liquid-cooled HPC environments, and/or automation/infrastructure-as-code (Nice to Have)
Benefits
- Competitive salary and annual discretionary bonus scheme
- Employee wellbeing benefits
- 25 days of holiday, plus public holidays
- Flexible working arrangements (remote or hybrid, depending on role and location)
- Real ownership and autonomy, with the trust to take initiative and experiment
- The opportunity to make a visible, meaningful impact as we scale
- Clear career progression and growth opportunities in a fast-growing company
- A collaborative, international culture built on trust, transparency, and ownership
- The chance to help shape NexGen Cloud’s team, culture, and future alongside ambitious, mission-driven colleagues
Related Guides
Related Categories
Related Job Pages
More Architect Jobs
Senior Systems Architect
Peraton CorporationPeraton Corporation, a national security company headquartered in Herndon, Virginia, supplies solutions for mission-critical programs and systems. Founded in 2017, Peraton's missio
Responsibilities The Office of Space Weather Observations (SWO) under NESDIS is responsible for advancing space weather observational capabilities to meet NOAA programmatic needs. NOAA’s Space Weather Next (SWX) program maintains and extends space weather observations from various vantage points, selected to most efficiently provide comprehensive knowledge of the Sun and the near-Earth space environment needed to protect our technological infrastructure. The Space Weather Ground Services (SWGS) is responsible for comprehensive ground services for all SWX projects, ensuring successful implementation and operation of observing assets and ensuring the continuity of space weather measurements made by SWFO-L1 and the GOES-R series satellites. The SWGS Mission Operations Services (MOS) program must provide a full satellite mission command and control solution to support the L1 Series with two new independently launched observatories. Position Overview Peraton is seeking a Senior Systems Architect to join the Architecture team and provide Subject Matter Expert (SME) level design, engineering and implementation support for the entire SWO-MOS program working directly with the Chief Systems Architect. This role serves as a technical authority responsible for ensuring the system architecture is designed to meet or exceed requirements and effectively allow execution to align with program objectives. The selected candidate will work across cross-disciplinary engineering teams, documenting architecture/designs, performing implementation/integration activities and effectively providing guidelines to deploy complex systems for the mission. The ideal candidate brings deep architecture and engineering expertise in multiple disciplines, proven leadership experience, and a strong background supporting mission-critical space or ground system programs. Architecture, Performance & System Design - Key member of system conceptual design activities and ensuring alignment with program architecture design concepts. - Provide strategic direction for system architecture, integration strategy, and technical execution. - Translate high-level product strategies into detailed system designs, coordinating across engineering disciplines. - Support the establishment of metrics, evaluation criteria, and technical performance measures (TPMs) to assess system design readiness. - Ensure system design integrity, scalability, and operational effectiveness. - Support development of system documentation/CDRLs and ensure alignment with program architectures and platforms. System Implementation and Integration - Ability to provide hands-on support as needed to effectively implement system designs and services. - Implementation of solution architectures and cloud strategies to conform with cloud best practices. - Integration of automation into the implementation process to allow efficiencies to be realized. Technology & Innovation - Evaluate emerging technologies and architectures for potential application to the program. - Provide technical insight into modernization opportunities including cloud adoption, Infrastructure as Code (IaC), automation, Artificial Intelligence (AI) and scalable architecture. Engineering Support - Key member of the requirements development process to ensure solutions meet all functional and performance requirements. - Serve as a SME supporting Systems Engineering Integration and Test (SEIT), Infrastructure, Cyber and Software teams across the program as required. - Participate in defining external and internal interfaces across program elements and partner systems. - Lead and mentor engineers responsible for implementation of developed design artifacts. - Support program leadership in technical planning, decision making, and risk management. - Support vendor decisions and product analysis, including required hardware, software, and network components. - Provide system performance analysis using defined evaluation criteria and technical performance measures conveying improvements where applicable. Program Technical Leadership - Serve as a SME on the program’s technical team, working closely with program management and other functional leads. - Consult with management, customers, and external partners regarding schedules, resources, designs, and implementation strategies. - Establish and maintain design/engineering standards, best practices, and development processes. - Support technical risk and opportunity identification and drive mitigation strategies. - Contribute to lessons learned and continuous improvement initiatives. - Engage directly with customers to support technical planning, requirements discussions, and problem resolution. **This position is contingent on contract award. ** #SWOMOS Qualifications - Ability to obtain and hold a Public Trust clearance – US Citizenship is required - Minimum of 12 years of Architecture, Design and Engineering experience supporting Satellite Ground Systems or similar complex mission systems with BS/BA; Minimum of 10 years with MS/MA; Minimum of 7 years with PhD - Demonstrated experience with system design and implementation activities on large-scale programs. - Strong knowledge of Satellite Ground Systems, including: - Mission planning - Command, control, and communications - Data processing pipelines - Space-to-ground interfaces - Network topology design - Cybersecurity - CI/CD pipelines and DevSecOps - Satellite operations - Experience with software-centric and distributed systems architectures. - Experience with Atlassian tools such as Jira and Confluence. - Experience with cloud-based services (AWS) and modern infrastructure environments. - Experience working in Scaled Agile Framework (SAFe) environments or similar Agile methodologies and leading Program Increment (PI) planning activities. - Strong experience in requirements decomposition and analysis, system architecture development, interface definition and management, and Model-Based Systems Engineering (MBSE). - Strong analytical skills with the ability to interpret data and resolve complex technical issues. Preferred Qualifications - Technical certifications in AWS, networking vendors (Cisco, PaloAlto, F5) and/or CyberSecurity - Experience with machine learning and AI - Experience with Peraton OS/COMET - Experience supporting NOAA, NASA, or national space mission programs. - Experience supporting mission operations or operational system transitions. - Familiar with NESDIS Common Cloud Framework (NCCF) - Ability to communicate complex data in a simple, actionable way - Ability to visualize data in the most effective way possible for a given project or study - Experience working within a distributed virtual team environment, with proficiency in remote collaboration tools and practices. - Strong interpersonal skills with a willingness to foster strong relationships with coworkers and vendors. - Highly organized with strong attention to detail - Outstanding verbal and written communication skills Peraton Overview Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we’re keeping people around the world safe and secure. Target Salary Range $176,000 - $282,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual’s experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay. EEO EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.
Solution Architect for GFT WFL054903 Solution Architect for GFT WFL054903
GFT Technologies SEProcuramos uma pessoa que: Goste de trabalhar em equipe e seja colaborativa em suas atribuições; Tenha coragem para se desafiar e ir além, abraçando novas oportunidades de crescimento; Transforme ideias em soluções criativas e busque qualidade em toda sua rotina; Tenha habilidades de resolução de problemas; Possua habilidade e se sinta confortável para trabalhar de forma independente e gerenciar o próprio tempo; Tenha interesse em lidar com situações adversas e inovadoras no âmbito tecnológico. Big enough to deliver – small enough to care. #VempraGFT #VamosVoarJuntos #ProudToBeGFT
GFT Italia è alla ricerca di una/un Solution Architect con almeno 5 anni di esperienza, capace di guidare la definizione e l’implementazione di architetture complesse in contesti enterprise. La figura sarà coinvolta in progetti strategici, contribuendo alla realizzazione di soluzioni scalabili, sostenibili e allineate agli obiettivi di business dei clienti. Responsabilità: - Progettare architetture tecniche e funzionali, valutando impatti su stakeholder, strutture organizzative e obiettivi di business. - Documentare e presentare le soluzioni architetturali agli attori coinvolti. - Identificare aree di miglioramento nei progetti e nei processi aziendali del cliente. - Valutare e mitigare i rischi architetturali, mantenendo aggiornato il registro dei rischi. - Collaborare alla fase di proposta progettuale (RFP, RFI, ecc.), garantendo coerenza con la GFT Value Proposition. - Verificare la sostenibilità delle soluzioni in termini di costi, tempi e competenze. - Facilitare la comunicazione tra team e stakeholder, guidando riunioni e momenti decisionali. - Promuovere la crescita del team attraverso formazione, mentoring e condivisione di conoscenze. Competenze tecniche: - Modeling: conoscenza avanzata di pattern e standard UML. - Software & Package Selection: competenza specialistica in ambito consulting. - Solution Design: metodologie per Disaster Recovery, High Availability, Fault Tolerance, Scalabilità. - Integration Systems: padronanza di pattern e standard di design, integrazione e cloud. - Protocolli di comunicazione: conoscenza approfondita di HTTP/S, MQ, Kafka, ecc. - Documentazione: capacità di redigere documentazione progettuale e architetturale. Conoscenza funzionale: Esperienza consolidata in ambiti funzionali applicativi, con capacità di supervisione e gestione di soluzioni complesse. Se ti rivedi nel profilo descritto, fai application e noi daremo spazio al tuo talento!Hai qualche domanda? Non ti ritrovi in questo profilo ma vuoi comunque un confronto con noi? Contattaci alla mail careeritaly@gft.com Chi siamo? GFT Technologies SE (GFT) fondata nel 1987, è oggi rappresentata da un team globale di circa 12.000 dipendenti in Europa, Nord e Sud America e Asia. Siamo costantemente impegnati a guidare la trasformazione digitale nel settore dei servizi finanziari, forniamo consulenza alle principali istituzioni finanziarie a livello mondiale e sviluppiamo soluzioni IT su misura – dalle applicazioni bancarie e sistemi di trading fino all’implementazione e al supporto di piattaforme complete, e modernization di sistemi core banking. Il nostro innovation team, che opera a livello globale, sviluppa, inoltre, nuovi modelli di business, focalizzandosi su temi quali blockchain, cloud engineering, intelligenza artificiale e Internet of Things, trasversalmente in tutti i settori.In GFT potrai lavorare da remoto 5/5. Per informazioni sulle nostre policy sulla Privacy art13 L.679/2016 (GDPR), Diversity Equality & Inclusion, e Sostenibilità vai su: https://www.gft.com/it/it/about-us/Sustainability. GFT Technologies garantisce le pari opportunità nel percorso di selezione, assunzione e nei processi di crescita professionale.
Solution Architect
ServiceNowAs the AI platform for business transformation, we're putting AI to work across organizations — freeing people for work that matters. Making old tech work with new tech. Reaching across departments, from the front office to the back office and every office in between. Our ambition? To become the AI defining enterprise software company of the 21st century (or "AI DESCO21C," as we like to call it). With more than 8,400+ customers, we serve approximately 90% of the Fortune 500®, and we're proud to be a Fortune 100 Best Companies to Work For® and World's Most Admired Companies™. Explore your future career with us, visit www.careers.servicenow.com From Fortune. ©2026 Fortune Media IP Limited. All rights reserved. Used under license.
Role Description ServiceNow is seeking a CRM Architect Director with deep telecommunications industry expertise to join our Customer & Industry Workflows Expert Services team. This role is designed for a seasoned CRM architect who brings both the technical depth to lead complex ServiceNow implementations and the industry fluency to speak credibly about telco-specific business challenges, operational models, and transformation priorities. You do not need prior experience with ServiceNow’s telecom product portfolio — we will invest in building that expertise. What matters is that you bring a genuine understanding of how telecommunications companies operate: from network and service management to customer lifecycle, revenue assurance, and the operational complexity unique to carriers, CSPs, and infrastructure providers. This industry knowledge, paired with your CRM architecture skills, will enable you to drive exceptional outcomes for some of the world’s most complex telco customers. What You Will Do - Lead ServiceNow CRM implementations for telecommunications customers, applying industry knowledge to accelerate adoption and drive measurable business outcomes. - Translate telco-specific business challenges — such as customer churn, order fallout, service assurance, and B2B/B2C complexity — into effective CRM solution designs. - Serve as a trusted advisor to telco customers, connecting their operational priorities to ServiceNow platform capabilities. - Demonstrate empathy for the customer and genuine passion in helping them succeed in a highly competitive industry. - Engage and collaborate with ServiceNow R&D teams on escalated technical issues, including telco-specific product gaps and enhancements. - Provide thought leadership to telco sponsors and stakeholders in solving business process and technology problems. - Review customer architecture, design processes, and system integrations, with sensitivity to legacy telco environments and OSS/BSS landscapes. - Configure solution environments to address customer requirements and business issues. - Contribute to pre-sales campaigns by sharing telco-specific implementation strategies and best practices. - Mentor field resources on telco industry context and delivery best practices for CRM applications in telecommunications. - Collaborate with Product Management to surface telco customer needs and inform the ServiceNow product roadmap. - Share industry insights and lessons learned with internal teams and the broader ServiceNow community. Qualifications - 10+ years of experience in customer-facing implementation and delivery roles such as Solution Architect, Technical Consultant, or developer — ideally in professional services or consulting. - 10+ years in the CRM technology industry. - Significant experience working with or within telecommunications companies — including carriers, CSPs, cable/broadband providers, or telco infrastructure firms. - Deep understanding of telco business models, customer lifecycle management, and operational challenges (e.g., churn, order management, B2B/enterprise sales, network-driven service delivery). - Familiarity with OSS/BSS architecture and how CRM systems interact with broader telco technology stacks. - Comfort learning new technology platforms quickly — prior ServiceNow experience helpful but not required. - Ideally ServiceNow CSA and CSM certified, or willingness to certify upon hire. - Ability to perform deep architectural advisory work. - Excellent verbal and written communication skills, including ability to present to executives, chair workshops, and facilitate complex stakeholder sessions. - Highly data-driven with commitment to driving customer engagement toward business outcomes and value realization. - Fanatical about customer success and tenacious at driving long-term customer value. - Must be able to travel up to 25% annually, when applicable. Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.
• Design the target operating model, including the overarching ITSM framework, ticket routing logic, and incident management workflows for the consolidated enterprise. • Build the CAB quality gate to enforce strict ITIL intake requirements and mandatory global regulatory holds (e.g., e911, UK IPA) before any site moves to execution. • Map raw observability alerts into the target ITSM platform (e.g., ServiceNow, Jira Service Management) to build the operational logic pipeline. • Architect automated incident-creation rules within the ITSM platform. • Ensure that validated alerts from the observability platform are automatically routed to the correct NOC queue, assigned the appropriate SLA, and deduplicated into a single parent ticket. • Complete the core ITSM design and transition finalized runbook templates and operating rules to the Service Delivery Manager prior to volume factory execution. • Collaborate directly with the Network Observability Architect to integrate telemetry and monitoring thresholds, translating validated alerts into actionable ITSM tickets via API webhooks.



