Job Closed
This listing is no longer active.
Senior IT Infrastructure Engineer – Cloud Focus
Location
United Kingdom
Posted
8 days ago
Salary
0
Seniority
Senior
Job Description
Senior IT Infrastructure Engineer – Cloud Focus
Signode
• Serve as subject matter expert in at least two areas: Cloud & infrastructure orchestration (automation scripting, integration) and O365/Exchange management • Lead deployment and support of hybrid cloud environments (Azure/on-prem) • Perform advanced support (Tier 3), including root-cause analysis and performance tuning • Contribute to automation efforts using scripting (PowerShell, Python, etc.) • Ensure security, backups, monitoring, patching, and documentation are rock solid • Support SOX-compliant operations and participate in audits as needed • Partner with global teams to align on standards and drive toward 'One Signode'
Job Requirements
- Significant hands-on experience in system engineering and cloud environments
- Deep experience with Azure, virtualization, server & application management
- Strong scripting and automation background (PowerShell, Python, VBScript, etc.)
- Fluent in English; French desirable
- Self-starter, detail-focused, and passionate about building better systems
- Comfortable working cross-functionally and influencing without authority
- Willingness to travel occasionally across EMEA
Benefits
- Health insurance
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Role Description As a Cloud Systems Engineer, you will join our team on a journey to help eliminate barriers for patients, increase their access to medications, and help them receive lifesaving treatments while working in an environment that nurtures you. As a part of the infrastructure group, you will help shape the core technologies used at Valeris. The role will have a primary focus on project work, with escalation from other teams as needed. This is a fast-paced environment with new challenges and technologies added to the mix daily. Teamwork and communication are key to be successful in this position. Responsibilities - Architect, design, and lead the implementation of enterprise-scale, Azure-first cloud solutions, with supporting workloads across AWS and hybrid on-prem environments. - Own, define, and drive cloud platform strategy and architecture, including governance, security, cost optimization, and operational best practices. - Lead the consolidation, standardization, and optimization of multiple cloud environments to improve scalability, efficiency, and overall operational maturity. - Serve as a senior technical escalation point for complex infrastructure, cloud, and platform-level issues. - Design, implement, and maintain highly available, secure, and resilient cloud infrastructure aligned with enterprise standards. - Lead system and cloud architecture design efforts, translating business requirements into scalable technical solutions. - Oversee capacity planning, workload forecasting, and performance optimization across cloud and hybrid environments. - Ensure consistent execution of change management, patching, and lifecycle management processes. - Participate in on-call rotation and perform off-hours maintenance as required to support platform reliability. - Utilize Valeris’ values as the driving force behind the team’s success. - On time adherence to training deadlines for all corporate policies and procedures. - Ensure all SOPs are followed with consistency. - Perform additional tasks or projects as assigned. Qualifications - Bachelor’s degree in computer science or related field; equivalent combination of education and work experience is acceptable. - 7+ years of experience in enterprise infrastructure, systems, or cloud engineering. - 5+ years of deep, hands-on Microsoft Azure experience, including architecture, deployment, and operations. - 3+ years of practical AWS experience supporting enterprise production workloads. - Proven ability to independently own and lead initiatives end-to-end. - Strong communication skills with the ability to translate complex technical concepts to technical and non-technical stakeholders. - Strong security-first mindset across design, implementation, and operations. - Extensive experience managing Windows Server environments (2012–2025). - 5+ years of experience managing VMware vSphere and enterprise hybrid virtualization environments. - Hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes (preferred). - Linux administration experience in production or hybrid environments preferred. - Advanced scripting and automation experience (PowerShell, MS Graph, Bash, Terraform, or equivalent). - Strong knowledge of enterprise monitoring, backup, and disaster recovery solutions. - Demonstrated ability to work with PowerPoint and Visio for infrastructure and workflow architectural diagrams. Requirements - Prefer candidates who can type at least 35 words per minute with 97% accuracy. - Although very minimal, flexibility to travel as needed is preferred. - This job operates in a professional office environment. This role routinely uses standard office equipment such as computers, phones, photocopiers, etc. Benefits - Medical, dental, and vision plans, including HSA- and FSA-eligible options, with Valeris contributing toward premium costs. - Additional health support, including telehealth and Employee Assistance Program (EAP) services. - Company match on Health Savings Account contributions. - Free Basic Life and AD&D coverage equal to your annual earnings, with a minimum of $50,000 and a maximum of $300,000. - Company-paid Short-Term Disability coverage, with the option to purchase Long-Term Disability. - 401(k) Retirement Savings Plan with 100% match on the first 5% you contribute, with immediate vesting. - Paid Time Off (PTO) and Sick Leave to support work-life balance. - Team members receive nine paid holidays plus two floating holidays. - Opportunities for advancement in a company that supports personal and professional growth. - A challenging, stimulating work environment that encourages new ideas. - Work for a company that values diversity and makes deliberate efforts to create an inclusive workplace. - A mission-driven, inclusive culture where your work makes a meaningful impact.
Founding ML Infrastructure Engineer
uRunWe build the stage, not the show. We're an infrastructure company, a developer-tools company, and a production partner for model labs, and focus is a deliberate choice we've made and hold to. Day-to-day, that means a small team, a high bar, and real ownership. You won't wait for permission or inherit a backlog of someone else's decisions. In a founding security role, the function is what you make it. It also means ambiguity: priorities shift, not everything is documented. You'll often be the person who decides what "secure enough, for now" means.
Role Description We are building the next generation of AI inference infrastructure. As our ML Infrastructure and Platform Engineer, you will own the architecture and scaling of our GPU compute platform from the ground up. This is a founding technical hire with end-to-end ownership across the full infrastructure stack, from bare metal to model serving. You will work directly with the founding team and define how we build. What you'll actually be doing day-to-day: - Design and scale our GPU compute platform to support 1,000+ GPU clusters, ensuring high availability and low-latency inference across the fleet. - Build and maintain the infrastructure layer for our compute marketplace, including multi-tenant scheduling, isolation, and billing-aware resource allocation. - Own production reliability for ML systems end-to-end: observability, incident response, and SLA achievement across model serving and infrastructure. - Architect feature stores and model registry systems that support rapid iteration and reproducibility at scale. - Design an experiment tracking infrastructure capable of handling thousands of concurrent runs with full auditability. - Build resource orchestration and scheduling systems that optimise for throughput, cost, and latency across heterogeneous hardware. - Set engineering standards for infrastructure reliability, capacity planning, and operational excellence as an early technical leader. Qualifications - Proven experience designing and operating large-scale distributed infrastructure at 1,000+ nodes or equivalent complexity, in any domain. - Deep expertise in distributed systems, cluster orchestration (Kubernetes, Slurm, or custom schedulers), and large-scale resource scheduling. - Strong production reliability instincts: observability, incident response, capacity planning, and SLA ownership across complex systems. - Experience building infrastructure that other engineers build on top of, not just operating it. - Ability to operate as a technical lead: set direction, make tradeoffs under uncertainty, and raise the bar for the team around you. - Startup orientation. You are energised by ambiguity, move fast, and build for scale from day one. Requirements - Exposure to ML infrastructure concepts: GPU networking (NCCL, InfiniBand, RoCE), model serving frameworks (vLLM, SGLang, TensorRT-LLM), or hardware-aware performance tuning (CuTe, Triton, TileLang). - Experience with multi-cloud GPU procurement and capacity management across AWS, GCP, Azure, and bare metal providers. - Familiarity with inference marketplace architectures, dynamic routing, or spot/preemptible workload management. - Prior experience at a Series A or earlier stage company scaling from early infrastructure to production. Benefits - Competitive salary and meaningful equity in an early-stage AI infrastructure company. The band above is our target; for an exceptional candidate we'll go higher. Equity is real — you're early, and the grant reflects that. - Health, dental, and vision — full coverage. - 401(k) — company-supported retirement savings. - FSA/HSA — flexible spending accounts for healthcare costs. - Paid time off — we trust you to manage your time. - Top-tier tooling — access to the best AI tools available: Claude, Codex, Kimi, and whatever else helps you move faster. - MacBook Pro and AirPods — the hardware you need, on us.
Senior Software Engineer, Infrastructure Engineering
NVIDIANVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation fueled by great technology and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
• Build, develop, deploy, optimize, and document backend infrastructure features. • Collaborate with engineering and product teams across the stack • Drive adoption and frictionless integration of the stack • Ensure a high level of architectural quality, performance in the developed code and features • Debugging and fixing deficiencies • Support members of other teams / clients as they adopt the technology developed • Support the technology on multiple hardware platforms • Partnering and collaborating with peers and managers
NOSC Infrastructure Tier 2 Engineer
CACI International IncExpertise and Technology for National Security
Role Description CACI is seeking an experienced Tier II NOC Engineer within a 24x7x365 enterprise network operations security center (NOSC) supporting a federal government customer. The Tier II NOC Engineer will be responsible for providing network administration and operational support for a large and dynamic enterprise network, as well as performing analysis and troubleshooting of network incidents with the objective of minimizing interruptions and outages for end users to mission critical communications and resources. Shift: Mon – Fri, 8am – 4pm Must be flexible and able to be on-call when scheduled Location: Nebraska Ave Complex (3801 Nebraska Ave NW, Washington, DC 20528), but currently remote. Responsibilities: - Leverage industry experience to provide operational support to maintain overall health and performance of enterprise network components, to include LAN, WAN, firewalls, VPNs, and other network platforms. - Monitoring of network performance dashboards to proactively detect potential network degradation events and outages. - Investigate and diagnose incidents to restore network services as quickly as possible, ensuring all incident details and restoration steps are thoroughly documented in the ServiceNow ticketing platform. - Lead root cause analysis efforts and draft After Action Reports as requested. - Implementation of approved network enhancements, firewall and whitelisting requests, and other network updates in accordance with the Change Management process. - Routinely collaborate with Government Leads, Watch Officers, and other Operational Teams in the communication and investigation of high priority troubleshooting efforts using the appropriate escalation procedures. - Installation and support for remote access platforms such as VPN, Terminal Services, and Citrix. - Work with vendor engineering teams in the investigation of complex hardware and software issues and initiate RMAs for failed hardware components as necessary. - Provide guidance and assist in the development of junior NOC engineers with over-the-shoulder training and the creation of knowledge articles. - Perform all tasks in accordance with established team Service Level Agreements (SLAs) and Standard Operating Procedures (SOPs). Qualifications - Ability to obtain DHS Entry on Duty (EOD). - Bachelor’s degree + 8 years of experience. - Relevant network/NOC work experience. - CCNA certification. - In-depth understanding of network infrastructure topologies, security policies, firewalls, and L2/L3 switch and routing infrastructure. - Experience with network security, including firewalls, IDS/IPS, and vulnerability assessment & remediation. - Experience using network management and analysis tools such as SNMP probes, network taps, and packet analyzers. - Working knowledge of following the network tools/platforms: Broadcom Spectrum and CA PAM platforms, ServiceNow, SolarWinds, Splunk, NetScout, Grafana, HP Network Automation (HPNA) or similar tools/platforms. - Experience with the configuration and support of Palo Alto firewalls. - Experience with the configuration of BlueCoat proxies. - Experience using ServiceNow or similar ticketing systems. - Ability to work independently as well as part of a team. Requirements - Previous experience supporting Zscaler (configuring, troubleshooting, etc.). - Previous experience with SDWAN. - Previous experience with Site-to-Site VPNs. - Desired Certifications: - CCNP. - ITIL 4 Foundations. - Security +. Benefits - A culture of integrity. - An environment of trust. - A focus on continuous growth. - Competitive compensation, benefits, and learning and development opportunities. - Comprehensive benefits such as healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. Company Description CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.


