The AI Factory. Accelerating the Future.
Senior Infrastructure Engineer
Location
Australia
Posted
52 days ago
Salary
0
Seniority
Senior
Job Description
Senior Infrastructure Engineer
NexGen Cloud
• Own the design, deployment, and operation of OpenStack and Kubernetes environments — ensuring platform performance, scalability, and resilience for GPU workloads • Build and improve infrastructure using infrastructure-as-code and GitOps practices, driving automation across provisioning, deployment, and operational workflows • Optimise GPU workload scheduling using Kubernetes and NVIDIA tooling, and implement monitoring, logging, and alerting to ensure platform stability • Lead incident response and drive continuous improvement of reliability across the platform • Maintain strong security controls across infrastructure and container layers — RBAC, network policies, and tenant isolation • Work closely with Platform, DevOps, AI, Product, and Support teams to align infrastructure capabilities with customer and platform requirements
Job Requirements
- Strong hands-on experience running OpenStack in production environments
- Proven experience operating Kubernetes at scale — ideally bare-metal or private cloud
- Solid understanding of Linux, networking, and storage systems
- Experience with infrastructure automation, CI/CD, and Git-based workflows
- Strong ownership mindset — comfortable operating without heavy oversight and able to simplify and scale systems in a fast-moving environment
- Experience integrating Kubernetes with OpenStack (Nice to Have)
- Exposure to GPU infrastructure, HPC, or large-scale compute environments (Nice to Have)
- Familiarity with advanced networking or cloud-native ecosystems (Nice to Have)
- Contributions to open-source projects (Nice to Have)
Benefits
- Competitive salary and annual discretionary bonus scheme
- Employee wellbeing benefits
- 25 days of holiday, plus public holidays
- Flexible working arrangements (remote or hybrid, depending on role and location)
- Real ownership and autonomy, with the trust to take initiative and experiment
- The opportunity to make a visible, meaningful impact as we scale
- Clear career progression and growth opportunities in a fast-growing company
- A collaborative, international culture built on trust, transparency, and ownership
- The chance to help shape NexGen Cloud's team, culture, and future alongside ambitious, mission-driven colleagues
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Infrastructure Systems Engineer
Peraton CorporationPeraton Corporation, a national security company headquartered in Herndon, Virginia, supplies solutions for mission-critical programs and systems. Founded in 2017, Peraton's missio
Responsibilities The Office of Space Weather Observations (SWO) under NESDIS is responsible for advancing space weather observational capabilities to meet NOAA programmatic needs. NOAA’s Space Weather Next (SWX) program maintains and extends space weather observations from various vantage points, selected to most efficiently provide comprehensive knowledge of the Sun and the near-Earth space environment needed to protect our technological infrastructure. The Space Weather Ground Services (SWGS) is responsible for comprehensive ground services for all SWX projects, ensuring successful implementation and operation of observing assets and ensuring the continuity of space weather measurements made by SWFO-L1 and the GOES-R series satellites. The SWGS Mission Operations Services (MOS) program must provide a full mission satellite command and control solution to support the L1 Series with two new independently launched observatories. Overview: Peraton is seeking a Systems Engineer to support the infrastructure design, configuration and deployment for a new satellite ground system development program supporting the National Oceanic and Atmospheric Administration (NOAA). This position will support all Infrastructure activities throughout the full system lifecycle—from architecture and design through integration, assessment, authorization, and operational deployment. The selected candidate will be responsible for supporting the program’s Infrastructure functions. This role requires close collaboration with other program functional elements including Cyber, Software Engineering, Networks, Systems Engineering, Architecture, Operations, Quality and program leadership. - Provide support for system design, deployment, and implementation of Information Technology (IT) systems within Linux, Windows and cloud infrastructures. - Develop and design hardware systems architecture based on project requirements. - Collaborate with cross-functional teams to define system specifications and requirements. - Research, evaluate, and recommend new hardware and software solutions. - Conduct comparative analyses of hardware and software options for specific projects. - Collaborate with stakeholders to understand business drivers and requirements for cloud migration. - Identify systems and applications suitable for migration to the cloud. - Create and maintain detailed technical drawings, diagrams, and schematics. - Create and maintain detailed design documentation. - Ensure accuracy and completeness of program technical documentation. - Monitor and analyze physical and virtualized environments to track resource allocation. - Support integration efforts for hardware and software components into larger systems. - Support procurement efforts for new material and maintenance renewals. - Support ongoing obsolescence analysis of existing hardware and software. **This position is contingent on contract award.** #SWOMOS Qualifications - 5 years with BS/BA, 3 years with MS/MA or 0 years with PhD. 9 years of experience with no degree - Demonstrated proficiency in system design, design principles, implementation, and troubleshooting across various platforms. - Ability to obtain and hold a Public Trust clearance – US Citizenship is required - Hands-on experience with cloud computing platforms (e.g., AWS, Azure, Google Cloud), including designing, deploying, and managing cloud-based infrastructures. - Proficiency in designing, deploying, and managing virtualized environments to optimize resource utilization and performance. - Expertise in VMware virtualization technologies, including vSphere, vCenter Server, and Horizon View - Strong understanding of Windows and Linux operating systems, including installation, configuration, and administration. - Hands-on experience with hardware including servers, networking equipment, and storage solutions Knowledge of product specifications, configurations, and best practices for deployment and maintenance. - Experience in managing changes to hardware designs, requirements, and project plans. Ability to assess the impact of changes, obtain approval, and implement changes effectively while minimizing disruption. - Experience in troubleshooting system issues and optimizing performance for mission-critical applications. - Proficient in creating formal drawings and schematics for system architecture, system diagrams, network topologies, and other visual representations of hardware and infrastructure designs. (Visio experience required, AutoCAD recommended). - Knowledge of electronic components and their specifications. Ability to select appropriate hardware components based on performance requirements, cost considerations, and availability. - Experience in collaborating with hardware vendors, software providers, and service providers to evaluate products and services. Desired Qualifications: - Experience working within a distributed virtual team environment, with proficiency in remote collaboration tools and practices. - Experience with Infrastructure as Code (IaC) to include tools like Terraform, CloudFormation and Ansible - AWS Certified Cloud Practioner or similar cloud certifications - Familiar with the Atlassian tool suite, including Jira, Asset Manager, Confluence, Risk Register, Crucible, Bitbucket, Git, etc. - Strong interpersonal skills with a willingness to foster strong relationships with coworkers and vendors. - Highly organized with strong attention to detail - Outstanding verbal and written communication skills - Experience leading projects and process improvement activities to completion with successful outcomes and delivery of desired results. - Active Public Trust clearance Peraton Overview Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we’re keeping people around the world safe and secure. Target Salary Range $104,000 - $166,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual’s experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay. EEO EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.
Senior Cloud Infrastructure Architect
AxwayAxway is a software company that helps enterprises realize their digital transformation. According to the team, organizations are living in a time of rapidly ch
• Design and implement scalable cloud and hybrid infrastructure solutions leveraging Microsoft Azure and on-premises technologies. • Architect and support highly available, resilient, and disaster recovery-enabled infrastructure environments. • Design and implement secure Azure networking solutions including: VNets, ExpressRoute, VPN Gateway, Azure Firewall, NSGs, Load Balancers, Application Gateway / WAF. • Deploy and support mission-critical applications using Azure infrastructure services. • Act as technical lead for customer onboarding, migration, and cloud infrastructure projects. • Design, deploy, administer, and troubleshoot enterprise Linux and Windows server environments. • Manage Active Directory, Entra ID (Azure AD), and hybrid identity integrations. • Deploy, administer, and support Kubernetes environments, including Azure Kubernetes Service (AKS). • Support containerized environments using Docker and Kubernetes orchestration technologies. • Manage Kubernetes cluster operations including scaling, upgrades, patching, monitoring, ingress, storage, and security. • Partner with DevOps and engineering teams to support CI/CD pipelines and cloud-native application deployments. • Automate infrastructure provisioning and operational activities using Terraform, scripting, and Infrastructure-as-Code methodologies. • Administer infrastructure monitoring and observability platforms such as Azure Monitor, Prometheus, Grafana, and Nagios. • Support enterprise hosting technologies including backup, replication, storage, proxy, and load balancing solutions. • Provide advanced L3 support across cloud infrastructure, Kubernetes, Linux, networking, and storage platforms. • Perform infrastructure lifecycle management, patching, capacity planning, and operational optimization. • Participate in audits, disaster recovery testing, operational governance, and security compliance initiatives. • Conduct root cause analysis and drive long-term operational improvements. • Participate in after-hours support rotations and scheduled maintenance activities as required.
Senior Data Infrastructure Engineer
BackblazeBackblaze is the cloud storage innovator delivering a modern alternative to traditional cloud providers.
• Design, configure and maintain data solutions such as Vitess, Apache Cassandra, NATS, Redis/ValKey • Work closely with Operations and Application Engineers to develop new solutions and optimize existing ones • Improve production system scalability, performance, and availability • Deliver enhancements to observability, operability and reliability to lower TCO
Staff Observability Data Infrastructure Engineer
CVS HealthBringing our heart to every moment of your health.
• Design, build, and operate high-volume log, metric, and trace pipelines using Databricks, cloud data lakes, and distributed processing engines • Architect and evolve an Observability Lakehouse aligned with OpenTelemetry (OTEL) data models and standards • Implement ingestion and transformation workflows using technologies such as Cribl, Vector, Jenkins, GitHub Actions, or equivalent tools • Normalize, model, and enrich telemetry data to support detection engineering, forensics, and operational analytics • Develop scalable ETL/ELT frameworks, Delta Lake architectures, and automated data quality validation for unstructured and semi-structured data • Partner with Security Engineering, SRE, Cloud, and SOC teams to improve enterprise visibility and detection accuracy • Build and maintain CI/CD pipelines and reusable Infrastructure-as-Code (IaC) patterns for observability platform deployment • Identify and resolve performance, latency, cost, and reliability issues across telemetry pipelines • Contribute to engineering standards, documentation, and knowledge sharing across observability and security platforms




