Job Closed

This listing is no longer active.

vCluster Labs

vCluster Labs is a venture-backed tech startup headquartered in San Francisco, California, with a distributed, remote-first team spanning eight time zones. Foun

AI Infrastructure Specialist

Location

California + 4 moreAll locations: California | New York | Massachusetts | Missouri | Washington

Posted

67 days ago

Salary

$150K - $200K / year

Seniority

Senior

Job Description

AI Infrastructure Specialist

vCluster Labs

• Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated vCluster environment. • Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBand. • Deploy and validate Kubernetes and vCluster to provide GPU-powered managed K8s. • Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independently. • Document reusable playbooks and deployment architectures so your learnings become the next customer's head start. • Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback loop from the field into the roadmap. • Join Sales in the pre-sales process where deep infrastructure work is required to achieve a meaningful proof of value.

Job Requirements

  • 5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environments.
  • Practical knowledge of NVIDIA GPU Operators, CUDA tooling, and systems-level configuration for GPU nodes.
  • Deep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environments.
  • Experience with persistent volume configuration, CSI drivers, and distributed systems like Ceph, Rook, Weka, or Longhorn.
  • Comfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real time.
  • You thrive in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal services.

Benefits

  • Competitive Salary: We offer a competitive compensation package, including equity.
  • Platinum-Level Insurance: Health, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country).
  • Flexible Working Schedule: You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day.
  • Workplace Flexibility: We’re very flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

CNX logo

Customer Engineer – Infrastructure – Azure Monitor

CNX

We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.

Full TimeRemoteTeam 10,001

Job Title: Customer Engineer – Infrastructure – Azure Monitor Job Description We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future. We’re solution-focused, tech-powered, intelligence-fueled. With unique data and insights, deep industry expertise, and advanced technology solutions, we’re the intelligent transformation partner that powers a world that works, helping companies become refreshingly simple to work, interact, and transact with. We shape new game-changing careers in over 70 countries, attracting the best talent. The Concentrix Technical Products and Services team is the driving force behind Concentrix’s transformation, data, and technology services. We integrate world-class digital engineering, creativity, and a deep understanding of human behavior to find and unlock value through tech-powered and intelligence-fueled experiences. We combine human-centered design, powerful data, and strong tech to accelerate transformation at scale. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. Within our professional services team, you will deliver strategic consulting, design, advisory services, market research, and contact center analytics that deliver insights to improve outcomes and value for our clients. Hence achieving our vision. Our game-changers around the world have devoted their careers to ensuring every relationship is exceptional. And we’re proud to be recognized with awards such as "World's Best Workplaces," “Best Companies for Career Growth,” and “Best Company Culture,” year after year. Join us and be part of this journey towards greater opportunities and brighter futures. The Azure Monitor Customer Engineer will work directly with customers, as a consultant and technical advisor to: • Design, Deploy, Review and Assess the health of the infrastructure • Upgrade and maintain deployments • Troubleshoot issues with infrastructure and agents • Tune and optimize for performance • Assist with reporting and visualizations • Implement new management packs • Assist in the development of custom management packs • Provide training in all areas of Azure Monitor to ensure customer goals are met Ideal candidate experience: - 15+ years working as a depth expert and technology owner or consultant for Azure monitor - Ability to present to multiple levels of customer leadership. - Ability to act as a consultant and architect for multiple customers. - Broad knowledge across multiple monitoring scenarios: · Windows and Linux Operating Systems · Azure Monitor · KQL Kusto Query language advanced level · URL, Network monitoring · Connecting to ITSM systems · Dashboards, Reporting, and Visualizations · PowerShell scripting - Deep level knowledge in at least 3 of the above categories - Advanced level of Dutch and English   Technical Skills Requirements: Azure Monitor: Broad knowledge of ALL the below areas, with deep understanding of (at least) 4 of the following: Deep understanding of Azure Monitor architecture (metrics vs logs, data flow, ingestion, retention) Strong knowledge of: • Log Analytics workspaces • Azure Monitor Metrics • Diagnostic settings • Resource level vs platform level telemetry - Ability to explain when to use Azure Monitor vs Azure Data Explorer / Grafana / third party tools. Additionally, be able to ; • Write complex KQL queries across multiple tables • Use: o parse, extend, mv-expand o joins, time series, summarize patterns o performance optimized queries • Build: o reusable queries o functions o summary rules for cost & performance optimization • Debug slow or expensive queries Tools • Visual Studio, Silect, MPViewer, Alert Update Connector, PowerShell Linux OS and Linux Monitoring Report Development Network Monitoring URL Monitoring Related Skills: • System Center Orchestrator • System Center Data Protection Manager • System Center Virtual Machine Manager • System Center Service Manager Location: NLD Work-at-Home Language Requirements: Time Type: Full time

Netherlands
CNX logo

Customer Engineer – Infrastructure – Azure Virtual Desktop / W365 (M/F.D)

CNX

We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.

Full TimeRemoteTeam 10,001

Job Title: Customer Engineer – Infrastructure – Azure Virtual Desktop / W365 (M/F.D) Job Description We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future. We’re solution-focused, tech-powered, intelligence-fueled. With unique data and insights, deep industry expertise, and advanced technology solutions, we’re the intelligent transformation partner that powers a world that works, helping companies become refreshingly simple to work, interact, and transact with. We shape new game-changing careers in over 70 countries, attracting the best talent. The Concentrix Technical Products and Services team is the driving force behind Concentrix’s transformation, data, and technology services. We integrate world-class digital engineering, creativity, and a deep understanding of human behavior to find and unlock value through tech-powered and intelligence-fueled experiences. We combine human-centered design, powerful data, and strong tech to accelerate transformation at scale. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. Within our professional services team, you will deliver strategic consulting, design, advisory services, market research, and contact center analytics that deliver insights to improve outcomes and value for our clients. Hence achieving our vision. Our game-changers around the world have devoted their careers to ensuring every relationship is exceptional. And we’re proud to be recognized with awards such as "World's Best Workplaces," “Best Companies for Career Growth,” and “Best Company Culture,” year after year. Join us and be part of this journey towards greater opportunities and brighter futures. The AVD / W365 Customer Engineer will work directly with customers, as a consultant and technical advisor to: Architectural Design & Strategy - Design for Resilience: Lead architectural design sessions to build scalable, secure, and resilient virtual desktop solutions with strong focus on BCDR strategies for mission-critical environments. - Modernization: Guide customers from legacy on-premises VDI (Citrix/VMware) to cloud-native solutions like AVD and Windows 365. - Trusted Advisor: Act as the primary technical point of contact for customer IT executives and architects, bridging the gap between business goals and technical implementation. Technical Implementation & Engineering - Image & Profile Management: Design and implement automated image creation solutions (i.e. Azure Image Builder) and robust profile management strategies using FSLogix containers. - Endpoint Management: Drive the integration of Microsoft Intune for managing physical and virtual endpoints. - Application Strategy: Advise on application delivery and packaging, specifically modern formats like MSIX and App Attach to decouple applications from base images. - Automation: Utilize PowerShell, Azure CLI, ARM or Biceps to automate deployment, scaling, and monitoring tasks, reducing manual operational overhead. Operational Excellence & Troubleshooting - Deep Dive Troubleshooting: Apply a methodical, analytical approach to resolve complex performance issues (latency, login times, resource contention) in large-scale environments. - Monitoring: Implement Azure Monitor and Log Analytics to provide proactive insights into host pool health and user experience. Ideal candidate experience: Minimum of 5 years working as a depth expert and technology owner or consultant for AVD / W365. Minimum of 10-15 years of experience of working with Windows Client Environments, ideally also Azure environments Required Hard Skills - Core Virtualization: Deep, hands-on expertise in Azure Virtual Desktop and/or Windows 365. Strong background in Hyper-V and RDS. - Identity & Security: Solid understanding of Azure Entra ID, Hybrid Identity, Conditional Access, and RBAC models. - Infrastructure: Proficiency in Azure Infrastructure (Networking, Storage, Compute). - Automation: Confident in PowerShell scripting for automation and system management. - OS Proficiency: Deep knowledge of Windows 10/11. Professional Experience - Public Sector Focus: Passion for and willingness to work with public sector customers, understanding their unique compliance and security requirements. - Experience: Degree in Computer Science, IT, or equivalent practical experience. Long-term experience with large enterprise customers and complex IT landscapes. - Languages: Excellent command of German and English (spoken and written) is mandatory for this role. - Mobility: Valid driver’s license and willingness to travel frequently to customer sites across Germany. Preferred (Nice to Have) - Certifications: Microsoft Certified: Azure Virtual Desktop Specialty (AZ-140) is highly preferred. Other relevant certs: Azure Administrator (AZ-104) or Azure Solutions Architect (AZ-305). - Legacy Knowledge: Experience with Citrix DaaS or VMware Horizon is helpful for migration conversations but not strictly required. - Network Security: Understanding of hub-and-spoke topology, ExpressRoute, and firewall configuration for VDI. Location: DEU Work-at-Home Language Requirements: Time Type: Full time

Germany
CNX logo

Customer Engineer – Infrastructure – Azure Monitor (m/f/d)

CNX

We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future.

Full TimeRemoteTeam 10,001

Job Title: Customer Engineer – Infrastructure – Azure Monitor (m/f/d) Job Description We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled. The global technology and services leader that powers the world’s best brands, today and into the future. We’re solution-focused, tech-powered, intelligence-fueled. With unique data and insights, deep industry expertise, and advanced technology solutions, we’re the intelligent transformation partner that powers a world that works, helping companies become refreshingly simple to work, interact, and transact with. We shape new game-changing careers in over 70 countries, attracting the best talent. The Concentrix Technical Products and Services team is the driving force behind Concentrix’s transformation, data, and technology services. We integrate world-class digital engineering, creativity, and a deep understanding of human behavior to find and unlock value through tech-powered and intelligence-fueled experiences. We combine human-centered design, powerful data, and strong tech to accelerate transformation at scale. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. Within our professional services team, you will deliver strategic consulting, design, advisory services, market research, and contact center analytics that deliver insights to improve outcomes and value for our clients. Hence achieving our vision. Our game-changers around the world have devoted their careers to ensuring every relationship is exceptional. And we’re proud to be recognized with awards such as "World's Best Workplaces," “Best Companies for Career Growth,” and “Best Company Culture,” year after year. Join us and be part of this journey towards greater opportunities and brighter futures.The Azure Monitor Customer Engineer will work directly with customers, as a consultant and technical advisor to: - Design, Deploy, Review and Assess the health of the infrastructure - Upgrade and maintain deployments - Troubleshoot issues with infrastructure and agents - Tune and optimize for performance - Assist with reporting and visualizations - Implement new management packs - Assist in the development of custom management packs - Provide training in all areas of Azure Monitor to ensure customer goals are met Ideal candidate experience: 15+ years working as a depth expert and technology owner or consultant for Azure monitor Ability to present to multiple levels of customer leadership. Ability to act as a consultant and architect for multiple customers. Broad knowledge across multiple monitoring scenarios: - Windows and Linux Operating Systems - Azure Monitor - KQL Kusto Query language advanced level - URL, Network monitoring - Connecting to ITSM systems - Dashboards, Reporting, and Visualizations - PowerShell scripting Deep level knowledge in at least 3 of the above categories Technical Skills Requirements: Azure Monitor: Broad knowledge of ALL the below areas, with deep understanding of (at least) 4 of the following: Deep understanding of Azure Monitor architecture (metrics vs logs, data flow, ingestion, retention) Strong knowledge of: - Log Analytics workspaces - Azure Monitor Metrics - Diagnostic settings - Resource‑level vs platform‑level telemetry Ability to explain when to use Azure Monitor vs Azure Data Explorer / Grafana / third‑party tools. Additionally, be able to ; - Write complex KQL queries across multiple tables - Use: - parse, extend, mv-expand - joins, time series, summarize patterns - performance‑optimized queries - Build: - reusable queries - functions - summary rules for cost & performance optimization - Debug slow or expensive queries Tools - Visual Studio, Silect, MPViewer, Alert Update Connector, PowerShell Linux OS and Linux Monitoring Report Development Network Monitoring URL Monitoring Related Skills: - System Center Orchestrator - System Center Data Protection Manager - System Center Virtual Machine Manager - System Center Service Manager #WAH #LI-Remote Location: DEU Work-at-Home Language Requirements: Time Type: Full time

Germany
Cashea logo

Cloud Infrastructure Engineer

Cashea

Compra ahora y paga después, en cuotas sin interés. El impulso que mereces.

Full TimeRemoteTeam 501-1,000Since 2022H1B No Sponsor

• Propiedad de GCP • Diseñar, implementar y operar servicios en Cloud Run, Cloud SQL (PostgreSQL), Application Load Balancer, Cloud IAP, Cloud NAT, Cloud Armor y Cloud DNS • Gestionar la organización GCP: IAM, grupos, proyectos, carpetas y políticas de seguridad • Mantener y evolucionar la arquitectura de red: VPCs, subredes, mapas de URL, enrutamiento basado en rutas y certificados SSL multidominio • Diseñar y mantener pipelines CI/CD en GitHub Actions y Cloud Build • Implementar pipelines de migración de bases de datos desacoplados de implementaciones de aplicaciones, con controles de lock_timeout y statement_timeout • Configurar y mantener Datadog: métricas, registros, APM, paneles y alertas • Participar en rotaciones de guardia y contribuir a la mejora continua de postmortems • Definir SLIs y SLOs para servicios críticos de la plataforma • Gestionar Secret Manager, políticas IAM de menor privilegio y remediación de vulnerabilidades • Documentar procesos clave: implementaciones, migraciones de bases de datos, Apigee, BFF, monitoreo

Argentina
Job Closed