True Anomaly logo
True Anomaly

Space was once the quietest place in the universe. Now, it's crowded, contested, and confrontational. We are True Anomaly: the only defense company focused exclusively on space defense. Founded in 2022 by ex-U.S. Space Force members, True Anomaly designs and builds advanced systems for space superiority: agile and powerful spacecraft platforms, mission software engineered for unmatched command and control, and payloads tailored for precision sensing and effects. True Anomaly is headquartered in Centennial, CO, with regional offices in Colorado Springs, CO, Long Beach, CA, and Washington, D.C. We are hiring and seeking exceptional talent to join True Anomaly, from any technical industry or background, to bring unique talents, perspective, and solutions. If you embrace complexity, lead instead of follow, showcase integrity over ego, take ownership for outcomes, and measure success by impact, we want to hear from you.

Senior Manager, Infrastructure Engineering

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteSeniorTeam 250Since 2022Company Site

Location

California + 1 moreAll locations: California | Colorado

Posted

7 days ago

Salary

$200K - $290K / year

Seniority

Senior

English

Job Description

Senior Manager, Infrastructure Engineering

True Anomaly

Space is a warfighting domain. True Anomaly seeks those with the talent and ambition to build the technology that secures it. OUR MISSION True Anomaly delivers decisive capabilities for space superiority. We build autonomous spacecraft, advanced payloads, mission software, and space-based interceptors — enabling the U.S. and its Allies to secure the space environment and counter threats from the ultimate high ground. OUR VALUES - Be the offset. We create asymmetric advantages with creativity and ingenuity. - What would it take? We challenge assumptions to deliver ambitious results. - It’s the people. Our team is our competitive advantage and we are better together. OUR MISSION The peaceful use of space is essential for continued prosperity on Earth—from communications and finance to navigation and logistics. True Anomaly builds innovative technology at the intersection of spacecraft, software, and AI to enhance the capabilities of the U.S., its allies, and commercial partners. We safeguard global security by ensuring space access and sustainability for all. OUR VALUES Be the Offset. We create asymmetric advantages with creativity and ingenuity What would it take? We challenge assumptions to deliver ambitious results It’s the People. Our team is our competitive advantage, and we are better together YOUR MISSION True Anomaly operates across Azure, AWS, their GovCloud variants, on-premises facilities, classified environments, and customer-deployed infrastructure. As Senior Manager of Infrastructure Engineering, you will own the compute platform that ties all of it together: Kubernetes, networking, CI/CD, cloud infrastructure, and the government-accredited environments that the company deploys into. Your team will grow initially to 6 then further as the company breaks out dedicated subteams for SRE, network engineering, developer productivity, and incident management. The infrastructure you build will serve every program in the company, directly displace subcontractor engagements that currently cost millions per program, and form the foundation of True Anomaly’s accredited compute platform. This role requires someone who can operate across the full breadth of infrastructure disciplines, make sound architectural calls in classified and unclassified environments, and build a team that scales with an aggressive growth trajectory. RESPONSIBILITIES - Build and lead the Infrastructure Engineering team: initially 3 Senior Engineers (Kubernetes/platform reliability, networking, CI/CD/cloud), growing into dedicated subteams as the organization scales - Own the compute platform strategy across commercial, CUI, and classified environments, delivering standardized infrastructure that any program can deploy onto instead of building from scratch - Deliver government-accredited compute environments (IL-5, IL-6) using internal engineering capacity, replacing the current model of outsourcing each new accredited environment to subcontractors - Drive Kubernetes standardization, multi-tenant cluster design, service mesh, and workload orchestration across the company’s multi-cloud and on-premises footprint - Partner closely with the Staff/Principal Platform Engineer and Staff/Principal Network Engineer (who report to the VP) as the senior technical anchors for compute and networking architecture - Collaborate with Cybersecurity on hardened infrastructure (STIGs, security scanning, network segmentation) and with AI and Data Engineering on the compute and networking layer their platforms run on QUALIFICATIONS - 10+ years of experience in infrastructure, platform, or systems engineering, with 3+ years leading infrastructure teams across multiple technical disciplines - Active U.S. security clearance, such as Secret, Top Secret or TS/SCI clearance - Deep hands-on expertise in Kubernetes, container orchestration, and cloud infrastructure (AWS, Azure, or GovCloud variants) at production scale - Working knowledge of networking across complex environments: multi-cloud, on-premises, VPNs, firewalls, and network segmentation across classification boundaries - Direct experience building or operating DoD accredited environments (IL-4/IL-5 or higher), with familiarity with NIST 800-53, STIGs, RMF, and the ATO process - Experience with CI/CD platform design and infrastructure-as-code practices at scale (Terraform, Ansible, or similar) - Track record of building and leading engineering teams in high-growth environments where the infrastructure had to scale faster than the headcount PREFERRED SKILLS AND EXPERIENCE - Experience in aerospace, defense, or national security technology companies where infrastructure spans classification levels - Background managing teams that operate across both unclassified and classified environments with mixed clearance requirements - Experience with service mesh (Istio, Linkerd), GitOps workflows, and developer platform tooling - Prior experience at a high-growth company where a single infrastructure team covered multiple disciplines before specializing into subteams - Familiarity with platform engineering “paved roads” operating models that balance standardization with engineering team autonomy COMPENSATION - Base Salary: Long Beach - $165,000 to $230,000, Denver - $160,000 to $220,000, SF Bay Area - $185,000 to $250,000 - Equity + Benefits including Health, Dental, Vision, HRA/HSA options, PTO and paid holidays, 401K, Parental Leave Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education and experience. ADDITIONAL REQUIREMENTS - Active U.S. Secret clearance required; Top Secret preferred; must be able to obtain TS/SCI - Work Location: this role will be fully onsite at our facilities in Centennial, CO, Long Beach, CA, or SF Bay Area, CA - Work environment is in a standard office, working at a desk or in a production factory. - Physical demands may include frequent standing, sitting, walking, bending, and lifting or carrying items up to 20lbs. This position will be open until it is successfully filled. To submit your application, please follow the directions below. #LI-Onsite To conform to U.S. Government space technology export regulations, including the International Traffic in Arms Regulations (IT - AR) you must be a U.S. citizen, lawful permanent resident of the U.S., protected individual as defined by 8 U.S.C. 1324b(a)(3), or eligible to obtain the required authorizations from the U.S. Department of State. We value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. True Anomaly is committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, maternity or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know. To conform to U.S. Government space technology export regulations, including the International Traffic in Arms Regulations (ITAR) you must be a U.S. citizen, lawful permanent resident of the U.S., protected individual as defined by 8 U.S.C. 1324b(a)(3), or eligible to obtain the required authorizations from the U.S. Department of State. True Anomaly is committed to equal employment opportunity on any basis protected by applicable state and federal laws. If you have a disability or additional need that requires accommodation, please do not hesitate to let us.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Lightning AI logo

Infrastructure Engineer - Observability

Lightning AI

The platform to build ML models & build/publish Lightning Apps that “glue” together your favorite ML lifecycle tools.

Full TimeRemoteTeam 11-50H1B Sponsor

Title: Infrastructure Engineer Observability Location: United States Job Description: New York, New York, United States; Remote; San Francisco, California, United States; Seattle, Washington, United States Who We Are Lightning AI is the company behind PyTorch Lightning. Founded in 2019, we build an end-to-end platform for developing, training, and deploying AI systems—designed to take ideas from research to production with less friction. Through our merger with Voltage Park, a neocloud and AI Factory, Lightning AI combines developer-first software with cost-efficient, large-scale compute. Teams get the tools they need for experimentation, training, and production inference, with security, observability, and control built in. We serve solo researchers, startups, and large enterprises. Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute. What We’re Looking For Lightning AI is seeking an Observability Infrastructure Engineer to join our Infrastructure Engineering team. In this role, you will own and evolve observability systems across large-scale, GPU-enabled bare-metal infrastructure. You’ll operate at the intersection of infrastructure, data, and product, building platforms for metrics, logs, traces, and alerting that power both internal operations and customer-facing visibility. You will play a key role in productizing observability, enabling scalable, multi-tenant monitoring experiences while keeping pace with rapid infrastructure buildouts. This includes designing telemetry pipelines, improving signal quality, and delivering actionable insights that ensure reliability and transparency across our platform. We’re flexible on location for this team. This role can work hybrid out of one of our US-based hubs (Seattle, NYC, or SF) or fully remote within the U.S., with occasional company and team offsites. We are not able to provide visa sponsorship for this position at this time. What You’ll Do Observability Platform & Productization - Own and evolve a scalable observability platform spanning metrics, logs, traces, and events - Drive the productization of observability capabilities for both internal teams and external customers - Design multi-tenant observability systems with scoped access, RBAC, and customer-facing visibility - Continuously improve observability systems to keep pace with rapid infrastructure buildouts Telemetry & Data Pipelines - Design and operate telemetry pipelines ingesting data from GPUs, CPUs, networking (Ethernet & InfiniBand), containers, APIs, and BMC/Redfish - Build systems to correlate signals across infrastructure layers to enable faster debugging and root cause analysis - Implement streaming and real-time data pipelines using tools such as Kafka, OTEL, Promtail, or similar Alerting, Reliability & Insights - Design and implement noise-resistant alerting systems to improve signal quality and reduce operational load - Create dashboards and alerting for InfraOps, Engineering, and Customer Success teams - Build automated insights and enable proactive detection, forecasting, and system health visibility at scale Systems & Infrastructure Engineering - Contribute to broader infrastructure engineering projects beyond observability - Partner with infrastructure and platform teams to embed observability into core systems and workflows - Support large-scale, distributed systems across compute, networking, and storage environments Cross-Functional Collaboration - Work closely with customer-facing teams to deliver external observability experiences - Collaborate with engineering, operations, and support teams to improve system transparency and reliability - Help define best practices for observability across the organization What You’ll Need Required Qualifications - 5+ years of experience in infrastructure engineering, SRE, or observability-focused roles - Strong experience with monitoring systems such as Prometheus, Grafana, ELK, or VictoriaMetrics - Experience building and operating observability platforms at scale - Proficiency in Python, Go, or bash for automation and data integration - Familiarity with containerized environments and Kubernetes observability - Experience with streaming telemetry pipelines (Kafka, OTEL, Promtail, or equivalent) - Experience with multi-tenant monitoring architectures - Strong written and verbal communication skills Ideal Experience - Experience with GPU observability, particularly NVIDIA DCGM - Experience monitoring large-scale GPU or HPC clusters - Familiarity with InfiniBand fabric observability - Experience building customer-facing or productized infrastructure systems - Experience with correlation engines, RCA workflows, or predictive alerting systems - Broad exposure to infrastructure domains including networking, storage, and provisioning Compensation We are committed to offering competitive compensation that reflects the value each team member brings to our mission. Final offers are based on factors such as experience, skills, geographic location, and role expectations. In addition to base salary, our total rewards package for eligible roles includes a discretionary bonus, a meaningful equity component, and comprehensive benefits. The anticipated annual base salary range for this role is: $180,000 - $200,000 USD Benefits and Perks We offer a comprehensive and competitive benefits package designed to support our employees’ health, well-being, and long-term success. Benefits may vary by location, team, and role. Benefits include: - Comprehensive medical, dental and vision coverage (U.S.); Private medical and dental insurance (U.K.) - Retirement and financial wellness support (U.S.); Pension contribution (U.K.) - Generous paid time off, plus holidays - Paid parental leave - Professional development support - Wellness and work-from-home stipends - Flexible work environment At Lightning AI, we are committed to fostering an inclusive and diverse workplace. We believe that diverse teams drive innovation and create better products. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected characteristic. We are dedicated to building a culture where everyone can thrive and contribute to their fullest potential.

United States
$180K - $200K / year

Infrastructure Automation Engineer

Bright Vision Technologies

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. We recognize that our people are our strength. We are an equal opportunity employer and place a high value on diversity and inclusion. We do not discriminate on the basis of any protected attribute. We make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Bright Vision Technologies is an Equal Opportunity Employer, including Disability/Veterans.

Role Description We are seeking an Infrastructure Automation Engineer with deep Terraform expertise to design, build, and maintain the infrastructure-as-code foundations that power our cloud and hybrid environments. This role focuses on creating reusable Terraform modules, hardening pipelines, enforcing policy-as-code, and standardizing infrastructure delivery across multiple teams and cloud providers. The ideal candidate brings strong software engineering discipline to infrastructure work, has shipped production-grade Terraform at scale, and understands the operational realities of managing thousands of resources across many environments and accounts. Key Responsibilities - Design, develop, and maintain modular, composable Terraform code that codifies the entire infrastructure estate across cloud accounts and environments. - Build a library of well-tested, reusable Terraform modules with clear interfaces, semantic versioning, and comprehensive documentation. - Implement Terraform automation pipelines using GitHub Actions, GitLab CI, Atlantis, Terraform Cloud, or Spacelift, with plan/apply gating, drift detection, and policy enforcement. - Define and enforce policy-as-code using Sentinel, Open Policy Agent (OPA), Conftest, or Checkov to prevent insecure or non-compliant infrastructure changes. - Manage Terraform state at scale with appropriate backend strategies, state locking, workspace organization, and disaster recovery patterns. - Drive multi-account, multi-region, and multi-cloud infrastructure provisioning strategies with clear isolation, naming, and tagging standards. - Implement infrastructure testing including unit tests with terraform-compliance, integration tests with Terratest, and policy tests across pull requests. - Collaborate with security, networking, and platform teams to embed guardrails directly into reusable modules and pipelines. - Standardize patterns for secrets management, identity federation, and least-privilege IAM through reusable Terraform abstractions. - Lead migrations from legacy, ClickOps, or non-IaC infrastructure into managed Terraform footprints with minimal disruption. - Drive cost optimization, tagging hygiene, and lifecycle management across the Terraform-managed estate. - Mentor engineering teams on Terraform best practices, anti-patterns, and pull-request review standards. - Maintain comprehensive runbooks, architecture diagrams, and onboarding materials for the infrastructure platform. - Stay current with Terraform, OpenTofu, and broader IaC ecosystem developments and recommend adoption where beneficial. Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related field. - Five or more years of experience in cloud infrastructure or DevOps engineering, with significant Terraform focus. - Deep, hands-on expertise authoring and maintaining production Terraform across at least one major cloud provider. - Strong experience designing reusable Terraform modules with clean APIs and version discipline. - Hands-on experience with Terraform state management, backends, and large-scale workspace organization. - Strong scripting skills in Python, Go, or Bash. - Experience with CI/CD pipelines for infrastructure code and automated policy enforcement. - Solid understanding of cloud networking, identity, and security primitives. - Strong Git-based workflows including code review, branching, and release management. - Excellent troubleshooting and root-cause analysis skills. Preferred Qualifications - Experience with multi-cloud Terraform (AWS + Azure or AWS + GCP). - Familiarity with Terragrunt, Atlantis, Spacelift, or env0. - Experience with policy-as-code engines (Sentinel, OPA, Checkov). - Contributions to public Terraform modules or providers. - Exposure to FinOps practices and tagging-driven cost governance. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] or contact us at (908) 650-6699. Learn more about Bright Vision Technologies at www.bvteck.com .

United States
$100K - $150K / year
Job Closed
Switzerland Global Enterprise logo

Cloud & Infrastructure Engineer

Switzerland Global Enterprise

We support Swiss SMEs in their international business and help innovative foreign companies to establish in Switzerland.

Full TimeRemoteTeam 51-200Since 1927H1B No Sponsor

Role Description Zur Verstärkung unseres IT-Teams suchen wir per sofort oder nach Vereinbarung an unserem Standort in Zürich eine engagierte und technisch versierte Persönlichkeit als Cloud & Infrastructure Engineer mit Schwerpunkt auf Microsoft Cloud, Infrastruktur und moderner Arbeitsplatztechnologie. - Konzeption, Betrieb, Weiterentwicklung und Lifecycle-Management unserer IT-Infrastruktur in der Cloud, inklusive AVD - Gestaltung, Ausbau und operative Betreuung der Microsoft Cloud-Umgebung mit Fokus auf Azure, Microsoft 365, Entra ID, Teams, SharePoint und Intune - Betrieb und Weiterentwicklung zentraler Infrastruktur- und Plattformservices, insbesondere AD, DNS, DHCP, GPO, Server- und Domain Services - Implementierung, Wartung und Optimierung von Netzwerk- und Sicherheitskomponenten wie Firewall, WAN, LAN, WLAN, VPN, VLAN und SD-WAN - Verantwortung für Themen im Bereich Identity & Access Management, inklusive Conditional Access, Enterprise Applications, App Registrations sowie Identity-, Mail- und AD-Security - Mitarbeit bei der technischen Security sowie Sicherstellung der Einhaltung von Standards, Architekturprinzipien, Security- und Compliance-Vorgaben - Engineering zukunftsgerichteter Lösungen sowie Automatisierung bestehender und neuer Infrastrukturen - Digitalisierung und Standardisierung von Prozessen, unter anderem mit PowerShell - Überwachung und Monitoring von Infrastruktur- und Services mit PRTG - Unterstützung und Mitwirkung in Projekten, Change Requests sowie im Requirements Engineering und bei der Weiterentwicklung von IT-Architekturen - Verantwortung für das Lizenzmanagement im Microsoft-Umfeld, insbesondere im Rahmen von Cloud Solution Provider (CSP) Modellen - Beratung und Coaching interner Stakeholder sowie Sicherstellung des Know-how-Transfers zu Methoden, Prozessen und Technologien - Mitarbeit im 1st-, 2nd- und 3rd-Level-Support sowie technische Koordination mit Applikationsverantwortlichen und externen Partnern Qualifications - Abgeschlossene Ausbildung oder Weiterbildung in Informatik, Wirtschaftsinformatik oder einem vergleichbaren Bereich - Mehrjährige Erfahrung im Microsoft-Umfeld mit Fokus auf Cloud, Serveradministration, Infrastruktur und Plattformentwicklung - Fundierte Kenntnisse in Azure, Microsoft 365, Storage, Backup, Virtualisierung sowie moderner Infrastruktur- und Cloud-Technologien - Sehr gute Kenntnisse in IAM, Entra ID, Conditional Access sowie Security-relevanten Themen im Microsoft-Umfeld - Erfahrung in der Entwicklung, Optimierung und Bewertung von IT-Architekturen und Lösungen unter Berücksichtigung von Security und Compliance - Sicherer Umgang mit PowerShell und Freude an Automatisierung und Standardisierung - Erfahrung als System Engineer, Cloud Engineer oder IT-Architektin beziehungsweise IT-Architekt - Strukturierte, lösungsorientierte und selbstständige Arbeitsweise sowie ausgeprägte Kommunikations- und Präsentationsfähigkeiten - Sehr gute Deutsch- und Englischkenntnisse in Wort und Schrift - Von Vorteil sind Microsoft-Zertifizierungen sowie Erfahrung mit methodischen Frameworks im Microsoft-Umfeld Benefits - S-GE bietet Ihnen eine faszinierende Tätigkeit an der Schnittstelle zwischen Politik und internationaler Wirtschaft in einem modernen Arbeitsumfeld im Herzen von Zürich. - S-GE setzt auf Flexibilität in der Arbeitsgestaltung und fördert ein kollegiales Umfeld sowie die fachliche und persönliche Entwicklung ihrer Mitarbeitenden. Company Description Wir freuen uns auf Ihre Online-Bewerbung. Allfällige Fragen beantworten wir Ihnen gerne unter hr@s-ge.com. Bitte beachten Sie, dass wir Ihre Bewerbung jedoch ausschliesslich über unser Online-Tool entgegennehmen können.

Switzerland
Ascend Technologies logo

Senior Infrastructure Systems Engineer

Ascend Technologies

Innovation & Technology Enabling Business Growth

Full TimeRemoteTeam 201-500Since 2020H1B Sponsor

• Provide escalated support and operational maintenance to client environments. • Work on project-based initiatives for clients, including upgrades to existing infrastructure and deployment of new technology. • Serve as escalation point for the Ascend Technologies Network Operations Center (NOC) and support engineers

United States
$100K / year