Job Closed

This listing is no longer active.

Swarm Aero

DevSecOps Engineer

DevOps EngineerDevOps EngineerOther Remote Senior Company Site

Location

United States

Posted

139 days ago

Salary

$150K - $250K / year

Seniority

Senior

Bachelor Degree4 yrs expEnglishDistributed Systems Docker Firewalls Kubernetes Linux

Job Description

• Architect and own software and dev ops infrastructure for a Command & Control (C2) system designed to control multi-domain unmanned systems • Design and implement secure network architectures across partner and government owned environments • Collaborate with partners on cyber security accreditation • Accelerate development through CI/CD improvements, cloud development environments, and integrations

Job Requirements

Degree in Computer Science or equivalent experience
4+ years building and managing production infrastructure for complex distributed systems
Extensive experience using containerization and orchestration tools both in cloud and on-premise (Docker, Podman, Kubernetes, ECS, etc)
Deep familiarity with configuring networks including Linux network configuration (firewalls, SE Linux, containerized networking), SSH tunnels, VPNs, middleware, and more
Working knowledge of DoD standard practices for cybersecurity/accreditation, encryption, and general system configuration
Deep experience bringing software through the defense / government security requirements (Risk Management Framework, FedRAMP, etc)
Experience improving developer velocity through tooling and automation
Background in GitOps practices
Familiarity with edge compute deployments and embedded systems

Benefits

Meaningful equity stake in a high-growth defense technology company
Competitive base salary commensurate with experience
Comprehensive benefits including medical, dental, vision, and 401k
PTO and Paid Sick Time
Monthly Wellness Stipend
Daily catered lunch to office
Paid Parental leave
Flexible work arrangement - remote/hybrid with regular collaboration in Oxnard & Seattle
Direct impact opportunity - be a key leader in building a critical technology for national security
World-class team - work alongside exceptional engineers and operators solving hard problems

Related Categories

DevOps Engineer

Related Job Pages

More Remote Jobs

More DevOps Engineer Jobs

Cloud Operations Engineer

2innovate

Enabling Tomorrow’s Payments Ecosystem. Deliver the Experiences your Customers and Partners Desire

DevOps Engineer139 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Automatizar tareas y procesos operativos utilizando herramientas de scripting e Infraestructura como Código (IaC). • Resolver incidentes de infraestructura, colaborando con otros equipos para cumplir con los SLAs acordados. • Configurar y gestionar conexiones seguras (VPNs) entre redes locales y la nube. • Realizar despliegues en Frame desde cero y actualizar componentes gestionados (AWS-GCP) • Implementar políticas de seguridad, configurar firewalls, IDS/IPS, y gestionar identidades y accesos. • Definir alertas, construir dashboards y supervisar el rendimiento de la infraestructura para identificar cuellos de botella y asegurar la disponibilidad.

AWS DNS Docker Firewalls GCP Kubernetes Python Terraform

View details: Cloud Operations Engineer

Colombia

Apply

Job Closed

Senior Site Reliability Engineer, Hawaii

Onebrief

Software for rapid military planning: make planning fast enough for today's environment

DevOps Engineer139 days ago

Other RemoteTeam 1-10Since 2019H1B No Sponsor

Company Site LinkedIn

About Onebrief Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meaning faster, smarter, and more efficient. We take ownership, seek excellence, and play to win with the seriousness and camaraderie of an Olympic team. Onebrief operates as an all-remote company, though many of our employees work alongside our customers at military commands around the world. Founded in 2019 by a group of experienced planners, today, Onebrief’s team spans veterans from all forces and global organizations, and technologists from leading-edge software companies. We’ve raised $320m+ from top-tier investors, including Battery Ventures, General Catalyst, Sapphire Ventures, Insight Partners, and Human Capital, and today, Onebrief is valued at $2.15B. With this continued growth, Onebrief is able to make an impact where it matters most. Security Clearance, Location, and Onsite Notice: This role requires regularly working on-site at customer locations on Oahu, Hawaii, specifically Camp H.M. Smith and Joint Base Pearl Harbor-Hickam. If you are not currently within commuting distance, you must be willing to relocate (note that Onebrief will provide relocation assistance). Active Top Secret Clearance required; SCI eligibility is a plus. About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll work closely with fellow SREs, security, and customer success. You will be the first line of support for our mission critical deployments, and responsible for ensuring best-in-class service quality and issue resolution. You will work in both on-premise DoD environments and AWS cloud environments. Your lessons from the field will shape how our team works, from policy to implementation. In addition to working at the customer, you will contribute directly to solutions that increase stability, performance, and security of our deployments, and improve the overall experience of deploying and managing Onebrief on premise. About You You care deeply about reliability and treat it as a core feature of any application or platform, with a bias toward “reliability over novelty.” You think about infrastructure and operability as products to be automated, well-documented, and continuously improved, and you aim to leave systems easier to operate than you found them. You are equally comfortable leading a post-incident review, or diving into a kubectl shell to triage a complex production issue. You don't just fix problems; you translate constraints and failure modes into clear, automated guardrails and scalable, resilient architecture. For you, robust monitoring, actionable alerting, and insightful runbooks are core parts of the engineering process, not afterthoughts. You mentor others, fostering a culture of blameless postmortems and proactive reliability. You collaborate naturally with application and platform teams, helping them move quickly but safely by building the tools, processes, and observability that make "fast recovery" a reality. What You'll Do You'll own the reliability, scalability, and security of the production application and/or platform. You will do this by: Implementing a World-Class Observability Platform: Design, implement, and manage our monitoring, logging, and alerting stack (e.g., Prometheus, Loki, Alloy, and Grafana). You won't just track metrics; you'll create the actionable insights and automated alerting that allow teams to identify and resolve issues before they impact users. Defining and Upholding Reliability: Define, measure, and own alerting that feeds into our Service Level Indicators (SLIs) and Service Level Objectives (SLOs), increasing trust internally and externally. You will be the organization's expert on what it means for our systems to be reliable and how to measure it. Leading Incident Response: Act as the incident responder and potentially incident commander during critical incidents who will lead blameless post-mortems / After Action Reviews (AARs) that identify true root causes and drive automated, long-term solutions to prevent recurrence. Automating for Scale and Security: Partner with platform engineers to design, build, and manage secure, resilient Kubernetes clusters and cloud/on-prem environments using Infrastructure-as-Code (Terraform, Ansible). You will embed security and compliance controls (RMF, STIGs) directly into this automation. Eliminating Toil and Scaling the Team: Proactively identify and eliminate operational toil by building automation. You will partner with other teams to share best practices for air-gapped environments and support their readiness for production. What We Look For An active Top Secret clearance 5+ years in Platform, DevOps, or Site Reliability Engineering with an infrastructure and operations focus. Proven partner to DevOps/Platform and application teams; collaborates well across functions and shares context openly. A deep understanding of incident response processes, with experience conducting thorough root cause analyses and driving continuous improvement. Technical expertise Infrastructure as Code: Terraform (or CloudFormation), Ansible. Containers and orchestration: Kubernetes design, deployment, and operations. CI/CD: experience building and maintaining pipelines (GitLab CI/CD, Jenkins, GitHub Actions). Scripting: proficiency with at least one of Python, Go, or Bash. Cloud: Familiarity with AWS or AWS GovCloud. Observability: Grafana stack, ELK stack, or Datadog. Networking fundamentals: core protocols and secure configurations. Bonus points (nice to have) Experience in DoD environments and compliance frameworks (RMF, STIGs, ICD 503). GitOps practices and toolchains. Security‑minded design for sensitive environments. Experience designing and implementing meaningful SLIs/SLOs (including error budgets) for complex, distributed systems. Familiarity with on‑prem virtualization(VMware, Proxmox, Nutanix, Hyper-V, etc). Service mesh exposure (Istio, Linkerd). Relevant certifications (e.g., AWS DevOps Engineer, CKA/CKAD). Active Security+ or another DoD 8570.01-approved security credential, or the ability to obtain the valid credentials within 3 months of employment. Notice to Third Party Recruitment Agencies Please note that Onebrief does not accept unsolicited resumes from recruiters or employment agencies. In the absence of an executed Recruitment Services Agreement, there will be no obligation to any referral compensation or recruiter fee. In the event a recruiter or agency submits a resume or candidate without an agreement Onebrief explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency. Any unsolicited resumes, including those submitted to hiring managers, shall be deemed the property of Onebrief.

Ansible AWS Docker Helm Kubernetes Linux Terraform VMware

View details: Senior Site Reliability Engineer, Hawaii

Hawaii

$180K - $220K / year

Apply

Job Closed

Senior Site Reliability Engineer

OfficeSpace Software

Create a better place for everyone

DevOps Engineer139 days ago

Other RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

About OfficeSpace: OfficeSpace is the AI workplace management platform that helps teams plan, connect, and perform in the modern workplace. As a performance-based, PE-backed company, we hire based on merit and a willingness to do what it takes to succeed long-term. You’re a great fit for the role if you’re entrepreneurial, passionate, motivated by building at light speed, and an Agentic AI early adopter. Our world-class teams operate in the US, Canada, and Costa Rica in a culture of trust, respect, growth, and impact. Role Summary: You own the performance, reliability, and cost efficiency of OfficeSpace’s production platform at scale. As a Senior Site Reliability Engineer, you shape how our systems run—fast, resilient, and predictable—while leading the shift from manual operations to AI-assisted reliability engineering. We provide the platform. You make it perform. What You’ll Do: - Drive measurable improvements in latency, throughput, and availability across a large-scale production environment. - Own system performance—from Linux internals to Kubernetes scheduling—and eliminate bottlenecks before customers feel them. - Define and enforce SLIs, SLOs, and error budgets that balance speed, reliability, and growth. - Partner with application engineers to profile code paths, improve execution efficiency, and harden services under real load. - Lead database performance optimization across queries, indexing, replication, and workload isolation. - Design and oversee AI-assisted load testing, stress testing, and capacity planning workflows. - Guide the migration from monolithic deployments to multi-tenant Kubernetes platforms. - Reduce infrastructure spend through architectural decisions, right-sizing, and intelligent scaling strategies. - Build and supervise automation for infrastructure provisioning, configuration management, and observability. - Set clear operational standards for reliability, performance, and incident response—and raise the bar for how we run production. What You Bring: - 7+ years operating and evolving large-scale production systems. Deep Linux systems expertise with hands-on performance tuning across CPU, memory, disk, and networking. - Strong Python skills for automation, tooling, and AI-assisted systems workflows. - Production experience with Ruby/Rails ecosystems, including Puma and Sidekiq. - Proven ability to diagnose and resolve complex database performance issues (MySQL/MariaDB or PostgreSQL). - Advanced Kubernetes experience—workload sizing, scheduling, and multi-tenant operations. - Infrastructure-as-code mastery using Terraform and Terragrunt. - Experience with configuration management tools such as Puppet or Ansible. - Strong observability instincts across metrics, logs, and traces using tools like Prometheus, Grafana, Datadog, or ELK. - AI fluency—comfortable supervising AI agents for analysis, testing, and reporting, and validating their outputs. - A builder mindset. You move fast, take ownership, and raise standards. Preferred Background: - Scaling and refactoring monolithic applications under real production load - Extracting databases or stateful components from monoliths - Apache and Nginx tuning at scale - Redis performance optimization and operational management - CI/CD systems and GitOps workflows, including ArgoCD - Cloud cost optimization and FinOps-aligned operational practices Why OfficeSpace? - High-Performance Culture: At OfficeSpace, we believe in the power of accountability, focus, and drive. Our A-Player team members work together to deliver measurable, meaningful results. We recognize and reward those who push boundaries and achieve excellence. - Ownership and Accountability: We trust our employees to take full ownership of their roles, providing the autonomy to innovate and the support to succeed. We seek individuals who are self-motivated and thrive in an environment where they can drive impactful outcomes. - Technology-Forward: As a company invested in cutting-edge technology, we integrate AI and other advanced solutions across our platform to enhance productivity, customer experience, and process efficiency. Our team members are excited by the potential of AI and proactively explore ways it can drive our success. - Growth Mindset: Continuous learning and improvement are integral to our culture. We encourage our team to embrace challenges, seek knowledge, and develop both personally and professionally. - Innovation and Agility: We foster a dynamic, fast-paced environment where fresh ideas and bold solutions are celebrated. We embrace change and thrive on turning challenges into opportunities, with a team that is agile, proactive, and resilient. - Collaborative, Results-Driven Environment: We value purposeful collaboration that leads to shared success and stronger results. While our team members are independent, they recognize the value of working together to drive our mission forward. - Competitive Benefits and Rewards: OfficeSpace offers comprehensive and competitive benefits packages globally, designed to support our team’s health, well-being, and financial security. We invest in our people so they can excel. OfficeSpace is committed to building and promoting a diverse workforce and celebrates the unique qualities that individuals of various backgrounds and experiences offer. We are committed to basing all employment-related decisions upon valid job-related factors without regard to race, color, sex (including pregnancy, sexual orientation, and gender identity), age, religion, national origin, genetic information, military status, veteran status, physical or mental disability, or any other status protected by law.

Ansible Datadog Grafana Kubernetes Linux Prometheus Python Ruby Terraform

View details: Senior Site Reliability Engineer

United States

Apply

Job Closed

DevOps Engineer

Ziphire HR

We connect talent to companies using our innovative platform.

DevOps Engineer139 days ago

Other RemoteTeam 1-10H1B No Sponsor

Company Site LinkedIn

• Own CI/CD for Salesforce • Design, implement, and maintain CI/CD pipelines using Salesforce DX (sfdx/sf), Git (GitHub/GitLab/Azure Repos), and a release toolchain (e.g., Copado/Gearset/AutoRABIT/Jenkins). • Define branching strategy (e.g., trunk-based or Gitflow), merge policies, code review standards, and automated validation. • Manage sandboxes, scratch orgs, and packaging: org shape, seeding, data masking, and refresh cadence. • Establish source-driven development with unlocked/second-gen packages when appropriate. • Integrate unit and integration tests (Apex tests, Jest for LWC), metadata diffs, and deployment validation (check-only). • Set quality gates (test coverage thresholds, rulesets) and break-glass procedures. • Run release trains, release notes, approvals, and change calendars. • Maintain deployment runbooks and back-out plans. • Enforce least privilege, profile/permission set migration, Health Check, Shield (if licensed), secrets management, and audit trails. • Implement monitoring and alerting on deployment pipelines and platform limits; track MTTR, change failure rate, and deployment frequency. • Orchestrate data migrations (Data Loader, sfdmu, Bulk API 2.0) with idempotent scripts and mapping. • Coordinate with integration platforms (MuleSoft, Boomi, Kafka, Azure) for versioned, backward-compatible changes. • Champion metadata rationalization, dependency management, and legacy remediation. • Coach engineers/admins on DevOps best practices; drive continuous improvement.

Azure Jenkins Jest Apache Kafka

View details: DevOps Engineer

California

$100K - $130K / year

Apply

Job Closed

DevSecOps Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Cloud Operations Engineer

Senior Site Reliability Engineer, Hawaii

Senior Site Reliability Engineer

DevOps Engineer