PlexTrac

Go Beyond Pentest Management and Reporting

Senior DevSecOps Engineer

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 51-200Since 2016H1B SponsorCompany Site LinkedIn

Location

Idaho

Posted

109 days ago

Salary

$140K - $170K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAnsible Cloud Google Cloud Platform Kubernetes Linux Python SDLC Terraform Go

Job Description

• Cloud & Infrastructure Security - Write and maintain Infrastructure as Code (IaC) with secure defaults, ensuring least privilege access and robust cloud configurations. • Vulnerability Management - Hunt for weaknesses, perform threat modeling, prioritize remediation, and guide engineering teams on how to fix discovered flaws. • Incident Response & Monitoring - Monitor live systems, investigate security anomalies, and respond to breaches. • Develop, deploy, and maintain Infrastructure-as-Code (IaC) in a GCP cloud-based environment. • Lead the development and enforcement of security architecture and operational best practices. • Establish monitoring, alerting, and incident response strategies across environments. • Define and execute on security roadmaps (e.g., threat modeling, vulnerability scanning, IAM policies). • Partner with developers to shift security and reliability left into the SDLC. • Support compliance and audit initiatives (SOC2, ISO27001). • Develop and maintain automated CI/CD pipelines for DBs, Servers, containers, and applications using DevSecOps tools to include Terraform, Ansible, GitHub, ArgoCD. • Develop integration interfaces using Python, Bash and Go. • Deploy and maintain complex modern cloud architectures. • Create automated testing plans for infrastructure and applications. • Create and update technical documentation (e.g. user guides, infrastructure diagrams). • Work across infrastructure that contains both Linux and Windows. • Work and communicate effectively in a group environment with technical and non-technical, management and customer both written and verbally. • Utilize robust troubleshooting skills. • Instill and apply solid engineering rigor, to include configuration management, testing. • Develop/engineer as part of an Agile team.

Job Requirements

5+ years of experience in DevOps, SRE, or DevSecOps roles, with increasing leadership or ownership
Deep knowledge of cloud infrastructure, with a focus on security, scalability, and cost-efficiency
Strong experience with infrastructure-as-code (Terraform, Ansible)
Fluency in CI/CD automation (GitHub Actions, ArgoCD, etc.)
Strong understanding of security fundamentals: identity and access management, secrets management, encryption, container security, etc.
Familiarity with compliance frameworks like SOC2 or ISO27001
Comfortable writing code and automation scripts (e.g., Python, Bash, Go)
A strategic mindset paired with startup scrappiness—you can zoom out and drive systems-level thinking, and also dive in and ship
Experience with Kubernetes, service mesh (e.g., Istio), and zero-trust architecture
History of leading incident response or large-scale reliability improvements
Strong communication skills across engineering and non-technical stakeholders.

Benefits

Competitive wellness benefits including Medical, Dental, Vision, Disability and Life
401(k)
Paid Parental Leave
Flexible work schedule - WFH, WFO
Flexible Time Off
World Class Culture

Related Categories

DevOps Engineer

Related Job Pages

DevOps Engineer Jobs in Idaho Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

DevSecOps Engineer

Swarm Aero

DevOps Engineer109 days ago

Other Remote

Company Site

• Architect and own software and dev ops infrastructure for a Command & Control (C2) system designed to control multi-domain unmanned systems • Design and implement secure network architectures across partner and government owned environments • Collaborate with partners on cyber security accreditation • Accelerate development through CI/CD improvements, cloud development environments, and integrations

Distributed Systems Docker Firewalls Kubernetes Linux

View details: DevSecOps Engineer

United States

$150K - $250K / year

Apply

Job Closed

Cloud Operations Engineer

2innovate

Enabling Tomorrow’s Payments Ecosystem. Deliver the Experiences your Customers and Partners Desire

DevOps Engineer109 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

• Automatizar tareas y procesos operativos utilizando herramientas de scripting e Infraestructura como Código (IaC). • Resolver incidentes de infraestructura, colaborando con otros equipos para cumplir con los SLAs acordados. • Configurar y gestionar conexiones seguras (VPNs) entre redes locales y la nube. • Realizar despliegues en Frame desde cero y actualizar componentes gestionados (AWS-GCP) • Implementar políticas de seguridad, configurar firewalls, IDS/IPS, y gestionar identidades y accesos. • Definir alertas, construir dashboards y supervisar el rendimiento de la infraestructura para identificar cuellos de botella y asegurar la disponibilidad.

AWS DNS Docker Firewalls GCP Kubernetes Python Terraform

View details: Cloud Operations Engineer

Colombia

Apply

Job Closed

Senior Site Reliability Engineer, Hawaii

Onebrief

Software for rapid military planning: make planning fast enough for today's environment

DevOps Engineer109 days ago

Other RemoteTeam 1-10Since 2019H1B No Sponsor

Company Site LinkedIn

About Onebrief Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meaning faster, smarter, and more efficient. We take ownership, seek excellence, and play to win with the seriousness and camaraderie of an Olympic team. Onebrief operates as an all-remote company, though many of our employees work alongside our customers at military commands around the world. Founded in 2019 by a group of experienced planners, today, Onebrief’s team spans veterans from all forces and global organizations, and technologists from leading-edge software companies. We’ve raised $320m+ from top-tier investors, including Battery Ventures, General Catalyst, Sapphire Ventures, Insight Partners, and Human Capital, and today, Onebrief is valued at $2.15B. With this continued growth, Onebrief is able to make an impact where it matters most. Security Clearance, Location, and Onsite Notice: This role requires regularly working on-site at customer locations on Oahu, Hawaii, specifically Camp H.M. Smith and Joint Base Pearl Harbor-Hickam. If you are not currently within commuting distance, you must be willing to relocate (note that Onebrief will provide relocation assistance). Active Top Secret Clearance required; SCI eligibility is a plus. About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll work closely with fellow SREs, security, and customer success. You will be the first line of support for our mission critical deployments, and responsible for ensuring best-in-class service quality and issue resolution. You will work in both on-premise DoD environments and AWS cloud environments. Your lessons from the field will shape how our team works, from policy to implementation. In addition to working at the customer, you will contribute directly to solutions that increase stability, performance, and security of our deployments, and improve the overall experience of deploying and managing Onebrief on premise. About You You care deeply about reliability and treat it as a core feature of any application or platform, with a bias toward “reliability over novelty.” You think about infrastructure and operability as products to be automated, well-documented, and continuously improved, and you aim to leave systems easier to operate than you found them. You are equally comfortable leading a post-incident review, or diving into a kubectl shell to triage a complex production issue. You don't just fix problems; you translate constraints and failure modes into clear, automated guardrails and scalable, resilient architecture. For you, robust monitoring, actionable alerting, and insightful runbooks are core parts of the engineering process, not afterthoughts. You mentor others, fostering a culture of blameless postmortems and proactive reliability. You collaborate naturally with application and platform teams, helping them move quickly but safely by building the tools, processes, and observability that make "fast recovery" a reality. What You'll Do You'll own the reliability, scalability, and security of the production application and/or platform. You will do this by: Implementing a World-Class Observability Platform: Design, implement, and manage our monitoring, logging, and alerting stack (e.g., Prometheus, Loki, Alloy, and Grafana). You won't just track metrics; you'll create the actionable insights and automated alerting that allow teams to identify and resolve issues before they impact users. Defining and Upholding Reliability: Define, measure, and own alerting that feeds into our Service Level Indicators (SLIs) and Service Level Objectives (SLOs), increasing trust internally and externally. You will be the organization's expert on what it means for our systems to be reliable and how to measure it. Leading Incident Response: Act as the incident responder and potentially incident commander during critical incidents who will lead blameless post-mortems / After Action Reviews (AARs) that identify true root causes and drive automated, long-term solutions to prevent recurrence. Automating for Scale and Security: Partner with platform engineers to design, build, and manage secure, resilient Kubernetes clusters and cloud/on-prem environments using Infrastructure-as-Code (Terraform, Ansible). You will embed security and compliance controls (RMF, STIGs) directly into this automation. Eliminating Toil and Scaling the Team: Proactively identify and eliminate operational toil by building automation. You will partner with other teams to share best practices for air-gapped environments and support their readiness for production. What We Look For An active Top Secret clearance 5+ years in Platform, DevOps, or Site Reliability Engineering with an infrastructure and operations focus. Proven partner to DevOps/Platform and application teams; collaborates well across functions and shares context openly. A deep understanding of incident response processes, with experience conducting thorough root cause analyses and driving continuous improvement. Technical expertise Infrastructure as Code: Terraform (or CloudFormation), Ansible. Containers and orchestration: Kubernetes design, deployment, and operations. CI/CD: experience building and maintaining pipelines (GitLab CI/CD, Jenkins, GitHub Actions). Scripting: proficiency with at least one of Python, Go, or Bash. Cloud: Familiarity with AWS or AWS GovCloud. Observability: Grafana stack, ELK stack, or Datadog. Networking fundamentals: core protocols and secure configurations. Bonus points (nice to have) Experience in DoD environments and compliance frameworks (RMF, STIGs, ICD 503). GitOps practices and toolchains. Security‑minded design for sensitive environments. Experience designing and implementing meaningful SLIs/SLOs (including error budgets) for complex, distributed systems. Familiarity with on‑prem virtualization(VMware, Proxmox, Nutanix, Hyper-V, etc). Service mesh exposure (Istio, Linkerd). Relevant certifications (e.g., AWS DevOps Engineer, CKA/CKAD). Active Security+ or another DoD 8570.01-approved security credential, or the ability to obtain the valid credentials within 3 months of employment. Notice to Third Party Recruitment Agencies Please note that Onebrief does not accept unsolicited resumes from recruiters or employment agencies. In the absence of an executed Recruitment Services Agreement, there will be no obligation to any referral compensation or recruiter fee. In the event a recruiter or agency submits a resume or candidate without an agreement Onebrief explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency. Any unsolicited resumes, including those submitted to hiring managers, shall be deemed the property of Onebrief.

Ansible AWS Docker Helm Kubernetes Linux Terraform VMware

View details: Senior Site Reliability Engineer, Hawaii

Hawaii

$180K - $220K / year

Apply

Job Closed

Senior Site Reliability Engineer

OfficeSpace Software

Create a better place for everyone

DevOps Engineer109 days ago

Other RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

About OfficeSpace: OfficeSpace is the AI workplace management platform that helps teams plan, connect, and perform in the modern workplace. As a performance-based, PE-backed company, we hire based on merit and a willingness to do what it takes to succeed long-term. You’re a great fit for the role if you’re entrepreneurial, passionate, motivated by building at light speed, and an Agentic AI early adopter. Our world-class teams operate in the US, Canada, and Costa Rica in a culture of trust, respect, growth, and impact. Role Summary: You own the performance, reliability, and cost efficiency of OfficeSpace’s production platform at scale. As a Senior Site Reliability Engineer, you shape how our systems run—fast, resilient, and predictable—while leading the shift from manual operations to AI-assisted reliability engineering. We provide the platform. You make it perform. What You’ll Do: - Drive measurable improvements in latency, throughput, and availability across a large-scale production environment. - Own system performance—from Linux internals to Kubernetes scheduling—and eliminate bottlenecks before customers feel them. - Define and enforce SLIs, SLOs, and error budgets that balance speed, reliability, and growth. - Partner with application engineers to profile code paths, improve execution efficiency, and harden services under real load. - Lead database performance optimization across queries, indexing, replication, and workload isolation. - Design and oversee AI-assisted load testing, stress testing, and capacity planning workflows. - Guide the migration from monolithic deployments to multi-tenant Kubernetes platforms. - Reduce infrastructure spend through architectural decisions, right-sizing, and intelligent scaling strategies. - Build and supervise automation for infrastructure provisioning, configuration management, and observability. - Set clear operational standards for reliability, performance, and incident response—and raise the bar for how we run production. What You Bring: - 7+ years operating and evolving large-scale production systems. Deep Linux systems expertise with hands-on performance tuning across CPU, memory, disk, and networking. - Strong Python skills for automation, tooling, and AI-assisted systems workflows. - Production experience with Ruby/Rails ecosystems, including Puma and Sidekiq. - Proven ability to diagnose and resolve complex database performance issues (MySQL/MariaDB or PostgreSQL). - Advanced Kubernetes experience—workload sizing, scheduling, and multi-tenant operations. - Infrastructure-as-code mastery using Terraform and Terragrunt. - Experience with configuration management tools such as Puppet or Ansible. - Strong observability instincts across metrics, logs, and traces using tools like Prometheus, Grafana, Datadog, or ELK. - AI fluency—comfortable supervising AI agents for analysis, testing, and reporting, and validating their outputs. - A builder mindset. You move fast, take ownership, and raise standards. Preferred Background: - Scaling and refactoring monolithic applications under real production load - Extracting databases or stateful components from monoliths - Apache and Nginx tuning at scale - Redis performance optimization and operational management - CI/CD systems and GitOps workflows, including ArgoCD - Cloud cost optimization and FinOps-aligned operational practices Why OfficeSpace? - High-Performance Culture: At OfficeSpace, we believe in the power of accountability, focus, and drive. Our A-Player team members work together to deliver measurable, meaningful results. We recognize and reward those who push boundaries and achieve excellence. - Ownership and Accountability: We trust our employees to take full ownership of their roles, providing the autonomy to innovate and the support to succeed. We seek individuals who are self-motivated and thrive in an environment where they can drive impactful outcomes. - Technology-Forward: As a company invested in cutting-edge technology, we integrate AI and other advanced solutions across our platform to enhance productivity, customer experience, and process efficiency. Our team members are excited by the potential of AI and proactively explore ways it can drive our success. - Growth Mindset: Continuous learning and improvement are integral to our culture. We encourage our team to embrace challenges, seek knowledge, and develop both personally and professionally. - Innovation and Agility: We foster a dynamic, fast-paced environment where fresh ideas and bold solutions are celebrated. We embrace change and thrive on turning challenges into opportunities, with a team that is agile, proactive, and resilient. - Collaborative, Results-Driven Environment: We value purposeful collaboration that leads to shared success and stronger results. While our team members are independent, they recognize the value of working together to drive our mission forward. - Competitive Benefits and Rewards: OfficeSpace offers comprehensive and competitive benefits packages globally, designed to support our team’s health, well-being, and financial security. We invest in our people so they can excel. OfficeSpace is committed to building and promoting a diverse workforce and celebrates the unique qualities that individuals of various backgrounds and experiences offer. We are committed to basing all employment-related decisions upon valid job-related factors without regard to race, color, sex (including pregnancy, sexual orientation, and gender identity), age, religion, national origin, genetic information, military status, veteran status, physical or mental disability, or any other status protected by law.

Ansible Datadog Grafana Kubernetes Linux Prometheus Python Ruby Terraform

View details: Senior Site Reliability Engineer

United States

Apply

Job Closed

Senior DevSecOps Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevSecOps Engineer

Cloud Operations Engineer

Senior Site Reliability Engineer, Hawaii

Senior Site Reliability Engineer