Welcome to composable healthcare.
Staff DevSecOps Engineer
Location
United States
Posted
3 days ago
Salary
$190K - $199K / year
Seniority
Lead
Job Description
Staff DevSecOps Engineer
Redox
• Champion a security-first mindset within Engineering to help set the security posture of our platform infrastructure — supply chain hardening, secrets management, IAM/IRSA, container image integrity, and vulnerability remediation across our AWS/EKS environment • Design and build automation that makes compliance evidence continuous, not manual — translating HITRUST controls into passing tests and structured outputs that flow into our compliance tooling (Vanta) • Embed security into the platform by default: make the secure path the easy path for application engineers, through guardrails, policy-as-code, and well-documented patterns • Partner with our Security team to translate threat assessments and control gaps into engineering proposals with clear scope, tradeoffs, and recommended paths forward • Lead platform security initiatives from design to operationalization — requirements, technical design, code and code review, deployment, and documentation • Contribute hands-on to the broader platform: CI/CD pipelines, container orchestration, observability, and developer tooling — this is an IC role, not a governance role • Participate in on-call rotation and own the systems you build, including production incidents • Mentor engineers on security practices and raise the security baseline across the team
Job Requirements
- 8+ years in cloud-native infrastructure or platform engineering roles, with demonstrable progression in technical scope and leadership
- Hands-on expertise with AWS and Kubernetes (EKS) — you've operated these in production, not just deployed them
- Security depth: you understand supply chain risk, IAM/zero-trust patterns, secrets management, and vulnerability management at the platform level — not just as concepts
- Experience translating compliance frameworks (HITRUST, SOC 2, or equivalent) into concrete engineering controls — bonus if you've worked with Vanta or similar compliance automation tooling
- Fluency in infrastructure-as-code (Terraform/HCL) and at least one scripting language (Python, Go, or Node.js/TypeScript)
- Experience with modern CI/CD systems and the security surface they introduce — pipeline integrity, artifact signing, registry controls
- Strong written communication and a track record of driving technical decisions in async, remote environments - you write proposals, not just Slack messages, and convert them to impact
Benefits
- 100% remote first culture (must be based in the US)
- Unlimited Flexible Time Off
- 15+ Observed Holidays
- Rest & R^Charge days (guaranteed a 3-day weekend each month)
- R^Charge (6 weeks paid sabbatical + stipend)
- 401k match 50% for up to 8% on Day 1
- Medical/Dental/Vision Benefits on Day 1
- HSA & FSA, Life, Disability, Medical Travel & Employee Assistance Program
- Paid Parental Leave (16 weeks)
- Productivity Stipend & Wellness Fund
- Redox Issued MacBook
- Virtual and/or in-person Team & Company Events
- Stock Options
- Employee Referral Bonus Program
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Reliability Operations Engineer
Serve RoboticsMeet the future of sustainable, self-driving delivery.
• Serve as the primary incident lead during your region’s daytime hours, coordinating technical investigations, centralizing communication, and engaging the appropriate engineering and SRE teams when escalation is required. • Respond to escalations from Tier 1 support, using runbooks, metrics, logs, and system diagnostics to investigate and remediate issues or determine when escalation to Tier 3 is necessary. • Develop and update runbooks, workflows, and operational documentation to ensure consistent and reliable responses to recurring issues, collaborating with product teams to expand coverage over time. • Write, maintain, and enhance automation scripts and tools that streamline common remediation steps, improve response times, and reduce manual operational overhead. • Use metrics, logs, and tracing tools (Grafana/Prometheus, GCP Monitoring, OpenTelemetry) to proactively identify problems, validate system behavior, and support continuous improvement of detection mechanisms. • Act as the central point of communication during active incidents, ensuring timely updates and clear routing to the correct product engineering and SRE stakeholders. • Collaborate with reliability and product teams to share insights, recommend improvements, and help refine processes that enhance the stability and operability of our systems. • Participate in a shared weekend on-call rotation to help maintain operational coverage for production systems, responding to incidents and escalations as needed and coordinating with engineering teams when issues arise. • Help establish operational best practices, refine workflows, and prepare the foundation for a broader reliability operations function.
• Lead incident investigations during your region’s daytime hours, providing timely updates, escalating appropriately, and supporting senior engineers leading the response. • Respond to escalations from Tier 1 support using established runbooks, metrics, logs, and diagnostics to remediate issues or escalate to Tier 3 when needed. • Update runbooks and operational documentation based on new issues, discoveries, and feedback, ensuring clarity and consistency across all procedures. • Run existing automations and collaborate with senior team members to enhance tooling and scripts that streamline troubleshooting and remediation tasks • Use observability tools such as Grafana/Prometheus, GCP Monitoring, and OpenTelemetry to interpret metrics, logs, and traces, helping identify anomalies and validate system performance. • Provide concise, accurate updates during incidents, ensuring information reaches the correct engineering and SRE contacts and supporting structured incident coordination. • Participate in discussions around root causes, share operational insights, and contribute to process improvements that enhance system stability and supportability. • Participate in a shared weekend on-call rotation to help maintain operational coverage for production systems, responding to incidents and escalations as needed and coordinating with engineering teams when issues arise. • Proactively strengthen workflows, adopt best practices, and build the foundation of the Reliability Operations function as it evolves.
• Own the DevOps roadmap across CI/CD, infrastructure automation, release workflows, and environment management with a clear focus on engineering velocity, reliability, and operational efficiency. • Lead the DevOps team while partnering closely with Engineering, SRE, Platform, and Product teams to remove production bottlenecks and raise automation standards across the organization. • Cultivate resilient multi-cloud practices across AWS and GCP, driving infrastructure as code (IaC), Kubernetes-based delivery, and modern operational tooling. • Strengthen observability, uptime discipline, and incident response while leading cloud cost and capacity optimization efforts as our platform scales. • Cultivate team capabilities, manage project plans, and build a high-accountability engineering culture that scales fluidly with company needs. • Evaluate and implement AI powered DevOps tools to improve deployment, monitoring, and incident response processes. • Leverage AI and machine learning solutions for predictive analytics, anomaly detection, capacity planning, and root-cause analysis. • Establish governance, security, and compliance standards for AI enabled infrastructure and operations. • Monitor emerging AI technologies and identify opportunities to improve operational efficiency and reduce manual effort.
Senior DevOps – Cloud & Bare Metal Infrastructure
Vivo (Telefônica Brasil)Com a conexão, queremos que você descubra novos pontos de vista e aproveite tudo o que realmente importa.
• Manage Bare Metal physical environments and provide recommendations for migration or adjustments to Cloud environments • Deploy and support servers, storage and virtualization • Plan and execute corporate connectivity projects • Troubleshoot and optimize networks and links • Implement redundancy and high-availability solutions • Manage infrastructure • Implement and scale Cloud and Multi-cloud infrastructures • Perform advanced connectivity troubleshooting • Ensure infrastructure security and network segmentation • Support integration between physical and cloud environments • Conduct capacity planning and optimize resource utilization • Support operational efficiency initiatives and infrastructure cost reduction • Monitor physical resource consumption and connectivity



