Zscaler logo
Zscaler

Zscaler helps leading organizations in 180+ countries securely transform their networks and applications for a mobile and cloud-first world. Founded in 2008, th

Sr. Staff Site Reliability Engineer-Federal, Security Clearance

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 8,697Since 2007Company Site

Location

Virginia

Posted

5 days ago

Salary

$140K - $200K / year

Seniority

Senior

English

Job Description

Sr. Staff Site Reliability Engineer-Federal, Security Clearance

Zscaler

About Zscaler Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, we are constantly pushing the envelope, leveraging the world’s largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location. Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate—we’re focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession, collaboration, ownership, and accountability. We value high-impact, high-accountability with a sense of urgency where you’re enabled to do your best work and embrace your potential. If you’re driven by purpose, thrive on solving complex challenges, and want to be part of the team that’s helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity. Role We are looking for a Sr. Staff Site Reliability Engineer (Federal) to join our Government Cloud team. This is a fully onsite role based in Crystal City, Virginia, reporting to the Manager, Site Reliability Engineering. You will join the team responsible for building the world’s largest cloud security platform, enabling organizations worldwide to harness speed and agility. In this critical role, you will maintain our commitment to security by managing operations within classified environments, ensuring our multitenant architecture remains the leader in cloud security. What you’ll do (Role Expectations) - Manage operational tasks for products in US Government classified environments, including deployments, on-call duties, incident management, and participation in regular deployment syncs - Oversee all cloud infrastructure components such as AWS, private cloud environments, containers, and VMs to ensure stability and scalability - Develop scripts, containerized services, and monitoring mechanisms to automate operations tasks and minimize service disruption - Build new and enhance existing services within classified environments while driving DevOps best practices through documentation and escalation management - Provide 24x7 coverage including night and holiday shifts within a SCIF environment to support critical government missions Who You Are (Success Profile) - You thrive in ambiguity. You’re comfortable building the path as you walk it. You thrive in a dynamic environment, seeing ambiguity not as a hindrance, but as the raw material to build something meaningful. - You act like an owner. Your passion for the mission fuels your bias for action. You operate with integrity because you genuinely care about the outcome. True ownership involves leveraging dynamic range: the ability to navigate seamlessly between high-level strategy and hands-on execution. - You are a problem-solver. You love running towards the challenges because you are laser-focused on finding the solution, knowing that solving the hard problems delivers the biggest impact. - You are a high-trust collaborator. You are ambitious for the team, not just yourself. You embrace our challenge culture by giving and receiving ongoing feedback—knowing that candor delivered with clarity and respect is the truest form of teamwork and the fastest way to earn trust. - You are a learner. You have a true growth mindset and are obsessed with your own development, actively seeking feedback to become a better partner and a stronger teammate. You love what you do and you do it with purpose. What We’re Looking for (Minimum Qualifications) - Active Secret Security Clearance with the ability to maintain it throughout employment - Bachelor’s degree in Computer Science or a related field with 7+ years of Site Reliability Engineering experience in both Operations and Engineering environments - Proficiency in Linux administration, network troubleshooting, and automation tools like Ansible and Terraform - Strong technical skills in Python coding and container-based architectures including AWS ECS and Kubernetes - Experience in monitoring activities such as vulnerability scanning, patch management, and reporting, with expertise in virtualization, web security, and networking protocols What Will Make You Stand Out (Preferred Qualifications) - Experience working within air-gapped and classified environments managing monthly monitoring programs - Familiarity with High/Moderate FedRAMP authorization levels and compliance requirements - Possession of Information Assurance Technician Level 2 certification or Top Secret security clearance #LI-Onsite #LI-YC2 Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training. The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits. Base Pay Range $140,000—$200,000 USD At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure. Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including: - Various health plans - Time off plans for vacation and sick time - Parental leave options - Retirement options - Education reimbursement - In-office perks, and more! Learn more about Zscaler’s Future of Work strategy, hybrid working model, and benefits here. By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines. Zscaler is committed to providing equal employment opportunities to all individuals. We strive to create a workplace where employees are treated with respect and have the chance to succeed. All qualified applicants will be considered for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status, or any other characteristic protected by federal, state, or local laws. See more information by clicking on the Know Your Rights: Workplace Discrimination is Illegal link. Pay Transparency Zscaler complies with all applicable federal, state, and local pay transparency rules. Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Inspired Testing logo

DevOps Engineer, GCP

Inspired Testing

Highly skilled professionals with attention to detail, using latest technologies, to ensure usability and reliability.

DevOps Engineer5 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Ensure reliability, uptime, and performance across GCP environments. • Implement SRE and DevOps best practices with strong focus on automation and scalability. • Build and optimize CI/CD pipelines using GCP-native tools. • Lead observability initiatives using Grafana, Prometheus, Stackdriver. • Troubleshoot production incidents and deliver root-cause fixes. • Apply Infrastructure as Code (Terraform, Deployment Manager). • Partner with cross-functional teams to maintain platform stability. • Champion a proactive, blameless incident management culture. • Drive continuous improvement through emerging cloud and automation technologies.

United Kingdom

Role Description In your role as DevOps/ Infrastructure Admin, you will: - Analyse, plan, and develop infrastructures for complex projects - Be responsible for IT administration tasks - Handle maintenance and continuous improvement of monitoring environments for software systems - Design, implement, and maintain CI/CD pipelines to inform testers and developers about deployment status - Analyse and resolve issues across the system landscape - Design IT architectures, including evaluating software solutions and conceptualising new systems - Solve critical problems affecting highly available development, test, and production systems Qualifications - A degree in a technical field (e.g. Computer Science or equivalent experience) - Experience building and automating complex environments using Infrastructure as Code - Strong knowledge of backend technologies (Docker, Kubernetes) - Experience with build tools and continuous delivery systems (e.g. Bitbucket Pipelines, Azure DevOps Server) - Excellent teamwork and communication skills in English and German - An independent, structured working style and interest in dynamic team environments - Valid work permit in the EU Benefits - A motivated and innovative team with flat hierarchies and open communication - Full flexibility through 100% remote work - Flexible working hours - A wide variety of tasks in an innovative, future-oriented industry - Strong development opportunities and the prospect of a permanent position - An inclusive and supportive company culture

Germany
Full TimeRemoteTeam 5,001-10,000Since 1999H1B Sponsor

• Own and evolve Azure infrastructure using Terraform — networking, compute, managed services, and secrets • Manage Kubernetes clusters and Helm-based deployments for a distributed microservices platform • Build and maintain CI/CD pipelines (GitHub Actions / Azure DevOps) that keep deployments fast and safe • Define and improve observability: metrics, logging, alerting, and tracing across services • Lead incident response — drive resolution, write postmortems, and follow through on reliability improvements • Manage PostgreSQL infrastructure: backups, replication, failover, and query performance • Collaborate closely with engineers to improve local development environments and deployment workflows • Enforce security best practices: secrets management, network policies, access controls, and vulnerability scanning

California + 4 moreAll locations: California | Colorado | New York | Oregon | Washington
$131.3K - $175K / year

Role Description About The Role The infra team is small, technically deep, and owns the full stack from cloud provisioning and k8s operators to workflow orchestration and core services. We ship frequently and operate at a scale that makes reliability a first-class engineering problem. As part of the Infra team, you'll work on the reliability and performance of our platform end-to-end. You'll work closely with infra and product engineers to instrument systems, establish SLOs, tune autoscaling, and drive incident process maturity. The work is technical and hands-on: you'll write code, dig into k8s internals, and hold yourself accountable to positive production outcomes. What You'll Own and Drive: - Own production reliability across our Knative, KEDA, and Kubernetes-based document processing platform. - Proactively detect degradation, diagnose root causes, and ship fixes. - Work on observability: end-to-end tracing, latency SLOs, capacity dashboards, and alerting that finds problems before customers do. - Load testing and capacity planning: establish throughput benchmarks, detect performance regressions before they reach production. - Support fleet operations: contribute to the safe, automated upgrade process for our growing fleet of production systems. Qualifications - 4+ years of SRE, platform engineering, or infrastructure engineering in a production Kubernetes environment. - Deep operational knowledge of Kubernetes. - Demonstrated experience diagnosing and resolving real production performance issues: resource saturation, timeout failures, scheduling problems, graceful shutdown gaps. - Enough Python or Go to read service code, trace a bug to root cause, and write a targeted fix. Benefits - Medical, dental, and vision coverage effective the 1st of the month following your start date. - Life and disability insurance. - Unlimited PTO and flexible parental leave. - 401(k) with company match. - Equity. - $500 work from home stipend. - $70/month internet reimbursement. - Team/company offsites throughout the year.

Worldwide