Leidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.
DevOps Lead
Location
United States
Posted
79 days ago
Salary
$107.9K - $195.1K / year
Seniority
Lead
No structured requirement data.
Job Description
DevOps Lead
Leidos
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Leidos is seeking a DevOps Lead to lead and further mature platform engineering, site reliability engineering (SRE), and application security functions within a large-scale software development and IT operations program. This role is responsible for driving operational excellence, security, resiliency, and scalability across enterprise DevOps platforms supporting mission-critical applications for federal government customers. The DevOps Lead will oversee a multidisciplinary team tasked with enabling continuous integration/continuous deployment (CI/CD), optimizing cloud and on-premise platforms, securing development pipelines, and ensuring highly reliable and performant systems. This leader will collaborate with architects, engineering leads, project managers, and customer stakeholders to maintain an automation-first culture and introduce best practices for reliability, security, and DevOps maturity. This opportunity is ideal for an accomplished DevOps leader who thrives on solving complex platform, security, and reliability challenges at scale and is passionate about technical excellence and customer mission achievement. Location: This position may be remote, though periodic visits to customer sites or the Program Office (within Washington, DC region) are required. Primary Responsibilities - DevOps & Platform Management: Direct the design, implementation, and maintenance of CI/CD pipelines, automated provisioning, and monitoring processes across cloud and hybrid environments. Lead efforts to standardize and optimize platform engineering practices, adopting Infrastructure-as-Code (IaC) and microservices deployment models. - Site Reliability Engineering (SRE): Develop and enforce SRE principles, including release management, system reliability, observability, incident management, SLAs/SLOs, and fault tolerance. Implement monitoring and alerting solutions to proactively identify issues, reduce mean time to resolution (MTTR), and drive service uptime objectives. - Application Security: Integrate security throughout the SDLC, ensuring robust code review, vulnerability scanning, and threat modeling within automated pipelines. Collaborate with security teams to remediate vulnerabilities and achieve compliance with industry and federal standards (e.g., FedRAMP, NIST). - Team Leadership & Collaboration: Mentor and lead a multidisciplinary team, promoting an agile, collaborative, and innovative work environment. Partner with development, security, and operations teams to align platform strategies with business and mission requirements. - Platform Engineering: Champion creation and curation of reusable infrastructure patterns, automation scripts, cloud orchestration templates, and developer self-service platforms. Evaluate emerging tools and technologies for enhancing developer productivity and system resilience. - Service Operations & Incident Response: Oversee operational readiness, incident response, root cause analysis, and continuous improvement initiatives to ensure high availability and rapid recovery from service disruptions. - Continuous Improvement: Drive a culture of innovation by assessing and implementing advancements in DevOps, platform engineering, and SRE practices. Regularly review system metrics, operational KPIs, and propose enhancements. - Reporting & Stakeholder Communication: Prepare and present operational dashboards, incident reports, risk assessments, and status updates to program leadership and customers. Ensure transparent communication of operational posture and improvement initiatives. Qualifications - Bachelor’s degree and 10+ years of progressive experience in software development, DevOps, or platform engineering. - 3+ years of technical team leadership or management experience. - Demonstrated expertise in advanced DevOps practices, including CI/CD, configuration management, automation, and cloud-native operations (AWS, Azure, or similar). - Hands-on experience with SRE frameworks, monitoring, logging, alerting, and reliability engineering techniques. - Proven background in securing applications and systems, including integrating security into pipelines and coordinating with security/compliance teams. - Strong technical knowledge of container orchestration (Kubernetes, Docker), IaC (Terraform, CloudFormation), and end-to-end application/platform lifecycle management. - Excellent interpersonal, written, and verbal communication skills. - Strong problem-solving skills and ability to thrive in a fast-paced, dynamic environment. - U.S. citizenship and ability to obtain and maintain required government security clearance. Preferred Qualifications - Certifications in cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps), SRE, or security (e.g., CISSP, CISM). - Experience managing federal or large enterprise DevOps/SRE/Platform Engineering teams. - Familiarity with federal compliance frameworks (FedRAMP, FISMA, NIST 800-53). - ITIL Foundations or equivalent IT service management training. - Prior experience with infrastructure modernization, large-scale migrations, or complex system integrations. - Familiarity with Department of Justice or federal government IT environments. Pay Range Pay Range $107,900.00 - $195,050.00. The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Job Requirements
- Bachelor’s degree and 10+ years of progressive experience in software development, DevOps, or platform engineering.
- 3+ years of technical team leadership or management experience.
- Demonstrated expertise in advanced DevOps practices, including CI/CD, configuration management, automation, and cloud-native operations (AWS, Azure, or similar).
- Hands-on experience with SRE frameworks, monitoring, logging, alerting, and reliability engineering techniques.
- Proven background in securing applications and systems, including integrating security into pipelines and coordinating with security/compliance teams.
- Strong technical knowledge of container orchestration (Kubernetes, Docker), IaC (Terraform, CloudFormation), and end-to-end application/platform lifecycle management.
- Excellent interpersonal, written, and verbal communication skills.
- Strong problem-solving skills and ability to thrive in a fast-paced, dynamic environment.
- U.S. citizenship and ability to obtain and maintain required government security clearance.
- Preferred Qualifications
- Certifications in cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps), SRE, or security (e.g., CISSP, CISM).
- Experience managing federal or large enterprise DevOps/SRE/Platform Engineering teams.
- Familiarity with federal compliance frameworks (FedRAMP, FISMA, NIST 800-53).
- ITIL Foundations or equivalent IT service management training.
- Prior experience with infrastructure modernization, large-scale migrations, or complex system integrations.
- Familiarity with Department of Justice or federal government IT environments.
- Pay Range
- Pay Range $107,900.00 - $195,050.00. The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Deploy and maintain critical applications on cloud-native microservices architecture • Implement automation, effective monitoring, and infrastructure-as-code • Deploy and maintain CI/CD pipelines across multiple environments • Support and work alongside a cross-functional engineering team on the latest technologies • Iterate on best practices to increase the quality & velocity of deployments • Have on call responsibilities in rotation with the engineering team • Increase the sophistication of our alerting and escalation mechanisms • Help increase system performance with a focus on high availability and scalability • Propose, scope, design, and implement various infrastructure architectures • Develop and maintain solutions for operational administration, system/data backup, disaster recovery, and security/performance monitoring • Continuously evaluate existing systems with industry standards, and make recommendations for improvement • Perform root cause analysis for production errors • Continue to keep the lights on (day-to-day administration)
• Provide technical and line-management leadership to your development team. • Take responsibility for the successful delivery of projects. • Identify and resolve blockers before they become issues. • Ensure best practices in DevOps, software development and agile methodologies are upheld within the team. • Work directly with clients, translating requirements into technical briefs. • Shape and define architectural decisions ensuring scalability, security, and maintainability. • Provide updates to client and Nearform leadership to ensure clear understanding of project status and drive good decision-making.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We are looking for a skilled individual to join our rapidly growing team at Bluelight Consulting. This position is ideal for someone who thrives in a fast-paced, dynamic environment where everyone's opinions and efforts are valued and appreciated. You will have the opportunity to contribute to challenging and meaningful projects, developing high-quality applications that stand out in the market. We value continuous learning, personal growth, and hard work, offering a collaborative environment that promotes professional development. If you are passionate about software development and eager to be part of a growing software consultancy, we invite you to apply and join us on this exciting journey. Qualifications - Cloud Engineering (cloud computing) experience with AWS, GCP, and/or Azure to include load balancing - Infrastructure as a code (Terraform / Pulumi / Cloudformation) - Designed and maintained CI/CD process and tools (CircleCI, GitLab, Jenkins) - In-depth experience with the orchestration tools (Kubernetes) - In-depth experience with the config management tools (Helm, Ansible, Chef Puppet) - Testing, code review, good communication skills Benefits - Competitive salary and bonuses, including performance-based salary increases - Generous paid-time-off policy - Technology / Office stipend - Health Coverage - Flexible working hours - Work remotely - Continuing education, training, conferences - Company-sponsored coursework, exams, and certifications
• Design and evolve reliability architecture for distributed and cloud-hosted systems. • Define and implement SRE best practices, including SLIs, SLOs, error budgets, and capacity planning. • Partner with platform and application teams to design systems for reliability, scalability, and operability. • Identify and mitigate systemic reliability risks across infrastructure and services. • Lead incident response processes including on-call rotations, escalation, and post-incident reviews. • Conduct root cause analysis for complex production incidents and drive long-term improvements. • Improve operational readiness through runbooks, automation, and resilience testing. • Reduce operational toil through tooling, automation, and process improvements. • Design and maintain observability systems for metrics, logging, tracing, and alerting. • Ensure services and data pipelines are observable, debuggable, and performant in production. • Drive performance analysis and tuning across infrastructure and service layers. • Build automation to improve system reliability, deployment safety, and recovery processes. • Partner with DevOps and Cloud Platform teams on CI/CD reliability, rollout strategies, and safe deployment patterns. • Support and improve Kubernetes-based environments and containerized workloads. • Collaborate with security teams to ensure secure and resilient system design. • Participate in disaster recovery planning and testing. • Maintain strong operational practices around access control, secrets management, and change management.



