Expertise and Technology for National Security
DevSecOps AWS Engineer
Location
United States
Posted
2 days ago
Salary
$98.5K - $206.8K / year
Seniority
Mid Level
No structured requirement data.
Job Description
DevSecOps AWS Engineer
CACI International Inc
Role Description CACI is seeking a DevSecOps AWS Engineer to join our cloud operations team on a contract supporting cloud solutions at the Department of Homeland Security. The DevSecOps AWS Engineer is primarily focused on securing and managing applications hosted on Amazon Web Services in a 24/7/365 environment. They will collaborate with clients to help design, implement, maintain, and test a variety of technical solutions utilizing the latest technologies. Responsibilities include creating and enhancing Continuous Integration (CI)/Continuous Delivery (CD) pipelines and various automation processes. - Serve as a Cloud Operations team member responsible for troubleshooting and maintenance. - Engineer, install, maintain, configure, and optimize server operating systems (Windows and Linux) and hosted applications in AWS cloud environment. - Perform analysis for collaboration of network/system needs and participate in planning, designing, upgrading, and deployment of enterprise datacenter hardware and software. - Apply appropriate security measures to safeguard government systems by implementing industry standard best practices and DHS mandated mitigations. - Support the distribution of security patches using enterprise level tools such as AWS Systems Manager Agent (SSM Agent) and Yum server. - Provide guidance for adhering to best practices for optimal and secure use of cloud technologies. - Deliver RMF documentation to support accreditation efforts. - Participate in the design, development, testing, and management of DevSecOps processes and tools. - Develop and integrate new or enhanced capabilities using a DevSecOps approach. - Provide and manage test automation and performance testing tool chain capabilities. - Serve as a technical consultant for application teams and provide mentoring support for DevSecOps adoption. - Configure and customize tools to enable automation. - Perform container management. - Ensure compliance of current security policies and remediations. - Participate in on-call rotation to handle and resolve high severity technical escalations. - Design and implement appropriate environments for applications, engineer suitable release management procedures, and provide production support. - Assist in migrating products from on-prem infrastructure to AWS. - Work with other Cloud Solutions Engineers to develop prototypes and enhance or expand fundamental capabilities. - Participate in daily stand-up meetings to report accomplishments, plans for the day, and any roadblocks encountered. - Prepare, participate, and present in Change Management process, Technical Review Board, and Change Control Board. - Contribute to documentation in the Knowledge Base, Standard Operating Procedures (SOP), work instructions, and job aids. - Conduct knowledge-sharing sessions with junior team members. - Document all work and updates in Service Now, JIRA, Confluence. - Perform additional duties as assigned. Qualifications - Ability to obtain DoD Security Clearance. - Ability to obtain Department of Homeland Security (DHS) Entry On Duty (EOD) - Active EOD preferred. - Bachelor’s degree or equivalent +7 years of applicable experience. - At least one Amazon AWS cloud certification. - Previous Amazon AWS cloud support experience in a production environment including network, security, deployment, and automation. - Previous DevSecOps experience and with DevSecOps tools. - ITIL Foundation certification (not required to start but expected to attain in first six months). - Experienced with coding and scripting languages such as Python, Bash, JavaScript, etc. - Comfortable in at least one high-level scripting language (PowerShell, Python, Bash, etc.). - Understanding/knowledge of Cybersecurity RMF Controls Validation Testing. - Work on-call rotation to respond to incident escalations. - Ability to communicate clearly and effectively in interpersonal and written formats. Requirements - Previous DHS or DoD experience. - Familiar with Windows and Linux Infrastructure hardening to DISA STIG specifications. - The following certifications are highly desired: - AWS Certified DevOps Engineer – Professional. - Cybersecurity certification. - ITIL Foundation. - Other Cloud certifications. - Knowledgeable of Problem Management best practice and processes. - Experience using Service Now service management software (or similar tool) to document incidents and service requests. - Participation in total software development lifecycle. - Knowledge of Continuous Integration (CI) / Continuous Deployment (CD) pipelines. - Knowledge of the Certification and Accreditation (C&A). - Experience with Ansible Tower and Terraform. Benefits - Comprehensive benefits such as healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. - Competitive compensation and learning and development opportunities. - Flexible time off benefit to balance quality work and personal lives. Company Description CACI places character and innovation at the center of everything we do. As a valued team member, you’ll be part of a high-performing group dedicated to our customer’s missions and driven by a higher purpose – to ensure the safety of our nation. CACI values the unique contributions that every employee brings to our company and our customers every day.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Cloud Operations Engineer
Made4netMade4net is a global leader in supply chain execution and warehouse management systems (WMS), delivering agile, scalable, and unified solutions that help companies streamline opera
Role Description We’re looking for a hands-on Cloud Operations Engineer to join our global operations team. This is a technical, hands-on role at the intersection of cloud infrastructure, production support, and security/ops monitoring. We operate a follow-the-sun support model across global regions, with coverage expectations in shifts during weekends/holiday. You’ll support mission-critical systems across AWS, Windows, and Linux ensuring reliability, performance, and rapid incident response. Key Responsibilities - Provide production support as part of a global follow-the-sun operations team - monitor systems, triage alerts, respond to incidents, and ensure service continuity across all environments. - Monitor infrastructure and application health across security and operations dashboards; detect, triage, and respond to performance, availability, and security alerts; produce and maintain SLA reporting. - Deploy, manage, and maintain AWS infrastructure with a primary focus on EC2-based workloads across Windows and Linux environments. - Configure and manage load balancers (ALB/NLB), including URL rewrite rules, routing policies, and SSL termination for web applications. - Design and maintain high-availability architectures on AWS, ensuring redundancy, multi-AZ deployments, and tested failover procedures. - Own backup and disaster recovery operations - scheduling, retention policies, monitoring, and regular restoration testing. - Automate configuration management and application update deployments using Ansible, ensuring consistency and minimal downtime across all environments. - Manage database infrastructure across environments - provisioning, patching, performance tuning, and backup/recovery operations. - Troubleshoot and configure networking components including VPC, subnets, routing tables, DNS, security groups, and firewall rules. - Handle incident escalation, maintain shift handover documentation, and contribute to post-mortems and continuous improvement of support processes. - Maintain infrastructure runbooks, operational documentation, and contribute to automation and tooling improvements. - Support the company’s ongoing transition toward microservices and containerized architectures, contributing operational knowledge and helping ensure smooth adoption in production environments. Qualifications - Bachelor’s degree in computer science, Information Technology or a related field, or equivalent experience through certifications, vocational training, or hands-on work. - 3-5 years of hands-on experience in a cloud infrastructure, systems engineering, or production support role. - Comfortable with a follow-the-sun support model; availability for occasional weekend coverage is expected as part of the global team rotation. - Strong hands-on experience with AWS, specifically EC2, ALB/NLB, VPC, S3, and AWS Backup services. - Solid experience administering Windows and Linux servers in an enterprise IaaS environment. - Hands-on experience with Ansible for configuration management and automated application deployments, including managing update pipelines. - Proven experience designing and maintaining high-availability architectures on AWS (multi-AZ, auto-scaling, failover). - Proficiency in configuring ALB/NLB, including URL rewrite rules, path-based and host-based routing, and listener rules for web applications. - Solid understanding of AWS networking: VPC design, subnets, routing tables, security groups, DNS, and VPN connectivity. - Hands-on experience managing relational database infrastructure (provisioning, patching, monitoring, and performance tuning (MSSQL Server, Postgres, Oracle). - Experience with security and ops monitoring - interpreting alerts, identifying anomalies, triaging incidents, and escalating appropriately. - Experience with infrastructure and application performance tracking; able to interpret metrics and respond to anomalies. - Scripting proficiency in Bash, Python, or PowerShell for automation and operational tasks. Preferred Qualifications - AWS certifications (Solutions Architect, SysOps Administrator, or equivalent). - Experience with infrastructure-as-code tools such as Terraform or CloudFormation. - Experience with ITSM / ticketing systems in a production support context (Zendesk preferred). - Experience with Grafana or similar monitoring and observability platforms (Grafana preferred). - Familiarity with CI/CD pipelines and DevOps practices. Benefits - Health insurance (medical, dental, vision) with a robust wellness program to support your physical and mental well-being. - Generous paid time off policy. - Company-matched 401(k) retirement plan to help you secure your future. - Tuition reimbursement program to support your continued education and career advancement. - Employee assistance program providing confidential counseling and support services for personal challenges. - Discretionary employee bonus program. - Employee Discounts and perks through our PEO. - Pay range: Starting from $120,000, per year salary.
• Serve as the primary technical owner for production reliability across U.S. customer environments. • Investigate and resolve complex issues spanning web applications, APIs, backend services, data pipelines, cloud infrastructure, and customer integrations. • Lead production incident response efforts, coordinating cross-functional teams to restore service and minimize customer impact. • Perform root cause analysis and drive corrective actions that improve long-term system stability and resilience. • Partner with software engineering and platform teams to identify recurring reliability risks and implement sustainable solutions. • Design, configure, and validate secure customer connectivity solutions including Site-to-Site VPNs, Transit Gateway integrations, routing configurations, and secure network paths. • Support customer onboarding initiatives by troubleshooting connectivity challenges and ensuring consistent implementation processes. • Enhance platform observability through improvements in monitoring, logging, alerting, tracing, and operational dashboards. • Contribute to CI/CD, infrastructure automation, and deployment processes that improve release safety and operational consistency. • Develop operational tooling that supports incident response, troubleshooting, onboarding, and system monitoring activities. • Collaborate with engineering leadership to improve cloud architecture, scalability, security, and operational readiness. • Partner with customer-facing teams to communicate technical issues, remediation plans, and reliability improvements in a clear and effective manner. • Support compliance, security, and risk management initiatives within highly regulated healthcare environments.
Estágio DevOps
Viasoft Korp | Industry ERPO sistema de gestão nascido na indústria que vive e respira processos industriais e distribuição 💙
• Auxiliar em rotinas envolvendo: - microsserviços; - containers; - observabilidade; - alta disponibilidade; - integração contínua; - entrega contínua; - monitoramento; - deploy contínuo. • Auxiliar em rotinas de automação de infraestrutura em ambientes cloud e on-premise; • Auxiliar na implantação e evolução de ambientes Kubernetes; • Auxiliar na automatização de processos utilizando Ansible; • Auxiliar na implementação, evolução de monitoramentos e observabilidade com Grafana; • Atuar no auxílio de resolução de incidentes e troubleshooting entre serviços e ambientes; • Auxiliar na garantia de estabilidade, disponibilidade e performance dos ambientes; • Auxiliar na evolução de pipelines e ferramentas de CI/CD; • Documentar procedimentos, fluxos e configurações; • Participar ativamente da evolução tecnológica da plataforma da Korp.
• Owning the SRE infrastructure lifecycle from design reviews and pre-rollout readiness assessments through production sign-off and ongoing reliability management • Designing and implementing frameworks that reflect customer experience for load balancing services and driving action when error budgets are at risk • Building and maintaining observability pipelines from load-balancing components and system-level sources to dashboards that enable rapid incident triage • Leading technical incident response for complex NB/NLB failures, acting as the technical commander and driving root cause analysis and preventive follow-through • Developing and automating safe deployment workflows for phased releases, including bake-period monitoring, feature flag management, and validation across global datacenter rollouts • Reviewing design documents, product-requirement documents and producing actionable SRE input on operational risks, capacity implications, Day-2 concerns, and product strategy gaps • Building automation and tooling using Python or Go that reduces operational toil and improves team-wide operational capability



