Job Closed
This listing is no longer active.
Expertise and Technology for National Security
Senior DevSecOps Engineer, AI Enablement
Location
United States
Posted
92 days ago
Salary
$98.5K - $206.8K / year
Seniority
Senior
Job Description
Senior DevSecOps Engineer, AI Enablement
CACI International Inc
• Join CACI’s AI Enablement team as a Senior DevSecOps Engineer delivering rapid GenAI infrastructure and CI/CD capabilities through 1–2 month program engagements. • Deploy secure pipelines, containerized platforms, cloud environments, and managed AI services while coaching program teams to operate and evolve systems independently. • Enhance our solution catalog by refining IaC templates and contributing new infrastructure patterns from field experience. • Rapidly deploy GenAI infrastructure across AWS, Azure, and on‑prem using catalog templates. • Implement and operationalize containerized platforms; train teams on deployment and troubleshooting. • Establish production readiness standards including observability, reliability, and documentation. • Build and refine GitLab CI/CD pipelines with security scanning and deployment automation. • Configure identity and access management (Keycloak or similar) with OIDC/SAML. • Lead workshops, pair‑programming, and reviews to build program team capabilities. • Develop reusable Terraform modules and IaC patterns for networking, IAM, and GenAI infrastructure. • Document architecture decisions, lessons learned, and best practices. • Improve catalog templates and tooling based on recurring field challenges.
Job Requirements
- 7+ years IT experience with 4+ years in DevSecOps, SRE, or Cloud Systems roles; consulting or multi‑project experience preferred.
- Bachelor’s degree in Computer Science, a related major.
- Ability to obtain a U.S. Secret Clearance.
- Practical fluency with AI tools and GenAI concepts.
- Experience deploying web apps and integrating APIs; familiarity with LLM or managed AI services a plus.
- Strong understanding of distributed systems, microservices, and complex system orchestration.
- Deep expertise with Kubernetes, Docker, and cloud‑native services across AWS/Azure/hybrid.
- Advanced Terraform experience, including reusable modules and templates.
- GitLab CI/CD (or similar) and automation skills using Python or Bash.
- Hands‑on IAM experience (Keycloak, Okta, etc.) with OIDC/SAML integration.
- Observability experience (Prometheus, Grafana, CloudWatch) plus familiarity with LLM observability concepts.
- Demonstrated ability to deliver quickly, make pragmatic decisions, and adapt across diverse environments.
- Strong communication, documentation, and enablement skills.
- Proficiency with agile workflows (GitLab, Jira).
Benefits
- healthcare
- wellness
- financial
- retirement
- family support
- continuing education
- time off benefits
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Devops Engineer
Bright Vision TechnologiesBright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications.
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly mobile applications. As we continue to grow, we’re looking for a skilled DevOps Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential. We are looking for OPT/CPT/H4 EAD/TN/E3 or any other Non-immigrant visa people who are looking for an H1B sponsorship for the year 2027 quota. Company: Bright Vision Technologies ( www.bvteck.com ) Job Title: DevOps Engineer Onsite/Hybrid: Remote FULL-TIME ROLE WITH BRIGHT VISION Job Description: Environment: CI/CD pipelines, Jenkins, GitHub Actions, GitLab CI, Docker, Kubernetes, Helm, AWS / Azure / GCP, Infrastructure as Code (Terraform, CloudFormation), Linux, Bash / Shell scripting, Monitoring & Logging (Prometheus, Grafana, ELK), Git, Agile methodologies, DevOps best practices BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. For every role, a coding test is required, so apply only if you are confident and technically strong. We prefer at least 3 to 5 years real time experience. If you are a DevOps Engineer with the above skills and are looking for an H-1 B sponsorship this year, please send your resume immediately to harry@bvteck.com. we are committed to providing equal employment opportunities and fostering an inclusive work environment. We encourage applications from all qualified individuals regardless of race, ethnicity, religion, gender identity, sexual orientation, age, disability, or any other protected status. If you require accommodations during the recruitment process, please let us know. Position offered by “No Fee agency.”Equal Employment Opportunity (EEO) Statement Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall. BV Teck expressly prohibits any form of workplace harassment or discrimination. Any improper interference with employees' ability to perform their job duties may result in disciplinary action up to and including termination of employment.
Principal Site Reliability Engineer - Remote
DFIN - Donnelley Financial SolutionsA leading provider of risk and compliance solutions, DFIN - Donnelley Financial Solutions offers data insights, industry expertise, and insightful technology to help clients make s
Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day. Our "Win as One" mentality ensures that our team's success is directly linked to Client, Shareholder and Employee Satisfaction. Recognized as one of AMERICA'S MOST LOVED WORKPLACES® for five consecutive years and a Built In Best Places to Work for six years, we are committed to our employees' total well-being. Enjoy competitive compensation, a flexible workplace, comprehensive benefits, and opportunities for professional growth. Bring your passion and talents to DFIN - because being YOU thrives here. Summary: We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise. The Principal Site Reliability Engineer - Cloud is responsible for designing, building, securing, monitoring and maintaining our SaaS product cloud infrastructure so it is fast, cost effective, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements. You either have a SaaS cloud infrastructure background in Azure or AWS with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS cloud infrastructure experience in Azure or AWS. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can lead colleagues independently to deliver solutions to complex problems. Responsibilities: - Champion and implement a culture to maintain performant, reliable, secure, cost-effective platform cloud infrastructure in DFIN SaaS products based on operationalized processes you define - Champion security of our cloud infrastructure collaborating with Security and Governance teams and using static and dynamic tooling - Champion and implement application and cloud infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs - Optimize cloud infrastructure and application performance at scale while maintaining effective cost controls - Automate cloud infrastructure buildout and maintenance including system operational runbooks - Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into operationalized work processes - Perform with broad independence and deliver on project milestones and tasks you define on schedule while communicating progress regularly - Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations - Learn continuously and apply lessons learned - Evangelize best practices, eliminate bottlenecks, and improve process - Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents Qualifications: - 8+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS - 5+ years experience creating, configuring, maintaining and monitoring Kubernetes clusters (AKS or EKS) in cloud infrastructure to optimize application performance and reliability - 5+ years building and deploying Infrastructure as Code with Terraform or similar technology - 5+ years experience with common cloud networking, firewall and load balancing configuration - 5+ years experience writing software in any modern software language such as C# .NET, Java - 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment - 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics - 5+ years experience supporting public client facing revenue generating systems - Experiencing monitoring and preventing issues with databases and database queries (SQL) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor - Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts - Experience securing Windows or Linux systems in 24x7 production environment - BS in Computer Science or equivalent work experience It is the policy of Donnelley Financial Solutions to select, place, and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran status, actual or perceived sexual orientation, genetic information or any other protected status. If you are a qualified individual w ith a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to talentacquisition@dfinsolutions.com . At DFIN, protecting your identity is a top priority. Please be aware of scammers impersonating DFIN recruiters. DFIN recruiters will never request personal information via email or text. You will only receive a text from us if you've already been in contact. All automated messages will come from talentacquisition@dfinsolutions.com . If you ever have doubts about the legitimacy of any communication from us, please do not hesitate to reach out for verification via talentacquisition@dfinsolutions.com (this email is for general TA questions and is not used for updates on your application status). #BI-Remote
Site Reliability Engineer
DevsuDevsu is a technology agency that provides software development services, IT augmentation and staffing.
We are seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, observability, and reliability engineering to support systems running across on-premises infrastructure and Google Cloud Platform (GCP). This role is primarily responsible for designing, operating, and improving monitoring, alerting, and observability platforms, with a strong focus on Grafana and Kubernetes environments. As a secondary responsibility, this role provides backup coverage for the Application Support team during periods of resource constraints or major incidents, offering L2/L3 technical support when required. ResponsibilitiesMonitoring & Observability (Core Focus) - Own and operate the monitoring and observability stack across on-prem and GCP environments - Design, build, and maintain Grafana dashboards for infrastructure, Kubernetes, and applications - Define, tune, and maintain alerts to ensure high signal-to-noise ratio - Establish observability standards and best practices across teams - Improve visibility into system health, performance, and reliability Site Reliability Engineering - Apply SRE principles to improve availability, performance, and resilience - Define and track SLIs, SLOs, and error budgets - Participate in on-call rotations and SEV incident response - Lead or contribute to incident investigations and root cause analysis (RCA) - Drive preventative actions to reduce repeat incidents Kubernetes & Platform Reliability - Support and monitor Kubernetes environments (GKE and on-prem clusters) - Monitor cluster health, capacity, and resource utilization - Troubleshoot platform-level issues impacting application reliability - Collaborate with Platform and Engineering teams on reliability improvements Secondary Responsibilities (Backup Application Support) - These responsibilities are activated as needed, not part of day-to-day operations. - Provide L2/L3 application support coverage during: - Support team resource shortages - High-severity incidents (SEVs) - Peak support periods or escalations - Triage and troubleshoot application issues using existing runbooks and dashboards - Collaborate with Application Support and Engineering teams during incidents - Ensure all actions, findings, and resolutions are documented in ServiceNow (SNOW) - Strong experience as a Site Reliability Engineer or Reliability Engineer - Deep hands-on expertise with Grafana (dashboards, alerting, troubleshooting) - Solid experience with monitoring and observability systems - Production experience operating Kubernetes environments - Experience supporting systems in GCP and on-prem environments - Strong Linux systems and troubleshooting skills - Fluent English (written and spoken). - Ability to work in PST time zone. - Ability to participate in an on-call rotation that includes coverage for one weekend day. Time worked during the weekend is compensated with one day off during the week, in accordance with the established work schedule. Technology Stack: - Observability: Grafana, Prometheus, logging platforms - Containers: Kubernetes (GKE and on-prem) - Cloud: Google Cloud Platform (GCP) - Operations: Linux, networking, infrastructure monitoring - Incident Tools: PagerDuty, ServiceNow, Slack (or equivalents) Nice to have: - Experience supporting application teams during SEV incidents - Knowledge of capacity planning and performance tuning - Scripting skills (Python, Bash, etc.) - Experience with hybrid infrastructure environments At Devsu, we believe in creating an environment where you can thrive both personally and professionally. By joining our team, you’ll enjoy: - A stable, long-term contract with opportunities for career growth - Private health insurance - A remote-friendly culture that promotes work-life balance - Continuous training, mentorship, and learning programs to keep you at the forefront of the industry - Free access to AI training resources and state-of-the-art AI tools to elevate your daily work - A flexible Paid Time Off (PTO) policy as well as paid holiday days - Challenging, world-class software projects for clients in the US and LatAm - Collaboration with some of the most talented software engineers in Latin America and the US, in a diverse work environment Join Devsu and discover a workplace that values your growth, supports your well-being, and empowers you to make a global impact.
Senior DevOps Engineer
ChowNowThe only fair-for-all food ordering marketplace — no commissions for restaurants and no hidden fees for diners.
• As a Senior DevOps Engineer at ChowNow, you will be specifically responsible for building, improving, and growing our technology infrastructure. • You will help design and implement reproducible processes in the enterprise environment as well as support the application production environment. • You will own and support engineering user-facing technology as well as share responsibility for supporting the production operations.



