Digital-first government for the common good.
Senior DevOps Engineer
Location
Virginia
Posted
11 days ago
Salary
$125K - $142K / year
Seniority
Senior
Job Description
Senior DevOps Engineer
Ad Hoc LLC
• Senior DevOps Engineer serves as an experienced individual contributor within a team • Responsible for supporting the goal of meeting scope, schedule and delivery requirements • Impact the long-term goals of the program while contributing to the development of the program's DevOps and software engineering strategy • May serve as the discipline's primary lead when working with stakeholders and utilize strong influential skills to drive improvements • Collaborate across teams to consolidate and simplify DevOps tools and services • Serve as a mentor to individuals within the team • Capable of self-directed cloud infrastructure design and pipeline design, including cloud-specific product offerings • Build libraries, modules, and packages for other infrastructure engineers to integrate with their projects • Develop CI/CD processes to help deliver software of higher quality at greater speed • Integrate tools to satisfy non-functional requirements such as quality thresholds, security vulnerabilities, and static analysis
Job Requirements
- Bachelor's and 7+ years of experience
- Demonstrates expert-level knowledge in at least one infrastructure-as-code tool (e.g. terraform, ansible)
- Expertise in key DevOps concepts: installing software, virtualization, containerization, networking, etc.
Benefits
- Company-subsidized health, dental, and vision insurance
- Flexible PTO
- 401K with employer match
- Paid parental leave after one year of service
- Employee Assistance Program
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and evolve cloud-native infrastructure and CI/CD platforms that power customer-facing digital products. • Collaborate with Engineering, Security, and Infrastructure teams to ensure our cloud environments are scalable, secure, observable, and reliable. • Lead technical projects from design through implementation and operational support. • Promote DevOps best practices and a culture of continuous improvement. • Implement and improve monitoring, alerting, and logging solutions. • Identify opportunities to reduce manual effort through intelligent automation and platform improvements.
Senior DevOps Engineer
Swift SCSwift SC is a global member-owned cooperative and the world’s leading provider of secure financial messaging services. Founded in 1973, the company plays a vi
Title: Senior DevOps Engineer Location: Manassas, United States Culpeper, United States Full time Job Description: ABOUT US We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value - across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy. We're unique too. We were established to find a better way for the global financial community to move value - a reliable, safe and secure approach that the community can trust, completely. We're always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions. We are seeking a skilled DevOps Engineer to design, build, and maintain reliable CI/CD pipelines, automate infrastructure and application deployments, and support Linux‑based systems across development, test, and production environments. This role works closely with development, database, and operations teams to ensure scalable, secure, and highly available platforms. The ideal candidate has strong hands‑on experience with Git‑based workflows, scripting, configuration management, and CI/CD tooling, and is comfortable supporting environments that include Oracle database technologies. Swift is unable to sponsor an employment authorization for this position now or in the future. What to Expect: In this role you will: - Manage source control and workflows using Bitbucket/Git - Automate deployments and operational tasks with Python, Bash, Ansible, and YAML - Administer and support Linux systems, including troubleshooting and performance tuning - Collaborate with development and operations teams to improve deployment reliability and automation - Support and integrate with Oracle database platforms (Data Guard, GoldenGate, RAC) as needed - Use SQL and build tools () to support application delivery and validation - Build, maintain, and optimize CI/CD pipelines using CloudBees/Jenkins/Maven - This position is mission critical and offers flexible work between onsite and remote (2 days working onsite and up to 3 days working remotely) - Willing to perform weekend deployment activities on a rotational basis with peer team members (Estimated: One Saturday out of 4 weekends) What will make you successful: We are seeking professionals with: - Bachelor's Degree in Computer Science or a related field - 5 to 10 years of experience in systems development - Solid Experience with Automation/Scripting (Ansible, Python, bash, Robot framework etc.) - Knowledge of any of these CI/CD technologies: Jenkins, Ansible, Docker, BitBucket, Nexus - Ability to work in a Linux environment - Autonomous, driven, with strong ability to quickly adapt and respond to change - Customer oriented and quality mindset - we continually strive to deliver true customer value - Open-minded, solutions oriented, and a true team player - gaining energy through collaboration with others - Fluent in English (spoken and written) Preferred Qualifications: - Worked on Oracle Database Enterprise Edition 12.1, 12.2 and 19c. Partitioning and multi-tenancy options - Oracle RAC - GoldenGate for data replication - DataGuard for Disaster Recovery (DR) - Good knowledge of IT security, mainly in the DBA area - Knowledge of Terraform or Maven The estimated salary range for a new hire in this position in Virginia is $101,303.00 USD Annual MINIMUM to $188,135.00 USD Annual MAXIMUM. Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. Our compensation packages include a competitive base salary and bonus opportunity for all employee's contingent on personal and company performance. Our generous benefits program includes medical, dental, vision and life insurance with no premium costs for our employees and their families, and retirement plan plus matching 401k. What we offer We give you the freedom to be yourself. We are creating an environment of unique individuals - like you - with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone's voice counts and where you can reach your full potential. We are committed to an inclusive and accessible recruitment process. If you require a reasonable accommodation related to accessibility during your application or interview. Please note that this mailbox is not monitored for general recruitment enquiries and should only be used for accessibility or accommodation-related requests (for example related to vision, hearing or neurodiversity). All requests are confidential and will not affect your candidacy. Don't meet every single requirement? At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.
• Build, deploy safely and incrementally, and operate critical production systems with focus on scalability, reliability, observability, performance and security. • Build automation to remove toil and proactively monitor, respond to, and enhance alerts with automated handling. • Create and maintain incident response runbooks, triage platform and infrastructural issues, and write postmortem documents to prevent recurring incidents. • Plan and communicate maintenance windows on production systems while engaging with 3rd party vendor support as needed. • Work with Arista's product development teams to identify infrastructural bottlenecks and design solutions to enhance developer experience and workflow efficiency. • Survey and adopt best practices around infrastructure and platform design to maintain secure, scalable and fault-tolerant systems, including studying OSS system implementations for better triage and resolution.
Site Reliability Engineering Manager
ArcoroArcoro is a software company offering an integrated HR and workforce management platform to help organizations with workforce hiring, tracking, and compliance. The company’s serv
Title: Site Reliability Engineering Manager - Remote Location: Phoenix United States Job Description: Why Arcoro? Want to work with a solid company that's transforming HR for the construction industry? Our team of dedicated professionals helps construction, contracting and field services companies hire, manage and grow their workforce with a market-leading SaaS solution. As a member of the A-Team, you'll enjoy a top-notch employee experience where you can embrace your problem-solving skills and innovation, work with a team of great colleagues and see the impact of your contribution each day. Our culture is collaborative, and we believe strongly in training, growth and internal advancement. We offer competitive compensation including comprehensive benefits and a generous time-off policy. We offer both on-site and remote opportunities. At Arcoro, you will help create software products that are cutting edge, easy to use, and that make an appreciated and notable difference in our customers' daily lives. About the Job: The Site Reliability Engineering Manager is responsible for leading the SRE team to ensure the availability, performance, scalability, and operational excellence of Arcoro's production systems. This role combines people leadership with deep technical oversight, ensuring services meet defined reliability targets and that the team is effective, engaged, and aligned with product and business goals. The SRE Manager partners closely with Engineering and Product to drive reliability engineering practices, incident response, observability, and continuous improvement across the production environment. This is a hands-on role. In addition to leading and developing the team, the SRE Manager is expected to contribute as an individual contributor by writing code and automation, building tooling, participating in on-call, and working directly in production systems alongside the team. What You'll Do - Lead and manage a team of Site Reliability Engineers responsible for the reliability, performance, and operational health of production systems - Serve as a hands-on technical contributor by writing code and automation, building reliability tooling, participating in on-call, and working directly in production systems alongside the team - Support career growth and development of team members through coaching, mentoring, and performance management - Define, measure, and drive Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets in partnership with engineering and product teams - Own incident response, including on-call rotations, escalation processes, severity management, and blameless postmortems - Drive continuous improvement in monitoring, observability, alerting, and on-call practices to reduce toil and mean-time-to-recovery - Lead the adoption of AI and automation across SRE practices, including AI-assisted incident response, intelligent alerting, automated remediation, and the use of AI tooling to reduce toil and accelerate operational workflows - Partner with Engineering to refine our products to better support agentic AI development, including improving APIs, telemetry, environments, and platform capabilities that enable AI agents to safely build on and operate against our systems - Drive cloud cost optimization and FinOps practices in partnership with Engineering, including vendor management, cost allocation, rightsizing, and engineering best practices that reduce cloud spend - Partner with Engineering on operational readiness reviews, production change management, and release safety - Champion reliability best practices and ensure they are embedded across the engineering organization - Track and report on key reliability metrics, incident trends, and team health to leadership - Stay current with emerging SRE practices, tooling, and industry standards What We're Looking For: - Proven experience leading SRE, operations, or reliability-focused engineering teams in a production software environment - Willingness and ability to operate as a hands-on individual contributor in addition to managing the team, including writing code, building automation, and participating in on-call - Strong understanding of SRE principles, including SLOs/SLIs, error budgets, and blameless postmortems - Hands-on background in incident response, on-call management, and production troubleshooting - Experience with modern observability practices, including metrics, logging, tracing, and alerting - Demonstrated experience applying AI and automation to reliability work, including using AI-assisted tooling, building automated remediation, and leading the adoption of AI-driven practices on a team - Solid grasp of distributed systems, cloud infrastructure, and the operational characteristics of web-scale applications - Strong leadership, coaching, and team development skills - Excellent communication skills, including the ability to lead through high-pressure incidents and communicate clearly with technical and non-technical stakeholders - Strong analytical and problem-solving abilities - Ability to work across teams and influence at multiple levels of the organization Preferred Qualifications - Bachelor's degree in Computer Science, a related field, or equivalent professional experience - 10+ years of experience in software engineering, systems engineering, DevOps, or site reliability engineering - 3+ years of experience in a technical leadership, team lead, Lead, or Principal role - Previous experience as an SRE Manager, Lead SRE, Principal DevOps/SRE, Operations Manager, or similar leadership role - Strong experience with Microsoft Azure; additional experience with AWS or Google Cloud Platform a plus - Experience with Microsoft technologies (.NET, C#, SQL Server) in a production environment - Experience with container orchestration (Kubernetes, AKS, or EKS) and tools such as Helm or Argo - Experience with observability platforms (e.g., Datadog, ELK, Grafana, OpenTelemetry, Azure Monitor) - Experience with infrastructure-as-code (e.g., Bicep, Terraform, CloudFormation) and modern CI/CD pipelines (e.g., Azure DevOps, GitHub Actions) - Experience with cloud cost optimization and FinOps practices - Familiarity with incident management and ITSM tooling (e.g., PagerDuty, Opsgenie, ServiceNow) - Hands-on experience with AI-assisted engineering tools (e.g., coding copilots, LLM-powered runbooks or agents) and automation platforms used in production operations - Microsoft Azure certifications (e.g., AZ-305 Solutions Architect Expert, AZ-400 DevOps Engineer Expert) a plus Salary Range: $200,000-$220,000 DOE What We Offer - Competitive salary and benefits package. - 401(k) with Company match - Flexible PTO and Company-paid holidays - Remote Work - Opportunities for professional growth and development. - A collaborative and innovative work environment. About the Company A rapidly growing SaaS company, Arcoro offers proven modular HR solutions for the construction and contracting industries. Our product suite and software platform provide end-to-end HR functionality to help drive business outcomes, enabling companies to better manage the entire employee lifecycle through improved candidate quality and flow, shortened time to hire, centralized learning and improved employee productivity. Our HR solutions integrate with top construction ERP systems further positioning Arcoro as a leader in proven modular HR solutions. With Arcoro's flexible solutions, customers select the modules that meet their needs for talent acquisition, talent management, core HR, benefits administration, time and attendance tracking and more. Arcoro has over 7000 customers across North America. Arcoro is a Fair and Equal Opportunity Employer Arcoro is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.


