Job Closed

This listing is no longer active.

ICF logo
ICF

We are not a typical consulting firm and our people are not typical consultants.

Site Reliability Engineer - Remote

DevOps EngineerDevOps EngineerOtherRemoteMid LevelTeam 5,001-10,000Since 1969H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

90 days ago

Salary

$108K - $184K / year

Seniority

Mid Level

No structured requirement data.

Job Description

Site Reliability Engineer - Remote

ICF

Description ICF is a mission-driven company filled with people who care deeply about improving the lives of others and making the world a better place. Our core values include Embracing Difference; we seek candidates who are passionate about building a culture that encourages, embraces, and hires dimensions of difference. Our Health Engineering Solutions (HES) team works side by side with customers to articulate a vision for success, and then make it happen. We know success doesn't happen by accident. It takes the right team of people, working together on the right solutions for the customer. We are looking for a seasoned SRE to establish a culture of improvement in observability and reliability. You will work closely with software engineering teams to ensure that applications, databases, pipelines and APIs run reliably. You will be expected to create, set, and exceed service level objectives as key indicators of application health. You will be working on a mission critical software program whose goal is to support the ecosystem of Centers for Medicare & Medicaid Services (CMS). Our core work hours are 10am - 4pm Eastern Time with the option to start earlier or work later depending on your time zone. Key Responsibilities: - Define and maintain SLIs, SLOs, and SLAs for the Internet-based Quality Improvement and Evaluation System (iQIES) application. - Performance tuning that will model load scenarios, forecasting capacity, and optimize scaling strategies - Design and optimize the observability stack through New Relic, CloudWatch, and Jenkins CI/CD pipelines - Participate in root cause analysis for operational issues and improve incident response process - Participate in creating, monitoring, and optimizing actionable alerts to respond to issues in a timely manner - Develop tools and scripts - Develop and maintain Jenkins CI/CD pipelines, using declarative Jenkinsfiles and foundational Groovy for pipeline logic and enhancements - Deploy services to Fargate, EKS, Lambda, Airflow, Databases - Manage security groups and access controls. Thoroughly understand fundamentals like security groups, IAM, managing RDS - Apply patch management and hardening practices - Align with DevOps and Technical Leads to ensure overall strategy - Actively participate in releases and product launches with expectation of being online during release windows Required Qualifications - 5+ years experience in a software development environment and a Bachelor’s degree; OR 3+ years experience in a software development environment and a Master’s degree - 5+ years supporting a high‑availability production environment (cloud or on‑prem) - 3+ years of working in a SRE role in a large scale cloud implementing high availability and scalability - 3+ years of experience focused on SRE, DevOps, or Platform Engineering - Must be able to obtain and maintain a public trust clearance - Candidate must reside in the US, be authorized to work in the US, and work must be performed in the US - Must have lived in the US 3 full years out of the last 5 years Preferred Qualifications - Previous work in a regulated healthcare or federal agency environment - Full stack web development experience - Expert in deployment techniques to minimize down-time like Blue-Green, Canary, A/B testing approaches, and zero downtime deployments - Understanding of security groups and access controls - Experience with Atlassian tooling such as Jira and Confluence  Professional Skills and Tools: - Cloud platform experience with AWS - Observability: CloudWatch, New Relic or similar - Infrastructure: Kubernetes, Docker - IaC: Terraform - CI/CD: Git, Jenkins or GitHub Actions - Database: SQL relational database - Docker: Thorough understanding of Docker and Docker Compose. Understand best practices, caching, volume mounts, etc - Highly effective analytical, problem-solving, and decision-making capabilities. - Strong written and verbal communication skills  - Ability to clearly articulate and communicate complex technical ideas to non-SRE colleagues.  - Ability to understand project requirements and be innovative in finding solutions in highly regulated government environments.  - Flexibility and the ability to accept a change in priorities as necessary.  - Demonstrated time management skills.  - Strong organizational skills with attention to detail.  Job Location: This position requires that the job be performed in the United States. If you accept this position, you should note that ICF does monitor employee work locations and blocks access from foreign locations/foreign IP addresses, and also prohibits personal VPN connections. Working at ICF ICF is a global advisory and technology services provider, but we’re not your typical consultants. We combine unmatched expertise with cutting-edge technology to help clients solve their most complex challenges, navigate change, and shape the future. We can only solve the world's toughest challenges by building a workplace that allows everyone to thrive. We are an equal opportunity employer. Together, our employees are empowered to share their expertise and collaborate with others to achieve personal and professional goals. For more information, please read our EEO policy. We will consider for employment qualified applicants with arrest and conviction records. Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in all phases of the application and employment process. To request an accommodation, please email Candidateaccommodation@icf.com and we will be happy to assist. All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.  Read more about workplace discrimination rights or our benefit offerings which are included in the Transparency in (Benefits) Coverage Act. Candidate AI Usage Policy At ICF, we are committed to ensuring a fair interview process for all candidates based on their own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) tools to generate or assist with responses during interviews (whether in-person or virtual) is not permitted. This policy is in place to maintain the integrity and authenticity of the interview process.  However, we understand that some candidates may require accommodation that involves the use of AI. If such an accommodation is needed, candidates are instructed to contact us in advance at candidateaccommodation@icf.com. We are dedicated to providing the necessary support to ensure that all candidates have an equal opportunity to succeed.   Pay Range - There are multiple factors that are considered in determining final pay for a position, including, but not limited to, relevant work experience, skills, certifications and competencies that align to the specified role, geographic location, education and certifications as well as contract provisions regarding labor categories that are specific to the position. The pay range for this position based on full-time employment is: $108,476.00 - $184,409.00 Nationwide Remote Office (US99)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Five9 logo

Senior DevOps Engineer – Compute Platforms

Five9

Helping Companies Bring Joy to CX.

DevOps Engineer90 days ago
OtherRemoteTeam 1,001-5,000Since 2001H1B Sponsor

• Operate and support enterprise compute platforms across hardware, OS, virtualization, and container orchestration layers • Deploy and maintain bare metal server infrastructure for Ubuntu OS with Kubernetes and hypervisors including Openstack & Harvester • Implement and maintain PXE-based provisioning environments leveraging Redfish APIs for large-scale server deployments • Install, patch, and maintain operating systems including Ubuntu and Harvester • Operate and support virtualization and private cloud platforms, including KVM on Ubuntu, OpenStack environments and Harvester HCI • Develop Infrastructure-as-Code using Ansible, Terraform, Helm and Git, with Python/Bash automation. • Implement CI/CD pipelines for infrastructure updates, patching, upgrades, testing, and rollback. • Perform firmware updates, patch management, and hardware health validation • Monitor system performance, capacity, and availability; proactively address reliability risks • Troubleshoot complex cross-stack issues spanning hardware, OS, virtualization, OpenStack, and Kubernetes • Manage to SLAs, KPIs and error budgets • Participate in on-call escalation support for complex platform-related issues • Collaborate globally on change management, documentation, and operational best practices. • Develop and maintain runbooks, operational procedures, and technical documentation

United States
$82.3K - $228.8K / year
Job Closed
Workstate logo

Senior DevOps Engineer (Bazel) - US Remote

Workstate

We believe that every great idea deserves to become reality.

DevOps Engineer90 days ago
OtherRemoteTeam 51-200Since 2003H1B No Sponsor

Workstate is seeking a DevOps Engineer with deep experience designing, implementing, and operating deployment pipelines built around Bazel. This role focuses on Bazel as part of a build, packaging, artifact, and release platform, including reproducibility, remote caching/execution, CI/CD integration, dependency management, environment promotion, and operational reliability. In addition to Bazel expertise, strong experience with AWS, Terraform, and modern DevOps practices is highly valued. Residents of the United States with the right to work without need for visa sponsorship are eligible for this role.

United States
Job Closed
Katalyst Space Technologies logo

Senior Software Engineer, DevOps - SDA

Katalyst Space Technologies

At Katalyst, our work on projects involving the U.S. Department of Defense requires adherence to International Traffic in Arms Regulations (ITAR), 22 C.F.R. Parts 120-130, which requires compliance with U.S. export laws before allowing employees to perform certain positions. Currently, our available roles necessitate access to ITAR-controlled information, and as a result, Katalyst would have to ensure any non-US person is authorized access to ITAR information before the commencement of employment. We are committed to equal employment opportunities and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin. *remote work will require an exception from the hiring team

DevOps Engineer90 days ago
OtherRemoteTeam 11-50

Katalyst Description: We build robotic spacecraft that enable dynamic space operations, creating a future where maneuvering, upgrading, refueling, and exploration are as routine as they are on Earth. Getting to space and overcoming the gravity well is only half the story. What you do there is the future. We are not building the trains. We are building the machines that lay down the tracks. By developing the foundational capabilities that make sustained, responsive operations possible, we enable a new era of space activity that strengthens national security and ensures freedom of action in an increasingly contested domain. Katalyst Culture: Working at Katalyst is intense, hands-on, and deeply rewarding. You’ll take real ownership and see your work move from concept to operations in an environment where the problems are hard and the impact is real. We value humility, craftsmanship, and doing things the right way, even when it’s harder. You’ll work closely with thoughtful, driven teammates who push each other to do their best. What You'll Do - Manage, provision, and help guide architectural decisions to support SDA applications running in AWS - Work with cross-functional teams to help efficiently implement and deploy SDA algorithms - Manage networking infrastructure including VPCs, load balancers, and firewalls in AWS - Lead internal company infrastructure initiatives to improve Developer Experience (DevEx) including efforts like managing common shared resources such as repository managers, secrets managers, etc. - Design and optimize CI/CD pipelines (Gitlab CI) - Monitor spending across different AWS Orgs and provide common tagging strategies to track spending across the company - Establish comprehensive observability solutions and best practices (We use the LGTM stack) - Mentor and help develop other less experienced engineers What We Value: - Strong ownership mentality – you’ll lead technical execution and ensure your features make it from development to production - Accountable – The team knows they’ll be able to depend on you - Curiosity – We are solving the hardest problems in space and want to hear everyone’s voice to ensure we’re always innovating - Leadership – You will be on a small team and will be empowered to help shape the culture and influence technical direction Your Ideal Background: Required: - 3+ years of full-time software engineering experience - 2+ years of DevOps experience - Comfortable working in a small, fast-paced Agile environment - Strong communication skills and ability to execute under uncertainty - Proven track owning features end-to-end from initial requirements, design, to application development and deployment - Strong proficiency in containerization and deployment technologies like Terraform, Docker, Kubernetes, etc. - Strong knowledge of software engineering best practices and CI/CD tooling - Eligible to obtain and maintain an active U.S. security clearance Nice to Have: - Knowledge of security best practices for DevOps and MLOps - Familiarity with MLOps concepts - Familiarity with Kubernetes and deployments into air-grapped environments Additional Requirements: Must be willing to work extended hours and weekends as needed. Compensation and Benefits: Your base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience. The anticipated salary range for this role is $120,000 - $160,000 annually. Base salary is just one part of your total rewards package at Katalyst. You will also be eligible for long-term incentives, in the form of the Employee Stock Option and Equity Plan, as well as a relocation bonus and other discretionary bonuses. You will also receive access to comprehensive medical, vision, and dental coverage, and unlimited Paid Time Off. At Katalyst our work on projects involving the U.S. Department of Defense requires adherence to International Traffic in Arms Regulations (ITAR), 22 C.F.R. Parts 120-130, which requires compliance with U.S. export laws before allowing employees to perform certain positions. Currently, our available roles necessitate access to ITAR-controlled information, and as a result, Katalyst would have to ensure any non-US person is authorized access to ITAR information before the commencement of employment. We are committed to equal employment opportunities and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin. *remote work will require an exception from the hiring team

United States
$120K - $160K / year
Job Closed
Darede logo

Senior DevOps Analyst

Darede

Tecnologia e Inovação para revolucionar seu negócio

DevOps Engineer90 days ago
Full TimeRemoteTeam 201-500Since 2013H1B No Sponsor

• Liderar o desenho, implementação e evolução da infraestrutura como código, com forte foco em Terraform e Terragrunt, definindo padrões, módulos reutilizáveis e boas práticas para todo o time. • Desenhar e implementar arquiteturas de pipelines de CI/CD robustas, escaláveis e seguras, garantindo qualidade, rastreabilidade e governança de deploys em múltiplos ambientes. • Conduzir iniciativas de modernização e migração de pipelines legados, propondo novas abordagens, ferramentas e fluxos que reduzam tempo de entrega e aumentem confiabilidade. • Atuar como referência técnica em containerização e orquestração, utilizando Docker e Kubernetes (preferencialmente em AWS/EKS), incluindo observabilidade, segurança, autoscaling e estratégias de deploy (blue/green, canary, etc.). • Apoiar e orientar times de desenvolvimento na adoção de arquitetura orientada a eventos e APIs, garantindo boas práticas de integração, resiliência e monitoramento. • Propor e implementar melhorias contínuas em monitoramento, logging, métricas e alertas, construindo um ambiente observável e confiável. • Desenvolver e manter scripts e ferramentas internas (principalmente em Bash e/ou Python) para automação de tarefas recorrentes, integrações entre sistemas e verificação de conformidade de infraestrutura. • Atuar como guardião de padrões de DevOps, disseminando práticas de automação, segurança, observabilidade, custo otimizado e confiabilidade ao longo de todo o ciclo de vida das aplicações. • Realizar revisões técnicas (code review de Terraform, pipelines, scripts) e apoiar na resolução de incidentes complexos, atuando em análises de causa raiz e melhoria permanente. • Colaborar com arquitetura, segurança e desenvolvimento na definição de roadmaps de plataforma, trazendo visão técnica de longo prazo para a evolução do ambiente.

Brazil
Job Closed