A business unit of General Dynamics, General Dynamics Information Technology (GDIT) supports some of the United States' most complex government, defense, and in
Reliability Engineer
Location
United States
Posted
6 days ago
Salary
$111.2K - $150.4K / year
Seniority
Senior
Job Description
Reliability Engineer
General Dynamics
• Design, build, and maintain scalable, reliable infrastructure and services that support Hosting Services, Site Reliability Engineering, virtualization, and data center operations for a federal customer. • Collaborate closely with software developers, infrastructure engineers, and IT operations teams to plan and execute deployments, improve system architectures, and enhance service reliability. • Use automation and scripting (e.g., Python, Bash) to reduce manual work, streamline deployments, and improve consistency across environments. • Monitor system performance, availability, and capacity using modern tooling; proactively identify issues and participate in on-call support to restore services quickly when incidents occur. • Implement and support Continuous Integration/Continuous Delivery (CI/CD) pipelines using tools such as Jenkins, Git, and Terraform to enable reliable and repeatable releases. • Leverage containerization and orchestration technologies such as Docker and Kubernetes to build resilient, scalable platforms. • Work with databases (e.g., SQL, MySQL) and application stacks (e.g., Java-based services) to ensure data integrity, performance, and fault tolerance. • Partner with cross-functional teams, using Jira and other collaboration tools, to track work, communicate status, and drive continuous improvement in reliability and operational excellence. • Contribute to a culture of teamwork and collaboration by sharing knowledge, participating in post-incident reviews, and helping define best practices for reliability engineering.
Job Requirements
- 5+ years of related experience in Site Reliability Engineering, DevOps, systems engineering, or software engineering roles
- Experience with deployments and production operations in Linux-based environments
- Proficiency with scripting/coding (e.g., Python, Java, shell scripting)
- Hands-on experience with AWS or other cloud platforms
- Strong Linux administration skills
- Experience with SQL/MySQL and database concepts
- Containerization and orchestration (Docker, Kubernetes)
- CI/CD and automation tools (Jenkins, Git, Terraform, Ansible)
- Experience with Infrastructure as Code ( IaC ) and automated configuration management
- Must have a BA/BS or equivalent
Benefits
- Comprehensive benefits and wellness packages
- 401K with company match
- Paid time off
- Full flex work weeks where possible
- Variety of paid time off plans including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
- 15 days of paid leave per calendar year
- Additional 10 paid holidays per year
- Short and long-term disability benefits
- Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• The DevOps Engineer works as an integrated part of a software engineering team • Facilitating and building automated pipelines for continuous delivery of the team's deliverables with guaranteed production-level quality and supportability • Defining and developing continuous integration and deployment pipelines • Building Infrastructure as Code • Coordinating build and release activities with other stakeholders • Managing day to day operations of release pipelines, build tools, and source control software and resources • Troubleshooting & responding to downtime, performance degradation and outside attacks • Performing ongoing maintenance and upgrades of DevOps systems • Identifying, researching, and prototyping new technologies to improve DevOps processes
• The DevOps Engineer is a hybrid, senior-level role sitting at the intersection of operational reliability and software delivery automation. You will function as an integrated part of a cross-functional engineering team, combining the proactive service management mindset of an Application Operations Engineer with the automation-first philosophy of a DevOps practitioner. • You will be responsible for keeping production environments healthy and performant, while simultaneously designing and maintaining the CI/CD pipelines, infrastructure-as-code frameworks, and tooling that enable rapid, high-quality software delivery. You are the connective tissue between engineering, platform, and operations — someone who is equally comfortable in an incident bridge call and a sprint planning meeting. • Design, build, and maintain continuous integration and continuous delivery (CI/CD) pipelines for rapid, quality-assured deployment of software deliverables. • Build and manage Infrastructure as Code (IaC) using tools such as CloudFormation, Ansible, Terraform, Chef, or Puppet. • Manage day-to-day operations of release pipelines, build tools, artifact repositories, and source control systems. • Coordinate build and release activities with engineering, QA, product, and other stakeholders across the organisation. • Identify, research, and prototype new technologies and practices to continuously improve DevOps processes and team efficiency.
SRE Specialist
SeazoneSobre a contratação: Contrato no modelo PJ Trabalho remoto Descanso remunerado de 15 dias a cada 6 meses de contrato
Role Description Fazer parte dessa equipe significa que você ajudará a compor o time da maior empresa nacional parceira do Airbnb e uma das Top 20 Startups do LinkedIn, somando ao nosso propósito de trazer inovação e tecnologia para o mercado de aluguel por temporada com o objetivo de maximizar a rentabilidade de clientes proprietários e investidores, além de garantir uma excelente experiência ao hóspede que escolhe os imóveis Seazone como o seu destino fora de casa. O time de SRE/Plataforma é a base de tudo que roda na Seazone. São 3 pessoas responsáveis por dois clusters Kubernetes (EKS na AWS e GKE no GCP), GitOps com ArgoCD, observabilidade completa, FinOps e a infraestrutura de IA da empresa — LiteLLM Hub, MCP Hub e AI Portal. Atendemos todas as verticais de engenharia e operamos com autonomia alta e ritmo intenso. O que você vai fazer: - Operar e evoluir clusters Kubernetes (EKS/GKE) com GitOps via ArgoCD - Responder a incidentes de produção, conduzir troubleshooting e post-mortems - Manter e evoluir a stack de observabilidade: Prometheus, Grafana, Loki, Alertmanager, Uptime Kuma - Executar o roadmap de FinOps: rightsizing, tagging e governança de custos AWS/GCP e consumo de LLM - Conduzir migrações em andamento: Coolify → Amplify/EKS e desligamento do Vault - Automatizar operações com IA (agentes de incidente, Claude Code, pipelines CI/CD no GitHub Actions) - Sustentar a plataforma de IA interna: LiteLLM Hub, MCP Hub, AI Portal - Infraestrutura como código com Terraform/Terragrunt e governança de secrets/segurança Qualifications - Graduação em Ciência da Computação, Engenharia de Software ou áreas correlatas é desejável - Certificações AWS e CKA são diferenciais Requirements Obrigatório: - AWS (EKS, IAM, CloudWatch, billing) e Kubernetes em produção com responsabilidade direta por incidentes - Terraform, ArgoCD/GitOps, Docker, GitHub Actions - Stack de observabilidade: Prometheus, Grafana, Loki - Linux, Python e Bash - Mínimo 5 anos em infraestrutura/cloud, sendo pelo menos 3 anos operando Kubernetes em produção - Uso de IA como ferramenta de trabalho no dia a dia — sem exceção - Disponibilidade para atuação em incidentes fora do horário comercial quando necessário Comportamental: - Autonomia real — aqui não tem mão na mão - Comunicação clara em situações de incidente - Disciplina de documentação Benefits - Auxílio alimentação - Ifood Benefícios no valor de R$ 462,00 - Plano de saúde e Odontológico após 6 meses como PJ na Seazone - Avaliações semestrais com a perspectiva de crescimento - Cultura de Feedback - Day-off de aniversário - Afastamento remunerado de maternidade/paternidade - Bônus por indicação - Descontos em hospedagens pelo nosso site de reservas - Hospedagem em Florianópolis (SC) para o evento de comemoração do aniversário da Seazone - Escritório disponível para coworking em Florianópolis (SC) Parcerias: - Parceria com a Gympass - Parceria com Escolas de Idiomas - Parceria com a ACATE - descontos em mensalidades universitárias, entre outras vantagens - Parceria com a Conexa Saúde e Psicologia Viva - Telemedicina 24h - Parceria com a Guapeco - Plano de Saúde Pet Company Description Contrato no modelo PJ; Trabalho remoto; Descanso remunerado de 15 dias a cada 6 meses de contrato.
Role Description The purpose of the role is to drive the execution, standardisation, and continuous improvement of DevOps and DevSecOps practices across the software development lifecycle. The role balances development, deployment, automation, and operational reliability to enable secure, scalable, and resilient technology platforms. Operating as a senior individual contributor and functional expert, the role defines and embeds DevOps standards, tooling, and automated deployment patterns, while supporting high‑availability, cloud‑native platforms. The role works closely with development, architecture, security, and operations teams to ensure efficient delivery, operational excellence, and platform stability, and plays a key role in enabling modern engineering practices across the Group. Qualifications - Degree or Diploma in Computer Science, Software Engineering, Information Systems, or a related field - (essential). - Kubernetes certifications (CKA, CKAD, or CKS) - (beneficial). - AWS or Azure DevOps / Cloud certifications - (beneficial). - 2-3 years’ experience in a senior DevOps, DevSecOps, or DevOps‑focused programming or similar role with exposure to software development and IT operations - (essential). - Extensive experience with DevOps and SDLC tools, processes, and methodologies - (essential). - Strong experience implementing and managing CI/CD pipelines using enterprise DevOps platforms - (essential). - Experience with infrastructure‑as‑code tools (e.g. Terraform, CloudFormation) - (essential). - Experience working in cloud environments (AWS and/or Azure) - (essential). - Proficiency in scripting languages such as YAML, Bash, PowerShell, or Python - (essential). - Experience with Agile and/or Waterfall delivery methodologies - (essential). - Hands‑on experience with container orchestration technologies (Docker, Kubernetes, EKS, AKS) - (beneficial). - Strong Linux systems administration and networking fundamentals (DNS, TLS, load balancing) - (beneficial). - Experience implementing monitoring, logging, and alerting solutions in production environments - (beneficial). - Exposure to cloud cost optimisation and capacity planning practices - (beneficial). Requirements - Define, implement, and mature DevOps and DevSecOps best practices, standards, and automated deployment patterns across the software delivery lifecycle. - Design, build, and maintain CI/CD pipelines using enterprise DevOps platforms (e.g. Azure DevOps, AWS CodePipeline, GitHub, Bitbucket). - Create and maintain reusable deployment templates and scripts for cloud and on‑premise environments. - Continuously monitor and improve DevOps processes to reduce risk, minimise manual intervention, and improve reliability. - Design, provision, and support scalable and resilient cloud infrastructure using infrastructure‑as‑code tools (e.g. Terraform, CloudFormation, Ansible). - Support containerised and microservices‑based architectures using Docker and Kubernetes, including deployment, scaling, and operational support. - Contribute to cloud cost optimisation, capacity planning, and performance improvements. - Support disaster recovery, business continuity, and platform resilience initiatives. - Integrate automated testing, quality gates, and security controls into CI/CD pipelines. - Collaborate with security teams to embed DevSecOps controls, vulnerability scanning, and compliance requirements into delivery pipelines. - Support development and QA teams with pipeline integrations, test automation enablement, and deployment best practices. - Review and validate code, pipeline configurations, and infrastructure definitions to ensure quality and compliance. - Support operational teams with incident management, troubleshooting, and root‑cause analysis related to DevOps platforms and pipelines. - Contribute to monitoring, logging, and alerting practices to improve system observability and production stability. - Participate in operational readiness, post‑incident reviews, and continuous improvement activities. - Act as a technical reference for DevOps practices, tooling, and deployment approaches. - Coach, mentor, and support developers, DevOps engineers, QA engineers, and platform teams on DevOps and DevSecOps principles. - Develop and maintain DevOps documentation, runbooks, standards, and training materials. - Facilitate workshops and maturity assessments to identify DevOps capability gaps and recommend improvement roadmaps. Benefits - Flexible working environment. - Opportunities for professional development and certifications. - Health and wellness programs. - Competitive salary and performance bonuses.

