Job Closed
This listing is no longer active.
SRE & DevOps Engineer (Kubernetes Specialist)
Location
United States + 1 moreAll locations: United States | Canada
Posted
70 days ago
Salary
0
Seniority
Mid Level
Job Description
SRE & DevOps Engineer (Kubernetes Specialist)
nubiral
Role Description Como SRE en Nubiral, no solo gestionarás infraestructura; serás el arquitecto de la confiabilidad. Tu objetivo será minimizar el "toil" mediante la automatización y maximizar la disponibilidad de los servicios mediante una gestión basada en datos y mejora continua. - Ingeniería de Confiabilidad y Rendimiento - Gestión de Niveles de Servicio: Definir, implementar y medir SLOs, SLAs y Error Budgets para asegurar que la experiencia del usuario final cumpla con los estándares de calidad de la compañía. - Diseño de Resiliencia: Diseñar patrones de infraestructura que soporten escalado automático, tolerancia a fallas y alta disponibilidad (High Availability) en entornos críticos. - Eficiencia Operativa: Participar activamente en la optimización de la arquitectura, gestionando la capacidad de los sistemas y el control de costos de nube (FinOps). - Operaciones y Observabilidad - Monitoreo y Diagnóstico: Supervisar la salud de los sistemas utilizando herramientas de observabilidad avanzadas para la detección proactiva de anomalías. - Gestión de Incidentes: Liderar el ciclo de vida de incidentes: desde la mitigación inicial hasta la ejecución de Post-mortems sin culpa (blameless), asegurando que cada falla se convierta en una oportunidad de aprendizaje. - Automatización de Remediación: Desarrollar scripts y herramientas para la autoreparación de sistemas y la reducción de tareas manuales repetitivas. - Colaboración Estratégica - Cultura DevOps: Acompañar a los equipos de producto en la adopción de buenas prácticas de confiabilidad, performance y seguridad (DevSecOps). - Sinergia Técnica: Colaborar estrechamente con las áreas de Desarrollo, Arquitectura y Seguridad para facilitar despliegues continuos y seguros en entornos de Kubernetes. Qualifications - Experiencia avanzada en la orquestación, despliegue y administración de clusters (EKS, AKS, GKE o implementaciones On-premise). - Conocimiento sólido en al menos dos de los siguientes proveedores: AWS, Azure, Google Cloud (Gemini) u Oracle Cloud. - Dominio de herramientas de Infraestructura como Código (Terraform, Pulumi) y pipelines de integración/entrega continua (GitLab CI, GitHub Actions, Jenkins). - Experiencia con stacks de monitoreo como Prometheus, Grafana, ELK o Datadog. - Fluidez en lenguajes de scripting o programación (Python, Go o Bash). Benefits - Tiempo para vos: contás con días Nubiral para disfrutar como vos quieras. - Día de cumpleaños off. - Licencia por adopción de mascotas. - Formación continua: certificaciones afines cubiertas. - Acompañamiento en momentos importantes para tu familia: regalo por nacimiento, cumple de hijos/as off. - Inclusión, innovación y trabajo en equipo son la base de nuestro éxito.
Job Requirements
- Experiencia avanzada en la orquestación, despliegue y administración de clusters (EKS, AKS, GKE o implementaciones On-premise).
- Conocimiento sólido en al menos dos de los siguientes proveedores: AWS, Azure, Google Cloud (Gemini) u Oracle Cloud.
- Dominio de herramientas de Infraestructura como Código (Terraform, Pulumi) y pipelines de integración/entrega continua (GitLab CI, GitHub Actions, Jenkins).
- Experiencia con stacks de monitoreo como Prometheus, Grafana, ELK o Datadog.
- Fluidez en lenguajes de scripting o programación (Python, Go o Bash).
Benefits
- Tiempo para vos: contás con días Nubiral para disfrutar como vos quieras.
- Día de cumpleaños off.
- Licencia por adopción de mascotas.
- Formación continua: certificaciones afines cubiertas.
- Acompañamiento en momentos importantes para tu familia: regalo por nacimiento, cumple de hijos/as off.
- Inclusión, innovación y trabajo en equipo son la base de nuestro éxito.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer – Cloud Migration, Bare Metal
Vesputi GmbHVESPUTI is one of Europe's leading B2B tech provider for sustainable mobility solutions.
• Support and drive our migration from GKE to self-hosted bare-metal servers. • Provision and manage infrastructure using Ansible, Kamal, and Docker Swarm. • Optimize containerized services for performance, scalability, and security. • Ensure high availability and reliability of our Ruby on Rails microservices. • Monitor, troubleshoot, and improve our PostgreSQL and Redis setups. • Automate processes to streamline deployments and system maintenance.
Senior Software Engineer – DevOps, Infrastructure
Cornelis NetworksCornelis Networks, founded in 2019, is a leader in high-performance networking solutions, specializing in purpose-built fabrics that enhance scalability, effici
• Design, build, and maintain CI/CD pipelines using GitHub Actions • Integrate HPC/AI workload and Cornelis software build and test stages into CI pipelines • Interface with SW teams to validate and test new packages • Collaborate with developers to help debug, interpret, and communicate testing results • Monitor pipeline health, troubleshoot failures, and continuously improve build reliability and speed • Administer and maintain onsite Linux-based development and test servers • Perform OS provisioning, patching, and lifecycle management • Manage hardware and software stacks for both CPU and GPU systems • Maintain driver and package compatibility matrices
AWS Cloud Site Reliability Engineer
Peraton CorporationPeraton Corporation, a national security company headquartered in Herndon, Virginia, supplies solutions for mission-critical programs and systems. Founded in 2017, Peraton's missio
Role Description We are seeking an experienced and motivated AWS Cloud Site Reliability Engineer (SRE) to join our dynamic team. As an AWS Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure on Amazon Web Services (AWS). The ideal candidate will have a strong background in AWS services, a deep understanding of infrastructure as code, and a passion for implementing best practices in site reliability engineering. The AWS Site Reliability Engineer (SRE) will collaborate closely with cross-functional teams, including development, quality assurance, and operations, to ensure seamless software releases and continuous improvement of our release processes. - Infrastructure Automation: - Design, implement, and manage infrastructure as code (IaC) solutions using tools like AWS CloudFormation, Terraform or Helm Charts to automate deployment and scaling processes. - Collaborate with development teams to integrate continuous deployment practices and ensure the reliability of applications. - Monitoring and Alerting: - Implement robust monitoring and alerting systems to proactively identify and address potential issues before they impact system performance. - Analyze system metrics, logs, and alerts to troubleshoot and resolve issues promptly. - Performance Optimization: - Conduct performance analysis and optimization of AWS infrastructure components to enhance system efficiency and reduce latency. - Identify and implement improvements to enhance system reliability and resilience. - Incident Response: - Participate in on-call rotations to respond to and resolve incidents promptly. - Conduct post-incident reviews to identify root causes and implement preventive measures. - Security and Compliance: - Work closely with security teams to implement and enforce best practices for securing AWS environments. - Ensure compliance with industry standards and regulations related to cloud infrastructure. - Communication: - Facilitate clear communication across teams, providing updates on release status, known issues, and any potential impact on stakeholders. - Coordinate communication of release schedules and changes to all relevant parties. - Release Planning and Coordination: - Collaborate with development, QA, and operations teams to plan and coordinate software releases. - Define release scope, schedule, and dependencies to ensure timely and smooth deployments. - Create and submit change records as required for process and audit compliance. - Participation in Technical Change Advisory and Review boards as required. - Release Automation: - Develop and maintain automated deployment pipelines using industry-standard tools such as AWS CI/CD, GitLab CI/CD, Jenkins or similar. - Automate and streamline release processes to improve efficiency and reduce manual errors. - Continuous Improvement: - Proactively identify areas for process improvement within the release management lifecycle. - Implement feedback loops to capture lessons learned from each release and apply improvements iteratively. - Stay up to date with industry best practices, emerging technologies, and trends related to release management and reliability engineering. - Quality Assurance: - Collaborate with QA teams to establish and execute release validation procedures. - Ensure releases are thoroughly tested and meet quality standards before deployment. - Drive continuous improvement by analyzing release management trends, identifying recurring issues, and working with teams to implement solutions. Qualifications - Bachelor's degree and 8 years of experience or 12 years with a HS Diploma. - Proven experience as a Site Reliability Engineer or similar role. - In-depth knowledge of AWS services and expertise in managing cloud infrastructure. - Advanced level programming and/or scripting in 3 or more of the following languages: Python, Ansible, Helm, Playwright, Bash, JavaScript, Terraform, and Java. - Strong understanding of DevOps principles and continuous integration/continuous deployment (CI/CD) pipelines. - Proficiency in CI/CD tools such as AWS CI/CD, GitLab CI/CD, or others. - Familiarity with infrastructure as code (IaC) tools like Terraform, CloudFormation, Helm Charts, or similar technologies. - Experience creating standards and policies for Configuration Management and leveraging Ansible to automate configuration. - Hands-on experience with version control systems (GitLab, AWS CodeCommit, SVN) and branching strategies. - Experience with containerization and orchestration tools (e.g., Amazon Elastic Compute Service (ECS), Amazon Elastic Kubernetes Service (EKS), Docker, Kubernetes). - Familiarity with monitoring tools (e.g., Datadog, CloudWatch, Prometheus, Grafana, DynaTrace) and log analysis. - Attention to detail, with a focus on maintaining high-quality software releases. - Solid understanding of Agile methodologies and their application in release management. - Excellent problem-solving and troubleshooting skills. - Strong communication and collaboration skills. - Must be a US Citizen. - Must be able to obtain and maintain the required agency clearance. Preferred Qualifications - Relevant certifications in DevOps or related fields are a plus. - High Risk Public Trust or Secret Clearance preferred. - 3 or more years in SRE or Platform Engineering group for high availability/critical platforms/applications. - Experience managing a distributed container platform including but not limited to deployment/release management, provisioning, capacity management, workload management. Company Description Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Target Salary Range $104,000 - $166,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual’s experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay. EEO EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.
Job Summary At Fuze Health, we put patients first and tirelessly address the most pressing needs in healthcare. We empower millions to digitally connect with care providers, essential health resources and needed treatments – and enable care providers, employers, health plans and life sciences companies to meaningfully enhance quality, outcomes and value. We are dedicated to helping our partners evolve and modernize to meet emerging patient and marketplace needs. Fuze Health’s foundation is built upon the strategic combination of several proven, technology-powered innovators in the digital health, diagnostics, and pharmacy sectors. Our growing portfolio brings together the capabilities of industry leaders including LetsGetChecked, Truepill, and Alto Pharmacy, to create a distinctive, unified force in healthcare. Together, we have the shared vision, advanced capabilities and talented teams to deliver next-generation solutions that patients and healthcare partners need today and into the future. Job Description Alto Pharmacy (Fuze Health) is seeking a Staff DevSecOps Engineer to join our Engineering organization. As a full-service pharmacy operating nationally across mail-order and physical pharmacy locations, we build and operate highly reliable, secure, and compliant systems that directly impact patient health and safety. In this role, you will operate as a senior technical leader responsible for embedding security deeply into our engineering lifecycle. You will define DevSecOps strategy, elevate our cloud and application security posture, and partner cross-functionally to ensure Alto’s platform is secure, scalable, compliant, and resilient as we grow nationwide. This is a hands-on technical leadership role for someone who thrives in complex, regulated environments and wants to shape security architecture at scale. Job Description Key Responsibilities Technical Strategy & Architecture - Define and lead the DevSecOps vision and roadmap across infrastructure, application, and CI/CD ecosystems. - Architect secure-by-design cloud-native systems across AWS/GCP environments. - Establish security patterns, guardrails, and reference architectures for engineering teams. - Evaluate and implement modern security tooling across SAST, DAST, SCA, container scanning, IaC scanning, and runtime protection. Secure SDLC & Automation - Embed security controls into CI/CD pipelines and developer workflows. - Drive infrastructure-as-code security best practices (Terraform, CloudFormation, etc.). - Automate security testing and compliance checks to reduce manual overhead. - Implement policy-as-code and automated governance controls. Cloud & Infrastructure Security - Lead identity and access management (IAM) strategy and least-privilege enforcement. - Strengthen container and Kubernetes security posture. - Oversee secrets management, encryption standards, and key management processes. - Partner with infrastructure teams on network segmentation, zero-trust architectures, and environment isolation. Risk, Compliance & Incident Response - Support and mature Alto’s security program in alignment with HIPAA, SOC 2, HITRUST, and other healthcare regulatory frameworks. - Conduct threat modeling, security design reviews, and architecture risk assessments. - Partner with Security and Compliance teams on audits and remediation efforts. - Provide senior-level leadership during security incidents, including root cause analysis and long-term mitigation planning. Technical Leadership - Mentor senior and mid-level engineers on secure coding and DevSecOps practices. - Influence engineering leadership and executive stakeholders on security strategy and risk prioritization. - Drive cross-functional alignment across Engineering, Product, IT, and Compliance. - Raise the overall security maturity of the organization through scalable frameworks and standards. Required Experience & Qualifications Minimum Qualifications: - 14+ years of experience in software engineering, infrastructure engineering, or security engineering, with significant experience in DevSecOps environments. - Deep expertise in cloud security architecture (AWS and/or GCP). - Strong experience securing containerized and Kubernetes-based environments. - Hands-on experience with CI/CD systems (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.). - Expertise in infrastructure-as-code (Terraform, CloudFormation) and securing IaC pipelines. - Strong knowledge of application security principles, OWASP Top 10, and secure coding practices. - Experience implementing and scaling SAST, DAST, SCA, container scanning, and secrets detection tools. - Deep understanding of IAM, RBAC, zero-trust models, and encryption best practices. - Experience operating in regulated environments (HIPAA, SOC 2, HITRUST, PCI, etc.). - Strong scripting or programming skills (Python, Go, Ruby, or similar). - Demonstrated ability to influence architectural decisions at a Staff or Principal level. Preferred Qualifications: - Experience in healthcare, pharmacy, fintech, or other highly regulated industries. - Experience building DevSecOps programs from early-stage to scale. - Background in site reliability engineering (SRE) or platform engineering. - Security certifications such as CISSP, CISM, CCSP, or cloud security certifications (AWS/GCP). - Experience implementing threat modeling frameworks (STRIDE, PASTA, etc.). - Experience with observability platforms and integrating security telemetry into monitoring systems. Additional Information Additional Physical Job Requirements Physical requirements for this role include the ability to work at a computer terminal with monitor, keyboard and mouse for extended periods of time, stoop, bend, and reach for equipment and supplies, make frequent repetitive motions required to operate a computer that include the wrists, hands and fingers, and lift, carry, push, pull, and move light objects up to 20 pounds. The role also requires the ability to effectively communicate through verbal interactions, discern auditory information, and visually perceive details to perform essential job functions. Consistent with the Americans with Disabilities Act (ADA) and similar applicable state laws, it is Fuze Health’s policy to provide reasonable accommodation to enable qualified individuals with disabilities to perform essential job functions, unless such accommodation would cause an undue hardship. Salary and Benefits Salary Range: $166,00 - $200,000 Commission Eligible: No Travel: No - Required up to 0% of the time Location Requirement: Alto is limited to individuals residing in the following states: Arizona, Arkansas, California, Colorado, Florida, Kansas, Maryland, Missouri, Nevada, New Jersey, New York, North Carolina, Oregon, Pennsylvania, South Carolina, Tennessee, Texas, Washington (WA), and Wisconsin. Employment Authorization Requirement: Applicants must be authorized to work for any employer in the U.S. Benefits: Full-time employee benefits include: dental, vision, and multiple group medical plans to choose from, a 401(k) retirement savings plan, group life insurance, accidental death and dismemberment (AD&D) insurance, flexible spending account (FSA) and health savings account (HSA), commuter benefits, employer-paid short-term (STD) and long-term disability (LTD) insurance, and additional supplemental insurance plans (spouse life insurance, legal insurance, an employee assistance program, home health testing kits, and a fertility medication discount program). Employees are also provided flexible vacation time, accrued paid sick time, 10 paid holidays, (2 floating holidays for full time non-exempt employees) , and eight weeks of paid parental leave for eligible employees, additional paid weeks for the birthing parent, 4 weeks paid caregiver leave, and a Lifestyle Spending Account allowance each month. More Benefits Information Here: Fuze Health Benefits Site Application deadline: March 15, 2026 #LI-Remote Fuze Health is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, sex, gender identity, sexual orientation, age, disability, veteran status, or any other legally protected basis. If you have a disability and require reasonable accommodation during any portion of the application or hiring process, please contact us at talent@fuzehealth.com. Fuze Health considers qualified applicants with arrest or conviction records for employment and conducts background checks consistent with applicable law, including the California, Los Angeles County, San Francisco, Philadelphia, and New York City Fair Chance laws. We are an E-Verify participating company. Fuze Health recruiters and hiring managers may use automated decision-making tools to assist with identifying candidates who match the stated job requirements, and to what extent. These tools are designed to help ensure fairness in all aspects of the hiring process by providing recruiters and hiring managers with data-backed insights based on information provided in your resume, including work experience, education, and other skills. If you have any questions or would like to request an alternative process, please contact us at talent@fuzehealth.com. To learn about Fuze Health’s privacy practices including compliance with applicable privacy laws, please click here.


