Job Closed

This listing is no longer active.

Sofka logo
Sofka

Retos técnicos y personales que te mantendrán en constante crecimiento. Un equipo conectado, enfocado en tu bienestar físico y mental. Cultura de mejora continua, fresca y colaborativa, con oportunidades de aprendizaje y gente dispuesta a apoyarte. Programas como Happy Kaizen y WeSofka que cuidan tu bienestar físico y emocional.

Site Reliability Engineer

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteMid LevelTeam 1,001-5,000

Location

Worldwide

Posted

75 days ago

Salary

0

Seniority

Mid Level

Job Description

Site Reliability Engineer

Sofka

Role Description Buscamos un SRE con más de 3 años de experiencia liderando la resiliencia tecnológica y la observabilidad en entornos de alta complejidad. Tu misión será actuar como el puente entre la innovación y la estabilidad, dominando conceptos de Observabilidad & Reliability, Arquitectura de Sistemas Distribuidos y Automatización (IaC). Esta es tu oportunidad para diseñar el futuro de la disponibilidad tecnológica en un entorno remoto, donde tu trabajo impactará directamente en la experiencia de miles de usuarios. Si buscas un reto donde la ingeniería de caos, la autorremediación y la innovación constante sean tu día a día, ¡queremos conocerte! Responsibilities - Adaptar las necesidades de observabilidad a cada solución técnica para asegurar cobertura, visibilidad y eficiencia operativa. - Configurar y mantener dashboards, métricas, alertas y controles críticos para el negocio. - Validar la resiliencia de las soluciones mediante pruebas de caos y evaluaciones de escalabilidad bajo carga. - Implementar patrones de diseño resilientes como circuit breakers, fallbacks y retries en arquitecturas distribuidas. - Identificar y automatizar procesos manuales utilizando herramientas de infraestructura como código para reducir el MTTR. - Liderar la implementación de flujos de autorremediación y promover prácticas de mejora continua en la operación. - Colaborar con equipos de desarrollo y arquitectura para asegurar la calidad técnica en los journeys críticos de los usuarios. Qualifications - Profesional en Ingeniería de Sistemas, Computación, Electrónica o carreras afines. Requirements - Amplia trayectoria en implementación de observabilidad y resiliencia en microservicios, entornos Cloud y equipos ágiles; experiencia comprobada en automatización de tareas operativas y gestión de incidentes bajo metodologías SRE/DevOps. - Conocimientos Técnicos: - Observabilidad: Dynatrace (Hands-on principal), Grafana, Prometheus, OpenTelemetry y ELK Stack. - Automatización e IaC: Ansible, Terraform, Terragrunt y Monaco (Monitoring as Code). - Contenerización: Kubernetes (AKS, EKS), OpenShift (Nivel avanzado) y Docker. - Lenguajes de Programación: Python (Avanzado), Bash, YAML y PowerShell. - Cloud & Infraestructura: Azure, AWS o GCP (Networking, Seguridad y Cómputo). - Gestión de Confiabilidad: Definición de SLIs, SLOs, SLAs y gestión de Error Budgets. - CI/CD: Git, Jenkins, Azure DevOps y GitHub Actions. - Ingeniería de Resiliencia: Chaos Engineering, Circuit Breaker y despliegues Canary/Blue-Green. Benefits - Contrato a término indefinido. - Queremos relaciones a largo plazo y que seas parte de nuestra familia por mucho tiempo. - En Sofka, te ofrecemos un ecosistema de aprendizaje con múltiples herramientas para cerrar brechas y potenciar tus habilidades. ¡Tú decides cómo quieres crecer! - Modalidad: remota.

Job Requirements

  • Profesional en Ingeniería de Sistemas, Computación, Electrónica o carreras afines.
  • Amplia trayectoria en implementación de observabilidad y resiliencia en microservicios, entornos Cloud y equipos ágiles; experiencia comprobada en automatización de tareas operativas y gestión de incidentes bajo metodologías SRE/DevOps.
  • Conocimientos Técnicos:
  • Observabilidad: Dynatrace (Hands-on principal), Grafana, Prometheus, OpenTelemetry y ELK Stack.
  • Automatización e IaC: Ansible, Terraform, Terragrunt y Monaco (Monitoring as Code).
  • Contenerización: Kubernetes (AKS, EKS), OpenShift (Nivel avanzado) y Docker.
  • Lenguajes de Programación: Python (Avanzado), Bash, YAML y PowerShell.
  • Cloud & Infraestructura: Azure, AWS o GCP (Networking, Seguridad y Cómputo).
  • Gestión de Confiabilidad: Definición de SLIs, SLOs, SLAs y gestión de Error Budgets.
  • CI/CD: Git, Jenkins, Azure DevOps y GitHub Actions.
  • Ingeniería de Resiliencia: Chaos Engineering, Circuit Breaker y despliegues Canary/Blue-Green.

Benefits

  • Contrato a término indefinido.
  • Queremos relaciones a largo plazo y que seas parte de nuestra familia por mucho tiempo.
  • En Sofka, te ofrecemos un ecosistema de aprendizaje con múltiples herramientas para cerrar brechas y potenciar tus habilidades. ¡Tú decides cómo quieres crecer!
  • Modalidad: remota.

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Node.Digital logo

Hosting Lead (Senior Task Lead/Systems Engineer - Hosting)

Node.Digital

A Digital Automation Company - Enabling Frictionless Transactions with Digital Engagement & Intelligent Automation

OtherRemoteTeam 11-50H1B No Sponsor

Hosting Lead (Senior Task Lead / Systems Engineer - Hosting) Location: Remote Work - Clearance: Must meet DHS/USCIS background investigation/EOD; support $24x7 after-hours response. Role Summary Lead day-to-day Operations & Maintenance (O&M) across hybrid multi-cloud enterprise, DHS data centers, Equinix data centers, and cloud environments (AWS, Azure, GCP). Own infrastructure readiness, patching, capacity, performance, and Tier II-III incident/problem resolution. Drive automation-first operations and ensure compliance with EIOSS/eAUTO SLAs and AQLs. Key Responsibilities - End-to-End O&M: Own O&M for servers, storage, backup, databases, virtualization, and middleware; ensure operational acceptance with As-Built/VDD/Runbooks. - Patching & Remediation: Lead patching, image baselines, remediation, and hardening for Windows, Red Hat/CentOS, and Solaris. Run integrated patch IPTs with DBAs, Security, and Field Ops. - Infrastructure Management: Manage VMware vSphere, Microsoft Hyper-V, Citrix/VDI, FlexPod, NetApp, and enterprise backup (NetBackup, Backup Exec, Veeam). - Modern Hosting: Run CI/CD-enabled hosting using Jenkins, Harness, Git, and Sonatype; operate OpenShift, Docker, Kubernetes, and UiPath. - Performance Metrics: Deliver bi-weekly health checks and monthly metrics. Meet SLAs/AQLs (e.g., $\ge99.95\%$ availability) and produce capacity/incident reports. - Automation & ITSM: Champion automation (Ansible/Chef) and ServiceNow ITSM/ITOM integration; maintain CMDB accuracy. - Collaboration: Lead collaboration with customers and partners to break down silos and drive automation adoption across Engineering, Security, and the TOC.

Virginia
Job Closed
OnHires logo

Senior Scraping Infrastructure Engineer

OnHires

Global tech recruitment & staffing for fast-growing companies

Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

• Architect, build, and maintain the core infrastructure for massive, large-scale asynchronous data extraction system. • Design, implement, and continuously optimize sophisticated anti-blocking strategies, IP rotation, fingerprint management, and anti-bot bypass techniques to ensure high reliability and consistent uptime against modern web blocking. • Implement robust monitoring, alerting, and logging systems to proactively debug, troubleshoot, and continuously improve scraper performance, reliability, and data quality across the platform. • Develop, test, and deploy highly robust and fault-tolerant web scraping components using advanced Python tools (Scrapy, Playwright, Selenium, Requests, etc.). • Manage and automate high-volume data ingestion pipelines and seamless integrations with internal and external REST APIs. • Drive DevOps best practices, including managing infrastructure with Docker, Nomad knowledge (a plus), CI/CD pipelines. • Partner with other engineers to set standards, enhance core infrastructure tooling, and mentor junior team members.

Peru
Job Closed
OnHires logo

Senior Scraping Infrastructure Engineer

OnHires

Global tech recruitment & staffing for fast-growing companies

Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

• Architect, build, and maintain the core infrastructure for massive, large-scale asynchronous data extraction system. • Design, implement, and continuously optimize sophisticated anti-blocking strategies, IP rotation, fingerprint management, and anti-bot bypass techniques to ensure high reliability and consistent uptime against modern web blocking. • Implement robust monitoring, alerting, and logging systems to proactively debug, troubleshoot, and continuously improve scraper performance, reliability, and data quality across the platform. • Develop, test, and deploy highly robust and fault-tolerant web scraping components using advanced Python tools (Scrapy, Playwright, Selenium, Requests, etc.). • Manage and automate high-volume data ingestion pipelines and seamless integrations with internal and external REST APIs. • Drive DevOps best practices, including managing infrastructure with Docker, Nomad knowledge (a plus), CI/CD pipelines. • Partner with other engineers to set standards, enhance core infrastructure tooling, and mentor junior team members.

Argentina
Job Closed
OnHires logo

Senior Scraping Infrastructure Engineer

OnHires

Global tech recruitment & staffing for fast-growing companies

Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

• Architect, build, and maintain the core infrastructure for massive, large-scale asynchronous data extraction system. • Design, implement, and continuously optimize sophisticated anti-blocking strategies, IP rotation, fingerprint management, and anti-bot bypass techniques. • Implement robust monitoring, alerting, and logging systems to proactively debug, troubleshoot, and continuously improve scraper performance, reliability, and data quality. • Develop, test, and deploy highly robust and fault-tolerant web scraping components using advanced Python tools (Scrapy, Playwright, Selenium, Requests, etc.). • Manage and automate high-volume data ingestion pipelines and seamless integrations with internal and external REST APIs. • Drive DevOps best practices, including managing infrastructure with Docker, CI/CD pipelines. • Partner with other engineers to set standards, enhance core infrastructure tooling, and mentor junior team members.

Brazil
Job Closed