Job Closed

This listing is no longer active.

Coderio logo
Coderio

Accelerate Your Digital Transformation

DevOps Engineer – SRE, Observability

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 201-500Since 2017H1B No SponsorCompany SiteLinkedIn

Location

Argentina

Posted

94 days ago

Salary

0

Seniority

Senior

Job Description

DevOps Engineer – SRE, Observability

Coderio

• Garantizar la salud proactiva de los sistemas • Enfocarse en la observabilidad end-to-end • Responder eficientemente ante incidentes • Asegurar que la experiencia del usuario final no se vea afectada

Job Requirements

  • 1. Stack de Observabilidad**
  • Experiencia avanzada en monitoreo y métricas con Prometheus, Grafana, Datadog o New Relic
  • Gestión de logs centralizados con ELK Stack (Elasticsearch, Logstash, Kibana), Splunk o Graylog
  • Implementación de trazabilidad distribuida con OpenTelemetry, Jaeger o Honeycomb para identificar cuellos de botella en microservicios
  • 2. Ingeniería de Confiabilidad (SRE Core)**
  • Capacidad para definir y configurar SLIs y SLOs alineados a las expectativas del negocio
  • Conocimiento en gestión de Error Budgets para decidir cuándo priorizar estabilidad sobre nuevas funcionalidades
  • Experiencia liderando procesos de Postmortem sin culpables y análisis de causa raíz (RCA)
  • 3. Automatización y Plataforma**
  • Dominio de Infrastructure as Code con Terraform o CloudFormation para despliegues automatizados de agentes de monitoreo
  • Conocimiento sólido en Kubernetes/OpenShift con recolección de métricas a nivel de clúster (Kube-state-metrics, Node Exporter)
  • Capacidad de automatizar respuestas a alertas y runbooks mediante Python, Go o Bash
  • 4. Gestión de Alertas y Respuesta a Incidentes**
  • Configuración de alertas inteligentes que reduzcan ruido y fatiga operacional con PagerDuty, Opsgenie o VictorOps
  • Dominio de técnicas de diagnóstico rápido en entornos productivos bajo presión, con foco en la reducción de MTTR

Benefits

  • 100% remoto
  • Compromiso a largo plazo, con autonomía e impacto
  • Rol estratégico y de alta visibilidad en una cultura de ingeniería moderna
  • Equipo internacional colaborativo y liderazgo técnico sólido
  • Plan de carrera y crecimiento dentro de Coderio

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Enroute logo

Senior DevOps – Platform Engineer

Enroute

We deliver IT services and solutions provided by a team of passionate problem solving individuals highly skilled.

DevOps Engineer94 days ago
Full TimeRemoteTeam 51-200H1B Sponsor

• Design, build, and maintain scalable and secure cloud infrastructure in AWS and/or Azure • Develop and maintain Infrastructure as Code and automated provisioning workflows • Design and optimize CI/CD pipelines to enable reliable and fast software delivery • Implement deployment strategies such as blue-green, canary, and zero-downtime deployments • Build and manage containerized environments using Docker and Kubernetes • Implement and maintain monitoring, logging, alerting, and performance metrics solutions • Enforce DevSecOps practices, including vulnerability scanning, secret management, and compliance automation • Collaborate closely with development teams to design infrastructure aligned with application architecture • Lead initiatives focused on platform reliability, scalability, and cost optimization • Mentor engineers and document infrastructure standards and best practices • Participate actively in cross-functional Agile teams alongside product, design, QA, and engineering stakeholders

Mexico
Job Closed
Full TimeRemoteTeam 201-500Since 2024H1B No Sponsor

• Design, build, and maintain Power Apps (Canvas and Model-Driven) to support business processes and automation needs • Develop and optimize Power Automate workflows for end-to-end process automation and integrations • Create, configure, and customize SharePoint environments including sites, lists, libraries, and permissions • Integrate Power Platform solutions with enterprise systems using REST APIs, CSOM, JSOM, and PnP • Support migration and modernization of legacy workflows and SharePoint sites • Ensure solutions comply with security, governance, and enterprise compliance standards • Collaborate within Agile/Scrum teams and contribute to sprint planning and delivery • Produce clear technical documentation and deployment artifacts

India
Job Closed
Futurae Technologies AG logo

Security Engineer – DevSecOps

Futurae Technologies AG

Future-proof, end user-centric authentication #2FA #MFA #SCA #Cybersecurity #Privacy #Usability #Transactions

DevOps Engineer94 days ago
Full TimeRemoteTeam 11-50Since 2016H1B No Sponsor

• Ensure the security of platforms and products by implementing best practices and conducting assessments. • Collaborate with cross-functional teams to embed security in the software development lifecycle (SDLC). • Define security best practices and conduct deep-dive security assessments. • Build innovative, AI-driven solutions to enhance product security posture against threats. • Lead incident response and remediation efforts in case of any breaches. • Stay updated on security trends, emerging exploits, and maintain a "threat-first" mindset.

Switzerland
Job Closed
OtherRemoteTeam 5,001-10,000H1B Sponsor

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Do you enjoy collaborating with teams to solve complex challenges? Do you enjoy solving large scale distributed content delivery challenges? Join our critical Platform and Reliability Engineering Team! The Platform & Reliability Engineering team is responsible for defining, measuring, & optimizing the key performance indicators of delivery customers. Your expertise in software engineering and systems administration will be instrumental in building robust and resilient infrastructure. In this role, you'll play a pivotal role in shaping the future of our products. You'll collaborate with product teams from the earliest stages of development to ensure the reliability, scalability, and performance of our systems. You'll define key performance indicators (KPIs), advance the state of monitoring, alerting and operational responses, and investigate complex performance issues. As a Senior Site Reliability Engineer, you will be responsible for: - Working on Internet technologies to improve the performance, availability, and scalability of large distributed content delivery systems. - Engaging in collaborative efforts with cross-functional teams, including Product & engineering, to define and establish measurable SLIs and SLOs. - Providing technical expertise and feedback to ensure system designs and implementations meet reliability and performance requirements. - Monitoring platform availability and performance, debug issues by leveraging data analysis skills and implement corrective actions to avoid recurrence. - Developing and implementing automation solutions to improve operational efficiency and reduce toil. - Participating in design reviews and providing technical guidance to ensure designs meet requirements for scalability, performance, and robustness. Qualifications - 5 years of relevant experience and a Bachelor's degree in Computer Science or its equivalent. - Familiarity with Internet protocols (DNS/HTTP/TLS/TCP etc.). - Experience utilizing Oracle SQL for data integrity checks, root cause analysis of data anomalies, and the development of data reports. - Proficiency in Scripting languages (Python, bash, JavaScript etc.). - Experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting. - Fluency working in a UNIX/Linux computing environment. Benefits - Flexible working options through FlexBase, allowing 95% of employees to choose to work from home, the office, or both. - Opportunities to grow, flourish, and achieve great things. - Benefits surrounding all aspects of your life, including health, finances, family, work-life balance, and personal endeavors. - Industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (PTO), sick time, parental leave, and an employee assistance program. Company Description Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences, helping billions of people live, work, and play every day. With the world's most distributed compute platform—from cloud to edge—we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

United States
Job Closed