Bora incentivar a leitura para transformar vidas através do conhecimento?
Senior Site Reliability Engineering (SRE)
Location
Brazil
Posted
59 days ago
Salary
0
Seniority
Senior
No structured requirement data.
Job Description
Senior Site Reliability Engineering (SRE)
Skeelo
O nosso time de Engenharia está precisando de alguém, e queremos você com a gente! 🚀 Sua missão? Você será a pessoa chave para garantir a estabilidade e a performance de nossos sistemas, especialmente em cenários de alta demanda, e nos ajudar a escalar. O seu dia a dia no Skeelo vai ser assim: - Estruturar e Otimizar Operações: Desenhar e implementar as melhores práticas e ferramentas para criar uma área de SRE e monitoramento robusta. - Monitoramento Abrangente: Desenvolver e manter sistemas de monitoramento para toda a nossa stack tecnológica, desde servidores e infraestrutura até o status de serviços, utilização de disco e filas. - Identificação e Resolução de Causa Raiz: Atuar proativamente na investigação de gargalos e falhas, mergulhando fundo para descobrir a causa raiz dos incidentes e garantir a rápida recuperação dos serviços. - Automação e Alarmes: Criar e otimizar pipelines de automação para detecção de anomalias, envio de alarmes e execução de ações corretivas, minimizando a intervenção manual e aumentando a eficiência. - Manutenção da Disponibilidade: Assegurar que nossos serviços estejam sempre funcionando perfeitamente, garantindo a confiabilidade, baixa latência e alta performance em todos os momentos. - Gestão de Ferramentas: Avaliar, selecionar e implementar as ferramentas de mercado que melhor atendam às nossas necessidades, construindo um stack tecnológico otimizado. - Aplicação de IA: Buscar e implementar soluções inovadoras usando Inteligência Artificial para aprimorar o monitoramento, prever falhas e automatizar tarefas. - Mentoria e Crescimento: Em um segundo momento, você terá a oportunidade de orientar e desenvolver talentos em um time de SRE em expansão. - Foco em Alta Demanda: Preparar e otimizar a infraestrutura para lidar com picos de acesso e eventos críticos, identificando pontos de escalabilidade e mitigando gargalos. Para isso você vai precisar ter: - Experiência Comprovada: Sólida vivência na estruturação de áreas de infraestrutura e/ou SRE do zero, com foco em ambientes de alto volume e missão crítica. - Conhecimento Profundo em Monitoramento: Expertise em diversas ferramentas e conceitos de monitoramento, cobrindo desde infraestrutura (servidores, máquinas) até o status de serviços e aplicações. - Automação: Habilidade comprovada em automatizar tarefas, criar alarmes e implementar fluxos de trabalho eficientes. - Resolução de Problemas: Capacidade excepcional para investigar e resolver problemas complexos, identificando a causa raiz em ambientes distribuídos. - Cultura de Alta Demanda: Experiência em empresas com grande volume de acesso. - IA no Dia a Dia: Experiência prática na aplicação de conceitos e ferramentas de Inteligência Artificial para otimização de operações. - Idioma: Leitura e escrita avançadas em Inglês são mandatórias. E seria muito bom se você tivesse: - Experiência com outras ferramentas de monitoramento de mercado. - Conhecimento avançado em LINUX. - Experiência prática com PM2, Node.js, Kibana e Elasticsearch. - Conhecimento em nuvem pública (AWS, GCP, Azure). - Vivência em ambientes que demandam escalabilidade massiva. Neste desafio você contará com alguns apoios: - Caso aconteça algum incidente e para garantir que esteja tudo em conformidade, você contará com o apoio do SulAmérica para Saúde, e Odonto Metlife e Seguro de vida Prudential. - Para recarregar as energias terá o Cartão Caju, que é o nosso famoso VA/VR! mas ele tem o diferencial de ser um cartão de crédito, bandeira visa, que você pode usar para diferentes fins, como: restaurante, mercado, delivery, cultura e etc. - E para manter suas forças e agilidade contará com o apoio da Totalpass. - Além do corpo, a mente também precisa de cuidados, para isso, contamos com ajuda da ZenKlub. - Por desbravar todos os desafios e alcançar todos os objetivos ao longo do ano, receberá também PPR como recompensa. - E para o aprendizado constante, acesso ao Skeelo Premium um plano incrível no nosso aplicativo. Ah! além disso, se você participar do nosso clube do livro, o Skeelê, você ganha o ebook escolhido para a leitura do mês. - E claro, para garantir a segurança de seus filhos (até 71 meses) enquanto você está nessa jornada, temos o Auxílio Creche. Além disso tudo, nossos Skeelers também contam com licença parental estendida para que seu Skeelinho receba muito carinho e atenção nos primeiros dias de vida. Mas como tudo na vida! precisamos de equilíbrio e descanso: - Nossos horários são flexíveis e trabalhamos no modelo 100% remoto. Com dois encontros presenciais em SP por ano! Mas relaxa, o Skeelo sempre organiza tudo pra gente. - Emendamos todos os feriados de São Paulo. - Day-off no mês do aniversário. - Recesso de fim de ano remunerado e sem desconto em saldo de férias. Esse papel principal pode ser seu, está esperando o que para se inscrever? #VemPraToca
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior AWS DevOps Engineer – Kubernetes Expertise
CloudelligentAn AWS Advanced Consulting Partner dedicated to helping businesses innovate on AWS.
• Build and deploy complex cloud environments for customers using automation tools and technologies. • Set up, maintain, and evolve AWS cloud architecture. • Support infrastructure monitoring and alerting, troubleshooting, and automation development. • Work alongside a team of DevOps Engineers to architect, build, and deploy AWS cloud infrastructure. • Promote infrastructure as code and automation. • Design, build, and maintain Kubernetes infrastructure, including clusters, namespaces, and deployments. • Work on projects independently with customers and stakeholders, gathering requirements and informing them of new features. • Create and improve operational practices and procedures. • Ensure implementation and maintenance of compliance and security practices in operational infrastructure. • Train and mentor junior DevOps engineers and new team members.
Sr. AWS DevOps Engineer- Kubernetes Expertise
CloudelligentAn AWS Advanced Consulting Partner dedicated to helping businesses innovate on AWS.
Position Title: Sr. AWS DevOps Engineer- Kubernetes Expertise Job Timings: 8am-5pm Central Time (US) Location: Remote, India About Cloudelligent Cloudelligent is an AWS Premier Consulting Partner helping organizations modernize, migrate, and innovate in the cloud. We work at the intersection of cloud, data, and AI to solve real business problems and not just implement technology. With an international footprint, Cloudelligent is customer-obsessed and focuses on Generative AI and Agentic AI to deliver practical, scalable solutions. Job Objective As an experienced DevOps Engineer your primary role would be to build and deploy complex cloud environments for our customers using innovative automation tools and cutting-edge technologies. You will be responsible for setting up, maintaining, and evolving the AWS cloud architecture and supporting infrastructure monitoring and alerting, issues troubleshooting, and developing required automation for our customers. Responsibilities - Work alongside a team of DevOps Engineers to architect, build, and deploy AWS cloud infrastructure that is redundant, resilient, scalable, self-sustaining, and self-healing. - Promote infrastructure as code and automation as much as possible. To achieve this, you must have extensive knowledge of the DevOps tool sets like Terraform, Cloudformation, CircleCI, Gitlab, Jenkins, Docker, Kubernetes. - Design, build, and maintain our Kubernetes infrastructure, including clusters, namespaces, and deployments. - Work on projects independently with customers and stakeholders, gathering their requirements and informing them of new features and enhancements. - Create and improve current operational practices and procedures. - Ensure the implementation and maintenance of compliance and security practices in operational infrastructure. - Responsible for training and mentoring junior DevOps engineers and new team members. Requirements - Bachelor’s in software/computer engineering, computer science or related field. - Effective communication skills (proficiency in English language). - 5+ years of professional experience in DevOps engineering. - 4+ years of using CI/CD tools (Jenkins, CircleCI, Gitlab, or similar) - Experience in Infrastructure as code (Terraform / Cloudformation/CDK) and using container orchestration tools (Kubernetes/EKS or ECS, Rancher, etc.) - Experience of using configuration management tools (Puppet, Ansible, etc.) and working with containers (Docker or Docker-Compose, etc.) - Experience of using Scripting when necessary (Bash, Python, Ruby, Go or Javascript, etc.) - Self-driven, ability to work independently or as part of a project team with limited supervision. Technical Experience/Expertise - Strong hands-on expertise with solution architecting environments in AWS cloud space. - Strong hands-on experience with code pipeline design, development, and managing CI/CD workflows and tools. - Comprehensive understanding and hands-on experience of core DevOps concepts (CI/CD, Agile & Automation). - Extensive experience in designing, deploying, and managing Kubernetes clusters in production environments. - Expertise in containerization technologies such as Docker, including creating Docker images, managing Docker registries, and configuring Docker runtime environments. - Familiarity with monitoring and logging tools such as Prometheus, Opentelemetry, Grafana, ELK stack (Elasticsearch, Logstash, Kibana). - Extensive knowledge and hands-on experience of IP networking, VPN's, DNS, load balancing, and firewall. - Extensive knowledge and hands-on experience with designing a secure infrastructure using current cloud-native security standards. Good to have - AWS Certified Solutions Architect Associate/Professional and/or AWS Certified DevOps Professional - Certified Kubernetes Administrator (CKA)
Sr. AWS DevOps Engineer- Kubernetes Expertise
CloudelligentAn AWS Advanced Consulting Partner dedicated to helping businesses innovate on AWS.
Position Title: Sr. AWS DevOps Engineer- Kubernetes Expertise Job Timings: 8am-5pm Central Time (US) Location: Remote, India About Cloudelligent Cloudelligent is an AWS Premier Consulting Partner helping organizations modernize, migrate, and innovate in the cloud. We work at the intersection of cloud, data, and AI to solve real business problems and not just implement technology. With an international footprint, Cloudelligent is customer-obsessed and focuses on Generative AI and Agentic AI to deliver practical, scalable solutions. Job Objective As an experienced DevOps Engineer your primary role would be to build and deploy complex cloud environments for our customers using innovative automation tools and cutting-edge technologies. You will be responsible for setting up, maintaining, and evolving the AWS cloud architecture and supporting infrastructure monitoring and alerting, issues troubleshooting, and developing required automation for our customers. Responsibilities - Work alongside a team of DevOps Engineers to architect, build, and deploy AWS cloud infrastructure that is redundant, resilient, scalable, self-sustaining, and self-healing. - Promote infrastructure as code and automation as much as possible. To achieve this, you must have extensive knowledge of the DevOps tool sets like Terraform, Cloudformation, CircleCI, Gitlab, Jenkins, Docker, Kubernetes. - Design, build, and maintain our Kubernetes infrastructure, including clusters, namespaces, and deployments. - Work on projects independently with customers and stakeholders, gathering their requirements and informing them of new features and enhancements. - Create and improve current operational practices and procedures. - Ensure the implementation and maintenance of compliance and security practices in operational infrastructure. - Responsible for training and mentoring junior DevOps engineers and new team members. Requirements - Bachelor’s in software/computer engineering, computer science or related field. - Effective communication skills (proficiency in English language). - 5+ years of professional experience in DevOps engineering. - 4+ years of using CI/CD tools (Jenkins, CircleCI, Gitlab, or similar) - Experience in Infrastructure as code (Terraform / Cloudformation/CDK) and using container orchestration tools (Kubernetes/EKS or ECS, Rancher, etc.) - Experience of using configuration management tools (Puppet, Ansible, etc.) and working with containers (Docker or Docker-Compose, etc.) - Experience of using Scripting when necessary (Bash, Python, Ruby, Go or Javascript, etc.) - Self-driven, ability to work independently or as part of a project team with limited supervision. Technical Experience/Expertise - Strong hands-on expertise with solution architecting environments in AWS cloud space. - Strong hands-on experience with code pipeline design, development, and managing CI/CD workflows and tools. - Comprehensive understanding and hands-on experience of core DevOps concepts (CI/CD, Agile & Automation). - Extensive experience in designing, deploying, and managing Kubernetes clusters in production environments. - Expertise in containerization technologies such as Docker, including creating Docker images, managing Docker registries, and configuring Docker runtime environments. - Familiarity with monitoring and logging tools such as Prometheus, Opentelemetry, Grafana, ELK stack (Elasticsearch, Logstash, Kibana). - Extensive knowledge and hands-on experience of IP networking, VPN's, DNS, load balancing, and firewall. - Extensive knowledge and hands-on experience with designing a secure infrastructure using current cloud-native security standards. Good to have - AWS Certified Solutions Architect Associate/Professional and/or AWS Certified DevOps Professional - Certified Kubernetes Administrator (CKA)
Role Description Zur Verstärkung unseres Teams suchen wir einen erfahrenen DevOps/Platform Engineer (SRE) 100 % remote (w/m/d). - Mitverantwortung für die Systemverfügbarkeit: Du trägst aktiv zur Verfügbarkeit, Zuverlässigkeit und Effizienz unserer komplexen Systemarchitektur bei, die aus etwa 70 Servern bei Hetzner besteht. - Wartung und Automatisierung: Du unterstützt die Wartung und Automatisierung unserer bestehenden Infrastruktur, die auf Technologien wie Ubuntu, Percona MySQL Cluster, MinIO, Elasticsearch, Redis, NGINX, HAProxy, TiDB, Clickhouse und Kubernetes basiert. Dabei bringst du deine Ideen zur Optimierung ein. - Monitoring und Analyse: Du verbesserst unsere Monitoring-Strategien und führst umfassende Fehleranalysen durch. Dein Ziel ist es, sicherzustellen, dass wiederholte Fehler vermieden werden und die Systeme jederzeit optimal funktionieren. - Hohe Verfügbarkeit: Du bist bereit, in Ausnahmefällen auch nachts aufstehen zu müssen, um sicherzustellen, dass unsere Systeme reibungslos laufen. Dein Engagement für hohe Verfügbarkeit ist für uns unverzichtbar – du arbeitest hart daran, dass solche Situationen möglichst selten auftreten. - Software Entwicklung: Mehrjährige Erfahrung in einer oder mehreren Programmiersprachen (z. B. Rust, Java, Go, Typescript) ist notwendig. - Automatisierung: Gutes Verständnis von Systemautomatisierung und IT-Sicherheit. - Systeme: Praxiserfahrung mit einigen der folgenden Systeme: MySQL, TiDB, Kubernetes, MinIO und Elasticsearch. - Sprachkenntnisse: Du kommunizierst verhandlungssicher in Deutsch und hast gute Englischkenntnisse. - Persönliche Eigenschaften: Gute analytische Fähigkeiten, gepaart mit ausgeprägten Kommunikations- und Präsentationsfähigkeiten. - Leidenschaft: Erfahrung in der Mitwirkung an Open-Source-Technologien wäre wünschenswert. Qualifications - Software Entwicklung: Mehrjährige Erfahrung in einer oder mehreren Programmiersprachen (z. B. Rust, Java oder Go) ist notwendig. - Automatisierung: Gutes Verständnis von Systemautomatisierung und IT-Sicherheit. - Systeme: Praxiserfahrung mit einigen der folgenden Systeme: MySQL, Kubernetes, MinIO und Elasticsearch. - Sprachkenntnisse: Du kommunizierst sicher in Deutsch und hast gute Englischkenntnisse. - Persönliche Eigenschaften: Gute analytische Fähigkeiten, gepaart mit ausgeprägten Kommunikations- und Präsentationsfähigkeiten. - Leidenschaft: Erfahrung in der Mitwirkung an Open-Source-Technologien wäre wünschenswert. Benefits - Remote-First Team: Arbeite bundesweit von überall aus im Team. - Workation auf Mallorca: Unsere Mitarbeiter* haben die Möglichkeit, die angemietete Villa auf Mallorca für eine inspirierende Kombination aus Arbeit und Erholung zu nutzen. - Flexible Arbeitszeiten: Unsere Arbeitszeiten sind flexibel und werden im Team abgestimmt. - Motiviertes Team: Wir sind ein offenes, motiviertes und nettes Team mit flacher Hierarchie – kein künstlicher Druck! - Sichere Anstellung: 39 Stunden/Woche, 30 Tage Urlaub und faire, wettbewerbsfähige Vergütung. - Modernste Ausstattung: Aktuelle MacBook Pros stehen dir zur Verfügung. - Weiterbildung: Schulungen und Weiterbildungsmöglichkeiten im Wert von bis zu 1.500 Euro pro Jahr. - Givve Card, Jobrad. Company Description Die easybill GmbH ist nicht nur ein Unternehmen – sie ist ein Ort, an dem Innovation und Leidenschaft seit über 18 Jahren ein Zuhause gefunden haben. Als Pioniere im Bereich cloudbasierter Rechnungssoftware haben wir es uns zur Mission gemacht, kleinen und mittelständischen Unternehmen (KMU) nicht nur Werkzeuge, sondern echte Partner an die Hand zu geben. - Unsere Lösung besticht durch Benutzerfreundlichkeit, umfassende Funktionalität und die Vielzahl an Schnittstellen, die es ermöglichen, finanzielle Prozesse mit Leichtigkeit zu optimieren. - Es erfüllt uns mit Stolz, dass mehr als 22.000 aktive Kunden auf uns vertrauen und wir gemeinsam mit ihnen weiter wachsen.


