Job Closed
This listing is no longer active.
Site Reliability Engineer
Location
Brazil
Posted
30 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
Site Reliability Engineer
Bússola Social
Role Description Você se motiva a atuar com tecnologia, resolver desafios e fazer a diferença em ambientes dinâmicos? Como parte do time, você terá um papel essencial na sustentação e evolução da nossa infraestrutura, contribuindo diretamente para a estabilidade das soluções e para a continuidade dos nossos produtos. Seu desafio será atuar com olhar técnico e senso de responsabilidade na resolução de incidentes e no dia a dia do ambiente, garantindo respostas eficientes e bem conduzidas. Buscamos alguém que organize, documente e contribua para que os problemas não se repitam. Procuramos uma pessoa com iniciativa, repertório técnico e senso de priorização, capaz de avaliar cenários, propor caminhos viáveis e contribuir com melhorias contínuas. Também é importante ter um olhar crítico sobre o que já existe, colaborando com a evolução do ambiente de forma consistente e estruturada. Se você gosta de ambientes dinâmicos, colaborativos e com espaço para atuação prática no dia a dia, essa pode ser a oportunidade ideal para você. Vamos juntos evoluir a base que sustenta a nossa tecnologia! 🚀 Responsibilities - Gerenciar e evoluir ambientes cloud (Azure, AWS ou GCP) garantindo disponibilidade, escalabilidade e eficiência. - Manter e evoluir infraestrutura como código (Terraform, Ansible) e plataformas baseadas em containers (Kubernetes, Docker). - Definir, acompanhar e defender SLIs e SLOs como instrumentos reais de tomada de decisão. - Implementar e aprimorar observabilidade: monitoramento, logs e tracing (Prometheus, Grafana, Elastic/Sentry). - Responder a incidentes (on-call), reduzir MTTR e conduzir post-mortems com aprendizados concretos. - Automatizar processos operacionais, pipelines de CI/CD e eliminar toil de forma sistemática. - Monitorar e otimizar custos de infraestrutura (FinOps), garantindo uso eficiente dos recursos computacionais. - Apoiar decisões arquiteturais equilibrando custo, performance, segurança e confiabilidade. Qualifications - Cloud: Azure, AWS ou GCP — gerenciamento de ambientes, redes, IAM e custos - Kubernetes (kubectl, Helm) e Docker — operação e troubleshooting em produção - IaC: Terraform/Ansible - CI/CD: GitHub Actions/Jenkins - Linux avançado - Observabilidade: Prometheus, Grafana + ao menos uma ferramenta APM (Elastic, Sentry) - Redes e segurança: DNS, TCP/IP, Load Balancer, VPN, Firewalls, IAM - Scripting: Bash e Python - Inglês técnico (leitura de documentação) Differentials - Certificações: CKA, AWS, Azure ou Terraform Associate. - Mensageria: Kafka ou RabbitMQ - GitOps: ArgoCD ou FluxCD - Experiência com Platform Engineering - Conhecimento em FinOps e otimização de custos de infraestrutura Benefits - Cartão Multibenefício (Swile) - Plano de Saúde - Unimed - Conexa Plus + Psicologia Viva - Plano Odontológico - Metlife - Seguro de vida - Metlife - TotalPass - Day Off no mês do aniversário de vida - Parceria com curso de inglês e espanhol - 20 dias úteis de descanso
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Provide ongoing DevOps and security guidance to engineering and leadership • Review current infrastructure (cloud, CI/CD, access controls) and recommend improvements • Conduct periodic security audits and risk assessments • Advise on and help implement best practices across cloud security, IAM, and data protection • Support incident response for security-related events, as well as helping refine our incident response procedures • Review and strengthen deployment pipelines and system architecture • Assist with security tooling selection and implementation (monitoring, alerting, vulnerability scanning) • Help ensure alignment with SOC 2 and general compliance standards • Partner with engineering on secure system design and new builds when needed • Document recommendations and maintain lightweight security playbooks
Senior Network Reliability Engineer – DGX Cloud
NVIDIANVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! Applications for this job will be accepted at least until June 15, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
• Engage in 24/7 global shift rotations to provide remote support for network repairs and changes while collaborating across teams and updating customers on status and ticket information. • Drive operational improvements in change management and daily operations by following procedures. • Manage and operate large scale IP network technologies and infrastructures. • Utilize your skills in Peering and Datacenter interconnect technologies: PNI, Transit, Exchange, Passive DWDM, Wave circuits. • Monitor and support the network health of on-premises and cloud infrastructures. • Collaborate and develop workflow enhancements while documenting best practices.
Senior DevSecOps & Infrastructure Engineer
AssureSoft - CareersAssureSoft is a multinational software development and information technology company providing strategic consulting, technology services, and outsourcing business processes. We work to innovate and create quality software with motivated, passionate, and qualified teams that develop in an environment of professional, stable growth and continuous learning. Inclusive Opportunities for Every Talent. At AssureSoft, we believe that true innovation is born from diversity—of ideas, experiences, and perspectives. That’s why our hiring practices are inclusive and reflect a firm commitment to equity and equal opportunity. Here, every person—regardless of origin, gender, orientation, or beliefs—finds a space to grow, contribute, and be valued not only for their talent, but also for who they are.
Role Description - Infrastructure as Code (IaC): Design and maintain production-grade infrastructure using Terraform. - Cloud Networking: Manage complex AWS Layer 3 networking configurations, including VPCs, Transit Gateways, Direct Connect, and routing tables to ensure secure and performant cross-account connectivity. - Container Orchestration: Manage and scale production Kubernetes (K8s) clusters (EKS). - GitOps & CI/CD: Build and optimize deployment pipelines using GitHub Actions (GHA), implement and maintain GitOps workflows with ArgoCD. - Observability: Configure and maintain comprehensive monitoring, logging, and alerting using Datadog, Cloudwatch, and PagerDuty. - System Administration: Manage Linux-based systems, high-performance Nginx configurations, and secure SFTP gateways for data ingestion. - Collaboration: Act as a subject matter expert on Git workflows and branching strategies to support seamless developer experiences. - Drive security across all layers — secrets management, IAM least-privilege, vulnerability scanning, compliance-as-code. Qualifications - Experience: 4–7 years of hands-on experience in DevOps, Cloud Infrastructure, or related roles. - Cloud: Extensive experience with AWS (Compute, Storage, IAM, Cost Savings). - Networking: Deep understanding of OCI model. - Orchestration: Expert-level knowledge of Docker and Kubernetes (K8s). - Automation: Proven experience with Terraform and Ansible. - Continuous Delivery/GitOps: Hands-on experience with ArgoCD and GitHub Actions (GHA), Helm and Kustomize. - Linux/OS: Advanced Linux system administration and shell scripting. - Web Servers: Experience configuring and tuning Nginx. - Monitoring: Proficiency in Datadog for APM and Infrastructure monitoring. - Protocols: Experience managing secure data transfer protocols (SFTP). - Version Control: Expert knowledge of Git. - Security: Familiar with SOC2, NIST or HITRUST. - Background in software development or application architecture (microservices, APIs, backend patterns) — able to think like a developer, not just an operator. Requirements - AWS Certified Solutions Architect or SysOps Administrator. - CKA - Certified Kubernetes Administrator. Benefits - Great Place To Work certification. - A company with more than 15 years of experience. - Work with world-class clients and long-term projects. - English scholarships for an external institute. - English classes with company teachers. - State-of-the-art tools and resources. - Certifications for your professional growth. - Recreation and leisure activities. - Compliance with the regulations and labor rights of your region. Company Description AssureSoft is a multinational software development and information technology company providing strategic consulting, technology services, and outsourcing business processes. We work to innovate and create quality software with motivated, passionate, and qualified teams that develop in an environment of professional, stable growth and continuous learning. Inclusive Opportunities for Every Talent. At AssureSoft, we believe that true innovation is born from diversity—of ideas, experiences, and perspectives. That’s why our hiring practices are inclusive and reflect a firm commitment to equity and equal opportunity. Here, every person—regardless of origin, gender, orientation, or beliefs—finds a space to grow, contribute, and be valued not only for their talent, but also for who they are.
Role Description The Community You Will Join: - Own the Storage SRE technical roadmap across a 12+ month horizon, setting the direction for how the team deepens its operational model as it takes on new database technologies alongside its existing systems. - Lead and grow a team of engineers by providing mentorship, timely feedback, and career development support to build a high-performing, inclusive team. - Drive the generalization of cluster lifecycle, schema management, and observability tooling as the team broadens its database technology support. - Partner with engineering teams across Airbnb as the primary expert on reliable database adoption, helping them work with mission-critical storage systems safely and efficiently at scale. - Establish and uphold operational excellence standards covering on-call strategy, incident response, backup and disaster recovery, and systemic reliability improvements. - Collaborate with storage infrastructure and platform teams to ensure Storage SRE’s tooling and observability stay current as the broader storage platform evolves. - Improve the developer experience for engineers working with high-traffic transactional storage systems. - Drive performance, security, scalability, and availability initiatives across Airbnb’s database systems. - Communicate technical strategy and trade-offs clearly to engineers and senior leadership. Qualifications - 9+ years of relevant industry experience in database infrastructure, storage systems, or site reliability engineering. - 3+ years of engineering management experience leading SRE, infrastructure, or platform teams. - Demonstrated track record of building high-performing teams by hiring strong engineers, developing talent, and maintaining team health through periods of change. - Strong technical foundation with the ability to partner with technical leads on architectural decisions, roadmap tradeoffs, and delivery quality. - Proven ability to lead a team through a technology transition while maintaining operational rigor on existing systems. - Solid understanding of distributed systems, cloud infrastructure, and production database operations. - Strong communicator able to cut through ambiguity and represent the team credibly to senior leadership. Requirements - This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. - While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. - Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. - If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from. Benefits - Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as: training, transferable skills, work experience, business needs and market demands. - The base pay range is subject to change and may be modified in the future. - This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits. - Pay Range: $212,000 — $265,000 USD. Company Description Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way.


