Engenheiro DevOps
Location
Brazil
Posted
4 days ago
Salary
0
Seniority
Senior
Job Description
Engenheiro DevOps
Phoebus
• Automação de processos de CI/CD • Gerência de configuração • Automação de provisionamento infraestrutura • Compreender os produtos da empresa bem como suas regras de negócio • Desenvolver os requisitos dos produtos com base na especificação • Reportar resultados de atividades bem como tirar dúvidas com o líder técnico e PO • Identificar, reportar e tratar dívidas técnicas • Analisar e corrigir problemas em ambiente produtivo.
Job Requirements
- Graduado em Ciências da Computação e afins (Análise e Desenvolvimento de Sistemas, Sistemas da Informação, Engenharia da Computação, Redes de Computadores)
- Conhecimento dos conceitos da cultura DevOps
- Processos de integração e entrega contínua (CI/CD)
- Ferramentas de CI/CD
- Ferramentas de gerência de configuração
- Conhecimento de arquitetura distribuída
- Terraform
- Docker e docker-compose
- Kubernetes
- Linux
- Shell script
- Amazon AWS
- GIT
- Gitlab e Nexus
- Saber programar em alguma linguagem (diferencial)
- Experiência em arquitetura com microsserviços (diferencial)
- Certificação Kubernetes (CKA) (diferencial)
- Certificações AWS (diferencial)
- Ecossistema Java (diferencial)
- Conhecimento sobre observabilidade (diferencial)
Benefits
- Vale alimentação
- Plano de Saúde
- Plano Odontológico
- Auxílio Home Office
- Auxílio Cultura
- GymPass com coparticipação (Wellhub)
- Bolsa de 50% para cursos de idiomas (Inglês ou Espanhol)
- Apoio a Capacitação Interna
- Horário flexível (Banco de Horas)
- Carga Horária semanal de 40h
- Seguro de vida em grupo.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Principal Site Reliability Engineer
SonicWallDelivering real-time breach detection and prevention solutions backed by SonicWall Capture Threat Network.
• Own the reliability, scalability, and operational excellence of our Cloud-based services. • Define and enforce reliability standards. • Drive the adoption of SRE practices across engineering teams. • Build the systems and tooling that keep our production infrastructure healthy. • Define, publish, and continuously refine Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) for all critical services, partnering with product and engineering leadership. • Own the error budget framework: track consumption, enforce error budget policies, and drive reliability investments when budgets are at risk. • Lead the design and implementation of comprehensive observability platforms — metrics, structured logging, and distributed tracing — to ensure full visibility into production systems. • Drive toil reduction initiatives by identifying and automating repetitive, manual operational work, targeting measurable reduction in operational burden across teams. • Design and execute chaos engineering programs to proactively uncover reliability weaknesses in our infrastructure and services before they impact customers. • Lead blameless postmortem culture: facilitate incident retrospectives, extract systemic learnings, and track corrective action items to completion. • Build and improve on-call incident response processes, runbooks, and escalation paths; manage and optimize on-call rotation health to prevent burnout. • Help design, build, and support infrastructure and security technologies within the cloud that offer resiliency, observability, and optimized cost. • Develop solutions for automated deployment of software and services on our production infrastructure hosted on AWS, applying reliability engineering principles throughout. • Shape how mission-critical enterprise software solutions are developed and deployed using optimized CI/CD pipelines that embed reliability and quality gates. • Develop management solutions for services across multiple cloud platforms and data centers, with a focus on fault tolerance and graceful degradation. • Collaborate with developers to bring new features and services into production using production-readiness reviews and launch checklists. • Champion reliability engineering best practices across the organization, embedding SRE principles into the software development lifecycle. • Mentor team members on SRE philosophy, technical decision-making, code reviews, and cloud engineering best practices. • Participate in roadmap planning, identify areas of improvement, and perform technology evaluation and selection.
• Collaborate with Technology Infrastructure teams to build and operate reusable, cloud-native platforms that abstract complexity and accelerate delivery while incorporating reliability from design through operations. • Work with business units and technical teams to improve application availability, observability, and reliability as our business applications are migrated to the Private Cloud. • Enhance platform reliability through automatic problem detection, self-healing systems, and well-architected notification and escalation protocols. • Use SLOs, SLIs, and KPIs to guide prioritization, measure impact, and drive continuous improvement. • Eliminate toil using intelligent automation and agentic workflows. • Conduct blameless retrospectives and share learnings across the organization. • Foster a culture of ownership, positive thinking, and continuous learning while remaining grounded in practicality, experimentation, and engineering excellence. • Integrate DevSecOps, zero-trust principles, and policy-as-code into every pipeline. • Produce and promote Architecture Decision Records (ADRs) and Cloud Well-Architected Frameworks that our business units can adopt to improve our technology standardization. • Maintain 24x5 active coverage with seamless regional handoffs and weekend escalation protocols.
Role Description As a Forward Deployed Engineer (FDEs) you will operate at the intersection of product engineering and customer impact. - Embedded within some of Okta’s most strategic customers, working side-by-side with their engineering teams. - Write production code, shape technical architecture, and influence how Okta's products are used at scale. - Act as a key liaison to understand customer needs and what engineering teams can build at scale. This role is ideal for engineers who want to: - Lead the development of Lab/Demo assets and AI prototyping. - Create "The Engine" for sales velocity by aligning pricing strategies and technical execution with field reality. - Develop reference architectures, whitepapers, and standardized POC guides. - Create centralized hubs and reporting dashboards to scale AI expertise globally. - Provide strategic guidance to SEs, TAMs, and Professional Services. - Act as a feedback loop, funneling field insights back to Product. - Represent Okta's AI strategy at conferences and analyst briefings. - Partner with the CPO, CISO, and Engineering to influence roadmaps with "AI-first" innovations. - Serve as a trusted advisor to CISOs and CIOs. - Lead high-impact POCs to secure "The Win." - Identify technical upsell opportunities through successful usage. - Partner with Post-Sales teams to develop runbooks and implementation guides. Qualifications - 10+ years in identity, cybersecurity, AI, or Data Science. - Experience driving technical customer success beyond the initial sale, including implementation, health monitoring, and expansion. - Deep understanding of AI/ML technologies (anomaly detection, LLMs) and Agentic Frameworks (A2A, MCP, and relevant protocols). - Strong credibility with enterprise CISOs and boards, with a proven track record of shaping cybersecurity strategy. - Ability to create repeatable playbooks and "backbone" assets that allow a sales organization to scale. - Prior experience as a CTO, Field CTO, or Distinguished Architect in a cybersecurity or SaaS company. - Advanced degree in a technical or business-related field. Requirements - Strong operational mindset. - Proven leadership background. Benefits - Health, dental, and vision insurance. - 401(k) plan. - Flexible spending account. - Paid leave, including PTO and parental leave.
Site Reliability Engineer
AlpacaDeveloper APIs for stocks and crypto trading, investing apps, and embedded fintech.
• Operate production day-to-day - oncall, incident response, postmortems, and the follow-ups that actually close the loop. • Own reliability practice - define and refine SLIs/SLOs and error budgets, and help product teams live within them. • Strengthen our observability across metrics, logs, traces, and alerting. • Ship infrastructure through code in a GitOps workflow - cloud resources and Kubernetes workloads alike. • Look after PostgreSQL: performance tuning, schema and migration review, online migrations on large tables, HA/DR, and CDC pipelines. • Mentor engineers on reliability and database fundamentals through code review, design review, and pairing.




