O sistema de gestão nascido na indústria que vive e respira processos industriais e distribuição 💙
Estágio DevOps
Location
Brazil
Posted
4 days ago
Salary
R$1.3K / month
Seniority
Senior
Job Description
Estágio DevOps
Viasoft Korp | Industry ERP
• Auxiliar em rotinas envolvendo: - microsserviços; - containers; - observabilidade; - alta disponibilidade; - integração contínua; - entrega contínua; - monitoramento; - deploy contínuo. • Auxiliar em rotinas de automação de infraestrutura em ambientes cloud e on-premise; • Auxiliar na implantação e evolução de ambientes Kubernetes; • Auxiliar na automatização de processos utilizando Ansible; • Auxiliar na implementação, evolução de monitoramentos e observabilidade com Grafana; • Atuar no auxílio de resolução de incidentes e troubleshooting entre serviços e ambientes; • Auxiliar na garantia de estabilidade, disponibilidade e performance dos ambientes; • Auxiliar na evolução de pipelines e ferramentas de CI/CD; • Documentar procedimentos, fluxos e configurações; • Participar ativamente da evolução tecnológica da plataforma da Korp.
Job Requirements
- Conhecimento em Linux e containers
- Conhecimento em Kubernetes
- Familiaridade com práticas de CI/CD
- Interesse por automação, observabilidade e infraestrutura moderna
- Capacidade analítica e boa resolução de problemas
- Facilidade para atuar em ambientes dinâmicos e de alta complexidade
- Interesse genuíno por tecnologia e aprendizado contínuo.
- Noções de Ansible
- Noções de Jenkins
- Noções de Grafana e observabilidade
- Noções de cloud computing
- Noções em microserviços e ambientes distribuídos.
Benefits
- Bolsa-auxílio de R$1.300,00
- Plano de carreira
- Seguro de vida
- Auxílio Home Office.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Owning the SRE infrastructure lifecycle from design reviews and pre-rollout readiness assessments through production sign-off and ongoing reliability management • Designing and implementing frameworks that reflect customer experience for load balancing services and driving action when error budgets are at risk • Building and maintaining observability pipelines from load-balancing components and system-level sources to dashboards that enable rapid incident triage • Leading technical incident response for complex NB/NLB failures, acting as the technical commander and driving root cause analysis and preventive follow-through • Developing and automating safe deployment workflows for phased releases, including bake-period monitoring, feature flag management, and validation across global datacenter rollouts • Reviewing design documents, product-requirement documents and producing actionable SRE input on operational risks, capacity implications, Day-2 concerns, and product strategy gaps • Building automation and tooling using Python or Go that reduces operational toil and improves team-wide operational capability
• designing, developing, testing, and operating critical services that support the reliability, scalability, and performance of our infrastructure • designing and implementing observability solutions, including monitoring, logging, alerting, and telemetry capabilities, to proactively detect and resolve issues • driving reliability improvements through automation, reducing operational toil and increasing the resilience of engineering processes • developing deep technical expertise in IAC systems and serving as a trusted technical resource, mentoring engineers and sharing best practices • collaborating with software engineering, infrastructure, and platform teams to investigate complex production issues, identify root causes, and implement long-term corrective actions • participating in an on-call rotation and providing leadership during incident response, driving timely service restoration, effective communication, and post-incident improvement efforts.
• Designing, developing, testing, and operating critical services that support the reliability, scalability, and performance of our infrastructure. • Designing and implementing observability solutions, including monitoring, logging, alerting, and telemetry capabilities, to proactively detect and resolve issues • Driving reliability improvements through automation, reducing operational toil and increasing the resilience of engineering processes. • Developing technical expertise in IAC systems and serving as a trusted technical resource, mentoring engineers and sharing best practices • Collaborating with software engineering, infrastructure, and platform teams to investigate complex production issues, identify root causes, and implement long-term corrective actions. • Participating in an on-call rotation and providing leadership during incident response, driving timely service restoration, effective communication, and post-incident improvement efforts.
Senior Site Reliability Engineer
DatavailWe help clients turn data into decisions no matter where it lives-in apps, on-prem, in a hybrid model, or in the cloud.
• Define and maintain SLIs/SLOs, monitor alignment and error budget usage • Lead incident response and postmortems, implement corrective measures • Automate operations tasks via tooling (e.g. auto-remediation, scaling rules) • Build, improve, and maintain CI/CD pipelines, canary deployments, blue/green strategies • Lead technical discussions with customers to align on reliability, scalability, and performance requirements • Drive continuous platform improvements across the service lifecycle, including architecture, monitoring, and operational processes • Implement and extend observability systems (metrics, tracing, log aggregation) • Optimize performance and cost by tuning cloud services, autoscaling, resource rightsizing • Design, deploy, and operate containerized workloads using Docker and Kubernetes in production environments • Collaborate with dev teams to integrate resilience patterns (circuit breakers, bulkheading) • Participate in architecture discussions around high availability, disaster recovery • Mentor mid and junior SREs; conduct reliability design reviews


