Job Closed
This listing is no longer active.
Senior DevOps SRE
Location
Brazil
Posted
29 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps SRE
FCamara Consulting & Training
• Operate and evolve the platform on AWS (with a focus on networking and architecture for Kubernetes) • Administer and optimize Kubernetes clusters, ensuring availability, performance, and security • Work with GitOps using ArgoCD for deployments and release standardization • Maintain and improve CI/CD pipelines, including migrating pipelines from Jenkins to GitHub Actions • Work with Kong Gateway in service mesh patterns, defining policies and traffic flows • Implement and tune scaling strategies with Karpenter and KEDA • Automate provisioning and infrastructure patterns with IaC (Terraform) where needed
Job Requirements
- Strong experience with AWS (especially networking and architecture for Kubernetes workloads)
- Deep knowledge of Kubernetes (core, networking, resources, and troubleshooting)
- Hands-on experience with ArgoCD (GitOps) for deployments and configurations
- Experience with GitHub Actions for CI/CD and pipeline migrations (from Jenkins)
- Experience with Kong Gateway and integrations in service mesh architectures
- Knowledge of scaling with Karpenter and KEDA
- Terraform and IaC (even if not the primary focus, considered a plus)
- Git, DevOps/SRE culture, and automation
Benefits
- People with disabilities (PwD) welcome
- Company profit-sharing
- Support for continuing education (courses, events, and workshops)
- Flexible working hours
- Partially remote (hybrid)
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
Sigma PrimeInformation security specialists & founders of Lighthouse, an open-source Ethereum consensus client
• Create testnets, devnets, tooling and monitoring systems to help developers identify issues and verify software implementations in a variety of network conditions • Maintain high-performance production instances of our software which require maintenance, monitoring and a very high standard of security practices and processes • Responsible for day-to-day activities as part of the devops team • Assist developers in building core decentralised network infrastructure • Work alongside a round-the-clock devops team
• Design and implement scalable, reliable, and fault-tolerant systems across cloud environments. • Develop and maintain observability tools, including monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK). • Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. • Optimize system performance, scalability, and incident response workflows to improve uptime. • Work closely with development and DevOps teams to improve system design for reliability. • Conduct root cause analysis (RCA) and implement preventative measures to minimize failures. • Ensure high availability by designing and maintaining load balancing, failover, and disaster recovery strategies. • Improve CI/CD pipelines to enhance deployment speed while maintaining stability. • Optimize cloud cost and resource utilization for AWS, Azure, or Google Cloud Platform (GCP). • Participate in on-call rotations to quickly address system failures and minimize downtime.
• Manage and optimize release pipelines to ensure smooth deployment of software updates. • Define and maintain versioning strategies, ensuring consistency across multiple environments. • Coordinate with engineering, QA, and DevOps teams to ensure timely and stable releases. • Automate and improve build, release, and deployment processes for efficiency and reliability. • Monitor and troubleshoot release-related issues, ensuring minimal downtime. • Maintain documentation for release workflows, rollback plans, and deployment strategies. • Ensure compliance with security, performance, and quality standards in all releases. • Work with CI/CD tools (e.g., Jenkins, GitHub Actions, GitLab CI, CircleCI) to manage automated releases. • Implement and maintain feature flagging strategies to enable controlled rollouts. • Analyze release performance and drive continuous improvements in deployment processes.
• Design and implement scalable, reliable, and fault-tolerant systems across cloud environments. • Develop and maintain observability tools, including monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK). • Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. • Optimize system performance, scalability, and incident response workflows to improve uptime. • Work closely with development and DevOps teams to improve system design for reliability. • Conduct root cause analysis (RCA) and implement preventative measures to minimize failures. • Ensure high availability by designing and maintaining load balancing, failover, and disaster recovery strategies. • Improve CI/CD pipelines to enhance deployment speed while maintaining stability. • Optimize cloud cost and resource utilization for AWS, Azure, or Google Cloud Platform (GCP). • Participate in on-call rotations to quickly address system failures and minimize downtime.


