Job Closed
This listing is no longer active.
Corporate Alumni Engagement & Management Platform For The Enterprise
DevOps, Cloud & Infrastructure Engineer
Location
Brazil
Posted
128 days ago
Salary
$50K - $70K / year
Seniority
Senior
Job Description
DevOps, Cloud & Infrastructure Engineer
EnterpriseAlumni
• Design, build, and maintain infrastructure on AWS (ECS, EKS, RDS) • Manage and scale Kubernetes clusters (EKS) using Helm • Develop and maintain infrastructure as code using Terraform / Terragrunt • Improve and maintain CI/CD pipelines (Jenkins) • Automate operational tasks using Bash and Python • Work with Docker to build and optimize containerized workloads • Implement and maintain observability solutions (Prometheus, Grafana, OpenSearch) • Ensure system reliability, scalability, and security (Linux hardening, OS-level tuning) • Troubleshoot production issues across infrastructure, networking, and applications • Collaborate with engineering teams and participate in architectural decisions
Job Requirements
- Senior-level experience (5+ years in DevOps / SRE / Platform roles)
- Upper-intermediate or fluent English
- Ability to work independently and take ownership
- Strong experience with Kubernetes (EKS) + Helm
- Solid hands-on experience with AWS
- Experience with Terraform (preferably Terragrunt)
- Strong knowledge of Docker and containerization
- Experience with CI/CD (Jenkins or similar)
- Good scripting skills (Bash and/or Python)
- Strong Linux/system administration background
- Good understanding of networking (TCP/IP, DNS, routing, load balancing)
- Experience with observability tools (Prometheus, Grafana, OpenSearch)
Benefits
- N/A
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer – SkillBridge Intern
ZscalerWe make it easy to secure your cloud transformation. Get fast, secure, and direct access to apps without appliances.
• Manage operational tasks for products in US Government classified environments, including deployments, on-call duties, and incident management. • Develop scripts, containerized services, and monitoring mechanisms to automate operations tasks and ensure minimal service disruption. • Create operations documentation and implement measures to prevent recurring incidents while contributing to DevOps best practices. • Build and enhance Zscaler services within classified environments, ensuring 24x7 coverage including night and holiday shifts.
Senior Manager, Site Reliability Engineering – SRE
UJETEnabling the development of electric vehicles of the future. From #materialscience to ultimate #emobility products.
• Build and lead a new SRE team, including hiring, onboarding, and career development • Define and implement SRE best practices: SLIs/SLOs, error budgets, incident management, on-call models, and postmortems • Establish clear operational ownership between SRE and product engineering teams • Drive reliability as a feature, balancing velocity and stability with data—not vibes • Reduce toil through automation and self-service platforms • Design and evolve incident response, escalation, and learning loops (no blame, lots of learning) • Partner with engineering leaders to influence architecture, capacity planning, and launch readiness • Own reliability metrics and communicate risk and performance clearly to technical and executive audiences
Senior Site Reliability Engineer
PandaDocPandaDoc is a computer software company that is working to empower clients “to streamline their process” to negotiate, generate, and sign a variety of documents and provide the
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers
Senior Site Reliability Engineer
PandaDocPandaDoc is a computer software company that is working to empower clients “to streamline their process” to negotiate, generate, and sign a variety of documents and provide the
• Own and influence the incident management process end-to-end • Maintain and evolve on-prem observability stack • Keep production applications running smoothly by participating in the on-call rotation • Develop automations and tools to support platform reliability • Contribute to production services with performance and resiliency in mind • Collaborate with product engineers to foster SRE principles within the R&D organization • Be a mentor for the SRE team or product engineers



