Job Closed
This listing is no longer active.
We help companies achieve their goals and expand their business through technology.
DevOps Engineer
Location
Brazil
Posted
73 days ago
Salary
0
Seniority
Senior
Job Description
DevOps Engineer
Thaloz
• Design, deploy, and manage containerized workloads using Amazon ECS (Elastic Container Service) and Amazon EKS (Elastic Kubernetes Service). • Build and maintain CI/CD pipelines to automate software delivery workflows. • Develop and manage Docker container images, registries (ECR), and container lifecycle best practices. • Implement Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or CDK. • Monitor, troubleshoot, and optimize cloud infrastructure performance, availability, and cost. • Enforce security best practices across containerized environments (IAM roles, network policies, secrets management). • Collaborate with software engineers to containerize applications and migrate workloads to ECS/EKS. • Manage Kubernetes cluster configurations, namespaces, Helm charts, and service mesh integrations. • Define and maintain observability standards using tools like CloudWatch, Prometheus, Grafana, or Datadog. • Participate in on-call rotations and incident response processes.
Job Requirements
- 5+ years of experience in a DevOps, Platform Engineering, or Site Reliability Engineering role.
- Advanced expertise in Amazon ECS – task definitions, services, capacity providers, Fargate & EC2 launch types.
- Advanced expertise in Amazon EKS – cluster provisioning, node groups, autoscaling, RBAC, and networking (VPC CNI, CoreDNS).
- Deep knowledge of Docker and container best practices (multi-stage builds, image optimization, security scanning).
- Strong experience with Kubernetes concepts: Deployments, StatefulSets, DaemonSets, Ingress, ConfigMaps, Secrets, HPA/VPA.
- Proficiency in Infrastructure as Code (Terraform preferred).
- Solid understanding of AWS networking (VPC, subnets, security groups, ALB/NLB, Route 53).
- Experience with CI/CD tools such as GitHub Actions, Jenkins, GitLab CI, or AWS CodePipeline.
- Strong scripting skills in Bash, Python, or similar languages.
- Familiarity with GitOps workflows (ArgoCD, Flux).**
- Nice to Have:**
- AWS Certifications: AWS Certified DevOps Engineer – Professional, AWS Certified Solutions Architect.
- Kubernetes Certifications: CKA (Certified Kubernetes Administrator) or CKAD.
- Experience with service mesh technologies (Istio, AWS App Mesh).
- Knowledge of FinOps practices for container cost optimization.
- Experience with multi-account AWS Organizations and landing zone architectures.
- Familiarity with security tools such as Trivy, Snyk, or AWS Security Hub.**
- Soft Skills:**
- Strong problem-solving and analytical mindset.
- Excellent communication skills – able to translate complex infrastructure topics to non-technical stakeholders.
- Proactive, self-driven, and able to work in a fast-paced, agile environment.
- Team player with a collaborative approach to cross-functional work
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
Cycloid - Sustainable Platform EngineeringPromote efficient software delivery along with digital sobriety: self-service portal, orchestration, finops, greenops
• Work with the product team on user story and UX/UI validation • Participate in backend development in Go • Contribute to Open Source software (Ansible, Concourse, Terraform, etc.) • Design, build, migrate, maintain and run the platform in a multi-cloud environment AWS, GCP, Azure • Collaborate with developers to fix problems and provide long-term solutions • Support presales/sales team as a solution architect • Define goals, agenda and priorities • Share ideas and knowledge with others
SRE, DevOps Engineer
Kraken Digital Asset ExchangeWe put the power in your hands to buy, sell, and trade digital currency 🌏
• Build and support infrastructure and tools, on-prem and in the cloud • Drive standardisation: Author RFCs and internal guides covering process improvements, reliability patterns, and best practices • Support, and guide engineers on SRE related topics • Partner with product-development teams to identify, and eliminate friction
• Collaborate closely with architects, developers, QA, and security teams to ensure smooth and reliable environment operations • Work in close partnership with the platform team, based on shared ownership, knowledge exchange, and mutual support • Own and operate containerized application platforms based on Docker and Kubernetes, ensuring reliability, scalability, and operational excellence • Design and deliver dynamic test environments at scale, including multiple parallel, per–merge request (branch-based) deployments • Build, maintain, and standardize CI/CD pipelines by creating reusable templates and components in GitLab CI • Drive deployment automation and GitOps practices • Identify operational bottlenecks and implement automation to reduce manual effort and improve delivery speed • Embed security-by-design across the SDLC, including pipeline hardening and automated security checks • Build and operate observability platforms: monitoring, logging, and diagnostics (Prometheus, Grafana, ELK/EFK/Loki, etc.) • Participate in on-call and incident response, including troubleshooting, root-cause analysis, and post-mortems • Take end-to-end ownership of the solutions you build (“you build it, you run it”).
Customer Reliability Engineer III
GitHub, Inc.GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together.
About GitHub GitHub is the world’s leading platform for agentic software development — powered by Copilot to build, scale, and deliver secure software. Over 180 million developers, including more than 90% of the Fortune 100 companies, use GitHub to collaborate, and more than 77,000 organisations have adopted GitHub Copilot. Locations In this role you can work from Remote, United Kingdom Overview GitHub is growing its Customer Success & Support team and we're seeking experienced professionals to elevate our technical customer support efforts. As a Customer Reliability Engineer III, you will efficiently manage and resolve customer issues and act as a liaison between customers and the engineering team. The ideal candidate will drive transformative customer experiences, ensuring long-term satisfaction and loyalty while fostering innovation and collaboration across teams. This role may require working non-standard working hours, including weekends and holidays on-call as part of a team-wide rotation. Responsibilities - Work with assigned customers via support tickets and/or real-time interaction (phone/screen sharing) to solve technical issues related to their usage of GitHub products often involving Linux servers, source code, and web application issues. - Act as a single point of contact for technical issues with ability to troubleshoot and resolve complex issues independently. - Collaborate with the Support and Engineering teams to resolve product issues requiring code changes. - Lead incident response for outages affecting assigned customers, followed by delivery of postmortem reports. - Act as a single point of contact for specific enterprise customers to provide performance, and best practice advice and assessment related to GitHub and customer's infrastructure. - Understand and maintain documentation around the customer infrastructure, workflows, and configuration of GitHub - Enterprise Server or GitHub Enterprise Cloud environment. - Coordinate and collaborate with other teams at GitHub when additional expertise is needed to resolve customer issues. - Manage customer incidents and outages, including joining Zoom/screen share sessions for live triage. - Perform incident postmortems, ticket analysis, and system health checks for Premium Support customers as needed. - Lead quarterly business reviews for the assigned accounts by presenting metrics, data, and health check summary and recommendations. - Organize and lead weekly/bi-weekly touchpoints with assigned accounts to review ongoing Support issues and projects. - Work proactively with customers on activities such as coordinating upgrades and ensuring their installation is running smoothly. - Set-up and onboard new assigned customers into the program. - Provide weekend on-call support as part of the team rotation (8 hour shifts, during normal work hours). - Update and maintain various repositories, including team and public documentation, and actively contribute to cross-organization strategy discussions. - Ensure that systems and processes comply with security standards and regulations, implementing best practices to protect customer data and maintain system integrity. Qualifications Required Qualifications: - 5+ years' experience in technical customer support, technical writing, system administration, or related roles, - OR bachelor's degree in computer science or related field AND 3+ years' experience in technical customer support, technical writing, system administration, or related roles, - OR equivalent experience. Preferred Qualifications: - Experience leveraging AI tools and technologies to enhance business processes and drive innovation. - Deep knowledge of Git, GitLFS, GitHub Administrator, and GitHub - Worked closely with large complex customer accounts in a technical capacity - Experience with production-level virtualization platform(s) and/or cloud provider(s) (e.g., VMware ESX, KVM, AWS, Azure) - Proficiency with and/or ability to understand and update code and scripts (e.g., Shell, Ruby, Go) - Proficiency in common applications in the web application stack (e.g., HAProxy, Nginx, MySQL, and Unicorn) GitHub values - Customer-obsessed - Ship to learn - Growth mindset - Own the outcome - Better together - Diverse and inclusive Manager fundamentals - Model - Coach - Care Leadership principles - Create clarity - Generate energy - Deliver success Who We Are GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together. Equal Employment Opportunity GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!




