Job Closed
This listing is no longer active.
Your Sidekick for AI, Cloud-Native & Real-Time Data Engineering | Scalable Innovations in Auto, EV, SaaS & Cybersecurity
DevOps Engineer
Location
Latin America
Posted
81 days ago
Salary
0
Seniority
Senior
Job Description
DevOps Engineer
NaNLABS
• Design, implement, and maintain scalable and secure cloud infrastructure in AWS • Manage infrastructure as code using Terraform to ensure consistency, automation, and reliability across environments • Build and improve CI/CD pipelines using GitHub Actions to enable efficient and reliable software delivery • Manage Kubernetes infrastructure (EKS) and support networking, security, and access management within AWS environments • Implement observability practices across services using modern monitoring and alerting tools • Support and scale infrastructure for data and machine learning workloads • Collaborate closely with engineering and data teams to ensure infrastructure supports product scalability and performance • Define and promote DevOps best practices related to automation, reliability, and infrastructure standards • Participate in incident response and help improve reliability through proactive monitoring and infrastructure improvements • Communicate technical decisions, risks, and trade-offs clearly with both technical and non-technical stakeholders
Job Requirements
- Strong experience working in DevOps, SRE, or Cloud Infrastructure roles supporting production systems
- Deep experience with AWS infrastructure and Kubernetes (EKS)
- Solid experience managing infrastructure using Terraform
- Experience building and maintaining CI/CD pipelines with GitHub Actions
- Experience managing or supporting data infrastructure, including tools like Kafka or Databricks
- Familiarity with modern observability practices and monitoring tools
- Experience supporting scalable, distributed systems in production environments
- Ability to communicate technical concepts clearly and collaborate effectively with engineering teams and stakeholders
- Strong sense of ownership, autonomy, and proactive problem solving
- Experience anticipating infrastructure risks and implementing mitigation strategies
- English level B2 or higher.
Benefits
- vacations fully flexible and self-managed
- sick leave and personal days
- public holidays
- paternity and maternity leave
- study leave
- moving days
- training in best practices and tech
- books and light talks
- in-house English classes
- continuous feedback
- 1:1 career development sessions
- flexible working hours
- equipment and work materials provided
- internal events and team activities
- a day off on your birthday
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
NearsureRemove the barriers to growth by scaling your team fast with top-notch Latin American IT talent
• Design, build, and standardize CI/CD pipelines (Azure DevOps preferred) to support multiple development teams. • Build and evolve a self-service, GitOps-driven platform on Azure. • Contribute to the architecture, design, and implementation of Azure Kubernetes Service (AKS) environments. • Contribute to architecture and security configuration decisions within Azure and Kubernetes. • Implement Infrastructure as Code using Terraform and Helm to standardize and automate cloud environments. • Establish reusable automation patterns and platform standards across teams. • Integrate and automate security tooling within CI/CD pipelines in alignment with the dedicated security team. • Maintain and optimize existing pipelines and platform components. • Standardize CI/CD and deployment practices across 7–8 parallel teams. • Enable development teams through platform improvements and self-service capabilities. • Troubleshoot and debug production issues across services, infrastructure, and Kubernetes clusters. • Participate in incident response and post-mortem activities when required. • Collaborate closely with cross-functional and distributed teams to ensure consistent platform evolution. • Take ownership of platform components, driving improvements proactively and independently.
Site Reliability Engineer
Flex Dental SolutionsFlex is a collection of smart and easy-to-use tools that pair with Open Dental to supercharge your patient engagement.
• Own the availability, performance, and resilience of our production systems. • Partner closely with engineering, product, and leadership to reduce operational risk. • Proactively monitor application health and performance across cloud infrastructure (AWS). • Lead incident response, including triage, mitigation, root cause analysis (RCA). • Lead and participate in disaster recovery drills and security incident simulations. • Build and maintain Infrastructure as Code (IaC) using AWS-native tooling. • Collaborate with development teams to improve CI/CD reliability. • Work closely with stakeholders and product teams to ensure technical reliability aligns with business needs. • Reduce operational toil through automation, tooling, and process improvements. • Champion best practices across security, availability, performance, and incident response.
• Manage complex cloud-based architectures using AWS services like EC2, Lambda, RDS, S3, EFS, and VPC to support scalable, high-performance game development workflows. • Implement test automation and ensure pipeline efficiency across platforms. • Develop and enforce best practices for configuration management, infrastructure as code (IaC), and version control. • Define SLAs, SLOs and SLIs, runbooks, and other monitoring and incident response patterns, including automation and notifications. • Ensure OSE’s CI/CD cloud infrastructure is scalable, reliable, performant, and secure. • Work with game teams to build-out, maintain, and evolve the company’s build, test, and deployment systems and pipelines (CI/CD and test automation). • Architect and optimize continuous integration and deployment pipelines for building, testing, and deploying PC/console games using Jenkins, Git, and Perforce. • Design and implement custom metrics and dashboards to monitor performance, availability, and reliability of game servers, CI/CD pipelines, and backend services, ensuring operational transparency and actionable insights. • Profile and optimize system resource usage (CPU, memory, disk I/O, and network) for both game servers and CI/CD systems, ensuring efficient resource allocation and minimizing bottlenecks in the pipeline and production environments. • Implement log aggregation and analysis systems using AWS CloudWatch Logs to centralize logs across services, making troubleshooting and incident response efficient. • Beyond gathering functional requirements, work diligently to surface quality attributes and other non functional specifications. • Develop and maintain concise software and technical design documents. • Work well as part of a collaborative and flexible team. • Stay up-to-date with industry trends and emerging technologies in DevOps and gaming, recommending improvements to enhance our development processes.
Senior DevOps Engineer – Azure, AKS
opinov8Globally recognized digital and engineering solutions partner.
• Architect and implement scalable, production-grade infrastructure on Azure and AKS; • Lead infrastructure delivery for an ML platform (TimescaleDB, message queues, autoscaling); • Design and own CI/CD pipelines and deployment strategies; • Set the bar for infrastructure quality - changes are tested, validated, and documented before sign-off; • Establish best practices across the team.




