Discover the best solution for documenting fiber optic networks.
Senior Platform Engineer – DevOps
Location
Brazil
Posted
31 days ago
Salary
R$20K / month
Seniority
Senior
Job Description
Senior Platform Engineer – DevOps
OZmap
• Evolve and structure the observability stack (metrics, logs, and traces), ensuring operational visibility and reliability; • Build and improve CI/CD pipelines with a focus on security, predictability, and deployment efficiency; • Implement DevSecOps practices (SAST, DAST, hardening, and vulnerability analysis); • Manage and evolve AWS environments (EC2, networking, IAM, and multi-environment setups); • Work closely with the development team to raise the technical level of operations; • Perform advanced troubleshooting, incident analysis (RCA), and structured post-mortems; • Automate processes and drive continuous improvements in infrastructure; • Support the evolution of the architecture towards more modern and scalable environments.
Job Requirements
- Strong experience with AWS (especially EC2, networking, IAM, and security);
- Proficiency in Linux and networking, with strong troubleshooting skills;
- Solid experience with CI/CD and infrastructure automation;
- Experience with GitHub Actions;
- Experience with observability tools (Prometheus, Grafana, Loki, or similar);
- Practical knowledge of security (SAST, DAST, vulnerability analysis);
- Proactive mindset with the ability to identify issues and implement solutions;
- Experience with Kubernetes / EKS;
- Experience in high-scale or fast-growing environments;
- Experience with GitOps (Argo CD or similar);
- Knowledge of FinOps and cloud cost optimization;
- Experience with tools such as SonarQube, Trivy, OWASP ZAP, DefectDojo.
Benefits
- Equipment allowance – to ensure a comfortable work setup;
- Health benefits – because your well-being matters;
- Education support – we support your continuous development;
- Birthday gift – because we like to celebrate together;
- Service anniversary recognition – your time with us is valued;
- Language learning support – to help you go beyond borders;
- TotalPass (employee-only access);
- Paid leave after 12 months of employment;
- Online integration events and social gatherings.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and maintain CI/CD pipelines to automate build, test, and deployment processes. • Manage and optimize cloud infrastructure on AWS, Azure, or Google Cloud Platform (GCP). • Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, or Pulumi. • Deploy and manage containerized applications using Docker and Kubernetes. • Monitor system performance, identify bottlenecks, and improve infrastructure reliability. • Establish and enforce security best practices, including identity management, logging, and vulnerability management. • Work with development teams to improve deployment efficiency and troubleshoot infrastructure issues. • Optimize cloud resource utilization to ensure cost-effectiveness and scalability. • Automate infrastructure provisioning, configuration management, and system maintenance. • Stay up-to-date with emerging DevOps technologies, tools, and best practices.
Senior Software Engineer, Site Reliability
BabylistBabylist eases the path to parenthood, offering helpful content, a curated store, and a universal online baby registry through which new parents can discover, request, and buy prod
Who We Are Babylist is the leading registry, e-commerce, and content platform for growing families. More than 9 million people shop with Babylist every year, making it the go-to destination for seamless purchasing, trusted guidance, and expert product recommendations for new parents and the people who love them. What began as a universal registry has grown into a full ecosystem for new parents, including the Babylist Shop, Babylist Health, and a flagship showroom in Los Angeles. Hundreds of brands in baby and beyond partner with Babylist to engage meaningfully with families during one of life’s most important transitions. With over $1 billion in annual GMV, and more than $500 million in 2024 revenue, Babylist is reshaping the $320 billion baby product industry. We’re helping parents feel confident, connected, and cared for at every step. As we build the generational brand in baby, our mission remains simple: to connect growing families with everything they need to thrive.To learn more, visit www.babylist.com. Our Ways of Working Babylist is remote-first with team members across the U.S. and Canada who move fast, think smart, and use AI as part of how they work every day — not as an experiment, as an expectation. We come together twice a year to build the relationships behind the work, and we hire people who are genuinely excited about what's possible and prove it through how they show up. How We Build Babylist is in the middle of a fundamental shift in how software gets made, and we are not tiptoeing into it. We are rebuilding our engineering culture around a simple belief: AI changes everything. How teams are structured, how decisions get made, how fast ideas become working software. Our engineers own problems end to end, working directly with product, design, and business partners with short feedback loops and real stakeholder access. We ship, learn, and iterate fast. When something is not working, we throw it out and start over — project failure and personal failure are not the same thing here. AI tools are as natural to our workflow as an IDE or version control. We are not exploring this, we are living it. Our engineers use AI to explore tradeoffs, pressure-test designs, and move from problem to solution in hours instead of days. They generate code with AI so they can stay focused on the decisions that actually require human judgment — not the routine ones. More velocity means more time for craft: better test coverage, stronger architecture, and deeper customer understanding. We hold ourselves to a higher quality bar because of AI, not in spite of it. We are building this playbook in real time, and we are looking for people who want to build it with us. If you have already changed how you work because of AI — or you are ready to — and you care more about shipping something great than following a prescribed process, we should talk. Our Tech Stack - Ruby on Rails - React - AWS - Sidekiq - MySQL - Redis - Native iOS and Android What the Role Is Babylist is looking for a Senior Software Engineer, Site Reliability to join our Platform team. In this position, you will play a vital role in ensuring our systems and services' stability, scalability, and reliability. You will work closely with all Babylist Engineering teams to support shared infrastructure and developer tools. Your expertise in site reliability engineering, AWS cloud infrastructure, and modern DevOps practices will be instrumental in optimizing our systems and driving continuous improvement. Who You Are - 8+ years of experience as a Site Reliability Engineer or similar role, demonstrating a strong background in maintaining highly available and scalable systems - Experience supporting high-traffic consumer-facing websites, understanding the unique challenges and considerations in maintaining such systems - Proficiency with Terraform is a must, as you will be a member of the team responsible for managing and building our AWS infrastructure using Infrastructure as Code (IaC) practices - You possess strong experience working with AWS cloud-based infrastructure and services, ensuring their reliability, performance, and security - Proficiency with Docker and Kubernetes is essential, as you will contribute to the design, deployment, and management of containerized applications in our environment - You have a solid understanding of cloud-native systems design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems - Troubleshooting and debugging are second nature to you, allowing you to quickly identify and resolve issues across various environments - Experience designing and supporting CI systems such as CircleCI, Jenkins, or GitHub Actions - You are familiar with monitoring and alerting best practices, utilizing tools like Datadog, Cronitor, Sentry, and PagerDuty to ensure proactive identification and resolution of issues - Proven experience in on-call management best practices, including effective incident response, escalation procedures, and post-incident reviews to drive continuous improvement and ensure system reliability - You have excellent verbal and written communication skills, and the ability to collaborate effectively with cross-functional teams - You're genuinely excited about what AI can do - not just as a concept, but as something you want to get your hands on. At Babylist, every team uses AI daily, and we're looking for people who lean in. How You Will Make An Impact - Manage and build our AWS infrastructure using Infrastructure as Code (IaC) tools like Terraform. You will ensure that our EKS clusters and databases are running up-to-date versions, optimizing performance and reliability - Improve the speed and reliability of our Continuous Integration (CI) systems to support the entire Engineering Team, enabling faster and more efficient development and deployment processes - Provide support to developers in troubleshooting issues across local development, staging, and production environments - Establish, communicate, and support best practices for monitoring and alerting. This will involve setting up effective monitoring systems and defining actionable alerts for proactive incident management About Compensation We use a market-based approach to compensation. The starting salary range for this role is: US: $186,818 to $224,183 Canada: $185,600 to $232,000 CAD Your starting salary will be based on your location, experience, and qualifications, with increases over time tied to performance, role growth, and internal pay equity. Why You Will Love Working At Babylist Our Culture - We work with focus and intention, then step away to recharge - We believe in exceptional management and invest in tools and opportunities to connect with colleagues - We build products that positively impact millions of people's lives - AI tools are as natural to how we work as your IDE or version control — we're not exploring this, we're living it. Growth & Development - Competitive pay and meaningful opportunities for career advancement - We believe technology and data can solve hard problems - We're committed to career progression and performance-based advancement Compensation & Benefits - Competitive salary with equity and bonus opportunities - Company-paid medical, dental, and vision insurance - Retirement savings plan with company matching and flexible spending accounts - Generous paid parental leave and PTO - Remote work stipend to set up your office - Perks for physical, mental, and emotional health, parenting, childcare, and financial planning Important Notices Recorded Interviews Babylist uses an interview recording tool to record and transcribe interviews for evaluation purposes in accordance with applicable privacy laws. By participating in an interview, you consent to this recording and transcription. Interview Integrity At Babylist, every team uses AI daily and we love it. During interviews though, we want to see you — your thinking, your problem-solving, your creativity. All interviews and assessments should be completed independently without AI tools or third-party assistance unless we tell you otherwise. We'll always be clear when AI is welcome. Misrepresentation during the process may result in removal from consideration. Protect Yourself from Scams All official communication comes from the Babylist Talent Team via @babylist.com email addresses. We will never ask for payment, bank information, or personal financial details. If you receive outreach via WhatsApp, Telegram, or a non-Babylist email, it's not us. Verify open roles on our careers page. Connections at Babylist In line with our conflict of interest policy, please let us know if you have a family member or close personal relationship with a current Babylist employee. This helps us keep our process fair for everyone. Text Message Updates You may opt in to receive SMS updates about your application. Opting out won't affect your status. Message and data rates may apply. Reply STOP to unsubscribe or HELP for assistance. See our Privacy Policy for details.
Senior Site Reliability Engineer
InnosphereOn a mission to provide our clients with the best staffing solutions possible
• Designing, building, and deploying solutions that increase product reliability and organizational efficiency • Motivating and guiding the creation of effective CI/CD pipelines • Providing mentorship and insight into DevSecOps best-practices • Working with product teams to expose their requirements and support the above • Improving reliability via root cause analyses, post-mortems, and using code to prevent recurrence • Implementing effective monitoring and security scanning • Assisting support teams in resolving issues • Demonstrating and evangelizing state of the art technologies and practices that can be used to build and improve better workflows • Discovering and implementing automation to reduce manual support requirements • Providing emergency after-hours support if needed
DevOps Engineer
AgileEngineAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.
Role Description We are looking for a Senior DevOps Engineer to design, operate, and scale cloud infrastructure supporting production systems on AWS. You will manage containerized workloads with Kubernetes, build and maintain infrastructure as code with Terraform, and own CI/CD pipelines end to end. The role requires deep hands-on experience across the full infrastructure lifecycle, from environment initialization to production troubleshooting and root cause analysis. What You Will Do - Design and operate scalable AWS infrastructure; - Build applications with containerization technologies such as Docker and orchestration tools like Kubernetes; - Implement robust monitoring and logging tools; - Build and manage infrastructure as code using Terraform; - Improve CI/CD pipelines to accelerate delivery; - Troubleshoot production issues and lead root cause analysis; - Build and work in complex SDLC environments; - Collaborate with engineering teams to streamline development and deployment workflows. Qualifications - 6–10 years of experience in DevOps, SRE, or infrastructure engineering roles; - Strong hands-on experience with AWS in production environments; - Proven production experience with Kubernetes, including cluster operations, ingress, pods, networking, and troubleshooting; - Strong experience with Terraform or equivalent Infrastructure as Code tools, including building and maintaining production infrastructure; - Hands-on experience with CI/CD pipelines, with ownership or significant contribution to production delivery; - Strong scripting skills in Bash, Python, or similar; - Solid understanding of Linux/Unix systems fundamentals; - Experience building or initializing production environments; - Upper-intermediate English level. Nice to Haves - Experience working in an Agile environment; - Experience with observability tools such as Datadog or New Relic; - Exposure to GitOps practices; - Experience with Jenkins or GitHub Actions specifically; - Strong communication skills with excellent interpersonal effectiveness in one-on-one interactions and presentations; - Self-awareness and a desire to continually improve. Benefits - Professional growth: Mentorship, TechTalks, and personalized growth roadmaps. - Competitive compensation: USD-based pay with education, fitness, and team activity budgets. - Exciting projects: Modern solutions with Fortune 500 and top product companies. - Flextime: Flexible schedule with remote and office options.



