We make innovation simple, convenient and right...we just make it HAPPEN
DevOps – Site Reliability Engineer
Location
Brazil
Posted
76 days ago
Salary
0
Seniority
Senior
Job Description
DevOps – Site Reliability Engineer
Oowlish
• Deploy and manage web, mobile, and API applications across cloud environments • Implement and maintain monitoring and observability tools like NewRelic, Datadog, or Prometheus/Grafana • Design and optimize CI/CD pipelines using tools like Azure Pipelines, Jenkins, or CircleCI • Manage containerized environments with Docker, Kubernetes, and Helm • Build and manage cloud infrastructure on Azure, AWS, or GCP • Write automation scripts using Bash and other scripting languages • Develop and maintain incident response processes and disaster recovery strategies • Collaborate with development, product, and operations teams to improve system reliability and deployment efficiency
Job Requirements
- 3+ years of experience in a DevOps, Site Reliability Engineering (SRE), or related role
- Strong hands-on experience with the deployment of web, mobile, and API applications
- Expertise in monitoring and observability tools (e.g., NewRelic, Datadog, Prometheus/Grafana)
- Strong experience with CI/CD pipelines and associated tools (Azure Pipelines, Jenkins, CircleCI)
- Proficiency with Docker, Kubernetes, and Helm
- Experience working with cloud platforms like Azure, AWS, or GCP
- Scripting proficiency in Bash
- Familiarity with incident response and disaster recovery planning
Benefits
- Remote work (home office)
- Competitive compensation based on experience
- Career development plans with opportunities for significant growth within the company
- International projects
- Oowlish English Program (technical and conversational)
- Oowlish Fitness with TotalPass
- Games and competitions
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
DocPlannerAt Docplanner Group, we’re on a mission to help people live longer, healthier lives. As the world’s largest healthcare platform, each month, we connect 24 million patients with 280k doctors across 13 countries. Our marketplaces, SaaS and AI tools simplify daily tasks and help doctors, clinics and hospitals work more efficiently. Real impact – We help doctors help patients. Your work truly makes a difference. At scale, yet agile – 3,000+ employees, but still fast, flexible, and hands-on. Shape the future, sustain growth – Make a difference now and build for long-term success.
• Participate in the monitoring, maintenance, and evolution of the system infrastructure supporting the TuoTempo web application. • Design, create, and support system distribution; test and monitor application code. • Support our internal development teams in the effective use of our organizational systems. • Propose and implement new solutions based on innovative technologies. • Planning and assignment of team activities. • Providing technical support to resolve complex issues. • Mentoring and professional development of team members. • Management of performance evaluations. • Availability for paid on-call shifts (1 or 2 weeks per month).
• Designing, implementing, and maintaining automation and shared tooling within Cloud Operations • Leading event, incident, case, and problem management, as well as service-request fulfilment • Ensuring security, latency, performance, efficiency, monitoring, emergency response, and capacity planning of IFS Cloud services • Demonstrating a strong commitment to service and process quality • Taking proactive action to prevent issues and resolving them quickly when they do occur • Contributing to knowledge management (KBAs, SOPs) and utilizing IFS support tools effectively • Actively participating in training and mentoring, both receiving and occasionally providing guidance
• Collaborating with engineering and development teams to evaluate and identify optimal solutions • Modifying and improving existing systems • Developing and maintaining solutions in accordance with best practices • Identifying, analyzing, and resolving infrastructure and application vulnerabilities • Performing system administration tasks including configuration, systems monitoring, troubleshooting, and support while innovating to automate as much as possible • Regularly reviewing existing systems and making recommendations for improvements • Performing software application installation, patching, and upgrades • Troubleshooting and resolving issues with the existing systems • Ownership of the custom build/deployment module #LI-DNI
Senior SRE Engineer – AWS Cloud
AM53 Smart SolutionsA tecnologia certa. O talento ideal. No momento exato.
• Develop, maintain and evolve CI/CD pipelines, ensuring continuous, stable and secure deliveries • Automate infrastructure and application deployment processes, reducing toil and increasing reliability • Continuously monitor and optimize the performance, availability and security of production environments • Administer and support AWS cloud environments, ensuring resilience and scalability • Serve as a technical reference for the development team, promoting best practices for continuous delivery • Ensure end-to-end observability with robust practices for metrics, logs, tracing, versioning and rollback • Manage and ensure availability and performance of MongoDB and PostgreSQL databases • Act as a FinOps mentor and point of reference, fostering a culture of cloud cost efficiency and governance • Lead the response to critical incidents — rapidly diagnose issues, coordinate resolution and ensure clear communication during crises • Conduct blameless post-mortems, turning incidents into lessons and concrete improvements




