Job Closed

This listing is no longer active.

CI&T logo
CI&T

Navigate Change

Senior SRE

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 5,001-10,000Since 1995H1B No SponsorCompany SiteLinkedIn

Location

Brazil

Posted

47 days ago

Salary

0

Seniority

Senior

Job Description

Senior SRE

CI&T

• Analyze application reliability, performance, and availability. • Monitor deployment issues for applications, addressing performance or security problems as they arise and capturing lessons learned to prevent similar incidents in the future. • Proactively manage the task backlog, identify opportunities for improvement, and propose effective collaborative solutions. • Maintain effective communication with teams responsible for different application journeys, ensuring a clear understanding of needs and priorities. • Stay up to date with industry trends, best practices, and emerging technologies related to cloud computing and DevOps/SRE.

Job Requirements

  • Experience as a Site Reliability Engineer (SRE) and familiarity with SRE metrics.
  • Experience monitoring Java backend applications.
  • Strong experience with FinOps practices and cloud cost management.
  • Experience working with observability tools such as Datadog, Grafana, Prometheus, and Thanos.
  • Experience with AWS-based platforms (ECS, EKS) and/or Kubernetes and Docker.
  • Experience with Linux.
  • Technical knowledge of GitHub, Jenkins, and Splunk (desirable).
  • Experience with CI/CD pipelines (GitHub Actions, CodeBuild, CodePipeline).
  • Infrastructure as Code (Terraform).
  • Analytical skills and strong problem-solving ability, with a desire to learn and adapt in a dynamic environment.
  • Performance testing and stress testing.
  • Understanding of chaos engineering concepts (what to test, what to validate, which failures to inject into the application, e.g., removing a database node and observing application behavior).
  • Ability to troubleshoot efficiently and propose continuous improvements (Splunk, dashboards, tracing tools).
  • Familiarity with mobile application monitoring (Android and iOS).
  • Knowledge of Google Analytics and Firebase Crashlytics.
  • Familiarity with any of the following (if applicable).
  • Knowledge of programming languages such as Java, Shell Script, Golang, Python.

Benefits

  • Health and dental insurance;
  • Food and meal vouchers;
  • Childcare assistance;
  • Extended parental leave;
  • Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) and TotalPass;
  • Profit-sharing/Performance bonus (PLR);
  • Life insurance;
  • Continuous learning platform (CI&T University);
  • Employee discounts club;
  • Free online platform dedicated to physical and mental health and wellness;
  • Pregnancy and responsible parenting course;
  • Partnerships with online course platforms;
  • Language-learning platform;
  • And many more

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Full TimeRemoteTeam 10,001+Since 1967H1B Sponsor

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world. Job Description We are looking for a DevOps/Platform Engineer with 2–3 years of hands-on experience in cloud infrastructure and DevOps practices. The role focuses on supporting cloud platforms, CI/CD tooling, and containerized environments, primarily within Google Cloud Platform (GCP). Experience with Kubernetes and GKE is required. Key Responsibilities - Support and maintain infrastructure automation using Terraform or similar IaC tools. - Develop and update CI/CD pipelines for development teams. - Work with Kubernetes environments, including GKE cluster maintenance under senior guidance. - Participate in building internal developer tools and platform components. - Implement basic monitoring, logging, and alerting solutions. - Troubleshoot platform issues and improve team workflows. - Contribute to platform documentation and knowledge sharing. Candidate Requirements Experience - 2–3 years in DevOps, SRE, or cloud engineering roles. - Experience with production or staging Kubernetes environments (GKE preferred). - Hands-on work with CI/CD and automation tools. Technical Expertise Platform Engineering - IaC basics: Terraform or similar. - CI/CD: GitLab CI, GitHub Actions, Jenkins, or Cloud Build. - Containers: Docker. - Kubernetes fundamentals; mandatory experience with GKE. - Monitoring & Logging: Prometheus/Grafana basics or Cloud Operations. Development & Automation - Knowledge of at least one programming language: Python, Go, or JavaScript. - Basic scripting and automation skills. - Understanding of microservices and cloud-native concepts. GCP Expertise - Basic experience with: - GKE, Cloud Run, Cloud Functions. - Cloud Storage, Cloud SQL. - VPC and load balancers. - Cloud Logging & Monitoring. - Familiarity with Cloud Build and Artifact Registry. Soft Skills - Willingness to learn and grow. - Problem-solving mindset.a - Good communication with developers and operations teams. - Ownership of tasks and reliability. - Ability to work in Agile teams. Capgemini is an AI-powered global business and technology transformation partner, delivering tangible business value. We imagine the future of organizations and make it real with AI, technology and people. With our strong heritage of nearly 60 years, we are a responsible and diverse group of 420,000 team members in more than 50 countries. We deliver end-to-end services and solutions with our deep industry expertise and strong partner ecosystem, leveraging our capabilities across strategy, technology, design, engineering and business operations. The Group reported 2024 global revenues of €22.1 billion. Make it real | www.capgemini.com #LI-Remote

Ukraine
Element 84 logo

Senior DevOps Engineer – Kubernetes Focused

Element 84

Accelerating and scaling impactful projects with great software and design. Geospatial, cloud, and petabyte-scale data.

DevOps Engineer47 days ago
Full TimeRemoteTeam 51-200Since 2010H1B No Sponsor

• Participate in all aspects of the software and data product development lifecycle from user story generation, through design, development, automated testing and operational support • Improve quality by actively participating in code-reviews and adhering to team quality standards • Work alongside other engineers on the team to elevate technology and consistently apply best practices • Own the execution of medium-to-large sized features with higher-level technical support • Describe and document the details of your work fluidly and accurately for technical peers and non-technical stakeholders • Think holistically about the application and build with an eye towards long term maintainability and efficiency • Collaboratively provide estimates and other input to the client, project managers, or others about features to help determine their feasibility, complexity, cost, and priority level • Contribute to a culture of positivity, curiosity, and respect for all individuals

Pennsylvania + 1 moreAll locations: Pennsylvania | Virginia
$145K - $175K / year
Xenon Seven logo

Sagemaker DevOps Engineer

Xenon Seven

Human Experts Implementing Artificial Intelligence #AI #ArtificialIntelligence #HumanIntelligence

DevOps Engineer47 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Build DevOps automations to setup Sagemaker Unified Studio for enterprise • Implement Sagemaker Lifecycle configurations • Create CICD pipelines for end-users to deploy custom Docker images & Kernels in Sagemaker • Build alert & monitoring capabilities for Sagemaker projects to control costs and service quotas • MLOps automations for model and infrastructure deployments to higher environments

India
Qonto logo

Staff Site Reliability Engineer – Storage

Qonto

The finance solution that energizes SMEs and freelancers

DevOps Engineer47 days ago
Full TimeRemoteTeam 501-1,000H1B No Sponsor

• Ensure reliability, resilience, and safe operations of Qonto’s critical storage systems (PostgreSQL, Kafka, Redis) • Assess resilience maturity of Kafka and Redis stacks, identify risks, propose improvements • Deliver improvements on disaster recovery readiness, upgrades, alerting, and capacity planning • Act as a consultant for backend and product engineering teams, lead design reviews • Respond to and lead high-severity incidents on critical infrastructure

France
Job Closed