Job Closed
This listing is no longer active.
Tornando novas conquistas possíveis.
SRE Specialist
Location
Brazil
Posted
73 days ago
Salary
0
Seniority
Senior
Job Description
SRE Specialist
credsystem
• Define product infrastructure according to the architecture guidelines; • Ensure environment resilience; • Align and manage SLIs, SLAs, and SLOs; • Troubleshoot application infrastructure (understands, participates, and proposes solutions); • Assist with application troubleshooting when requested by developers; • Drive monitoring, logging, and automation solutions; • Document product infrastructure; • Understand and participate in capacity and cost planning for the infrastructure; • Analyze application trends; • Propose new solutions for the product; • Participate in POCs and tests for new solutions; • IaC: Infrastructure as Code; • Deploy/create cloud infrastructure (Azure, OCI, AWS, and GCP); • Request and follow up on on-premises infrastructure work with the respective teams.
Job Requirements
- University degree completed;
- Hands-on experience developing technical solutions relevant to the position’s technologies;
- Ability to understand system architectures and apply them efficiently and strategically, focusing on performance, scalability, and alignment with business goals;
- Knowledge of distributed architectures, virtualization, and/or cloud computing;
- Knowledge of Linux operating systems;
- Experience operating cloud environments – Azure, OCI, AWS, and GCP;
- Knowledge of Docker/containers;
- Knowledge of Kubernetes;
- Knowledge of messaging systems, such as Kafka and others;
- Familiarity with DevOps culture and concepts;
- Experience with CI/CD (Azure DevOps, Argo CD, and others);
- Experience with process automation using market-standard tools: Ansible and Terraform;
- Knowledge of Shell scripting to support automation and solution development;
- Knowledge of Prometheus and Grafana for monitoring and metrics analysis;
- Experience with observability models focused on the business, emphasizing strategic and operational indicators.
Benefits
- Meal and food allowance;
- Health insurance (for you and your dependents);
- Dental insurance (for you and your dependents);
- Two monthly psychologist consultations (at no cost);
- Two monthly nutritionist consultations (covered by Credsystem);
- Gympass (to support your health and well-being);
- Wellness time (weekly massage sessions to help you relax);
- Life insurance;
- Variable compensation, according to your role and achievement of goals;
- University discounts (to advance your career);
- Language school partnership (to develop and improve a second language);
- Birthday day off;
- Payroll-deductible loans available;
- Commuter benefits: public transit voucher, company shuttle, or parking support;
- Childcare assistance;
- Sesc membership/benefits.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Help build and scale the internal development platform. • Build tools, services, and automation for the engineering team. • Provide autonomy and a self-serve culture for teams. • Foster adoption of IA and agentic development while ensuring security and architectural standards.
SRE Specialist
CEACEA is the exclusive distributor of JCB, Atlas Copco, Ditch Witch, & Dynapac equipment.
• Management and governance of cloud environments on the AWS platform. • Management of the Kubernetes environment (OpenShift). • Automation of server provisioning with Terraform. • Support for test automation and continuous integration. • Administration of Linux servers.
• Help build and maintain cloud infrastructure and applications that powers Legal AI platform • Collaborate with engineering teams for monitoring, incident response, and deployment strategies • Ensure high availability and reliability of proprietary models and services • Standardise and implement observability practices in service-based architecture • Design, deploy, and operate infrastructure to support product teams • Add automation around manual operational tasks • Participate in and improve on-call and incident handling processes
• Help build and maintain cloud infrastructure and applications for our Legal AI platform • Collaborate with engineering teams to establish monitoring, incident response, and deployment strategies • Ensure high availability and reliability of our proprietary models and services • Standardise and implement observability practices through logging, traces, metrics, and monitors • Design, deploy, and operate infrastructure to support product teams as we expand into new regions • Add automation around manual operational tasks • Participate in and improve on-call and incident handling processes to ensure 24/7 system reliability



