Com T.I., ajudamos empresas e projetos a atingir o máximo potencial de performance e crescimento sustentável.
Senior SRE – Datacenter
Location
Brazil
Posted
3 days ago
Salary
0
Seniority
Senior
Job Description
Senior SRE – Datacenter
GrooveTech
• Provide support and ensure the reliability of datacenter environments • Perform advanced troubleshooting on critical infrastructure • Monitor and ensure the availability of environments • Work with physical servers and administer operating systems • Support high-severity incidents and conduct root cause analysis • Collaborate with Infrastructure, Network, and Security teams • Identify improvements in performance, stability, and observability • Support technical documentation and operational procedures.
Job Requirements
- Solid experience as an SRE, in Infrastructure, or Datacenter operations
- Advanced knowledge of networking
- Experience with physical servers
- Experience with Windows and Linux operating systems
- Experience troubleshooting critical environments
- Knowledge of monitoring and observability
- Experience analyzing logs, connectivity, and performance
- Experience working in high-availability environments
- Nice-to-have: experience with monitoring tools
- Knowledge of virtualization
- Experience with automation and scripting
- Certifications related to infrastructure, networking, or cloud technologies.
Benefits
- Contractor (PJ)
- Remote work model - Duration: 4 weeks
- Wellhub
- Life insurance
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Developer
Tech And SolveTech And Solve es una empresa dedicada al desarrollo de software a la medida para clientes de los sectores retail, financiero y de servicios. Aquí, utilizamos la tecnología y nuestro ingenio para hacer la vida más fácil y generar progreso. Las personas que hacen parte de nuestro equipo se conocen como Solvers, profesionales que comparten el propósito de construir soluciones que generen valor real. Nuestra cultura promueve el crecimiento de las personas, el trabajo colaborativo y la búsqueda constante de soluciones simples para problemas complejos.
Role Description Este es un puesto de trabajo remoto. Buscamos un Desarrollador DevOps con más de 3 años de experiencia en: - Scripting en PowerShell avanzado - Dominio de herramientas de CI/CD (Azure DevOps, Jenkins, GitLab CI, etc.) - Experiencia sólida en contenedores y orquestación (Kubernetes) - Infraestructura como código (Terraform, CloudFormation, Ansible) - Monitoreo y observabilidad avanzada - Scripting avanzado (Python, Bash) - Administración avanzada de Windows Server 2019 Qualifications - Con Certificación en: OCI Foundations o Architect - Deseable: Certificación AZ-104 (Azure Administrator) - Conocimientos en FinOps y prácticas de seguridad avanzada Benefits - Ser parte de Tech And Solve significa trabajar en un entorno donde las personas son protagonistas - Horario flexible - Vacaciones parciales (puedes tomarlas cuando lo necesites) - BYOD / Auxilio para trabajar con tu propio equipo - Medio día libre en tu cumpleaños - Pago del 100% de incapacidades - Plataforma de gestión de beneficios - Acceso a plataforma de capacitación - Pólizas de vida y salud
Role Description We’re looking for a Senior DevOps Engineer to play a key role in building and maintaining the cloud platforms that underpin life-saving services. Working as part of our broader DevOps System Engineering team, you’ll design and implement scalable, secure cloud infrastructure and automation that keeps our systems running reliably. Your work will directly support clinicians, patients and donors across Australia. Key Responsibilities: - Design and implement Infrastructure as Code solutions in AWS using Terraform - Build and optimise CI/CD pipelines and automated deployment processes - Monitor system performance, troubleshoot issues and improve reliability - Work across infrastructure, integration, middleware and data layers - Drive DevOps best practice, standards and continuous improvement - Contribute to technology roadmaps and uplift platform capability - Build strong relationships with ICT, Cyber, Data and Architecture teams - Mentor engineers and contribute to the DevOps community of practice - Manage cloud usage and optimise cost without compromising performance Qualifications - Strong experience in AWS cloud environments - Proven expertise in Infrastructure as Code (Terraform) - Solid Linux and scripting experience - Experience with Python, Node.js or similar - Strong troubleshooting and problem-solving capability - Experience working in DevOps, Agile or iterative delivery environments - Ability to influence stakeholders and communicate technical concepts clearly - A proactive mindset with a focus on continuous improvement Benefits - Ability to purchase additional annual leave - Access to salary packaging with an option to package up to $15,900 for living expenses and $2,650 for meals and entertainment - 14 weeks paid parental leave - no waiting period for either parent - Our dedicated employee assistance program - Sonder that you can access 24/7 from an app on your phone - Plus, many more!
Dev Ops Engineer
VehloWe started Vehlo in 2019 with a simple goal: to be the industry’s favorite provider of repair shop technology. Across every part of the auto repair industry, Vehlo is igniting vehicle service success with software and financial solutions that unlock your potential. Our founder-led products power the entire service lane experience and keep customers coming back with streamlined tools that help you handle communication, workflow automation, touchless payments, valet pickup, and much more. We’re out to simplify the customer journey from start to finish and give power back to the people under the hood, making their jobs easier and your shop more profitable — just ask our over 30,000 customers, who generate more than 50M annual repair orders. At Vehlo, our only purpose is your success, and together, we’re reaching your goals faster than ever. Being a Veep comes with more than a comprehensive benefits package—our biggest benefit is opportunity: Opportunity to make an impact, opportunity for growth, and opportunity for recognition and rewards. This is not a mega-corporation where you wonder what people are doing all day - every Veep is moving the ball forward day in, day out for our customers or for each other.
Role Description We're hiring a platform engineer to deliver our 2026 strategy across IT performance visibility, vulnerability and risk management, APM, and FinOps, which in practice means rolling out New Relic across our AWS accounts, migrating to GitHub Enterprise Advanced Security, building the Jira and Confluence foundation for risk and performance reporting, and standardizing the Terraform modules and GitHub Actions workflows a small team relies on. Day to day, you'll also handle the operational work that keeps the platform running, including Dependabot triage, alert tuning, deploy support, and one off brand requests. This is an all hands on keyboard team where every engineer ships code, writes Terraform, resolves incidents, and helps each other through hard problems rather than guarding private corners. What You’ll Do: - New Relic & APM Implementation: - Deploy and manage New Relic APM, infrastructure, and browser agents across AWS services (ECS, Elastic Beanstalk, Lambda, EC2, EKS). - Establish standardized alert policies and dashboards using Terraform. - Optimize telemetry ingest, manage drop rules, and control observability costs. - GitHub Enterprise & Vulnerability Management: - Lead migration to GitHub Enterprise, implementing SSO, branch protections, CODEOWNERS, and Advanced Security features. - Develop reusable GitHub Actions workflows to streamline CI/CD. - Operationalize vulnerability data into actionable Jira workflows with SLA tracking and brand-level reporting. - Jira, Confluence & Reporting: - Design and manage Jira projects, workflows, automation rules, and permissions. - Administer Confluence spaces, templates, and backups. - Build centralized reporting to provide leadership with visibility into delivery performance, risk, and application health (APM). - FinOps & Cost Visibility: - Create domain-level cost dashboards leveraging AWS, New Relic, and SaaS data. - Drive cost optimization initiatives (e.g., S3 Intelligent Tiering, lifecycle policies, telemetry drop rules, resource decommissioning). - Support vendor renewal evaluations and cost analysis. - Standardization & Enablement: - Develop reusable Terraform modules, GitHub Actions workflows, and engineering templates. - Author reference documentation and promote adoption of best practices across teams. - Infrastructure & Daily Operations: - Own shared AWS infrastructure, including provisioning, access management, networking, and ongoing maintenance. - Triage Dependabot PRs, fine-tune alerts, support team migrations, participate in on-call rotations, and create/run operational runbooks. Qualifications - Experience: 3–6 years of experience in Cloud Engineering, DevOps, Site Reliability Engineering (SRE), Platform Engineering, or Developer Productivity. - Observability Expertise: Hands-on experience with observability platforms at scale (e.g., New Relic, Datadog, or similar). - GitHub Administration & Automation: Experience with GitHub at an organizational level, including teams, SSO, branch protection, OIDC, and reusable GitHub Actions workflows. - Atlassian Tools Familiarity: Working knowledge of Jira and Confluence as a user; familiarity with project configuration, workflows, and collaboration. - AWS Cloud Proficiency: Production experience with AWS services such as IAM, S3, Lambda, and at least one compute platform (e.g., ECS, EC2, EKS). - Infrastructure as Code: Experience using Terraform (or equivalent IaC tools). - Scripting & Automation: Proficiency in at least one scripting or programming language such as Python or Bash. - Communication Skills: Strong written communication skills with experience creating runbooks, technical design documents, and stakeholder-facing reports. Benefits - Medical, dental, vision, and life insurance - 401(k) with company match - Paid time off and holidays
Role Description We're hiring a platform engineer to deliver our 2026 strategy across IT performance visibility, vulnerability and risk management, APM, and FinOps. This includes: - Rolling out New Relic across our AWS accounts - Migrating to GitHub Enterprise Advanced Security - Building the Jira and Confluence foundation for risk and performance reporting - Standardizing the Terraform modules and GitHub Actions workflows Day to day, you'll also handle the operational work that keeps the platform running, including: - Dependabot triage - Alert tuning - Deploy support - One-off brand requests This is an all hands on keyboard team where every engineer ships code, writes Terraform, resolves incidents, and helps each other through hard problems rather than guarding private corners. Qualifications - 3–6 years of experience in Cloud Engineering, DevOps, Site Reliability Engineering (SRE), Platform Engineering, or Developer Productivity - Hands-on experience with observability platforms at scale (e.g., New Relic, Datadog, or similar) - Experience with GitHub at an organizational level, including teams, SSO, branch protection, OIDC, and reusable GitHub Actions workflows - Working knowledge of Jira and Confluence as a user - Production experience with AWS services such as IAM, S3, Lambda, and at least one compute platform (e.g., ECS, EC2, EKS) - Experience using Terraform (or equivalent IaC tools) - Proficiency in at least one scripting or programming language such as Python or Bash - Strong written communication skills with experience creating runbooks, technical design documents, and stakeholder-facing reports Requirements - Deploy and manage New Relic APM, infrastructure, and browser agents across AWS services (ECS, Elastic Beanstalk, Lambda, EC2, EKS) - Establish standardized alert policies and dashboards using Terraform - Lead migration to GitHub Enterprise, implementing SSO, branch protections, CODEOWNERS, and Advanced Security features - Design and manage Jira projects, workflows, automation rules, and permissions - Create domain-level cost dashboards leveraging AWS, New Relic, and SaaS data - Develop reusable Terraform modules, GitHub Actions workflows, and engineering templates - Own shared AWS infrastructure, including provisioning, access management, networking, and ongoing maintenance Benefits - Medical, dental, vision, and life insurance - 401(k) with company match - Paid time off and holidays Work Environment & Physical Requirements - Ability to remain in a stationary position (sitting or standing) for extended periods - Ability to operate a computer and standard office equipment (e.g., keyboard, mouse, headset) - Ability to view and interpret information on a computer screen for extended periods - Ability to communicate effectively via phone, video, and written communication - Ability to participate in virtual meetings with or without reasonable accommodation Remote Work Expectations - Maintain a dedicated, safe, and distraction-free workspace - Reliable high-speed internet connection sufficient for video conferencing and job-related systems - Ability to maintain productivity in a remote environment - Must reside in a state where the company is authorized to employ workers - Must be able to work core hours aligned to US business hours


