Parceira que transforma desafios em resultados!
Senior DevOps
Location
Brazil
Posted
10 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps
Cadmus Soluções em TI
• Monitorar ambientes e aplicações, realizando troubleshooting e análise de incidentes para garantir a rápida identificação e resolução de problemas críticos; • Implementar, evoluir e disseminar boas práticas DevOps, promovendo a integração e colaboração entre os times de Desenvolvimento e Infraestrutura; • Definir, implementar e otimizar pipelines de CI/CD, automatizando processos de build, testes e deploy; • Administrar e apoiar a sustentação de ambientes conteinerizados, garantindo disponibilidade, desempenho e escalabilidade das aplicações; • Atuar na automação de processos operacionais e de infraestrutura, contribuindo para maior eficiência e confiabilidade dos ambientes; • Apoiar os times de desenvolvimento durante todo o ciclo de entrega das aplicações, desde a homologação até a produção; • Participar ativamente de iniciativas de melhoria contínua, propondo soluções que aumentem a qualidade, segurança e agilidade dos processos de desenvolvimento e entrega de software.
Job Requirements
- Ensino superior completo em Tecnologia da Informação, Ciência da Computação, Engenharia de Software, Sistemas de Informação ou áreas correlatas;
- Experiência sólida com ambientes conteinerizados utilizando Docker e AWS ECS;
- Experiência com versionamento de código e automação de pipelines de CI/CD utilizando ferramentas como Git, Jenkins, AWS CodePipeline ou similares;
- Vivência com ferramentas de monitoração, observabilidade e gerenciamento de alertas, como AWS CloudWatch, SNS, Zabbix ou equivalentes;
- Experiência em ambientes Cloud, com foco em AWS e conhecimento em Google Cloud Platform (GCP);
- Conhecimento e experiência com Infraestrutura como Código (IaC), utilizando AWS CloudFormation;
- Experiência em troubleshooting e suporte a ambientes produtivos de missão crítica;
- Vivência anterior com desenvolvimento de software utilizando linguagens como Java, Python, .NET ou similares;
- Conhecimento de boas práticas de automação, integração contínua, entrega contínua e gestão de ambientes em nuvem.
Benefits
- 🍽 Vale Refeição e/ou Vale Alimentação (Ifood Beneficios)
- 🏥 Assistência Médica (Amil)
- 💼 Seguro de Vida
- 😁 Assistência Odontológica (Amil)
- 🚗 Vale Combustível
- 🏋 Gympass/Wellhub: Acesso a academias com diversas modalidades de treino.
- 🐶 Plano de saúde PET
- 👶🏻 Auxílio Creche
- 💳 Lincard: Clube de parcerias com descontos de até 60% em mais de 4 mil estabelecimentos no Brasil.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Engineer, Site Reliability
SynitiThere's nothing status quo about your data. It's time to put Data First.
• Design and build automated cloud infrastructure for Syniti-hosted SaaS workloads across Azure and AWS. • Implement and manage CI/CD pipelines using GitHub Actions and ArgoCD GitOps workflows for multi-environment deployments (dev, preprod, prod, GovCloud). • Develop and maintain Terraform modules for infrastructure provisioning, including EKS/AKS clusters, Aurora PostgreSQL, Redis, OpenSearch, and S3/Azure Storage. • Integrate and maintain observability frameworks (Prometheus, Grafana, Loki, Mimir, Jaeger) and enforce structured logging of application, audit, and security events. • Support and extend Istio service mesh configurations including mTLS policies, AuthorizationPolicies, and SPIFFE-based workload identity. • Implement and maintain supply chain security controls: container image signing (Cosign), SBOM generation (Syft), provenance attestation (SLSA), and vulnerability scanning (Trivy, Inspector, Snyk). • Collaborate with security teams to meet FedRAMP High, Cyber Essentials+, NIST 800-53, and SecNumCloud control objectives across Azure and AWS environments. • Provide L3 incident response and lead root cause analysis efforts for application-tier outages across the Northstar platform. • Support the automated compliance gate (11 controls: NAC, FIPS, STIG, IRSA, SAST, DAST, SBOM, Sign, Vuln, Audit, Auth) and ensure application services pass all gates. • Manage Kubernetes workload autoscaling (Karpenter, KEDA, HPA) and pod security policies (Kyverno) for production services.
Staff Site Reliability Engineer, Core AI Infrastructure
CoinbaseWe're building an open financial system for the world.
• Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros. • Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments. • Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines. • Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence. • Develop full-stack applications that power internal AI products and infrastructure with Go or Python.
Senior Site Reliability Engineer, Core AI Infrastructure
CoinbaseWe're building an open financial system for the world.
• Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros. • Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments. • Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines. • Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence. • Develop full-stack applications that power internal AI products and infrastructure with Go or Python.
Role Description - Own key parts of the DevOps effort for a global engineering team and a platform supporting millions of transactions per day — and more than doubling annually. - Drive improvement across our development process and tooling: source control, build, test, packaging, release, and deployment. - Drive improvement in our infrastructure configuration, management, and cost efficiency. - Drive enhancement of monitoring and observability across all infrastructure and services, for both availability and performance. - Partner with other technical leaders to improve our architecture’s maintainability, scalability, and resilience. - Serve as a technical lead to software and DevOps engineers on development and operations practices. - Partner with other technical leaders to strengthen our information security posture. - Contribute to the team’s development and operations standards and processes. Qualifications - Deep experience designing automated pipelines end to end — git-based source control and branching strategies, build and test automation (Jenkins, GitHub Actions, or similar), artifact and dependency management, and progressive, low-risk deployment patterns. - Strong, hands-on experience with Terraform (or similar) — reusable modules, state management, and managing multi-environment infrastructure (staging, beta, production) as code. - Experience operating a GitOps workflow with ArgoCD (or similar) as the source of truth for declarative, auditable deployments. - Strong experience building and operating cloud infrastructure at scale — compute, VPC networking, storage, message queues, serverless, DNS, load balancing, IAM, and logging. - Production experience running and operating Kubernetes at scale — cluster lifecycle and upgrades, workload scheduling and resource management, autoscaling (HPA/cluster/event-driven), networking and ingress, and diagnosing complex cluster issues. - Authoring, configuring, and maintaining Helm charts for templated, repeatable application deployments across environments. - Experience deploying, operating, and tuning a mix of data stores — relational (PostgreSQL / CloudSQL), NoSQL document (MongoDB), wide-column (Cassandra), and cache (Redis) — including replication, backups, scaling, and performance troubleshooting. - Demonstrated track record deploying and monitoring large-scale, mission-critical services — defining SLIs/SLOs, building actionable alerting, and driving incident response and blameless post-mortems. - Solid grounding in cloud and infrastructure security — IAM, secrets management, network policy, and supply-chain hygiene. - Bonus: Java application release engineering. Requirements - Great communication and documentation skills. - An obsession with automation and a desire to leave things better than you found them. - A customer-first mindset and strong attention to detail. - 5+ years as a DevOps or Site Reliability Engineer. - Bonus: A sense of humor. Benefits - Be part of a well-funded, fast-growing company tackling complex, relevant challenges in sustainable last-mile delivery. - Competitive compensation and equity incentives. - Health, vision, and dental insurance. - 401(k) plan. - Flexible time off. - 100% remote — open to candidates across the U.S. - Compensation: The targeted base salary range for this role is listed in the compensation section below. Actual salary may be above or below this range based on location, skills, and relevant experience. In addition, this position may include additional compensation in the form of bonuses, equity, or commissions.


