Rain is the world's first AI Financial Health Platform, serving 3.5 million employees at leading organizations like McDonald's, Marriott, and T-Mobile. Rain works in the background to optimize every employee's financial life to prevent shortfalls and build long-term stability. Backed by top investors including QED and Prosus, Rain has raised $150M in venture funding to fuel our next stage of hyper growth.
Senior DevOps Engineer
Location
Portugal
Posted
20 days ago
Salary
0
Seniority
Senior
No structured requirement data.
Job Description
Senior DevOps Engineer
Rain Technologies Inc.
Role Description As a Senior DevOps Engineer at Rain, you will play a central role in designing, building, and operating our cloud infrastructure as we continue to scale to millions of users globally. You will work alongside a small, high-performing cloud team to drive automation, improve observability, and ensure the reliability and security of our platform. This role goes beyond keeping the lights on — you will actively shape how we build and operate infrastructure and influence architectural decisions. - Design, build, and maintain scalable, secure cloud infrastructure on AWS using Terraform and Terragrunt (IaC, Infrastructure as Code) - Manage and evolve our Kubernetes (EKS) clusters — including node group management, autoscaling with Karpenter, and workload reliability - Own and improve our CI/CD pipelines (GitLab CI), ensuring fast, reliable, and secure delivery - Drive observability initiatives: metrics, logging, alerting, and dashboards using Prometheus, Grafana, and related tooling - Support and evolve our Kafka infrastructure in collaboration with backend engineering teams - Champion infrastructure-as-code practices, ensuring consistent, reviewed, and well-documented changes - Respond to production incidents, lead post-mortems, and drive improvements in incident response processes - Collaborate with backend, security, and product engineering teams to support their infrastructure needs - Leverage AI-assisted tooling (e.g., GitHub Copilot, AI-powered incident analysis, LLM-based automation) to increase productivity and quality Qualifications - You bring 5+ years of experience managing large-scale production environments and aren't afraid of architectural complexity - You have a "code everything" mindset, replacing manual tasks with scalable, DRY Infrastructure as Code (IaC) - You understand the "why" behind Kubernetes internals and cloud networking, not just the "how" of deployment - You communicate complex infrastructure concepts clearly to both engineering peers and business stakeholders - You treat security, secrets management, and observability as core features, not afterthoughts Requirements - Advanced AWS & EKS: Deep proficiency in EC2, RDS, S3, IAM, and VPC networking, specifically within multi-account EKS environments - Kubernetes Internals: Hands-on experience with CNI, RBAC, Affinity/Taints, and managing complex workloads (StatefulSets/DaemonSets) - IaC Mastery: Proven ability to scale infrastructure using Terraform and Terragrunt with modular, reusable patterns - CI/CD & Helm: Expertise in designing secure GitLab CI pipelines and managing versioned Helm charts across environments - Observability: Proficiency in building dashboards and alerting logic using Prometheus and Grafana - Linux & Scripting: Strong Bash skills for environment management (Python proficiency is a significant plus) Company Description Rain is the world's first AI Financial Health Platform, serving 3.5 million employees at leading organizations like McDonald's, Marriott, and T-Mobile. Rain works in the background to optimize every employee's financial life to prevent shortfalls and build long-term stability. Backed by top investors including QED and Prosus, Rain has raised $150M in venture funding to fuel our next stage of hyper growth.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Director, Security Engineer – DevSecOps
JLL - Jones Lang LaSalleJones Lang LaSalle (JLL) is a professional and financial services company that specializes in investment management and commercial real estate services. A Fortu
• Lead the technical security strategy for product and application security • Architect and implement a comprehensive DevSecOps pipeline • Drive threat modeling practices and collaborate with engineering leads • Design and implement a centralized security telemetry architecture • Lead technical evaluation and implementation of security tools • Establish and mentor a team of DevSecOps engineers • Own the technical roadmap for improving detection times and fraud detection
Director, Security Engineer – DevSecOps
JLL - Jones Lang LaSalleJones Lang LaSalle (JLL) is a professional and financial services company that specializes in investment management and commercial real estate services. A Fortu
• Lead the technical security strategy for product and application security, defining architecture standards, security baselines, and secure coding guidelines aligned with OWASP ASVS, NIST SSDF, and BSIMM frameworks. • Architect and implement a comprehensive DevSecOps pipeline, integrating SAST, DAST, SCA, and container scanning across all CI/CD pipelines serving 10 product verticals. • Drive threat modeling practices across critical product flows, partnering with engineering leads to identify and mitigate security risks before they reach production. • Design and implement a centralized security telemetry architecture, connecting application logs, WAF events, and fraud signals into a unified SIEM platform for real-time detection. • Lead the technical evaluation, selection, and implementation of security tools (SAST/DAST, SIEM/SOAR, PAM, API Gateway security, container security scanners). • Establish and mentor a team of 7-8 embedded DevSecOps engineers across product verticals, providing technical guidance and ensuring consistent security standards. • Own the technical roadmap for reducing MTTD from >48h to <1h and fraud detection from D+1 to real-time through security engineering and automation. • Live the mission: inspire and empower others by genuinely caring for your own wellbeing and your colleagues. Bring wellbeing to the forefront of work, and create a supportive environment where everyone feels comfortable taking care of themselves, taking time off, and finding work-life balance.
• Be highly organized - Drive revenue impact by leading 8–14 project deployments per quarter, contributing an average of $500K–$1M in deployed eARR once fully ramped (expected within 90 days). • Execute and optimize deployments by owning end-to-end implementation processes and collaborating with internal teams to ensure timely delivery. • Enhance operational efficiency by identifying and implementing process improvements, tooling enhancements, and automation. • Partner across teams by working closely with Engineering and Product to stay current on product updates, provide feedback, and contribute to roadmap alignment. • Contribute to knowledge sharing by documenting deployment processes, updates, and best practices. • Support strategic scalability by helping address high pipeline demand, ensuring future quarters maintain delivery velocity while mitigating at-risk eARR. • Maintain a habit of using AI tools to think, build, and ship faster—it’s your default, not an afterthought.
Especialista de Infraestrutura, Cloud, SRE
Grupo Salta EducaçãoO maior grupo de educação básica do Brasil com a missão de entregar um Brasil melhor para as próximas gerações.
• Liderança Técnica: Ser a referência técnica para o time em decisões de arquitetura, revisão de código de infraestrutura (IaC) e disseminação de boas práticas. • Engenharia de Plataforma: Desenhar e evoluir o ambiente em AWS e on-premises, sustentando o backend primário baseado em EC2 (com ALB/NLB e API Gateway) e RDS, além de gerenciar ferramentas críticas em containers (ECS) e arquiteturas Serverless. • Observabilidade: Consolidar a estratégia de monitoramento 360º usando Zabbix/Grafana e DataDog, garantindo visibilidade ponta a ponta. • Automação: Reduzir o trabalho manual através de Terraform, Ansible e automações via AWS Lambda e EventBridge. • Conectividade Híbrida: Garantir a estabilidade da malha de conectividade (VPNs, roteamento e links) entre as unidades escolares e a nuvem. • FinOps & Eficiência: Liderar iniciativas técnicas de otimização de custos (rightsizing de instâncias EC2/RDS, instâncias reservadas e limpeza automatizada). • Segurança: Atuar em parceria com SecOps para garantir que a arquitetura considere C.I.D. e padrões de mercado.



