Job Closed
This listing is no longer active.
20% of fortune 500 fintech trust Kunai for engineering talent.
Senior Site Reliability Engineer
Location
United States
Posted
3 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer
Kunai
• Help clients modernize and evolve their business in financial services • Work at the center of a large scale, critical cloud transformation • Collaborate with teams that demand openness to change and patience with regulations
Job Requirements
- Extensive experience with monitoring, alerting, and troubleshooting in production
- Splunk, DataDog, Service Now, etc. It's not about the tools but how you use them.
- Experience reducing unactionable alerts arriving to the team from an observed 50% no-action rate
- Experience remediating noisy alerts
- Experience developing severity classes and alert playbooks
- Extensive experience with various tools within AWS
Benefits
- Competitive compensation
- Professional development opportunities
- Flexible work arrangements
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Site Reliability Engineer – Build
RemoteThe easier way to employ globally. Remote builds belonging for your team with payroll, benefits, & compliance solutions.
• Infrastructure as code at scale. Design, implement, and maintain infrastructure-as-code patterns using Terraform and Kubernetes that support both standard connectors and custom builds. Make it easy for engineers to deploy and operate with confidence. • Observability and incident response. Build and maintain comprehensive monitoring, logging, and alerting systems. Lead incident response efforts, conduct post-mortems, and drive continuous improvement in system reliability. • Security and compliance in motion. Work with our Security team to embed security into every layer of Build infrastructure. Ensure we meet compliance requirements across 100+ jurisdictions without creating friction for developers or customers. • Performance and cost optimisation. Continuously optimize system performance, resource utilization, and cloud costs. Make recommendations that improve both reliability and unit economics. • Automation and operational leverage. Identify manual operational toil and systematically eliminate it. Build tools and processes that let teams operate efficiently without scaling headcount. • Platform reliability and developer experience. Partner with platform teams to ensure APIs, MCP, and CLI are resilient and observable. Give infrastructure feedback that shapes how the platform evolves.
Senior Site Reliability Engineer – Build
RemoteThe easier way to employ globally. Remote builds belonging for your team with payroll, benefits, & compliance solutions.
• Infrastructure as code at scale. Design, implement, and maintain infrastructure-as-code patterns using Terraform and Kubernetes that support both standard connectors and custom builds. Make it easy for engineers to deploy and operate with confidence. • Observability and incident response. Build and maintain comprehensive monitoring, logging, and alerting systems. Lead incident response efforts, conduct post-mortems, and drive continuous improvement in system reliability. • Security and compliance in motion. Work with our Security team to embed security into every layer of Build infrastructure. Ensure we meet compliance requirements across 100+ jurisdictions without creating friction for developers or customers. • Performance and cost optimisation. Continuously optimize system performance, resource utilization, and cloud costs. Make recommendations that improve both reliability and unit economics. • Automation and operational leverage. Identify manual operational toil and systematically eliminate it. Build tools and processes that let teams operate efficiently without scaling headcount. • Platform reliability and developer experience. Partner with platform teams to ensure APIs, MCP, and CLI are resilient and observable. Give infrastructure feedback that shapes how the platform evolves.
• Design, implement, and maintain CI/CD pipelines. • Automate infrastructure provisioning and deployment using Terraform and Ansible. • Manage and optimize cloud environments, primarily on AWS and Azure. • Deploy and manage containerized applications using Docker, Kubernetes, and OpenShift. • Monitor system performance, troubleshoot issues, and ensure high availability. • Collaborate with development, QA, and operations teams to implement DevOps best practices. • Manage source code repositories using Git.
DevSecops - Cloud Engineer
Sequoia ConnectOur core expertise lies in connecting Top Technologists with Top Companies through unparalleled IT headhunting solutions
Role Description We are currently searching for a DevSecops - Cloud Engineer : - Design, develop, and maintain automation frameworks and scripts to streamline security processes and workflows. - Deploy and manage AWS resources using Terraform, ensuring secure and scalable infrastructure. - Implement innovative security solutions to reduce the mean time to detect and respond to threats. - Implement and optimize AWS Step Functions and Lambda for serverless automation workflows. - Leverage AWS security-centric services (e.g., IAM, Control Tower, KMS, Macie, GuardDuty, CloudTrail, EventBridge) to enhance cloud security. - Collaborate with cross-functional teams to integrate security automation into CI/CD pipelines. - Monitor and troubleshoot AWS infrastructure to ensure high availability, performance, and compliance. - Stay updated on AWS best practices, security trends, and emerging technologies to drive continuous improvements. - Perform hands-on support for a wide range of security technologies, including Pipeline security, DevSecOps, CloudFormation templates, Terraform, Docker, Kubernetes, SIEM, CSPM, and Vulnerability Scanners. - Work independently with minimal supervision, while providing guidance and collaborating with the team as needed. Qualifications - Bachelor’s degree in Computer Science, Information Technology, a related field (or equivalent work-related experience). - 3+ years of professional experience in Python, Go, or equivalent development, focusing on automation. - 3+ years of hands-on experience with AWS technologies, particularly security-centric services (IAM, KMS, Control Tower, Macie, GuardDuty, CloudTrail, EventBridge, etc.). - Proficiency in writing and developing infrastructure-as-code using Terraform. - Working knowledge of AWS Step Functions and AWS Lambda for serverless architectures. - Experience with SIEM, IPS, and Vulnerability Scanners. - Familiarity with cloud security best practices and compliance frameworks. - Strong problem-solving skills and the ability to work independently or in a team environment. - Excellent communication skills to collaborate with technical and non-technical stakeholders. - High-Performance Mindset: Resilience, emotional intelligence, and a focus on agile delivery. - Technologist DNA: A deep understanding of the difference between "coding" and "engineering." Requirements - AWS certification (e.g., AWS Certified Security - Specialty, AWS Certified Solutions Architect). - Experience with CI/CD tools (e.g., Scalr, Harness, Jenkins) and containerization (e.g., Docker, Kubernetes). - Knowledge of additional programming languages or scripting tools. - Familiarity with other cloud platforms (e.g., Alibaba, GCP) is a plus. - Familiarity with cloud-native foundations or AI coding assistants. Languages - Advanced Oral English. - Advanced Spanish. Benefits - Fully remote.



