Job Closed
This listing is no longer active.
Market-leading solutions that empower governments to build thriving communities, grow businesses and protect citizens.
Senior Manager, Cloud Engineering – Operations
Location
United States
Posted
106 days ago
Salary
$180K - $200K / year
Seniority
Senior
Job Description
Senior Manager, Cloud Engineering – Operations
Accela
• Hire, develop, and retain a high-performing team of SREs and CREs • Establish clear performance expectations, career paths, and technical standards • Foster a culture of ownership, accountability, and continuous improvement • Serve as a mentor to senior engineers and emerging leaders • Own reliability outcomes across availability, performance, security, and compliance • Serve as executive escalation point for high-severity production incidents • Lead incident response strategy, root cause analysis (RCA), and corrective action planning • Mature Incident, Problem, and Change Management processes • Drive automation initiatives in partnership with DevOps, Security, and Database Engineering • Oversee infrastructure scalability and resiliency across Microsoft Azure • Champion Infrastructure as Code (Terraform) and configuration management (Ansible) standards • Improve observability through robust dashboards, metrics, logging, and monitoring practices • Partner with Product and Engineering to integrate reliability early in the PDLC • Engage executive leadership with operational metrics, risk posture, and strategic roadmaps • Manage vendor and partner relationships supporting SaaS production environments • Align reliability strategy with business growth and public sector compliance requirements
Job Requirements
- 10–12+ years of experience in software engineering and/or production systems engineering within a SaaS environment
- 3–5+ years of people leadership experience managing technical teams
- Proven track record of building, scaling, and retaining high-performing engineering teams
- Strong executive communication skills with experience presenting operational metrics and strategy
- Deep expertise in distributed systems, system design, and troubleshooting complex production environments
- Experience operating in Microsoft Azure environments
- Experience in Linux environments and software version control systems
- Strong scripting capability (Bash, Python, Ruby, or Go)
- Mastery of production monitoring, logging, and observability tools
- Demonstrated ability to lead full-stack incident response and root cause analysis
Benefits
- flexible time off
- comprehensive medical, dental, and vision plans
- family planning benefits
- 401(k) retirement savings plan with company match
- health savings account with company contributions
- flexible spending account
- life, accident, and disability coverage
- business travel insurance
- employee assistance programs
- other well-being benefits
Related Guides
Related Categories
Related Job Pages
More Cloud Engineer Jobs
• Design, build, and maintain event-driven architectures using Kafka (Amazon MSK) and IBM MQ. • Develop and deploy microservices on Kubernetes (EKS) to enable secure and scalable integrations. • Implement robust integration patterns, including idempotency, error handling, retry strategies, and secure messaging flows. • Ensure the reliability, scalability, and high availability of the platform in production environments. • Establish and maintain strong observability practices (monitoring, logging, metrics, and alerting) using CloudWatch, Prometheus, and Grafana. • Collaborate with technical and business teams to design solutions aligned with functional and technical requirements.
• Own and evolve Jira Cloud, Confluence Cloud, and Loom Enterprise as secure, scalable collaboration platforms. • Define and roll out governance standards for workflows, permissions, templates, and information architecture that scale across teams. • Reduce configuration sprawl and platform technical debt to improve maintainability, reporting consistency, and user experience. • Design and implement automation and integrations using Jira Automation, APIs, and scripting to replace manual processes with durable solutions. • Partner with Engineering and TPM teams to align workflows with SDLC best practices and enable consistent execution across orgs. • Improve visibility and confidence in reporting by standardizing configuration patterns and driving adoption of best practices. • Enable async first collaboration by strengthening Loom adoption and defining clear patterns for using video updates effectively. • Maintain platform reliability, security, and operational excellence, including incident response and durable follow through on root causes. • Create and maintain clear documentation, runbooks, and scalable operational practices.
• Design, deploy, and operate cloud infrastructure in AWS, Azure, and/or GCP • Build and manage managed Kubernetes clusters (EKS, AKS, GKE) • Implement best practices for cloud security , including IAM, network segmentation, and secure workloads • Design and maintain cloud networking and routing , including VPC/VNet architecture, load balancing, and private connectivity • Collaborate with application teams to support scalable, resilient deployments • Work within Agile development processes , contributing to planning, standups, and retrospectives • Use Git-based workflows for infrastructure and configuration management • Troubleshoot production issues and continuously improve reliability and performance
Azure Architect
Weekday (YC W21)We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent
• Design and implement a new Azure tenant from scratch • Establish governance frameworks, policies, access controls, and compliance standards • Architect the Azure environment to meet defined performance, scalability, and availability requirements • Configure and manage various Azure services, including Azure Traffic Manager and Azure Kubernetes Services • Implement security best practices and ensure high availability and business continuity • Collaborate with development teams and conduct knowledge transfer sessions




