The Operating Core for Legal
Senior Site Reliability Engineer – GCP
Location
United States
Posted
56 days ago
Salary
$130K - $180K / year
Seniority
Senior
Job Description
Senior Site Reliability Engineer – GCP
Filevine
• Provide strong leadership, mentoring, and sound judgment as the Reliability Engineering lead on your team. • Design and maintain autonomous systems for building, deploying, testing, and operating all Filevine products. • Act as the authoritative voice of reliability across the full software development lifecycle (SDLC). • Monitor, aggregate, dashboard, and alert on software/infrastructure events to ensure visibility and fast response. • Continuously enhance CI/CD pipelines, automation scripts, playbooks, and tools to streamline processes and reduce resolution time. • Proactively identify and resolve gaps in system availability, performance, and security while defending overall security posture. • Document processes, architecture, procedures, and best practices; research, adopt, or build reliable tools to boost engineer productivity. • Collaborate within your team (or independently), mentor junior engineers, participate in 24/7 on-call rotation for production support and emergency response, and communicate clearly with technical and management stakeholders.
Job Requirements
- 8+ years of hands-on technical experience in software engineering, infrastructure, or operations roles, including a minimum of 4 years dedicated to Site Reliability Engineering (SRE).
- Demonstrated curiosity, self-motivation, continuous learning mindset, passion for improvement, and proactive enthusiasm to enhance systems and processes daily without needing direction.
- Strong proficiency in Python, Bash, PowerShell, and other common SRE tooling and scripting technologies.
- Expert-level experience designing, building, and maintaining autonomous systems that handle software build, deployment, testing, monitoring, and operations with minimal human intervention.
- Deep proficiency with Google Cloud Platform (GCP) and its core SRE services, including Compute Engine, Kubernetes Engine/GKE, Cloud Monitoring, Cloud Logging, and IAM. Experience with AWS is a strong plus (e.g., EC2, EKS, CloudWatch, S3).
- Proficiency in all core skills expected of an SRE II, including monitoring/alerting, incident response, capacity planning, performance optimization, CI/CD pipeline enhancement, and reliability engineering best practices.
- Bachelor’s degree in Computer Science, Information Systems, or a related field; equivalent certifications (e.g., Google Cloud Professional certifications, AWS certifications); or substantial comparable direct work experience.
- Proven track record of independently driving reliability improvements, reducing toil through automation, and contributing to high-availability, scalable production systems in a fast-paced environment
Benefits
- A dynamic, rapidly growing company, focused on helping organizations thrive
- Medical, Dental, & Vision Insurance (for full-time employees)
- Competitive & Fair Pay
- Maternity & paternity leave (for full-time employees)
- Short & long-term disability
- Opportunity to learn from a dedicated leadership team
- Top-of-the-line company swag
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Monitoring and management of cloud environments (AWS, Azure, GCP) • Assisting with CI/CD pipelines and general code management practices • Develop and maintenance of tooling for deployment of Espresso Systems services
• Responsible for the architecture, design, and implementation of modern cloud solutions • Lead and execute AWS cloud migration projects • Migrate applications, infrastructure, and databases to AWS • Automate cloud environments and deployments • Serve as a technical sparring partner for client teams
• Lead and implement AWS cloud migration projects • Design and build secure landing zones • Migrate applications, infrastructure and databases to AWS • Support cutovers and production go-lives with a strong focus on stability and security • Design and maintain CI/CD pipelines • Implement Infrastructure as Code (Terraform) • Automate cloud environments and deployments • Ensure observability using monitoring, logging and alerting tools • Migrate and operate containerized workloads using Kubernetes (EKS) or ECS/Fargate • Improve reliability, scalability and performance of workloads • Act as a trusted technical counterpart for customer teams • Support migration scoping and technical estimations • Clearly communicate trade-offs and architectural decisions
• Lead and implement AWS cloud migration projects • Design and build secure landing zones • Migrate applications, infrastructure and databases to AWS • Support cutovers and production go-lives with a strong focus on stability and security • Design and maintain CI/CD pipelines • Implement Infrastructure as Code (Terraform) • Automate cloud environments and deployments • Ensure observability using monitoring, logging and alerting tools • Migrate and operate containerized workloads using Kubernetes (EKS) or ECS/Fargate • Improve reliability, scalability and performance of workloads • Act as a trusted technical counterpart for customer teams • Support migration scoping and technical estimations • Clearly communicate trade-offs and architectural decisions


