Job Closed
This listing is no longer active.
Boulevard powers the next generation of salons and spas so it’s easier for everyone to look and feel their best.
Staff Database Reliability Engineer
Location
United States
Posted
70 days ago
Salary
$183K - $225K / year
Seniority
Lead
Job Description
Staff Database Reliability Engineer
Boulevard
• Own and improve database reliability, performance, and scalability. • Drive architectural improvements to reduce incident frequency and impact. • Partner with engineering teams to design and operate scalable distributed systems. • Build tools and automation to eliminate operational overhead. • Elevate observability through actionable metrics and dashboards. • Mentor engineers and foster a culture of reliability.
Job Requirements
- 8–10+ years of experience in systems, infrastructure, or backend software engineering.
- Strong focus on RDBMS and NoSQL systems.
- Production experience with managed cloud databases (AWS Aurora/RDS).
- Expertise in deploying/managing infrastructure using infrastructure-as-code tools.
- Proven experience with reliability outcomes, SLOs, SLIs, and observability practices.
- Strong background in automation, scripting, and infrastructure-as-code (e.g., Terraform, Python, Go).
- Experience diagnosing and mitigating production incidents in high-availability systems.
- Excellent communication skills and ability to influence across engineering teams.
- Demonstrated ability to set technical standards and mentor engineers.
- Ability to navigate uncertainty and iterate toward meaningful outcomes.
Benefits
- 401(k) match plus dental, medical, vision, and life insurance.
- Flexible vacation day policy.
- Fully remote with a monthly work from home stipend.
- Family planning resources and specialized support programs.
- Equity opportunities.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
A position at White Cap isn’t your ordinary job. You’ll work in an exciting and diverse environment, meet interesting people, and have a variety of career opportunities. The White Cap family is committed to Building Trust on Every Job. We do this by being deeply knowledgeable, fully capable, and always dependable, and our associates are the driving force behind this commitment. Job Summary Responsible for bridging software development and IT operations, for engineering teams to deliver applications faster, more reliably, and with higher quality. Automates CI/CD and infrastructure, establishes best practices in monitoring, reliability, and incident response for all systems. Major Tasks, Responsibilities and Key Accountabilities - Design and operate cloud infrastructure to host front-ends, API layers, and platform integrations, ensuring high availability, security, and scalability. Containerize services and operate container orchestration platforms and manage container registries. - Collaborate closely with product, frontend, backend, QA, and vendor support specialists to enable page-level launches and maintain performance SLAs. - Build and maintain infrastructure as code to provision and manage production and non-production environments. - Create and maintain robust CI/CD pipelines for frequent, safe deployments of page-level changes and backend services. - Harden platform and application security, manage secrets, and support compliance efforts. - Build observability, monitoring, logging, tracing, and alerting. - Automate repeatable runbooks and incident response; participate in on-call rotations and post-incident reviews. - Implement deployment strategies that support page-by-page migration, including rollback safety, canary/blue-green releases, and feature-flag-driven launches. Nature and Scope - Consults with senior management on solution development for complex strategic and technical business issues. Independently solves unique and complex problems that have a broad impact on the business. Assignments are large scope, high impact, high cost, and high importance. - Establishes operational plans for assigned area. Acts as a strategic advisor and uses expert skills to contribute to the development of strategic company objectives. Receives general administrative and business direction as needed. Typically operates with broad latitude in a complex environment. - Guides and mentors staff at all levels, provides advice to senior management on complex and strategic issues, and may lead or manage complex projects with dotted line responsibility. Work Environment - Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable. - Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles. - Typically requires overnight travel less than 10% of the time. Education and Experience - Typically requires a bachelor’s degree and 10+ years of experience in a related field OR MS/MA and generally 8+ years of experience in a related field. Maintains expert knowledge in area of responsibility with a strong understanding in adjacent areas for the development of creative solutions. Preferred Qualifications - Strong experience with AWS, Azure, or GCP — including core services (compute, networking, load balancers, CDN, databases). Azure experience preferred. - Hands-on experience containerizing applications (Docker) and operating Kubernetes in production (EKS/AKS/GKE). - Proven experience building and operating CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, CircleCI, etc.). - Experience implementing deployment strategies such as canary, blue/green, rolling, and feature-flag driven releases. - Experience with performance tuning and capacity planning for web applications. - Experience with Azure middleware and platform services (Azure App Service, Azure API Management, Azure Functions, Service Bus, Logic Apps. - Strong Linux administration, networking, and security fundamentals (firewalls, VPC/VNet, IAM, TLS, secrets management). - Demonstrated skills in observability and incident management (metrics, logs, tracing, alerting, runbooks). - Solid scripting and programming skills (Bash, Python, Node.js/TypeScript or similar) to automate tasks and build tooling. If you’re looking to play a role in building America, consider one of our open opportunities. We can’t wait to meet you.
Senior DevOps Engineer
Perseus Group, Constellation SoftwareWe recognize the value and importance of diversity and inclusion in our communities and in the workplace. We celebrate diversity and one of our goals as an employer is to create an inclusive work environment for all employees. We are an equal opportunity employer and do not discriminate against any employee or applicant because of: Race Religion Sex Sexual orientation including gender identity or expression Pregnancy National origin Age Marital status Veteran status Disability status Any other category or characteristic protected by law Applicants with disabilities who would like to require a reasonable accommodation related to any part of the application process may contact us at Perseus_HR@constellationhbs.com . NOTE: If an applicant is selected to receive a conditional offer of employment, and in accordance with applicable law, a criminal background check may be conducted before the offer becomes final and employment begins. Pursuant to the San Francisco Fair Chance Ordinance, and other applicable laws, we will consider for employment qualified applicants with arrest and conviction records.
• Design, implement, and maintain CI/CD pipelines using Azure DevOps • Manage cloud infrastructure (primarily Azure) to ensure scalability, security, and high availability • Implement backup strategies, high availability, and disaster recovery solutions • Automate infrastructure provisioning and configuration using Infrastructure as Code (IaC) tools • Monitor system performance and implement alerting and logging solutions • Collaborate with developers to optimize deployment workflows, database standards, schema design and troubleshoot issues • Design, implement, and manage SQL Server environments in Azure Cloud • Ensure compliance with security and operational standards • Participate in agile ceremonies and contribute to sprint planning and retrospectives • Create and maintain technical documentation and support release deployments
• Build, maintain, and operate scalable production infrastructure. • Own reliability and availability for key services and environments. • Contribute to the design and operation of Kubernetes-based infrastructure. • Develop and maintain Infrastructure-as-Code frameworks (e.g., Terraform). • Improve monitoring, alerting, and observability across systems. • Participate in on-call rotations and respond to production incidents. • Investigate root causes of incidents and contribute to postmortems and reliability improvements. • Improve system performance, availability, and fault tolerance. • Contribute to CI/CD pipeline improvements to increase release safety and predictability. • Support the deployment and operation of data platforms and ML workloads. • Help standardize environments and infrastructure across internal systems and customer deployments. • Troubleshoot issues across infrastructure, services, and deployment pipelines. • Work closely with QA and engineering teams to improve production readiness and release stability. • Contribute to automation efforts that reduce operational toil.
• Collaborate closely with architects, developers, QA, and security teams to ensure smooth and reliable environment operations • Work in close partnership with the platform team, based on shared ownership, knowledge exchange, and mutual support • Own and operate containerized application platforms based on Docker and Kubernetes, ensuring reliability, scalability, and operational excellence • Design and deliver dynamic test environments at scale, including multiple parallel, per–merge request (branch-based) deployments • Build, maintain, and standardize CI/CD pipelines by creating reusable templates and components in GitLab CI • Drive deployment automation and GitOps practices • Identify operational bottlenecks and implement automation to reduce manual effort and improve delivery speed • Embed security-by-design across the SDLC, including pipeline hardening and automated security checks • Build and operate observability platforms: monitoring, logging, and diagnostics (Prometheus, Grafana, ELK/EFK/Loki, etc.) • Participate in on-call and incident response, including troubleshooting, root-cause analysis, and post-mortems • Take end-to-end ownership of the solutions you build (“you build it, you run it”).




