Job Closed
This listing is no longer active.
Enhancing the Retail Experience.
Database Reliability Engineer
Location
Arizona + 16 moreAll locations: Arizona | Colorado | Connecticut | Florida | Idaho | Kansas | Nebraska | Nevada | New Jersey | North Carolina | Ohio | Oregon | Massachusetts | Michigan | Pennsylvania | Texas | Virginia
Posted
152 days ago
Salary
$120K - $170K / year
Seniority
Senior
Job Description
Database Reliability Engineer
RT²
• Install, update, and maintain SQL Server database software across multiple versions (2012–2022) • Manage database performance by developing protocols, monitoring systems, and resolving processing and programming issues • Implement database security policies and disaster recovery (DR) procedures to ensure data integrity and accessibility • Support database migrations from on-premises to cloud environments, with a focus on Azure databases and SQL in cloud/Azure • Design, implement, and maintain highly available and performant SQL Server and MySQL database systems • Monitor database health, performance, and capacity using industry-standard tools and custom scripts • Automate routine database operations and deployments using Infrastructure as Code (IaC) and CI/CD pipelines • Upgrade SQL Server versions and optimize systems for performance and reliability • Create and optimize queries, stored procedures, and indexing strategies • Maintain and administer SSAS, SSIS, and SSRS solutions for reporting and analytics • Provide guidance and support to developers, including schema design, code review, and query tuning • Collaborate with DevOps to install, configure, and manage SQL Server instances and databases in hybrid environments • Design, deploy, and maintain SQL Server availability groups across data centers • Develop and maintain database standards, documentation, and failover runbooks • Automate database maintenance tasks and processes to improve efficiency • Be available for on-call support to address urgent database issues
Job Requirements
- Proven experience with SQL Server (2012–2022), Azure databases, and SQL on VMs
- 3+ years of Terraform/Ansible/Powershell experience
- Experience with Azure SQL, Azure Managed Instances, and on-prem SQL Server
- Hands-on experience with disaster recovery (DR) solutions for SQL Server
- Expertise in SSAS, SSIS, and SSRS, with exposure to Power BI preferred
- Demonstrated ability to migrate on-premises databases to cloud environments, including Azure
- Strong skills in updating SQL Server versions and implementing new database systems
- Proficiency in SQL query language, query optimization, and troubleshooting
- Strong experience with PowerShell scripting for automation and configuration management
- Exposure to Infrastructure as Code (IaC) tools, such as Terraform, Ansible, or ARM templates
- Advanced knowledge of database security, performance tuning, and backup/recovery standards
- Familiarity with dimensional and relational data modeling concepts
- Strong programming skills, including experience with PL/SQL coding
- Experience with Unix and PowerShell scripting for database management tasks
- Knowledge of emerging database technologies and the ability to recommend and implement solutions
- Excellent problem-solving skills and attention to detail
- Strong communication skills to collaborate effectively with technical and non-technical teams.
Benefits
- Competitive compensation
- Generous STI and LTI provisions
- Health, Dental and Vision Insurance
- Paid Annual Leave
- Paid Sick Leave
- 401K, and more
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior SysAdmin – DevOps Experience
Salvo SoftwareWe provide affordable custom software solutions specialized in ERP Systems, Business Automation, and Blockchain.
• Global Endpoint Management: Own the full lifecycle (provisioning, configuration, and support) of NUC computers and devices for our distributed workforce. • Identity & Network Ownership: Maintain Windows domains (Active Directory), VPNs, and secure access controls. • Data Security: Define and enforce strict user access levels for sensitive, licensed OEM data to ensure project-based compliance. • Infrastructure Projects: Independently execute hardware upgrades and configurations, including factory routers, firewalls, and switches. • Technical Support: Act as the sole Tier 2 escalation point for the team, ensuring internal productivity never stalls. • Cloud Architecture: Manage and optimize cloud infrastructure (AWS, Azure, or GCP) to support both internal needs and client-facing digital transformations. • Automation & IaC: Reduce manual overhead by building and maintaining Infrastructure as Code using Terraform, Ansible, or CloudFormation. • CI/CD Support: Maintain and troubleshoot CI/CD pipelines (GitHub Actions, Jenkins, etc.) to ensure rapid, high-quality software delivery. • Containerization: Oversee Docker and Kubernetes environments to ensure scalable and reliable application hosting. • Observability: Manage monitoring, logging, and alerting systems to proactively address performance issues.
Field & Cloud DevOps Engineer – Edge Infrastructure
VOLT AIImproving security effectiveness while reducing costs through advanced AI
• Own end-to-end infrastructure operations spanning customer sites, edge devices, and AWS production environments • Plan and execute customer deployments in collaboration with customer IT teams, including leading technical discussions around network architecture, security requirements, and deployment constraints, translating customer IT policies into executable deployment plans, and driving deployment readiness and timelines to ensure on-time launches • Deploy, configure, and maintain edge compute hardware in customer environments, including installing and maintaining Linux-based operating systems, creating, deploying, and updating standardized golden images, managing OS patches, drivers, firmware, and security updates, and diagnosing hardware, OS, and performance issues remotely and on-site • Design, integrate, and troubleshoot customer-side networking, including VLANs, firewall rules, NAT, routing, and bandwidth constraints, IP camera and video infrastructure (RTSP, ONVIF, PoE, managed switches), and operating effectively within locked-down or highly regulated networks • Build and operate AWS-based infrastructure supporting edge systems, including Kubernetes (EKS) clusters and containerized workloads, CI/CD pipelines for deployment and upgrades, and logging, monitoring, and alerting across distributed systems • Develop and maintain automation and internal tooling, including Python for deployment automation, operational tooling, and system workflows, shell scripting for provisioning, configuration, and diagnostics, and tools that reduce manual effort and improve deployment repeatability • Ensure reliable edge-to-cloud connectivity and data flow, debugging failures across networking, software, and infrastructure boundaries • Lead incident response and operational debugging across edge and cloud systems, driving issues to root cause and permanent resolution • Own infrastructure cost visibility and optimization, including monitoring and analyzing AWS spend, implementing right-sizing, scaling, and cost controls, and working with external cost-optimization partners and tooling • Feed real-world deployment and operations insights back into product and infrastructure design, improving reliability and scalability
• Working in an agile environment as part of a full design and development team, participating in agile ceremonies, interacting with and supporting our customer • Continuously improving the client infrastructure and platform for users • Supporting developer teams in deploying, maintaining, and troubleshooting web applications • Deploying new CI/CD workflows and improving the performance of existing ones • Serving as the subject matter expert for containerization of Node.js applications and DevSecOps and provide relevant advice to team members and customers
Senior Site Reliability Engineer – Infrastructure
Underdog FantasyUnderdog Fantasy describes itself as one of the fastest-growing sports companies on the market, bringing "fun, approachable contests and games to the masses." A
• Own and maintain the incident response process, including defining procedures, tools, and best practices • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems • Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs • Develop and implement disaster recovery plans, including regular testing and regulatory compliance • Collaborate with teams on architecture decisions to ensure high availability and scalability • Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness) • Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes • Emphasis on automation and tooling to scale our workload • Contribute across codebases in Ruby, Python, Go, TypeScript, Swift, and Kotlin as needed to support the initiatives described above.




