Where enterprise AI runs and outcomes scale
Staff Database Reliability Engineer
Location
India
Posted
7 days ago
Salary
0
Seniority
Lead
No structured requirement data.
Job Description
Staff Database Reliability Engineer
Rackspace Technology
Role Description - Expertise in SQL Server, Oracle, PostgreSQL, and MySQL for administration and migration. - Experience with PostgreSQL/MySQL/PAAS DB, scripting, and automation. - Experience in AWS/Azure Cloud environments. Key Responsibilities - Create and maintain SOPs for migration execution. - Formulate and monitor database management policies, procedures, and standards. - Provide design standards and guidance for projects and technical roadmaps. - Mentor and train Level 1 and Level 2 DBAs. - Engage with customers to streamline project deliverables. - Forecast database growth and plan for hardware and storage requirements. - Plan and deploy high availability and disaster recovery strategies. - Execute database software upgrades, patches, and service packs. - Investigate and resolve complex database-related issues. - Develop automation tools to streamline tasks. - Set up monitoring and alerting systems. - Generate performance reports and analysis. - Maintain up-to-date documentation of database configurations. - Resolve incidents, changes, and service requests under client SLAs. - Create detailed RCA reports for problem management. - Evaluate and manage third-party tools for database management. - Participate in sales implementations and vendor calls. - Provide 24x7 production support for database operations. - Execute large, complex database projects. Knowledge and Skills - Proficient in SQL Server, Oracle, PostgreSQL, and MySQL for architecture, installation, configuration, performance tuning, high availability, and disaster recovery. - Experience in planning and executing database migrations and upgrades. - Cloud-enabled for project-based migrations, automation, and administration. - Familiarity with industry best practices for database administration. - Effective communication of technical information. - Leadership skills for unexpected situations. - Experience with automation using Python, AWS CLI, PowerShell, Shell. - Knowledge of infrastructure as code with CloudFormation, Terraform, GitHub. - Ability to deploy, manage, and troubleshoot HADR configurations. - Knowledge of deploying and managing RDBMS in cloud platforms. - Understanding of monitoring tools like Datadog, Azure Monitor, AWS CloudWatch. - Usage of ITIL-based ticket tools like ServiceNow. Company Description We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Collaborate closely with fellow devops engineers and the development team to deploy and maintain application infrastructure. • Assist in the development and support of tooling to streamline the deployment and maintenance of our products. • Work with Github, Jenkins, and Chef to deploy applications from development through to production environments. • Support both in-house and third-party applications, including handling deployments, upgrades, and troubleshooting. • Build and manage automation pipelines for application deployment and maintenance. • Engage in the day-to-day management of Linux servers via the command line. • Create monitoring dashboards and alerts in Grafana leveraging Prometheus and Alertmanager. • Document processes and best practices clearly and concisely. • Participate in incident solving on-call rotation
• Own end-to-end release and deployment lifecycle: build → package → deploy → verify → rollback • Develop and support **Octopus Deploy** projects, lifecycles, channels, variables, and deployment processes • Implement deployment automation with **Ansible** (playbooks/roles, inventories, idempotent changes) • Maintain Git-based release workflows in **GitHub** (branching, tagging, versioning, release notes) • Build/maintain CI pipelines in GitHub Actions (or existing tooling) to produce artifacts and trigger Octopus releases • Standardize deployment patterns across applications (templates, shared steps, reusable Ansible roles) • Manage environment configuration and secrets in a controlled way (variable sets, permissions, auditing) • Improve deployment safety: approvals, health checks, smoke tests, automated validation, and rollback strategies • Support production releases, troubleshoot deployment failures, and drive root-cause analysis • Maintain release documentation, runbooks, and change management practices • Collaborate with developers, QA, and operations to plan releases and reduce downtime
• Act as technical lead for DevOps/Platform/Release engineering: set direction, standards, and best practices • Architect and govern end-to-end delivery: infrastructure provisioning, configuration management, CI/CD, release processes, and operations • Design and support Windows-based high availability solutions, with deep ownership of Windows clustering (failover/HA patterns, maintenance, upgrades, troubleshooting) • Lead Linux automation and platform standardization (configuration, patching, hardening, performance tuning) • Own Infrastructure as Code strategy with Terraform (modules, environments, state, governance) • Own automation strategy with Ansible (reusable roles, inventories, secure secrets handling, idempotency) • Build and standardize deployments using Octopus Deploy, GitHub, and Ansible (templates, shared steps, release promotion, rollback) • Design and mature CI/CD pipelines (artifact versioning, approvals, promotion strategy, policy-as-code where applicable) • Establish observability standards using VictoriaMetrics/Prometheus (metrics strategy, alerting, SLO/SLA monitoring, dashboards) • Provide production leadership: incident response, RCA/postmortems, reliability improvements, capacity planning • Mentor engineers, review designs/code, and raise overall engineering quality across teams • Produce and maintain architecture docs, runbooks, and platform roadmaps
• Build and maintain CI/CD pipelines for application builds, automated testing, packaging, and deployment activities. • Implement automation solutions for environment provisioning, operational workflows, release processes, and infrastructure support tasks. • Support secure delivery practices including code scanning, dependency validation, secrets management, and policy enforcement activities. • Troubleshoot and resolve build, deployment, pipeline, and environment-related issues across multiple applications and services. • Collaborate with development and QA teams to improve release quality, deployment reliability, and software delivery timelines. • Support cloud-based infrastructure and shared platform services in coordination with engineers, architects, and operations teams. • Maintain documentation for deployment pipelines, environment configurations, release procedures, and operational support processes. • Participate in incident response efforts, root cause analysis, and continuous process improvement initiatives. • Monitor system and pipeline performance and recommend improvements to automation, tooling, and workflow efficiency. • Support change management, deployment coordination, and release readiness activities across production and non-production environments. • Contribute to various projects and initiatives as assigned, demonstrating adaptability and a collaborative mindset.


