Job Closed
This listing is no longer active.
We help the best professionals and companies find each other despite borders.
Senior Database Reliability Engineer, Architect
Location
Georgia
Posted
71 days ago
Salary
0
Seniority
Senior
Job Description
Senior Database Reliability Engineer, Architect
Alex Staff Agency
This position is open at a global product-led IT company specializing in infrastructure stability and security solutions. Their products are recognized as the industry standard in the Hosting and Enterprise segments, powering over 500,000 servers worldwide. In 2025, the company is evolving its data management strategy, shifting from traditional database administration to an Internal Database-as-a-Service (DBaaS) model. This role requires a visionary engineer to design resilient distributed systems, automate infrastructure through code, and transform databases into a reliable service for product teams. This is an ideal opportunity for those ready to handle petabytes of data and build high-scale platform solutions. **Key Challenges & Responsibilities:** - Designing and implementing a self-service platform (Terraform + Ansible) for deploying HA clusters (PostgreSQL, ClickHouse, MongoDB, Redis) in a heterogeneous environment (Bare Metal, OpenNebula, K8s, Public Clouds). - Managing rapidly growing analytics clusters (12+ clusters, tens of terabytes), focusing on sharding, ReplicatedMergeTree, and building reliable S3 backup pipelines under high load. - Maintaining and scaling infrastructure for Apache Airflow and Redash, ensuring the reliability of ETL pipelines and visualization tools. - Implementing SRE practices in data management: replacing manual incident response with automated self-healing mechanisms and defining SLO/SLIs. - Migrating legacy solutions to modern cloud patterns and implementing Kubernetes operators for stateful workloads. - Serving as a technical authority for product teams to optimize data schemas and SQL queries for high-load systems. **Tech Stack:** - **DB:** PostgreSQL 15+ (Patroni, PgBouncer), ClickHouse (Sharded/Replicated), MongoDB, Redis, Kafka. - **Data & Analytics:** Apache Airflow, Redash. - **Infrastructure:** Hybrid Cloud (3+ private DCs, OpenNebula, K8s, Bare Metal, AWS, GCP, Azure, DO). - **IaC & CI/CD:** Terraform, Ansible, Python/Go, GitLab, Jenkins, Gerrit. - **Observability:** VictoriaMetrics, Grafana, Loki.
Job Requirements
- Must have:**
- 5+ years of PostgreSQL expertise: deep knowledge of MVCC, locking mechanics, expert-level Patroni/PgBouncer configuration, and experience with seamless major version upgrades under load.
- ClickHouse mastery: experience operating large clusters, understanding ZooKeeper/ClickHouse Keeper, sharding, replication internals, and performance diagnostics at the data-part level.
- Engineering mindset (SRE/DevOps): experience writing complex Terraform modules and Ansible roles; proficiency in Python or Go for automation is a major asset.
- Hybrid environment experience: understanding the nuances of running DBs on Bare Metal vs. Kubernetes vs. Public Cloud, with the ability to optimize TCO and disk subsystem performance (NVMe, Network Storage).
- Systems approach: understanding the full stack from network packets to business logic, including security standards (FIPS, Audit logs) and Disaster Recovery.
- Nice to Have:**
- Experience building an Internal Developer Platform (IDP).
- Experience operating databases in Kubernetes via operators (CloudNativePG, Altinity Operator).
- Background working with Cloud or Hosting providers on similar services.
Benefits
- Fully remote work from any location worldwide and flexible working hours.
- Opportunity to impact architectural decisions for services used by thousands of companies globally.
- 24 days of vacation, 10 national holidays, and unlimited paid sick leave.
- Compensation for private medical insurance.
- Reimbursement for co-working spaces and gym/sports activities.
- Dedicated budget for education, training, and conferences.
- Reward program for innovative ideas that lead to company patents.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Configure a complete CI/CD pipeline from scratch and continuously improve it • Use automated deployment tools like Gitlab or Jenkins • Implement automated end-to-end testing frameworks • Manage deployment and resources in the AWS ecosystem • Administer Linux systems and utilize scripting languages for process automation
**Position Description:** This position focuses on creating and modifying pipelines using GitHub Enterprise Cloud repositories. The role requires expertise in developing and maintaining pipelines using Jenkins servers and troubleshooting deployment issues. Candidates should incorporate metrics such as Mean Time To Build (MTTB) and Mean Time To Deploy (MTTD). Experience with multiple CI/CD tools, Git Actions, and code scanning tools like CodeQL, Fortify, SonarQube, and Nexus is desired. Familiarity with automation tools such as Selenium, Cucumber, Maven, and AWS CodeBuild/CodeDeploy is advantageous.
Role Description We are looking for an experienced professional to join our team as our first dedicated Senior DevOps / Platform Engineer (f/m/x) - someone who will own this area end-to-end and build the foundation for everything that follows. This is a senior individual contributor role with full ownership. Your mission is to make MAIA's platform reliable, secure, auditable, and developer-friendly, at a stage where every decision you make has lasting impact. - You take full ownership of our infrastructure - Hetzner first, AWS, GCP, and Azure for selected services. - You own our CI/CD pipelines (GitHub Actions) and deployment workflows, continuously improving rollout strategies, versioning, and rollback procedures. - You build, improve and maintain our observability stack (Grafana, Loki, Sentry, Posthog) - metrics, logs, traces, and alerting that surfaces problems before customers do. - You implement and improve our security fundamentals: IAM, secrets management, TLS, vulnerability scanning, and patch management. - You implement the technical controls required for our ISO 27001 certification and ensure evidence is continuously and auditably produced. Qualifications - Strong experience running production systems in a SaaS environment. - Solid understanding of Linux systems and networking fundamentals, containers and reverse proxies and API gateways (e.g., Traefik, Kong). - Strong security fundamentals: IAM and least privilege, secrets management, vulnerability scanning, and patching. - Proven experience with Infrastructure as Code (Terraform or equivalent), CI/CD pipelines (GitHub Actions or similar), and observability tooling (Grafana/Loki or similar). - PostgreSQL operations basics: availability, backup and restore, performance awareness. - Fluent in English. German is a plus. Requirements - Strong ownership mentality - you take responsibility end-to-end, without needing to be managed. - People enjoy working with you - not just because of what you know, but because of how you interact with them. - Pragmatic builder - you improve systems without adding unnecessary complexity, and you can defend why a simpler approach was the right call. - AI is part of how you work - not just as a tool you use, but something you actively explore. You experiment with models, prompts, and workflows, and you have opinions about what actually works. - You prioritize based on risk and business impact, not technical interest alone. - Comfortable working in a fast-moving startup environment with ambiguity and shifting priorities. - You communicate clearly with both technical and non-technical stakeholders. Nice to have - Experience with ISO 27001 implementation, especially technical controls. - Experience with NixOS. - Experience with self-hosted Supabase and Postgres-based platforms. - Experience with SRE practices: SLOs, error budgets, and incident review culture. - Familiarity with multi-cloud usage patterns (Hetzner, AWS, Azure, GCP). Benefits - The opportunity to build and own an entire platform engineering discipline from the ground up at a well-funded, fast-growing startup. - Short decision-making paths, real ownership, and a team that trusts you to lead your area. - Direct impact on product reliability, security posture, and company growth. - Flexible working hours and fully remote. - Access to a WellPass fitness membership for your physical and mental wellbeing. - Competitive salary of 70,000 - 80,000 EUR and VSOP (Virtual Stock Option Plan) participation opportunities. - We are a remote-first company and have been since day one. Most of our team is based in Leipzig, but remote team members are a natural part of how we work - not an exception. - We do bring the full team together in Leipzig, or somewhere else, a few times a year for team events and planning sessions, with all travel and accommodation costs covered. - If you are somewhere in Germany you are in scope. Hiring timeline - We are aiming to have this role filled by May 2026. - The process consists of three stages: an intro call (30min), a technical interview (90min) and a final conversation with leadership. - We move quickly and will keep you informed at every step. Company Description We are driven by our passion for innovation! MAIA is the AI platform built for industrial companies - the places where generic AI tools break down because the data is complex, the stakes are high, and precision actually matters. We focus on the hard problems: integrating with real engineering workflows, understanding complex technical documents, and turning implicit organizational knowledge into something the whole company can build on. Our team brings together sharp organizational thinkers, creative problem solvers, and engineers who care deeply about what they build.
Azure DevOps Specialist
Sparrow ConnectedTransforming how companies communicate and engage with their entire workforce.
• Take over ownership and responsibility for our build, release, and environments with the full support of the rest of the product team. • Work closely with the CTO and our senior developers as a key member of the product team. • Championing DevOps thinking to the dev team and during the feature spec'ing process. • Designing, building, testing, automating, monitoring and supporting significant components of our environments in Azure. • Monitoring and optimizing cost, efficiency, resiliency, and more. • Contributing to technical decisions and direction in a collaborative team environment, including architecture, estimation, product planning, user story/requirement creation. • Implementing modern Continuous Delivery processes for releasing software to production. • Applying industry best practices and patterns across infrastructure and application components e.g. security, elasticity, performance. • Applying configuration management, infrastructure provisioning, and container orchestration tooling to solve business problems. • Develop best of breed solutions. • Work in an Agile environment.




