Senior Database Reliability Engineer, Architect
Location
Poland
Posted
33 days ago
Salary
0
Seniority
Senior
Job Description
Senior Database Reliability Engineer, Architect
CloudLinux
• DBaaS Architecture: Design and implement a self-service platform based on Terraform and Ansible, enabling the deployment of HA clusters (PostgreSQL and ClickHouse, MongoDB, Redis) in a heterogeneous environment (Bare Metal + OpenNebula + Kubernetes + Public Clouds). You will turn infrastructure into a product. • Scaling ClickHouse: Manage exponentially growing analytics clusters (12+ clusters, tens of terabytes of data). You will tackle sharding, table engine optimization (ReplicatedMergeTree), and building reliable S3 backup pipelines under high load. • Data Platform & Analytics Support: Maintain and scale the infrastructure for Apache Airflow and Redash. You will ensure the reliability of ETL pipelines and visualization tools, bridging the gap between raw infrastructure and the data analytics team. • Reliability as Code: Implement SRE practices in data management. Replace manual incident response with automated self-healing mechanisms. Define and implement SLO/SLI for all databases. • Stack Modernization: Lead the migration process from legacy solutions to modern cloud patterns. Participate in decision-making regarding the implementation of Kubernetes operators for stateful workloads. • Expertise & Mentorship: Serve as the technical authority for product teams, helping them optimize data schemas and SQL queries for high-load systems.
Job Requirements
- AI-Augmented Engineering: You don't view AI as a replacement for deep technical fundamentals, but as a high-leverage tool. We actively use AI agents (Claude, Codex, Gemini, etc.) to automate boilerplate, analyze complex logs, and speed up research. We expect you to be open to modern workflows and integrate AI into your day-to-day operations, allowing you to focus your brainpower on the true architectural challenges.
- Deep PostgreSQL Expertise (5+ years): You know MVCC internals, understand locking mechanics, can configure Patroni and PgBouncer "with your eyes closed," and have experience with seamless major version upgrades under load.
- ClickHouse Mastery: Experience operating large clusters, understanding ZooKeeper/ClickHouse Keeper, sharding, replication internals, and the ability to diagnose performance issues at the data-part level.
- Engineering Mindset (SRE/DevOps): You hate doing the same task twice by hand. Experience writing complex Terraform modules and Ansible roles is mandatory. Programming skills in Python or Go for automation are a huge plus.
- Hybrid Environment Experience: You understand the differences between running DBs on Bare Metal vs. Kubernetes vs. Cloud and know how to optimize TCO and disk subsystem performance (NVMe, Network Storage).
- Systems Approach: You see the big picture - from the network packet to the application business logic. You understand the importance of security (FIPS, Audit logs) and Disaster Recovery.
- Nice to Have:**
- Experience building an Internal Developer Platform (IDP).
- Experience operating databases in Kubernetes (CloudNativePG, Altinity Operator).
- Experience working in Cloud and Hosting providers on similar services.
Benefits
- A focus on professional development.
- Interesting and challenging projects.
- Fully remote work with flexible working hours, which allows you to schedule your day and work from any location worldwide.
- Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves.
- Compensation for private medical insurance.
- Co-working and gym/sports reimbursement.
- Budget for education.
- The opportunity to receive a reward for the most innovative idea that the company can patent.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Cloud Data DevOps Engineer
AAAProud to serve our 62+ million members, help travelers see the world and drive real change to improve road safety.
• Design, deploy, and maintain scalable and reliable cloud infrastructure on AWS and GCP. • Build scalability, fault-tolerance, security, and performance into our data platforms to meet our growing data & analytics needs. • Develop cloud data ingestion and processing frameworks. • Automate routine tasks to improve efficiency and reduce manual intervention. • Implement and maintain CI/CD pipelines for data applications. • Optimize data pipelines and processes for performance and scalability. • Troubleshoot and resolve issues related to data pipelines and data processing. • Work with our data engineering teams, data science teams, and analytical business users to review and ensure best practices are followed and high-quality code gets implemented. • Partner with our Information Security, Application Security, and Cloud Infrastructure Services teams to ensure data is secure both at-rest and in-flight. • Ensure data platforms remain current to take advantage of the latest features and support. • Mentor, teach, champion, and encourage an Automation mindset amongst the IT department and other business units. • Communicate the status of assigned work to management and follow agile practices, standard procedures, and policies. • Seek guidance when direction is needed and speak up about technology risks identified.
• Lead and support on-site deployments of autonomous robotic systems, including installation, calibration, commissioning, and startup activities. • Troubleshoot and resolve mechanical, electrical, and software issues in real time to minimize downtime and restore system performance. • Perform preventive maintenance, inspections, and system health checks to ensure reliable operation. • Support remote diagnostics and respond to escalations in partnership with Engineering and Technical Support teams. • Train customer operators and site personnel on system operation, safety procedures, and best practices. • Document all field service activity including repairs, troubleshooting steps, photos, and site conditions for internal tracking and continuous improvement. • Travel frequently to customer sites (up to 100%) and work independently in dynamic warehouse and industrial environments.
Senior ServiceDesk Reliability Engineer – SDRE
TabbyOn a mission to create financial freedom. No interest. No fees. Shariah-Compliant.
• Tabby creates financial freedom in the way people shop, earn and save by reshaping their relationship with money. • The company’s flagship offering allows shoppers to split their payments online and in-store with no interest or fees. • Tabby generates over $10 billion in annual transaction volume for its partner brands and is the highest-rated, most-reviewed, largest, and fastest-growing FinTech in the GCC region. • Tabby launched in 2019 and has since raised +$1 billion in equity and debt funding from global and regional investors, and is now valued at $4.5 billion.
Senior ServiceDesk Reliability Engineer – SDRE
TabbyOn a mission to create financial freedom. No interest. No fees. Shariah-Compliant.
- Tabby creates financial freedom in the way people shop, earn and save by reshaping their relationship with money. Over 15 million users choose Tabby to stay in control of their spending and make the most out of their money. - The company’s flagship offering allows shoppers to split their payments online and in-store with no interest or fees. Over 40,000 global brands and small businesses, including Amazon, Noon, IKEA, and SHEIN use Tabby to accelerate growth and gain loyal customers by offering easy and flexible payments online and in stores. - Tabby generates over $10 billion in annual transaction volume for its partner brands and is the highest-rated, most-reviewed, largest, and fastest-growing FinTech in the GCC region. - Tabby launched in 2019 and has since raised +$1 billion in equity and debt funding from global and regional investors, and is now valued at $4.5 billion.



