Xayn is pioneering next genAI for lawyers.
Lead DevOps Engineer
Location
Germany
Posted
22 hours ago
Salary
€95K - €118K / year
Seniority
Senior
Job Description
Lead DevOps Engineer
Xayn
• Own and optimize Noxtua's infrastructure across OTC and our self-hosted GPU servers — ensuring efficient architecture, reliable operation, and cost control. • Lead and grow a team of 4–5 DevOps engineers, setting technical direction, supporting their development, and having a strong ownership mindset. • Operate our self-managed GPU server fleet — provisioning, driver installation, hardening, and connectivity via Ansible — and manage provider SLAs to keep heavy AI workloads running reliably. • Build and maintain infrastructure automation using Infrastructure as Code (Terraform & Ansible). • Run our container platform on Kubernetes, support teams with Docker, and keep our services (APIs) stable, accessible, and secure. • Set up and maintain monitoring and alerting (e.g., Prometheus, Grafana) to ensure system reliability and performance. • Develop and maintain CI/CD pipelines and collaborate with the development and AI teams to automate deployments and support AI-driven workloads.
Job Requirements
- Leadership: Experience leading or mentoring a team, setting technical direction, and balancing hands-on operations with people responsibility.
- Managing server fleets: You've managed a fleet of servers and understand the methodology behind it — not just rented cloud instances.
- Experience with GPU servers is a strong plus, but not required.
- Strong proficiency in Linux and Bash, plus a scripting language such as Python.
- Proven track record designing, operating, and cost-managing cloud-based architectures — ideally OTC (Open Telecom Cloud), or transferable experience from AWS, Azure, or Google Cloud — with solid networking fundamentals (DNS, OSI model).
- Strong focus on automating provisioning and configuration with Terraform and Ansible.
- Expertise in containerizing applications with Docker and running them at scale on Kubernetes.
- Able to set up and maintain monitoring/alerting tools (e.g., Prometheus, Grafana), aggregate data, visualize insights, and derive actions.
Benefits
- 100% remote work possible (given a German residence), other countries upon request
- Flexible working hours
- Vacation: 26 days + December 24th & 31st off, + 1 additional vacation day per year of employment (up to 30 days)
- Discounts: e.g., Urban Sports Club Membership, depending on location
- Equipment: Laptop (Lenovo or Mac), plus €1,000 net home office setup budget (paid with your first salary)
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
BCC SoftwareBCC Software is the leading postal and presort software solutions provider with in-depth data marketing services.
Role Description The DevOps Engineer is responsible for supporting and enhancing the engineering infrastructure, managing secure engineering operations, and maintaining internal development tools and software. This role includes automating engineering workflows, maintaining cloud-based infrastructure, and contributing to product development efforts. The DevOps Engineer will work closely with Development, QA, and IT teams to streamline and secure all aspects of the CI/CD pipeline while advancing cloud infrastructure, automation, and operational excellence across the engineering organization. Key Responsibilities - Design, build and maintain CI/CD workflows and automation frameworks to support secure software delivery across on-premise and public cloud-hosted engineering environments and applications in alignment with IT and security access policies. - Champion and help implement established CI/CD, release automation and build security best practices across engineering teams, partnering with IT, security and architecture teams to align with enterprise standards. - Continuously improve deployment speed, release reliability and services operations through automation, pipeline quality improvements and observability capabilities. - Be a key contributor in developing and maintaining all aspects of public cloud infrastructure solutions, including automated virtual machine deployment and ongoing support for team’s infrastructure needs. - Continuously monitor and improve build systems for performance, security, scalability, and automation. - Work closely with Engineering, QA and IT to troubleshoot infrastructure and deployment issues in development, QA, staging, and production environments. Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent professional experience. - Strong understanding of CI/CD principles, with hands-on experience building pipelines/actions within GitHub and/or ADO eco-system. - 5-7 years of experience in DevOps, infrastructure engineering, or a similar role. - Experience in supporting container-based solutions utilizing Docker and Kubernetes. - Familiarity with Microsoft Azure PAAS and SAAS services, including computer, networking, storage, containers and identity management. - Experience working with Infrastructure as Code (IaC), particularly using tools like Terraform, Bicep, ARM. - Knowledge of public and private cloud infrastructure security best practices. - Proficiency in PowerShell or like scripting languages for automation and system management. - Experience working within Lean and Agile development environments. Benefits - This position offers remote work flexibility, with a preference for candidates located in the Rochester, NY area to enable increased in-person and onsite collaboration as business needs evolve. Physical Demands - This is primarily an office-based role requiring extended periods of sitting and frequent use of hands for typing, writing, and operating standard office equipment. - The position requires effective verbal and written communication. - Occasional standing, walking, bending, reaching, or lifting of office supplies or small equipment up to 25 pounds may be required. - Visual acuity sufficient for reading, computer work, and document review is necessary. - Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential functions of the position. Position Type and Expected Hours of Work - This is a full-time exempt position. - Some flexibility in hours is allowed, but the employee must be available during the “core” work hours of Monday through Friday, 8:00 a.m. to 5:00 p.m. - Additional hours including evening and weekend work may be required as job duties demand. AAP/EEO Statement - BCC Software provides equal employment opportunity to all individuals regardless of age, race, color, creed, religion, ancestry, sex (including pregnancy, childbirth or related medical conditions), gender, sexual orientation, gender identity or expression (including transgender status), national origin, veteran or military status, marital status, genetic information, physical or mental disability, familial status, reproductive health decisions, status as a victim of domestic violence, or any other basis protected by applicable laws and regulations. - Further, the company takes affirmative action to ensure that all applicants and employees are treated without regard to any of these characteristics during the application process and/or employment. - Discrimination of any type will not be tolerated. Compensation - USD 130,000 - USD 140,000 yearly
Staff Devops Engineer II (DBOps)
Housecall ProMission control for your business - Housecall Pro is a digital tool that lets you run and grow your business on the go.
Role Description As a Staff Devops Engineer II (DBOps), you are an expert communicator who bridges the gap between schema design, site reliability engineering, and infrastructure automation. You possess deep relational database expertise and are comfortable collaborating cross-functionally across engineering teams to optimize query performance, architecture, and system scalability. You take a high level of ownership over database health, proactively addressing scaling bottlenecks, sharding, and replication topologies before they impact production. You bring an SRE mindset to continuously improve the observability, efficiency, and disaster recovery posture of core data systems. Our team is passionate, empathetic, hard working, and above all else focused on improving the lives of our service professionals (our Pros). Our success is their success. Compensation: 10,000 - 10,700 USD per month (B2B) What You'll Do Each Day - Own the observability, monitoring, and alerting of production database clusters and replication topologies - Optimize database engine configurations for performance, stability, compliance, and security - Design scalable database schemas and storage architectures to support high-growth application workloads - Implement advanced data scaling strategies including sharding, partitioning, and high-availability clustering - Triage critical production database incidents and establish long-term preventative engineering solutions - Lead cross-functional initiatives for disaster recovery testing, data classification, and major engine upgrades - Build cloud infrastructure using Infrastructure as Code to automate deployment pipelines and environments - Collaborate with software development teams on complex migrations, schema reviews, and query optimization - Participate in systemic architecture reviews to provide authoritative database engineering expertise - Automate operational tasks and third-party API integrations using modern scripting languages Qualifications - 5+ years of hands-on experience designing, implementing, and operating relational databases in production, plus 3+ years in DevOps, SRE, or software engineering - Production-scale expertise with relational technologies (i.e. MySQL, PostgreSQL) including backups, replication topologies, indexing, and partitioning - Advanced knowledge of database architecture, data modeling, query optimization, and high-availability / backup & recovery solutions - Extensive experience building cloud infrastructure using Infrastructure as Code (i.e. Terraform, CloudFormation, Pulumi, CDK) within public cloud environments (i.e. AWS) - Proficiency in programming languages (i.e. Python, Golang) to drive automation and tool integration - Strong operational familiarity with Linux operating systems, containerization (i.e. Docker), and container orchestrators (i.e. Kubernetes, ECS) - Hands-on experience with continuous integration and deployment pipelines (i.e. GitLab, TravisCI, CircleCI, Jenkins) - Demonstrated experience deploying monitoring and observability platforms (i.e. Datadog, New Relic) - Demonstrated ability to leverage AI tools to improve workflows, streamline execution, or enhance outputs - Bachelor’s degree in Computer Science, Information Technology, or equivalent work experience What Will Help You Succeed - Exceptional breadth of interest shown through tangible, self-initiated ventures or deep community involvement; you love trying new things and may possess a demonstrated history of successfully pivoting or starting over in life and work - Strong systems thinking and the ability to balance long-term technical objectives with immediate operational priorities - Deep technical comfort explaining complex architectural trade-offs, cardinality, data types, and engine execution behaviors to other developers - A genuine passion for data-driven decision-making and troubleshooting distributed architectures - Familiarity with data streaming or ETL architectures (i.e. Kafka, Confluent Cloud) Benefits - Paid holidays and flexible, take-it-as-you-need-it scheduled time off - A culture built on innovation that values big ideas, no matter where they come from - A MacBook set up and ready from day one, plus a $500 stipend to design your ideal workspace - Central European Time (CET) hours to support a balanced schedule for our Poland-based team - Equity in a rapidly growing startup backed by top-tier VCs
• Design, implementation, and automation of large-scale distributed systems • Build tools and automation that help Five9 achieve higher availability, scalability, latency, and efficiency • Work with Engineering teams to deliver high-quality software in a fast-paced environment • Monitor production and development environments to build preventive measures and provide a seamless customer experience • Work with delivery teams on software improvements to achieve higher availability and lower MTTD • Participate in on-call rotation (8h shift, 7days a week, every 3-4 weeks)
• Work closely with developers to prototype and design new infrastructure features • Deploy, install, configure and maintain sophisticated trading/finance and related software • Configure bare-metal instances using Infrastructure as Code • Build and maintain CI/CD pipelines • Make key decisions regarding scalability, reliability and availability • Install and manage in-house and third-party monitoring systems • Design, deploy, and configure cloud-based servers and networks; provision servers and storage; configure firewalls, VPNs, monitoring, etc. • Administer UNIX infrastructure — installation, configuration, and maintenance • Manage Nexus and Git repositories




