Job Closed
This listing is no longer active.
Promote efficient software delivery along with digital sobriety: self-service portal, orchestration, finops, greenops
Senior DevOps Engineer
Location
Germany
Posted
73 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer
Cycloid - Sustainable Platform Engineering
• Work with the product team on user story and UX/UI validation • Participate in backend development in Go • Contribute to Open Source software (Ansible, Concourse, Terraform, etc.) • Design, build, migrate, maintain and run the platform in a multi-cloud environment AWS, GCP, Azure • Collaborate with developers to fix problems and provide long-term solutions • Support presales/sales team as a solution architect • Define goals, agenda and priorities • Share ideas and knowledge with others
Job Requirements
- OS: Linux
- Automation: Terraform, Ansible
- Containers: Docker/Kubernetes
- Monitoring: Prometheus, Grafana
- DB: MySQL, Redis, Elasticsearch
- Development/scripting: Bash, Python, Golang
- Cloud Providers: AWS and/or GCP and/or Azure
- Design and maintain infrastructures
- English proficiency
- Experience with CI/CD tools
- Cloud Provider Certifications
- Dev expertise in Go
- Open Source contribution
- French speaker
Benefits
- A great autonomy
- A collaborative environment where your opinion matters
- 100% remote work: live and work from home across Europe
- Home office setup up to €1200 (including device)
- Choose your own device spirit
- 1 retrait per year somewhere in EU
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
SRE, DevOps Engineer
Kraken Digital Asset ExchangeWe put the power in your hands to buy, sell, and trade digital currency 🌏
• Build and support infrastructure and tools, on-prem and in the cloud • Drive standardisation: Author RFCs and internal guides covering process improvements, reliability patterns, and best practices • Support, and guide engineers on SRE related topics • Partner with product-development teams to identify, and eliminate friction
• Collaborate closely with architects, developers, QA, and security teams to ensure smooth and reliable environment operations • Work in close partnership with the platform team, based on shared ownership, knowledge exchange, and mutual support • Own and operate containerized application platforms based on Docker and Kubernetes, ensuring reliability, scalability, and operational excellence • Design and deliver dynamic test environments at scale, including multiple parallel, per–merge request (branch-based) deployments • Build, maintain, and standardize CI/CD pipelines by creating reusable templates and components in GitLab CI • Drive deployment automation and GitOps practices • Identify operational bottlenecks and implement automation to reduce manual effort and improve delivery speed • Embed security-by-design across the SDLC, including pipeline hardening and automated security checks • Build and operate observability platforms: monitoring, logging, and diagnostics (Prometheus, Grafana, ELK/EFK/Loki, etc.) • Participate in on-call and incident response, including troubleshooting, root-cause analysis, and post-mortems • Take end-to-end ownership of the solutions you build (“you build it, you run it”).
Customer Reliability Engineer III
GitHub, Inc.GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together.
About GitHub GitHub is the world’s leading platform for agentic software development — powered by Copilot to build, scale, and deliver secure software. Over 180 million developers, including more than 90% of the Fortune 100 companies, use GitHub to collaborate, and more than 77,000 organisations have adopted GitHub Copilot. Locations In this role you can work from Remote, United Kingdom Overview GitHub is growing its Customer Success & Support team and we're seeking experienced professionals to elevate our technical customer support efforts. As a Customer Reliability Engineer III, you will efficiently manage and resolve customer issues and act as a liaison between customers and the engineering team. The ideal candidate will drive transformative customer experiences, ensuring long-term satisfaction and loyalty while fostering innovation and collaboration across teams. This role may require working non-standard working hours, including weekends and holidays on-call as part of a team-wide rotation. Responsibilities - Work with assigned customers via support tickets and/or real-time interaction (phone/screen sharing) to solve technical issues related to their usage of GitHub products often involving Linux servers, source code, and web application issues. - Act as a single point of contact for technical issues with ability to troubleshoot and resolve complex issues independently. - Collaborate with the Support and Engineering teams to resolve product issues requiring code changes. - Lead incident response for outages affecting assigned customers, followed by delivery of postmortem reports. - Act as a single point of contact for specific enterprise customers to provide performance, and best practice advice and assessment related to GitHub and customer's infrastructure. - Understand and maintain documentation around the customer infrastructure, workflows, and configuration of GitHub - Enterprise Server or GitHub Enterprise Cloud environment. - Coordinate and collaborate with other teams at GitHub when additional expertise is needed to resolve customer issues. - Manage customer incidents and outages, including joining Zoom/screen share sessions for live triage. - Perform incident postmortems, ticket analysis, and system health checks for Premium Support customers as needed. - Lead quarterly business reviews for the assigned accounts by presenting metrics, data, and health check summary and recommendations. - Organize and lead weekly/bi-weekly touchpoints with assigned accounts to review ongoing Support issues and projects. - Work proactively with customers on activities such as coordinating upgrades and ensuring their installation is running smoothly. - Set-up and onboard new assigned customers into the program. - Provide weekend on-call support as part of the team rotation (8 hour shifts, during normal work hours). - Update and maintain various repositories, including team and public documentation, and actively contribute to cross-organization strategy discussions. - Ensure that systems and processes comply with security standards and regulations, implementing best practices to protect customer data and maintain system integrity. Qualifications Required Qualifications: - 5+ years' experience in technical customer support, technical writing, system administration, or related roles, - OR bachelor's degree in computer science or related field AND 3+ years' experience in technical customer support, technical writing, system administration, or related roles, - OR equivalent experience. Preferred Qualifications: - Experience leveraging AI tools and technologies to enhance business processes and drive innovation. - Deep knowledge of Git, GitLFS, GitHub Administrator, and GitHub - Worked closely with large complex customer accounts in a technical capacity - Experience with production-level virtualization platform(s) and/or cloud provider(s) (e.g., VMware ESX, KVM, AWS, Azure) - Proficiency with and/or ability to understand and update code and scripts (e.g., Shell, Ruby, Go) - Proficiency in common applications in the web application stack (e.g., HAProxy, Nginx, MySQL, and Unicorn) GitHub values - Customer-obsessed - Ship to learn - Growth mindset - Own the outcome - Better together - Diverse and inclusive Manager fundamentals - Model - Coach - Care Leadership principles - Create clarity - Generate energy - Deliver success Who We Are GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub. Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms. Join us, and let’s change the world, together. Equal Employment Opportunity GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!
• Ensure the reliability, security, and performance of backend systems. • Establish best practices in DevOps, observability, and CI/CD. • Work closely with development teams to deploy code, automate processes, and maintain high-availability distributed systems. • Design and implement scalable infrastructure using Terraform, Kubernetes, and containerized environments. • Develop monitoring, logging, and alerting solutions to maintain system health and minimize downtime. • Build and optimize CI/CD pipelines, ensuring efficient deployment of backend services and smart contracts. • Identify and resolve bottlenecks in distributed systems to improve scalability and efficiency. • Implement and enforce security policies, including key management, access controls, and network security.




