Ubuntu is a community-developed, Linux-based operating system that is published and commercially supported by software development firm Canonical. Like Canonica
Senior Site Reliability Engineer, GitOps
Location
Worldwide
Posted
69 days ago
Salary
0
Seniority
Senior
Job Description
Senior Site Reliability Engineer, GitOps
Canonical
• Drive the development of automation, Gitops in your team as an embedded tech lead • Closely collaborate with the IS architect to align your solutions with the IS architecture vision • Design and architect services that IS can offer to the organization as products • Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes • Automate software operations for re-usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems • Maintain operational responsibility for all of Canonical’s core services, networks, and infrastructure • Develop skills in troubleshooting, capacity planning, and performance investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services • Provide assistance and work with globally distributed engineering, operations, and support peers • Be given uninterrupted development time to focus on larger projects and automation of manual tasks • Share your experience, know-how and best practices with other team members in design sessions, mentorship and ‘doing work together’ • Carry final responsibility for time-critical escalations
Job Requirements
- A modern view on hosting architecture, driven by infrastructure as code across both private and public clouds.
- A product mindset thriving to develop products rather than solutions.
- Python software development experience, with large projects
- Experience working with Kubernetes or other container orchestration systems.
- Proven exposure to manage and deploy cloud infrastructure with code.
- Practical knowledge of Linux networking, routing, and firewalls
- Affinity with various forms of Linux storage, from Ceph to Databases
- Hands-on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor's degree or greater, preferably in computer science or related engineering field
- Able to communicate clearly and effectively in English over email, chat, video or voice calls and in-person
- Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate
- A willingness to be flexible and able to learn new things quickly
- Be inspired by the needs of fast-changing environments
- Happy to work within distributed teams
- Be passionate and familiarized about open-source, especially Ubuntu or Debian
Benefits
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Team Member Assistance Program & Wellness Platform
- Opportunity to travel to new locations to meet colleagues
- Priority Pass and travel upgrades for long-haul company events
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Director of DevOps
PortProMeet PortPro's drayOS 2. The latest and most premier operating system for drayage carriers.
• Define and execute the company's DevOps roadmap, ensuring alignment with business and engineering goals. • Lead and mentor a team of DevOps engineers, SREs, and automation specialists. • Drive a culture of automation, continuous improvement, and collaboration across development and operations teams. • Design, implement, and manage scalable, secure, and cost-effective cloud infrastructure (AWS, Azure, GCP). • Oversee containerization and orchestration (Docker, Kubernetes, Helm). • Establish infrastructure-as-code (IaC) best practices using tools like Terraform, CloudFormation, or Pulumi. • Develop and maintain CI/CD pipelines to support rapid, reliable software releases (GitHub Actions, Jenkins, GitLab CI, ArgoCD). • Drive automation for deployment, monitoring, and self-healing systems. • Implement observability and logging solutions (ELK, Prometheus, Grafana, Datadog). • Ensure DevSecOps principles are integrated into the software development lifecycle. • Maintain compliance with industry standards (SOC 2, ISO 27001, GDPR etc.).
Senior Site Reliability Engineer
CertifIDCertifID provides identity protection services to help prevent wire fraud. Focused on securing digital financial transactions, the company strives to reduce the financial and emoti
• Own and improve the reliability, availability, and performance of production systems while defining and operationalizing SLIs/SLOs and error budgets. • Design and implement autonomous and semi-autonomous AI agents for monitoring distributed systems and applications. Build agents capable of consuming multi-source observability data (metrics, logs, traces, etc.). • Participate in and help lead an on-call rotation, serving as an escalation point for major incidents and facilitating blameless postmortems. • Build automated workflows to eliminate manual work and design/maintain Infrastructure-as-Code with Terraform. • Improve metrics, logs, traces, and alerting using tools like Datadog or Prometheus to reduce noise and increase signal. • Partner with application teams to implement reliability best practices and mentor junior engineers to foster a culture of knowledge sharing.
Senior DevOps Engineer – Lead Role
Bikeleasing-Service DeutschlandDienstrad-Leasing leicht gemacht | Beste Konditionen für Arbeitgeber, Arbeitnehmer & Selbstständige
• Responsible for the cloud infrastructure (AWS, Azure, Terraform) • Ensure reliable deployments through the CI/CD landscape (GitHub Actions, ArgoCD) • Develop self-service capabilities together with the Enabler team • Define clear SLIs and SLOs for critical services • Responsible for technical security and compliance posture • Lead the DevOps team and establish processes
DevOps Engineer
Group 1001We are a financial services enterprise creating useful and intuitive solutions and products for everyone.
• Design, implement, and optimize CICD pipelines to automate software delivery processes. • Manage and maintain our cloud infrastructure on AWS, including ECS, RDS, DocumentDB, Redis, etc. • Convert cloud infrastructure to code using Terraform. • Implement and enforce cloud security measures and best practices. • Monitor system performance, troubleshoot issues, and ensure high availability and scalability. • Collaborate with development teams to streamline deployment processes and improve system reliability. • Implement measures to optimize cloud costs and resource utilization. • Stay up-to-date with the latest cloud technologies and best practices.



