Job Closed
This listing is no longer active.
Powerful edge and cloud solutions for media business and the entertainment industry
DevOps Engineer, Cloud AIaaS
Location
Cyprus
Posted
76 days ago
Salary
0
Seniority
Senior
Job Description
DevOps Engineer, Cloud AIaaS
Gcore
• Design, develop, and maintain infrastructure for AI inference workloads, including GPU scheduling, model deployment pipelines, and data access patterns in on-prem environments • Build and manage monitoring and observability tools for AI inference platforms, including dashboards, alerts, and runbooks for model health and system performance • Collaborate with ML engineers and platform teams to design system architecture for AI workloads, integrate inference runtimes, and test performance at scale
Job Requirements
- Hands-on Experience In Containerization and Container Orchestration: Kubernetes, Helm, Docker/CRI-O
- Linux and networks
- Programming and Scripting: Python/Go/Bash
- Infrastructure as Code (IaC) approach: Ansible, Terraform
- Creating CI/CD pipelines: GitLab/GitHub actions
- Experience with Cluster API or any other "Kubeception" technology
- Deep experience with Kubernetes CNI, CSI, and Operators
- Nice to Have Knowledge in Kubernetes-related technologies such as ArgoCD, Helmfile
- Experience with Prometheus stack
- Experience with other Cloud Native technologies.
Benefits
- Competitive compensation
- Flexible working hours and hybrid or remote options, depending on your role
- Work from anywhere in the world for up to 45 days per year
- Private medical insurance for you and your family*
- Extra paid vacation and sick leave days*
- Support for life’s important moments and celebrations
- Language courses to help you connect and grow
- Modern, welcoming offices with snacks, drinks, and entertainment*
- Team sports and social activities*
- Benefits may vary depending on your location.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer – Cybersecurity Platform
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Architect, scale, and maintain self-managed Redis, Kafka, Elasticsearch/OpenSearch, and MongoDB clusters • Collaborate with Product and Engineering teams to design resilient architectures for high-scale, real-time cloud operations • Proactively identify bottlenecks and security risks, implementing robust solutions with minimal supervision • Ensure high availability and disaster recovery for distributed systems through Infrastructure-as-Code and CI/CD best practices • Optimize performance and reliability of cloud-native systems, focusing on observability and automated recovery processes • Mentor team members and contribute to DevOps knowledge-sharing across the organization
Senior DevOps Engineer – Cybersecurity Platform
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Architect, scale, and maintain self-managed Redis, Kafka, Elasticsearch/OpenSearch, and MongoDB clusters • Collaborate with Product and Engineering teams to design resilient architectures for high-scale, real-time cloud operations • Proactively identify bottlenecks and security risks, implementing robust solutions with minimal supervision • Ensure high availability and disaster recovery for distributed systems through Infrastructure-as-Code and CI/CD best practices • Optimize performance and reliability of cloud-native systems, focusing on observability and automated recovery processes • Mentor team members and contribute to DevOps knowledge-sharing across the organization
Senior DevOps Engineer – Cybersecurity Platform
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Architect, scale, and maintain self-managed Redis, Kafka, Elasticsearch/OpenSearch, and MongoDB clusters • Collaborate with Product and Engineering teams to design resilient architectures for high-scale, real-time cloud operations • Proactively identify bottlenecks and security risks, implementing robust solutions with minimal supervision • Ensure high availability and disaster recovery for distributed systems through Infrastructure-as-Code and CI/CD best practices • Optimize performance and reliability of cloud-native systems, focusing on observability and automated recovery processes • Mentor team members and contribute to DevOps knowledge-sharing across the organization
SRE, Site Reliability Engineer
Arthur Grand TechnologiesArthur Grand Technologies is focused on Digital Transformation initiatives for our federal and commercial customers.
• Looking for a strong SRE professional for a Remote position • Deep expertise in reliability engineering, automation, and cloud platforms


