Job Closed
This listing is no longer active.
Custom-Built Software Engineering Teams
Lead or Middle or Senior DevOps Engineer
Location
Portugal + 7 moreAll locations: Portugal | Poland | Egypt | Georgia | Croatia | Serbia | Armenia | Kazakhstan
Posted
64 days ago
Salary
0
Seniority
Lead
No structured requirement data.
Job Description
Lead or Middle or Senior DevOps Engineer
Akvelon, Inc.
Role Description We are looking for a Lead, Middle, or Senior DevOps Engineer to join a research infrastructure team building an on-demand GPU platform for advanced compute workflows. The role focuses on enabling secure, scalable, and user-friendly access to high-performance GPU resources through automation, scheduling, and modern platform tooling. Locations: Serbia, Georgia, Armenia, Kazakhstan, Poland, Croatia, Portugal, Egypt. Tasks - Strong hands-on experience with Kubernetes and platform orchestration; - Solid understanding of scheduling, reservation, or namespace-based resource management systems; - Experience with GPU infrastructure, virtualization, slicing, or containerized workstation environments; - Strong scripting and automation skills; - Practical Azure experience and familiarity with secure infrastructure operations. Requirements - Build and improve an on-demand GPU workstation platform with lightweight containerization or virtualization; - Implement scheduling, reservation, registration, image management, storage mounting, SSH with SSO, and developer-friendly access flows; - Automate cluster namespace configuration across CPU, GPU, memory, and storage allocations; - Support hierarchical capacity allocation models with RBAC-based administration; - Automate storage import, export, and archival workflows as allocations change; - Build monitoring, alerts, and automated incident ticket creation for large-scale cluster environments; - Improve integrations between source control, CI/CD, package distribution, and GPU-connected development workflows; - Contribute automation, scripts, and agentic tooling that improve infrastructure and day-to-day research workflows. Nice to Have - Experience with Prometheus, Grafana, incident automation, or on-call paging workflows; - Experience with developer platforms, devcontainers, or remote development tooling such as VS Code integrations; - Exposure to AI-assisted monitoring, trend analysis, or agentic infrastructure tooling. Engagement Type - B2B contract. Location / Timezone - Remote work from Serbia, Georgia, Armenia, Kazakhstan, Poland, Croatia, Portugal, Egypt. - European working hours. - Occasionally available for meetings up to 10:00 AM PST (US overlap).
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Manage the reliability, availability, and performance of high-traffic web platforms. • Administer and optimize Cloudflare services, including CDN, caching, DNS, WAF, and rate limiting. • Configure and manage DataDome to mitigate bots, abuse, scraping, and malicious traffic. • Monitor production systems and respond to incidents affecting uptime, latency, and user experience. • Investigate outages and performance issues, conduct root cause analysis, and implement long-term fixes. • Collaborate with engineering teams to improve resiliency, observability, and deployment safety. • Support traffic scaling, capacity planning, and operational readiness for large-volume environments. • Implement automation and operational best practices to improve stability and efficiency.
• Design, build, and manage production-grade infrastructure in AWS • Build, scale, and maintain Kubernetes environments for critical services • Develop and improve CI/CD pipelines and infrastructure automation • Drive observability through monitoring, logging, tracing, and SLI/SLO implementation • Lead incident response, root cause analysis, and reliability improvements • Embed PCI DSS v4.0 compliance into infrastructure and delivery workflows • Implement security best practices, including IAM, RBAC, secrets management, and encryption • Drive cloud cost optimization and improve infrastructure performance and efficiency • Collaborate closely with engineering, product, and security teams to support platform growth
Senior DevOps Engineer
Jimmy TechnologiesLeveraging the world’s best IT brains to build first-class software and shape your digital products
• Design and implement scalable, repeatable deployment frameworks for AI, data, and cloud-native applications. • Develop and maintain Infrastructure as Code (IaC), automated environment provisioning, and deployment workflows to ensure consistency across environments. • Build and optimize CI/CD pipelines that enable reliable, automated delivery across development, testing, staging, and production. • Standardize application packaging and deployment models to enable seamless delivery into customer environments with minimal customization. • Define and implement best practices for secrets and configuration management, identity and access management (IAM), networking, secure connectivity, observability, logging, monitoring, alerting, and release management. • Improve production readiness by strengthening application resilience, scalability, security, runtime governance, and operational excellence. • Observability Improvements: Enhance monitoring for services to improve system reliability. • Scripting & Automation: Develop, implement, and maintain scripts to automate processes and reduce manual efforts.
Site Reliability Engineer, SRE
3Core Systems, IncDelivering end-to-end SAP System Integration and IT Professional Services for Emerging Technologies
• Analyze and classify Critical security vulnerabilities across servers, applications, databases, and middleware • Review CVEs, scanner findings, and vendor advisories to validate applicability and risk • Develop written remediation plans (patching, upgrades, configuration changes, library updates) • Coordinate remediation with Information Security Server Administrators, Application Development teams, DBAs, QA, and Release Management • Track remediation progress, risks, and dependencies • Update status, evidence, and closure documentation in enterprise tools (e.g., ServiceNow, Tenable) • Support re-scans, validation, audits, and compliance documentation • Provide regular status reporting on open, in-progress, and closed vulnerabilities



