Job Closed
This listing is no longer active.
Creating Digital Leaders. Digital Transformation Consultancy Services and Solutions
Senior AWS DevOps Engineer, SRE – AI
Location
Poland
Posted
111 days ago
Salary
zł22K - zł30K / month
Seniority
Senior
Job Description
Senior AWS DevOps Engineer, SRE – AI
Xebia
• Building and supporting the tools, processes and infrastructure empowering the faster delivery and scaling of software iterations • Ensuring availability, reliability and scalability of application infrastructure • Building and supporting continuous integration/delivery and release tools • Ensuring the right metrics are collected and monitored
Job Requirements
- 5+ years of experience working with DevOps practices and Continuous Delivery
- Practical knowledge of AWS services, infrastructure and networking
- Solid experience with Kubernetes (ideally EKS on AWS) and container orchestration
- Python knowledge
- Experience in Claude Code
- Experience in working with AI Agents
- Experience in FastMCP or other MCP libraries
- Hands-on with GitOps practices, preferably with ArgoCD
- Strong skills in Terraform and Helm
- Proficiency in Bash and PowerShell scripting
- Experience with CI/CD pipelines and tooling (GitLab CI/CD, GitHub Actions, or similar)
- Experience with monitoring, observability, and logging tools, such as Prometheus, Grafana, AppDynamics, and OpenSearch
- Security awareness (OWASP, encryption, secrets management)
- Very communicative and collaborative, with a strong sense of ownership
- Upper intermediate/advanced English (B2/C1)
- Work from EU and a work permit to work from EU are required
Benefits
- Personal development budgets
- Flexible work arrangements
- Support for tech communities
- Meetups and events
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Building and supporting the tools, processes and infrastructure empowering the faster delivery and scaling of software iterations • Ensuring availability, reliability and scalability of application infrastructure • Building and supporting continuous integration/delivery and release tools • Ensuring the right metrics are collected and monitored.
Senior Site Reliability Engineer
SelectorIndustry leading AIOps platform for operational intelligence.
• Serve as a senior technical expert in deploying and maintaining Selector’s operational analytics platform across on-premises and SaaS environments. • Lead complex customer installations, including deployments in air-gapped and highly regulated environments. • Partner directly with customers via Zoom/Teams to troubleshoot, triage services, and resolve installation or performance nuances. • Author, review, and maintain Infrastructure as Code (IaC) using Terraform/OpenTofu, ensuring scalable and maintainable infrastructure design. • Deploy and manage containerized applications using Kubernetes (including RKE) and Kustomize in production environments. • Triage and resolve issues across distributed systems, Kafka pipelines, CI/CD workflows (Jenkins), and Google Cloud infrastructure. • Provide structured, actionable feedback to Platform Engineering and DevOps teams to improve reliability, scalability, and performance. • Participate in and help mature on-call processes, ensuring high availability and operational excellence. • Perform root cause analysis for production incidents and implement long-term corrective and preventative solutions. • Research, evaluate, and implement new tools or architectural improvements to address infrastructure and operational challenges. • Mentor junior engineers and promote SRE best practices across reliability, observability, and automation. • Improve internal tooling, automation, and operational workflows to enhance developer productivity and system stability.
• Write and maintain Bash scripts to automate operational and deployment tasks; • Create, support and improve existing CI/CD pipelines for application delivery; • Help manage and monitor cloud environments (dev, test, production); • Perform basic troubleshooting of infrastructure and deployment issues; • Work with Linux systems (services, processes, permissions, networking basics); • Assist with monitoring, logging, and alerting solutions; • Document procedures, configurations, and runbooks; • Collaborate with developers and senior engineers to improve system reliability; • Assist in building and maintaining cloud infrastructure using Terraform and other IaC tools.
• Design, manage, and operate compute, storage, network, and IAM across AWS, GCP, using Infra-as-code (Terraform). • Build and maintain secure networking and connectivity. • Manage production Kubernetes platforms (EKS, GKE). • Implement observability using Prometheus, Grafana, Datadog(optional).




