DevOps/Platform Engineer
Location
Worldwide
Posted
52 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
DevOps/Platform Engineer
Comma Soft AG
Role Description Als „DevOps/Platform Engineer (m/w/d)“ stellst du für unseren KI-Plattform Alan eine sichere, skalierbare, beobachtbare Plattform bereit und etablierst das Prinzip „You build it, you run it“ im Team. Du unterstützt die produktiven Teams auf „paved paths“ (Self-Service, Guardrails) und sorgst für vorhersehbare Performance und Kosten. - Ownership für zentrale Plattform-/Serving-Komponenten übernehmen - K8s-Cluster, Networking (Ingress), Storage (Datenbanken, Snapshots) und OS/Kernel-Patching betreiben und deren sicheren und stabilen Betrieb sicherstellen - Multi-Cloud-Ressourcen (insb. Open Telekom Cloud) per Konsole und IaC (Terraform) modellieren - CI/CD-Pipelines und Release-/Versionierungs-/Rollback-Strategien aufbauen - OpenTelemetry-basiertes Tracing, Metrics und Logs im Bereich Observability & Site Reliability Engineering implementieren, SLIs/SLOs, Alerting und Error Budgets definieren - Plattform für Model Serving gemeinsam mit unseren AI Engineers bereitstellen: GPU-Scheduling, Autoscaling, Inference-Gateways, Observability (Latency/QPS/Token-Kosten) Qualifications - Masterstudium oder Promotion in einem der MINT-Fächer oder einem geisteswissenschaftlichen Fach mit MINT-Vertiefung erfolgreich abgeschlossen - Mindestens 2 Jahre relevante Berufserfahrung in den Bereichen DevOps, Site Reliability Engineering oder Platform Engineering - Nachweisliche Verantwortung für Kubernetes, IaC, CI/CD, Observability sowie den produktiven Betrieb – idealerweise im SaaS-Umfeld - Praxis-Know-how in Git-basierten Deployments, modularer IaC, Secret-/Config-Management sowie Incident-Erfahrung - Security-Fachwissen in Netzwerksicherheit, Secrets, Härtung (CIS), Software-Supply-Chain und Zugriffsprinzipien (Least Privilege) - Idealerweise erste Praxiserfahrung im Betrieb von Inferenz-Workloads (vLLM o. ä.), GPU-Capacity-Management, Autoscaling und Observability - Neugier und Wissbegierde sowie ausgeprägte Problemlösungs- und Kommunikationsfähigkeit - Überzeugende und effiziente Kommunikation in deutscher und englischer Sprache Benefits - Arbeiten an einer hochmodernen, skalierbaren AI-Plattform mit viel Gestaltungsspielraum - Früh Verantwortung für zentrale Infrastruktur- und Architekturentscheidungen übernehmen - Fachlicher Austausch auf Augenhöhe mit zukünftigen Kolleg:innen - Budget und Zeit für eigene Innovationsprojekte - Fachliche und persönliche Weiterentwicklung durch speziell abgestimmte Weiterbildungen, Zertifizierungen und Laufbahnprogramme - Schwerpunktsetzung und Ausbau in Spezialgebieten - Attraktives Fixgehalt zzgl. Umsatz- und Ergebnisbeteiligung - Überstunden ausgleichen und Reisezeiten als Arbeitszeit buchen - Freie Wahl des Arbeitsorts und flexible Arbeitszeit - Top ausgestatteter Arbeitsplatz, JobRad, Body & Mind Workout, GamesNights, Grillen auf der Dachterrasse - Team-Aktionen mit unternehmungslustigen Kolleg:innen, Sommerfeste mit Familienmitgliedern und viele weitere Benefits
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior ML Ops/Devops - Hybrid (She/He/They)
CapcoCapco, a Wipro company, is a management & technology consultancy dedicated to the financial services & energy industries
CAPCO POLAND *We are looking for Poland based candidate. Hybrid work. Warsaw is preferred. At Capco Poland, we’re not just another consultancy - we’re the spark behind digital transformation in the financial world. As a global leader in technology and management consulting, we thrive on helping clients tackle the toughest challenges across banking, payments, capital markets, wealth, and asset management. Our secret? A culture that’s fast, flexible, and fiercely entrepreneurial. We move quickly, think creatively, and always put our people first. We’re passionate about growth - both for our clients and ourselves - and that means attracting the very best talent to join us on this exciting journey. We’re proud to be: • Trailblazers in banking, payments, capital markets, wealth, and asset management • Champions of an agile, nimble, and innovative work environment • Dedicated to building a team of top-notch professionals who share our drive and vision THINGS YOU WILL DO Platform Engineering for ML - Design, build, and improve MLOps platform components that support the full model lifecycle (development à validation à deployment à monitoring). - Create reusable templates and standardized pipelines to reduce time-to-production and improve consistency across teams. Model deployment & release engineering - Implement robust deployment patterns for credit risk models (primarily batch; other patterns as required). - Build & maintain CI/CD pipelines using Jenkins and GitHub, with appropriate quality gates and traceability. - Automate environment configuration and repeatability using Ansible. Monitoring, observability & operational readiness - Implement model and pipeline monitoring covering operational health, data quality signals, and model performance/drift indicators. - Establish dashboards, alerting, and runbooks; partner with stakeholders to ensure alerts are actionable and aligned to business impact. - Drive continuous improvement through post-release reviews and reliability enhancements (no on-call requirement). Collaboration & stakeholder management - Work closely with credit risk modellers to productionise models built with tools such as TensorFlow, MLFlow, and similar. - Translate modelling needs into scalable engineering solutions, balancing pace with control expectations. - Mentor junior team members (nice-to-have) and contribute to shared engineering standards and documentation. SKILLS & EXPERIENCES YOU NEED TO GET THE JOB DONE - 5+ years’ experience across MLOps/DevOps/Platform Engineering, with a track record of delivering production-grade ML or data solutions. - Strong experience building CI/CD and automation using Jenkins and GitHub. - Strong experience with Airflow (Bash), Bash itself, and Groovy for pipeline automation. - Hands-on configuration automation using Ansible. - Strong coding/scripting capability in Python (including PySpark), plus working knowledge of Spark. - Experience with ML tooling such as MLFlow, TensorFlow, and similar, including model packaging and deployment considerations. - Proven ability to implement observability (metrics/logs/dashboards/alerting), with tooling flexibility (e.g., Grafana, Splunk, or similar). - Comfortable working in hybrid environments; experience with Hadoop and an ability to integrate with cloud services (preference for GCP). Nice to have: - Exposure to GCP services, especially Vertex AI and BigQuery. - Experience with secrets management (tooling flexible). - Familiarity with Model Risk Management, SDLC controls, and data lineage concepts in regulated environments. - Prior experience mentoring junior engineers and helping teams adopt standard patterns. ONLINE RECRUITMENT PROCESS STEPS - Screening call with the Recruiter - Capco Hiring Manager Interview - Client’s interview - Feedback/Offer We offer a flexible collaboration model based on a B2B contract, with the opportunity to work on diverse projects.
• Supporting developers through internal productivity platforms and responding to internal customer requests, leveraging team collaboration and AI tools where appropriate • Owning and maintaining developer productivity services such as GitHub, Jenkins, and Artifactory, ensuring reliability and scalability • Creating and managing Kubernetes operators, while troubleshooting infrastructure (network)‑related issues (it's always DNS ^^) • Automating recurring tasks for teams and developers using an AI‑first engineering approach, including building integrations with third‑party systems and APIs • Participating in the on‑call rotation for critical services, maintaining internal technical documentation, and continuously learning and experimenting to improve how software is built
Senior Java Developer
Poland and Eastern EuropeXebia is a global tech company with a journey in CEE that started with two Polish companies – PGS Software and GetInData. We are a team of 1,000+ experts delivering top-notch work across cloud, data, and software. We work on impactful projects across various sectors including fintech, e-commerce, aviation, logistics, media, and fashion, helping clients build scalable platforms and cutting-edge applications. Our clients include notable names like McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, and InPost.
Hello, let’s meet! Who We Are While Xebia is a global tech company, our journey in CEE started with two Polish companies – PGS Software, known for world-class cloud and software solutions, and GetInData, a pioneer in Big Data. Today, we’re a team of 1,000+ experts delivering top-notch work across cloud, data, and software. And we’re just getting started. What We Do We work on projects that matter – and that make a difference. From fintech and e-commerce to aviation, logistics, media, and fashion, we help our clients build scalable platforms, data and AI solutions, and cutting-edge applications to shape the future of tech. Our clients include McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, InPost, and many, many more. We value smart tech, real ownership, and continuous growth. We use modern, open-source stacks, and we’re proud to be trusted partners of Databricks, dbt, Snowflake, Azure, GCP, and AWS. Fun fact: we were the first AWS Premier Partner in Poland! Beyond Projects What makes Xebia special? Our community. We support tech communities, organize meetups (Software Talks, Data Tech Talks), and have a culture that actively support your growth via Guilds, Labs, and personal development budgets — for both tech and soft skills. It’s not just a job. It’s a place to grow. What sets us apart? Our mindset. Our vibe. Our people. And while that’s hard to capture in text – come visit us and see for yourself. You will be: - developing and maintaining backend applications with a strong focus on API development, - participating in the migration and upgrade of Java, - designing and implementing scalable and maintainable solutions using Java and Spring Boot, - working with messaging systems (e.g. Kafka) to support data processing and integration, - collaborating with the team to deliver high-quality solutions and improve existing systems, - analysing requirements and contributing to technical solutions with a focus on code quality and performance. Your profile: - 5+ years of professional experience in Java backend development, - strong experience with Java (8-21/25), - hands-on experience with Spring and Spring Boot, - experience with jOOQ or similar ORM tools, - experience with Java Server Faces (JSF) and PrimeFaces, - experience with Java version upgrades, - basic experience with Apache Kafka, - understanding of working with APIs and backend services, - practical experience using AI-powered assistants (e.g. Claude Code, GitHub Copilot, Cursor) to improve productivity, quality, or decision-making in software delivery, - strong analytical and critical thinking skills, - good command of English (min. B2). - Work from the European Union region and a work permit are required. Nice to have: - familiarity with Apache Strom, - experience with KSQL, - experience applying GenAI in a more structured way within the SDLC, including defined workflows, prompt patterns, or tool integrations embedded into daily work, - interest in and familiarity with emerging AI-driven practices (e.g. agent-based workflows, automation patterns, AI-augmented development), with a willingness to explore and experiment beyond standard approaches. Recruitment Process: CV review – HR call – Technical Interview – Decision
Administrador/a Cloud DevOps Openshift - Kubernetes Sr - Teletrabajo 100%
Logicalis SpainSomos Arquitectos Del Cambio, ayudamos a las organizaciones a tener éxito en un mundo cada vez más digitalizado.
En Logicalis Spain buscamos incorporar un/a Administrador/a Kubernetes/OpenShift Senior. La persona seleccionada se incorporará en nuestra SL de Cloud y participará en proyectos de gran relevancia tecnológica trabajando en un entorno estable y con posibilidades de crecimiento. Responsabilidades principales: - Administrar y mantener plataformas de contenedores (Kubernetes, OpenShift, AKS, EKS, etc.). - Operar entornos híbridos (cloud pública y on-premise). - Automatizar tareas de operación e infraestructura utilizando Ansible, Helm y Terraform. - Ejecutar despliegues, actualizaciones, backups y tareas de troubleshooting en entornos productivos. - Mantener y optimizar pipelines de CI/CD (Jenkins, ArgoCD, etc.). - Gestionar configuraciones y repositorios de código (Git). - Monitorizar el estado de los sistemas y garantizar su rendimiento y disponibilidad. - Colaborar con equipos de desarrollo y operaciones en entornos DevOps. - Asegurar el cumplimiento de buenas prácticas de operación, seguridad y documentación técnica. Requisitos: - Experiencia en administración de plataformas de contenedores (Kubernetes, OpenShift). - Experiencia en herramientas de integración y despliegue continuo (Jenkins, SonarQube, ArgoCD). - Manejo de sistemas de control de versiones (Git). - Experiencia en entornos híbridos (cloud y on-premise). - Experiencia en operación de entornos productivos y resolución de incidencias. Valorable: - Soluciones de backup en entornos de contenedores (Velero, Kasten). - Automatización mediante scripting (Bash, Python) y herramientas como Ansible o Terraform. - Conocimientos básicos de almacenamiento en contenedores (ODF, Rook Ceph, etc.). - Experiencia en entornos de nube pública: AWS, Azure o Google Cloud Platform (GCP). - Experiencia previa en servicios gestionados o entornos multicliente. Sobre Logicalis Spain: Logicalis es un grupo internacional con más de 20 años de experiencia en el sector IT. Estamos especializados en proyectos y servicios de alto nivel en Data Centers, Ciberseguridad y Analytics. ¿Qué ofrecemos en Logicalis? - Proyecto estable y de larga duración - Modalidad de trabajo 100% remoto - Jornada intensiva todos los viernes y durante verano. - Día libre en tu cumpleaños + día adicional de asuntos propios - Acceso a plan de retribución flexible (tarjeta restaurante, transporte, guardería…) - Seguro médico privado y acceso a Wellhub (Gympass) - Descuentos en formación, tecnología, viajes, retail… - Formación continua y certificaciones técnicas - Oportunidades reales de desarrollo profesional en un entorno internacional Si quieres formar parte de un equipo innovador y participar en proyectos estratégicos de alto impacto tecnológico, ¡te estamos esperando! 🚀


