M

MediaMarktSaturn Retail Group

Remote Jobs

1 open roleLatest: May 11, 2026, 12:00 AM UTC
Post Date
Minimum Salary
Experience

1 Jobs

Role Description - AI Platform Operations & Monitoring: Build and maintain comprehensive observability solutions for our AI Platform, including real-time monitoring, evaluation frameworks, and anomaly detection systems to ensure optimal performance and reliability. - FinOps & Cost Optimization: Implement and manage cost optimization strategies and resource management practices across our AI infrastructure, ensuring efficient budget allocation and transparent cost tracking for AI workloads. - Configuration Management: Design and maintain robust configuration management systems for prompt engineering workflows, policy configurations, and operational parameters that support scalable AI agent deployments. - Foundation Engineering: Contribute to the core engineering efforts that establish the foundational infrastructure for our AI Platform, focusing on reliability, scalability, and operational excellence. - Cross-functional Collaboration: Work closely with engineering teams to establish best practices for AI operations, providing guidance on monitoring strategies and cost-effective resource utilization. - Documentation & Knowledge Sharing: Create comprehensive documentation and operational runbooks to enable team members to effectively manage and troubleshoot AI platform components. Qualifications - Experience: 3-5+ years in Software Engineering, DevOps, or Platform Engineering with demonstrated experience in AI/ML operations and infrastructure management. - Technical Expertise: Strong proficiency in cloud platforms (preferably GCP), containerization technologies, and Infrastructure-as-Code tools. Experience with monitoring and observability platforms is essential. - AI/ML Operations: Practical knowledge of MLOps practices, model monitoring, and experience working with AI/ML frameworks and deployment pipelines. - Cost Management: Understanding of cloud cost optimization strategies and experience with FinOps practices in AI/ML environments. - Problem-Solving: Proven ability to design monitoring solutions for complex, non-deterministic systems with focus on proactive issue detection and resolution. - Communication: Excellent English skills with the ability to collaborate effectively in a remote team environment and translate technical concepts into actionable insights. Benefits - Location: Ingolstadt, München - Department: HQ - IT - Entrylevel: Professional Level - Type of Employment: Full Time - Working Hours: 37.5 - Persona: Job Requisition Tech Employee - Recruiter: Laura Schröder

Germany
Job Closed