Job Closed
This listing is no longer active.
Infios, a global supply chain software company headquartered in Bad Nauheim, Hesse, Germany, was formed in 2025 through the rebranding of Körber Supply Chain S
Senior AI Platform Engineer
Location
California
Posted
4 days ago
Salary
$170K - $190K / year
Seniority
Senior
Job Description
Senior AI Platform Engineer
Infios
• Define AI feature specifications upfront — including acceptance criteria, evaluation metrics, prompt contracts, and expected behaviors — and champion this spec-driven approach across the team. • Own end-to-end AI feature delivery across the full AI SDLC: spec definition, prototyping, development, evaluation, deployment, and production monitoring. • Build production-grade LLM and agentic AI applications using Spring AI — including RAG pipelines, agent orchestration, tool-use patterns, guardrails, and human-in-the-loop workflows. • Architect and operate AWS AI infrastructure (Bedrock, Bedrock Agents, Agent Core, SageMaker) alongside core AWS services (ECS/EKS, Lambda, S3, DynamoDB, RDS, API Gateway). • Design and implement scalable microservices and distributed systems in Java, TypeScript, and Python that power the Archer AI platform. • Build CI/CD pipelines for AI workloads — including LLM evaluation pipelines and automated regression testing for AI outputs — using Terraform, CloudFormation, Docker, Kubernetes, and GitHub Actions. • Drive AI-specific operational practices: observability, drift detection, quality scoring, feedback loops, and incident response for non-deterministic systems. • Communicate technical concepts clearly to both technical and non-technical stakeholders; author AI specs, design documents, and architectural decision records. • Mentor engineers, conduct thorough code reviews, and champion engineering excellence.
Job Requirements
- Spec-Driven AI SDLC: Deep expertise in the AI software development lifecycle with a specification-first mindset.
- Experience authoring AI feature specs (acceptance criteria, evaluation metrics, prompt contracts) and driving the full lifecycle from prototyping through evaluation frameworks, A/B testing, deployment of non-deterministic systems, and production monitoring (drift detection, quality scoring, feedback loops).
- Track record of shipping AI-powered features through multiple product cycles with engineering rigor.
- AWS AI Infrastructure: Strong hands-on experience with Amazon Bedrock, Bedrock Agents, Agent Core, SageMaker, and Amazon Q.
- Solid knowledge of core AWS infrastructure including compute (ECS/EKS, Lambda), databases (RDS, DynamoDB, ElastiCache), networking (VPC, ALB, CloudFront), and security (IAM, KMS, Secrets Manager).
- Experience architecting AI infrastructure pipelines with cost optimization and high availability.
- LLM Frameworks & Agentic AI: Hands-on experience building production applications with Spring AI.
- Solid understanding of LLM application patterns (prompt management, RAG, context orchestration, vector stores, evaluation) and agentic workflows (multi-step agents, tool-use orchestration, planning loops).
- Java, TypeScript & Python: 5+ years of professional software engineering with strong proficiency across all three languages — Java (Spring Boot, Spring Cloud), TypeScript (Node.js, modern frameworks), and Python (AI tooling, evaluation frameworks).
- Comfortable choosing the right language for each task.
- Enterprise & Large-Scale Systems: Experience designing and operating distributed systems at scale.
- Familiarity with event-driven architectures, message brokers (Kafka, SQS/SNS), caching (Redis, ElastiCache), and relational/NoSQL database design.
- DevOps & Infrastructure: Proficiency in CI/CD pipelines, Infrastructure as Code (Terraform, CloudFormation), containerization (Docker, Kubernetes/EKS), and GitOps workflows.
- Problem Solving & Communication: Excellent analytical skills and the ability to tackle complex, ambiguous challenges independently. Outstanding written and verbal communication — able to articulate technical concepts to diverse audiences and collaborate effectively across teams.
- Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field (or equivalent practical experience).
Benefits
- Competitive Medical, Dental, and Vision insurance
- 401K matching program
- Flexible Time Off
- 11 paid holidays
- 1 volunteer day per year
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
• Lead the architecture and design of complex, large-scale distributed systems • Design, develop, and maintain scalable, fault-tolerant microservices for both real-time and batch data processing • Build and maintain high-performance REST APIs with a focus on scalability, reliability, and maintainability • Design and implement data models and queries for efficient storage and retrieval across SQL and NoSQL databases • Lead data quality initiatives to ensure consistency, accuracy, and reliability of data across the platform • Collaborate with cross-functional teams to translate platform requirements into effective technical solutions • Drive technical standards and best practices for code quality, testing, and observability across engineering teams • Mentor and provide technical guidance to engineers through code reviews, design discussions, and direct support • Take end-to-end ownership of the software development lifecycle, including deployment, monitoring, and incident response
• Lead the design, development, integration, deployment, and ongoing support of the organization's AI platform and related engineering capabilities • Define and implement platform architecture, engineering standards, and development practices • Partner with product, data, infrastructure, security, and business teams to align platform capabilities with enterprise needs • Oversee engineering teams in building reusable platform services, tools, and components • Drive modernization and integration efforts to connect existing systems and processes with AI platform capabilities • Monitor platform performance and operational health • Establish effective operating models, support processes, and governance practices • Communicate platform priorities, delivery progress, technical risks, and investment needs to senior leaders
Executive Director, AI Platform Engineering Lead
NovartisNovartis is a leading global pharmaceutical and healthcare research and solutions company dedicated to improving patient lives by uncovering solutions to curren
• Define and lead the engineering strategy, roadmap, and operating model for the organization’s agentic AI platform. • Oversee the design, development, integration, deployment, and ongoing operation of platform capabilities. • Establish platform engineering standards, architectural principles, and development practices. • Lead cross functional engineering teams in building reusable platform services, tools, and components. • Partner with product, data, infrastructure, security, and business teams. • Drive platform performance, operational excellence, and continuous improvement. • Guide the modernization of existing systems and processes. • Communicate platform strategy, technical direction, delivery progress, risks, and investment needs to senior executives and key stakeholders.
Platform Engineer II
First AmericanFirst American is on a mission to deliver a variety of real estate-focused services and solutions. As an employer, First American has been recognized for its ex
• Administer Oracle E-Business Suite R12.2.x, including installation, cloning, upgrades, online patching, and lifecycle management. • Manage Oracle Database 19c RAC environments, including backup and recovery, high availability, disaster recovery, and performance tuning. • Support Oracle Fusion Middleware and WebLogic administration, deployments, monitoring, and performance tuning. • Execute database, EBS technology stack, middleware, and security patching, including Linux administration and vulnerability remediation for Oracle application and database servers. • Support cloud or hybrid environments across OCI, AWS, or Azure, including automation, monitoring, and production operations. • Use approved AI tools to explore enterprise use cases and improve productivity, automation, documentation, and service delivery.



