Head of Platform & Infrastructure (AI SaaS)
Location
Germany
Posted
43 days ago
Salary
0
Seniority
Lead
Job Description
Head of Platform & Infrastructure (AI SaaS)
LDB Gruppe
Role Description Gestalte die technologische Grundlage einer skalierenden AI-SaaS-Plattform aktiv mit. Wir suchen eine erfahrene Persönlichkeit mit strategischem Blick auf Plattformarchitektur, Infrastruktur und technische Skalierung. In dieser Rolle verantwortest du die Weiterentwicklung unserer technischen Plattform- und Infrastrukturstrategie und stellst sicher, dass unsere Systeme den Anforderungen eines wachsenden AI-SaaS-Unternehmens nachhaltig gerecht werden. Du verbindest technologisches Verständnis mit wirtschaftlichem Denken und sorgst dafür, dass Skalierbarkeit, Stabilität, Security und Effizienz in einer modernen Plattformarchitektur zusammengeführt werden. Dabei arbeitest du eng mit Geschäftsführung, Produktmanagement und Engineering zusammen und gestaltest zentrale Infrastrukturentscheidungen aktiv mit. Aufgaben - Entwicklung und Weiterentwicklung der Zielarchitektur unserer AI- und Voice-basierten SaaS-Plattform - Definition und Umsetzung einer skalierbaren, stabilen und wirtschaftlich effizienten Infrastrukturstrategie - Sicherstellung der Skalierbarkeit bei steigender Nutzung und wachsender Systemlast - Verantwortung für Cloud-, Hosting- und Infrastrukturkonzepte auf Basis moderner Plattformtechnologien - Weiterentwicklung von Security-, Governance- und Compliance-Strukturen - Einführung und Optimierung von Observability-, Monitoring- und Performance-Standards - Optimierung der Infrastrukturkosten mit Blick auf Skalierung, Stabilität und Wirtschaftlichkeit - Aufbau klarer Infrastruktur-Roadmaps sowie Priorisierung technischer Initiativen - Steuerung externer Technologie- und Infrastrukturpartner - Enge Zusammenarbeit mit Engineering, Produktmanagement und Geschäftsführung bei Architektur- und Plattformentscheidungen - Sicherstellung klarer Entscheidungs- und Priorisierungsprozesse zur Vermeidung von Verzögerungen in Infrastruktur- und Plattforminitiativen Qualifications - Mehrjährige Erfahrung in technischen Architektur-, Plattform- oder Infrastrukturrollen - Erfahrung im Aufbau oder in der Weiterentwicklung skalierbarer SaaS-Plattformen - Sehr gutes Verständnis moderner Cloud-Infrastrukturen, idealerweise Azure, AWS oder GCP - Sehr gutes Verständnis von Linux-basierten Systemen und deren Betrieb in produktiven Cloud-Umgebungen - Erfahrung mit AI-/LLM-basierten Systemen oder datenintensiven Plattformumgebungen - Fundierte Kenntnisse in Infrastruktur-Architektur, Plattformdesign und DevOps-nahen Strukturen - Erfahrung mit Security-, Governance- und Compliance-Anforderungen in modernen Cloud-Umgebungen - Fähigkeit, technische Architekturentscheidungen unter Berücksichtigung von Skalierbarkeit, Kosten und Umsetzbarkeit unter Zeitdruck und mit klarer Priorisierung zu treffen - Erfahrung in der Zusammenarbeit mit Engineering-Teams sowie in der Abstimmung mit Business-Stakeholdern - Fähigkeit, komplexe technische Themen verständlich und adressatengerecht zu kommunizieren - Strukturierte, analytische und lösungsorientierte Arbeitsweise - Hohes Maß an Eigenverantwortung, Entscheidungsstärke und Umsetzungsorientierung Benefits - Eine strategisch wichtige Rolle mit direktem Einfluss auf die Weiterentwicklung unserer Plattform - Gestaltungsspielraum in einem dynamischen, technologiegetriebenen Umfeld - Direkte Zusammenarbeit mit der Geschäftsführung und kurzen Entscheidungswegen - Möglichkeit, zentrale Infrastruktur- und Architekturthemen nachhaltig mitzugestalten - Moderne AI- und SaaS-Technologien in einem wachstumsorientierten Umfeld - Hoher Impact auf Skalierbarkeit, Stabilität und Zukunftsfähigkeit unserer Plattform - Flexible und hybride Arbeitsweise mit deutschlandweitem Remote-Setup - Zusammenarbeit mit erfahrenen Produkt-, Engineering- und Technologie-Teams
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
• Develop, implement, and maintain governance frameworks across the Power Platform ecosystem. • Administer and support Power Platform environments, including Power Apps. • Design and deliver hands-on application solutions, including automation workflows and data architecture. • Maintain a comprehensive understanding of broader platform capabilities. • Design, build, and deploy scalable solutions using Power Pages, Power Apps, Power Automate, and Dataverse.
Platform Engineer, Windows Infrastructure
SimplesenseSimplesense, also stylized as SimpleSense, is on a mission to help “those who help others” by solving the issues surrounding information-sharing in emergencies. As an employer,
Title: Platform Engineer, Windows Infrastructure Job Description: Simplesense builds, deploys, and sustains the Installation Resilience Platform that enables mission operators to rapidly adapt and respond. The Platform protects critical infrastructure from cyber attack while unlocking previously siloed information to monitor, diagnose, and improve response times to incidents. Our adversaries rapidly adopt the latest technology: we help defense users respond in kind. Simplesense is a non-traditional defense contractor and prime on the Air Force's Installation Resilience Operations Command and Control (IROC) program, which is now expanding to five additional Air Force, Space Force, and Army installations from the one prototype installation, Tyndall Air Force Base. Our team combines over 100 years of direct mission experience solving hard problems with 50 years technical expertise deploying DevSecOps, cybersecurity, and cloud infrastructure, giving us a deep appreciation for our customers’ mission and end users’ priorities. We build for scale, architecting and prioritizing technical work for long term sustainability. Platform Engineer, Windows Infrastructure Location: Denver, CO (Hybrid), San Antonio, TX (Hybrid), Brooklyn, NY or Remote (US Based) About the Role: As a Platform Engineer, Windows Infrastructure, you will play a critical role in building and maintaining the Windows-based infrastructure that underpins Simplesense’s operational platform helping to secure critical infrastructure. You will support the implementation, automation, and ongoing improvement of Windows domain infrastructure and partner closely with the Platform, Security, and Operations teams to ensure our systems remain resilient, secure, and mission-ready. Work Model: We prioritize candidates in the Denver, CO, San Antonio, TX, and Brooklyn, NY area, but are open to remote talent. - Locals: 2 days/week onsite. - Remote: Quarterly travel for team meetings. What Success Looks Like: - 30 Days: Onboard into the platform environment, gain familiarity with Windows infrastructure, IaC workflows, and internal tools. - 60 Days: Develop and deploy Windows automation scripts and infrastructure updates and begin contributing to tasks following the development workflow. - 90 Days: Take ownership of small- to medium-sized infrastructure tasks, contribute to system reliability and automation efforts, and collaborate cross-functionally to support operations and platform enhancements. What You’ll Do - Implement Infrastructure-as-Code solutions: Deploy features, fixes, and updates to Windows infrastructure using infrastructure-as-code (IaC) practices, ensuring reliability, performance, and security. - Automate Windows environments and processes: Develop, test, and maintain automations for system configuration, deployments, and operational workflows, including CI/CD pipelines. - Collaborate across teams to improve systems: Partner with Platform, Security, and Operations teams to troubleshoot issues, integrate new assets, and continuously improve system performance and resilience. - Contribute to technical delivery and growth: Actively participate in code reviews, follow engineering best practices, and contribute to solving technical challenges while continuing to build your skillset. What You Bring Required Qualifications: Experience: 3+ years of experience using software development and Infrastructure-as-Code practices to manage Windows infrastructure. Technical Skills: - Strong experience with self-hosted Windows Server environments, including Active Directory, Group Policy, and system performance tuning - Proficiency in PowerShell scripting for automation (modules, system configuration, networking, services) - Experience with CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI, Azure DevOps) - Familiarity with version control (Git) and infrastructure deployment workflows - Experience with monitoring and logging tools, including analyzing Windows Event Logs and system metrics Familiarity with AWS infrastructure tools (e.g., CloudFormation, Systems Manager) - Domain Knowledge: Understanding of secure system configuration, automation best practices, and infrastructure reliability in mission-critical or regulated environments. - Travel: Quarterly travel required for team planning and collaboration. Clearance & Eligibility: - Must be a U.S. Citizen - Must be able to obtain a DoD NIPR network account and Common Access Card (CAC) - Must have, or be able to obtain, a Secret Clearance - Security relevant certification (e.g., CompTIA Security+, Cloud+) or willingness to obtain Preferred Qualifications: - Based in Denver, CO, San Antonio, TX, or Brooklyn, NY - Experience working within DoD, Air Force, or Federal contractor environments - Bachelor’s degree in Computer Science, Software Engineering, Information Systems, or related field (or equivalent experience) - AWS certification (e.g., AWS Certified Cloud Practitioner) - Experience with PowerShell DSC - Experience using Ansible for Windows configuration and automation - Hands-on experience with Windows domain administration Our Culture At Simplesense, we value high-trust autonomy. We look for people who can navigate ambiguity and are driven by the mission. - Safety & Innovation: You embed security and reliability practices into daily work to drive continuous improvement and mitigate risk. - People & Communication: You invite vigorous debate and offer "kindly blunt" feedback, always maintaining empathy and assuming noble intent. - Integrity & Ethics: You build trust by honoring commitments, acting ethically, and resolving conflict through direct, honest communication. - Strategic Problem Solving: You focus on high-priority issues to create documented, and scalable solutions—avoiding shortcuts. - Agility: You move quickly to fix small problems, learn from the past, and pivot transparently when the mission requires it. Compensation and Benefits Pay Range: $115k-$145k per year. Compensation is determined based on experience, skill level, and location. We review ranges regularly to ensure market competitiveness. Competitive Benefits - Equity - Medical, Life, Short-Term Disability, and AD&D insurance - Medical travel coverage - Dental coverage - Vision coverage - 401k matching Our Typical Hiring Process - Find Your Fit: Your journey starts here. Explore and apply to our open positions to find the right role for your skills. - Initial Chat: A brief call with our recruiting team to learn about your background and answer your initial questions about Simplesense. - Values & Vision: A conversation with a hiring manager to discuss how your aspirations align with our mission and goals of the team. - Show Your Skills: Complete a technical assessment that reflects the work you’d be doing. - Team Interview: Interview with the team to discuss your experience and see if we’re a great match. - Final Handshake: A final conversation to ensure we’ve answered all your questions before making a decision. - Welcome to Simplesense! Simplesense is an equal opportunity employer committed to a policy of merit-based employment. All employment decisions—including recruitment, hiring, promotion, compensation, benefits, training, and termination—are made based on individual qualifications, performance, and business needs. We strictly prohibit discrimination or harassment of any kind on the basis of protected characteristics as recognized by federal, state, or local law. As a U.S. government contractor, Simplesense complies with all applicable equal employment opportunity laws, Section 503 of the Rehabilitation Act, and the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA). If you need a reasonable accommodation to complete the application or take part in the interview process, please contact People Operations
Site Reliability Engineering and Platform Engineer
WorkdayWorkday is a computer software company that provides cloud-based applications for the finance and human resources industries. Founded by co-CEOs Dave Duffield a
Title: P3 SRE and Platform Engineer - Sana Search - US Federal Location: USA, GA, Atlanta Work Type: Hybrid, Full Time Job ID: 0106531 Job Description: Your work days are brighter here. We're obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're shaping the future of work so teams can reach their potential and focus on what matters most. The minute you join, you'll feel it. Not just in the products we build, but in how we show up for each other. Our culture is rooted in integrity, empathy, and shared enthusiasm. We're in this together, tackling big challenges with bold ideas and genuine care. We look for curious minds and courageous collaborators who bring sun-drenched optimism and drive. Whether you're building smarter solutions, supporting customers, or creating a space where everyone belongs, you'll do meaningful work with Workmates who've got your back. In return, we'll give you the trust to take risks, the tools to grow, the skills to develop and the support of a company invested in you for the long haul. So, if you want to inspire a brighter work day for everyone, including yourself, you've found a match in Workday, and we hope to be a match for you too. About the Team Workday, founded in 2005, stands as a groundbreaking force in the human capital and financial management industry, with a global presence and a diverse array of customers. Across our offices worldwide, our teams are united by a shared dedication to innovation, collaboration, and excellence! The Workday Sana Search Team is responsible for creating the world's most powerful, platform-agnostic enterprise search product, transforming the way people and agents interact with knowledge, inside and outside the Workday ecosystem. We are in the process of building the definitive discovery platform: a standalone-ready, hybrid-by-design service that orchestrates enterprise data at scale. From federated gateways to agentic text retrieval pipelines and open personalization frameworks, we provide the blueprints for modern search, engineered for universal portability and precision. Joining our team means embarking on a journey of opportunity to advance your career and contribute to impactful solutions that shape industries. Whether you thrive with solving sophisticated business problems, collaborating with agile teams, or championing innovation and software design, Workday offers an environment where your talents can thrive. About the Role This role will support one or more direct or indirect contracts with the U.S. Federal Government which, due to federal government security requirements, mandates that all Workday personnel working on the contracts be United States citizens (naturalized or native). Federal Security Requirement: This role supports one or more direct or indirect contracts with the U.S. Federal Government. Due to federal government security requirements, all personnel working on these contracts must be United States citizens (naturalized or native). What You Will Do Infrastructure & Platform Engineering - Infrastructure Management: Provision and manage AWS resources (EC2, Lambda, ElastiCache, S3, RDS) using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. - Self-Service Platforms: Build platforms and tools that empower application developers to interact with production in a self-service manner. - Containerization: Manage Docker images and Kubernetes manifests (using Kustomize/Helm) to support and scale microservices. - Automation: Define, design, implement, test, and deploy automation infrastructure for configuration management and service deployment to improve operational efficiency. CI/CD & Deployment - Pipeline Maintenance: Support and troubleshoot CI/CD pipelines (e.g., Jenkins, TeamCity, Argo CD), ensuring builds are fast and deployments are reliable. - Operational Scaling: Drive the "commit to production" workflow, automating manual touchpoints where reasonable to help scale the team. Reliability & Observability - Monitoring & Alerting: Configure CloudWatch, Prometheus, and ELK dashboards to ensure team visibility into system health. - Production Response: Triage, fix, and resolve issues identified by production monitoring. Conduct retrospectives and act on incidents to continually improve systems. - On-Call Support: In time, you will participate in an infrequent on-call rotation to ensure high availability for critical systems. Collaboration & Growth - Cross-Functional Partnership: Build and maintain strong relationships with peers and partners; work closely with developers to debug environment-specific issues and optimize application performance. - Documentation: Maintain clear, concise documentation for deployment processes, infrastructure diagrams, and reliability practices. - Innovation Culture: Engage in a culture of learning and innovation through hackathons, online course offerings, and employee-led special interest guilds. About You Basic Qualifications: - 5+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering. - 5+ years of experience with AWS (Compute, Storage, Networking, and Control Plane). - Citizenship: Must be a U.S. Citizen (required for Federal Government contract compliance). Other Qualifications: - Orchestration: Experience managing production workloads in Kubernetes. - Automation: Deep familiarity with CI/CD tools and IaC frameworks. Workday Pay Transparency Statement The annualized base salary ranges for the primary location and any additional locations are listed below. Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate's compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday's comprehensive benefits, please click here. Primary Location: USA.GA.Atlanta Primary Location Base Pay Range: $117,400 USD - $176,000 USD Additional US Location(s) Base Pay Range: $111,500 USD - $199,800 USD Our Approach to Flexible Work With Flex Work, we're combining the best of both worlds: in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work. We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply spend at least half (50%) of our time each quarter in the office or in the field with our customers, prospects, and partners (depending on role). This means you'll have the freedom to create a flexible schedule that caters to your business, team, and personal needs, while being intentional to make the most of time spent together. Those in our remote "home office" roles also have the opportunity to come together in our offices for important moments that matter
Staff Platform Engineer – CI/CD, Build Systems
HashgraphHashgraph, formerly Swirlds Labs, is a software company home to some of the brightest minds in web3.
• Architect and evolve scalable CI/CD systems that support complex multi-product release workflows across internal and open-source platforms • Design and build developer tooling, deployment automation, and release orchestration systems using technologies such as GitHub Actions, Kubernetes, and cloud-native infrastructure • Lead the engineering strategy for build pipelines, artifact management, release governance, and deployment reliability • Improve developer experience by reducing friction in build, test, deployment, and operational workflows • Build and maintain highly reliable Kubernetes-based infrastructure powering release automation and engineering productivity systems • Drive observability and operational excellence across release systems through metrics, monitoring, and performance instrumentation • Partner closely with Platform Engineering, DevOps, Security, Program Management, and Product Engineering teams to align release infrastructure with business and technical priorities • Serve as a senior technical leader and multiplier within the organization through architecture guidance, mentorship, and operational rigor



