Building a better world with better data.
Lead AI Platform Engineer
Location
United Kingdom
Posted
137 days ago
Salary
0
Seniority
Senior
Job Description
Lead AI Platform Engineer
Prolific
• Bridge the gap between research and real-world application. • Ensure high-performance infrastructure, automated pipelines, and deployment strategies. • Design and maintain scalable cloud environments (GCP/AWS) using Terraform. • Manage GPU/TPU resource allocation for training, fine-tuning, and interactive notebooks. • Build internal services and CLI tools for the AI team. • Design CI/CD and training pipelines using tools such as GitHub Actions, MLFlow, Vertex AI Pipelines. • Develop reusable patterns for model serving and manage service deployments to Kubernetes. • Manage and optimize vector databases and embedding pipelines for RAG-based systems. • Implement techniques to reduce latency and increase throughput.
Job Requirements
- 5+ years experience with cloud infrastructure and infrastructure as code.
- Previous experience with the ML and LLM lifecycle - training, hosting, optimisation, observability.
- Used to working closely with researchers and data scientists - taking experiments from worksheets into production.
- Strong grasp of ML fundamentals and modern GenAI stack.
Benefits
- Competitive salary
- Remote working
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
AI Platform Engineer – Lead
KayzenKayzen powers the world's best mobile marketing teams to take programmatic in-house.
• Design and build internal AI frameworks, SDKs, and shared libraries • Enable teams to integrate AI features with minimal friction • Set up standardized patterns for using LLMs, embeddings, agents, and workflows • Build reusable components for prompt management, evaluation, observability, and safety • Define best practices for AI usage, cost control, and reliability • Evangelise AI internally through documentation, examples, and hands-on guidance • Rapidly prototype AI-powered features and turn them into reusable building blocks • Own AI tooling from experimentation to production
Senior Platform Engineer
vCluster LabsvCluster Labs is a venture-backed tech startup headquartered in San Francisco, California, with a distributed, remote-first team spanning eight time zones. Foun
• Infrastructure Management: Own and improve our multi-cloud infrastructure spread across AWS, GCP, and Digital Ocean. You will manage Kubernetes clusters, handle patching, manage access, and enhance to ensure our tooling has robust alerts and metrics. • CI/CD Optimization: Drive the improvement of GitHub CI pipelines. You will be responsible for creating secure, repeatable testing environments and automating pipeline updates to streamline the developer experience. • Internal Services Architecture: Architect and host infrastructure for engineering development, including internal services and vCluster-specific platforms (e.g., loft.rocks, vCluster Cloud). You will empower engineers to build pipelines securely through education and tooling. • Customer Zero: Act as the first and most critical user of our products. You will push vCluster features to their limits to create useful internal tools, discovering bugs and providing feedback to Engineering to shape the future of our software. • Terraform Automation: Focus on automating updates and managing infrastructure as code using Terraform Spacelift. You will give the team the ability to create infrastructure on demand, ensuring scalability and consistency. • Execution: Manage a variety of Kanban tasks via Linear, ranging from improving observability to handling GitHub policy requests, release engineering, and access management.
Cyber Range Platform Engineer
Horizon3.aiContinuous, autonomous pentesting, powered by NodeZero. Are your systems secure? Don't wait for a breach to find out!
• Implement highly scalable, secure, resilient cloud-native and on-premise application platforms that host vulnerable applications, configurations, and other services vital to research and development of our product. • Identify, design, and implement improvements in deployment processes so engineers can begin development anytime with confidence. • Design and implement tooling for the internal platform to collect telemetry data used to gauge its effectiveness in supporting H3's engineers. • Help create an effective documentation culture within the Engineering Organization (Confluence, Documentation as code). • Participate in effective project management and work allocation (Jira, Agile Concepts). • Create Terraform modules and Ansible playbooks that are extensible across the infrastructure to reliably and effectively deploy vulnerable testing scenarios for engineering development efforts. • Be a positive example of automation for infrastructure-as-code, platform operations, and overall CI/CD methodologies. • Evaluate new technologies and patterns for automation, application hosting, and improving infrastructure provisioning. Recommend and define the work needed.
Principal Data Platform Engineer
ExperianBased in Dublin, Leinster, Ireland, Experian is a global information services company that operates in 40 countries around the world and has additional headquar
• Lead global teams in design and architecture discussions with partners, developers, and platform engineers • Implement scalable and secure database architectures for enterprise systems across multi-availability-zone, multi-region cloud-native platforms (AWS, and private cloud) • Optimize applications and databases to use the full capabilities and cost efficiencies of AWS cloud-native RDS & EMR platforms • Review, configure, and maintain globally distributed data platforms that are available and scalable • Collaborate with IT, DevOps, and application teams to ensure seamless integration with existing systems • Manage automated operations, deployments, patching, upgrades, and capacity planning through CI/CD pipelines and Infrastructure as Code (IaC) tools ensuring secure and scalable database operations across multi-cloud environments • Monitor, troubleshoot, and fine-tune database performance to ensure scalability across multi-cloud environments • Implement data protection strategies, including encryption and access controls • Support audit readiness and security reviews • Manage regular maintenance windows to ensure system uptime • Develop enterprise data models and enforce data standards, best practices, and governance frameworks to ensure data integrity and consistency • Research new technologies to enhance database efficiency • Create clear, comprehensive documentation for database designs, configurations, and procedures



