Job Closed
This listing is no longer active.
The leading provider of digital identity verification and fraud solutions. Salesinfo@socure.com
Senior Manager, AI Platform Engineering
Location
United States
Posted
136 days ago
Salary
$190K - $210K / year
Seniority
Senior
Job Description
Senior Manager, AI Platform Engineering
Socure
• Develop and own the roadmap for Socure’s AI/ML platform, including data and feature engineering workflows, training infrastructure, experimentation tooling, model deployment/serving, monitoring, and governance. • Define architecture and standards that create clear, scalable, and secure paths for building and operating AI systems. • Assess technology options and drive consolidation across the company to reduce fragmentation and improve consistency across the ML toolchain. • Partner with Data Science, Engineering, Product, and Sales-Enablement teams to develop AI infrastructure that delights Customers. • Lead the design and operation of the end-to-end ML lifecycle: data ingestion, feature engineering, experimentation, training, model registry, deployment, and continuous monitoring. • Guide the team to deliver high-quality platform capabilities with predictable timelines and strong technical rigor. • Implement and enforce best practices around model versioning, auditability, lineage tracking, data governance, and security controls. • Lead, mentor, and grow both senior and junior ICs across ML infrastructure, MLOps, and distributed systems.
Job Requirements
- 8+ years of professional software engineering experience, including time spent building or operating large-scale ML, data, or distributed systems platforms.
- 3+ years of engineering leadership experience managing multiple teams or engineering managers.
- Strong technical background in ML infrastructure, data engineering, and/or cloud-native distributed systems.
- Demonstrated experience delivering complex, cross-functional platform initiatives.
- Excellent communication and stakeholder management skills, with the ability to translate between technical detail and business priorities.
- Experience working in fast-paced, iterative environments using modern development practices.
Benefits
- Offers Equity
- Offers Bonus
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
• Bridge the gap between research and real-world application. • Ensure high-performance infrastructure, automated pipelines, and deployment strategies. • Design and maintain scalable cloud environments (GCP/AWS) using Terraform. • Manage GPU/TPU resource allocation for training, fine-tuning, and interactive notebooks. • Build internal services and CLI tools for the AI team. • Design CI/CD and training pipelines using tools such as GitHub Actions, MLFlow, Vertex AI Pipelines. • Develop reusable patterns for model serving and manage service deployments to Kubernetes. • Manage and optimize vector databases and embedding pipelines for RAG-based systems. • Implement techniques to reduce latency and increase throughput.
AI Platform Engineer – Lead
KayzenKayzen powers the world's best mobile marketing teams to take programmatic in-house.
• Design and build internal AI frameworks, SDKs, and shared libraries • Enable teams to integrate AI features with minimal friction • Set up standardized patterns for using LLMs, embeddings, agents, and workflows • Build reusable components for prompt management, evaluation, observability, and safety • Define best practices for AI usage, cost control, and reliability • Evangelise AI internally through documentation, examples, and hands-on guidance • Rapidly prototype AI-powered features and turn them into reusable building blocks • Own AI tooling from experimentation to production
Senior Platform Engineer
vCluster LabsvCluster Labs is a venture-backed tech startup headquartered in San Francisco, California, with a distributed, remote-first team spanning eight time zones. Foun
• Infrastructure Management: Own and improve our multi-cloud infrastructure spread across AWS, GCP, and Digital Ocean. You will manage Kubernetes clusters, handle patching, manage access, and enhance to ensure our tooling has robust alerts and metrics. • CI/CD Optimization: Drive the improvement of GitHub CI pipelines. You will be responsible for creating secure, repeatable testing environments and automating pipeline updates to streamline the developer experience. • Internal Services Architecture: Architect and host infrastructure for engineering development, including internal services and vCluster-specific platforms (e.g., loft.rocks, vCluster Cloud). You will empower engineers to build pipelines securely through education and tooling. • Customer Zero: Act as the first and most critical user of our products. You will push vCluster features to their limits to create useful internal tools, discovering bugs and providing feedback to Engineering to shape the future of our software. • Terraform Automation: Focus on automating updates and managing infrastructure as code using Terraform Spacelift. You will give the team the ability to create infrastructure on demand, ensuring scalability and consistency. • Execution: Manage a variety of Kanban tasks via Linear, ranging from improving observability to handling GitHub policy requests, release engineering, and access management.
Cyber Range Platform Engineer
Horizon3.aiContinuous, autonomous pentesting, powered by NodeZero. Are your systems secure? Don't wait for a breach to find out!
• Implement highly scalable, secure, resilient cloud-native and on-premise application platforms that host vulnerable applications, configurations, and other services vital to research and development of our product. • Identify, design, and implement improvements in deployment processes so engineers can begin development anytime with confidence. • Design and implement tooling for the internal platform to collect telemetry data used to gauge its effectiveness in supporting H3's engineers. • Help create an effective documentation culture within the Engineering Organization (Confluence, Documentation as code). • Participate in effective project management and work allocation (Jira, Agile Concepts). • Create Terraform modules and Ansible playbooks that are extensible across the infrastructure to reliably and effectively deploy vulnerable testing scenarios for engineering development efforts. • Be a positive example of automation for infrastructure-as-code, platform operations, and overall CI/CD methodologies. • Evaluate new technologies and patterns for automation, application hosting, and improving infrastructure provisioning. Recommend and define the work needed.



