We amplify pride and create connections for all fans around the world.
Senior Platform Engineer
Location
New York
Posted
135 days ago
Salary
$129.2K - $183.6K / year
Seniority
Senior
Job Description
Senior Platform Engineer
Fanatics, Inc.
• Own and deliver medium-to-large platform projects from implementation through production • Build and maintain internal platform services and tooling that improve developer experience and delivery velocity • Partner with application teams to unblock delivery and align platform capabilities with developer needs • Drive cloud cost efficiency and optimization across our cloud and infrastructure • Improve operational excellence through observability, automation, and incident response • Help evolve CI/CD pipelines and GitOps-based workflows • Mentor junior engineers through code reviews and knowledge sharing • Participate in on-call rotations and incident response
Job Requirements
- 5–7 years of experience in platform, infrastructure, DevOps, or software engineering
- Proficiency with Python and Java. Familiarity with Kotlin, Go a plus
- Deep Kubernetes and container orchestration knowledge
- Proven CI/CD and GitOps experience
- Infrastructure as Code expertise (Terraform)
- Strong AWS and cloud-native experience
- Experience operating distributed, high-availability systems
- Strong ownership mindset and ability to deliver defined projects independently
- Clear communication and strong collaboration skills
Benefits
- Medical
- Dental
- Vision
- 401K
- paid time off
- GymPass
- Pet Insurance
- Family Care Benefits
- Free Ship deliveries
- $700 to set up your home office
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
• Architect, implement, and manage cloud-native infrastructure (Kubernetes, Terraform, Terragrunt, Docker, Azure) to support scalable and reliable platforms. • Champion SRE principles: drive service reliability, availability, and performance through automation, monitoring, and incident management. • Lead platform automation, CI/CD, and infrastructure-as-code initiatives to improve deployment velocity and system consistency. • Own observability, monitoring, and alerting using Datadog and related tools, ensuring actionable insights and rapid incident response. • Collaborate with software engineers to ensure robust integration between application and platform layers, and to improve developer experience (DevX). • Provide expert support for critical incidents and production issues, demonstrating high ownership and urgency (including out-of-hours support when required). • Mentor and guide engineers in platform, SRE, and DevOps best practices, fostering a culture of reliability and continuous improvement. • Identify and address gaps in tooling, automation, and platform reliability to proactively improve engineering outcomes.
AI Platform Engineer
Thinkahead Consultant Psychologist Pty LtdWe get to the heart of the matter.....real people......real solutions
• Architect and manage Kubernetes clusters tailored to AI/ML workloads. • Implement Run:ai and operators for GPU resource orchestration and workload scheduling. • Develop and maintain Python-based automation scripts and ML pipelines; automate infrastructure provisioning with Terraform and configuration management with Ansible. • Create and manage Jupyter Notebooks for experimentation and collaboration. • Integrate and optimize NVIDIA Enterprise Suite components (CUDA, NeMo Framework, Triton, TensorRT, GPU drivers) for accelerated computing. • Establish and maintain MLOps best practices for model lifecycle management, CI/CD, and monitoring (e.g., MLflow, Kubeflow). • Work closely with data scientists and platform engineers to ensure efficient resource utilization and scalability across environments.
Senior Manager, AI Platform Engineering
SocureThe leading provider of digital identity verification and fraud solutions. Salesinfo@socure.com
• Develop and own the roadmap for Socure’s AI/ML platform, including data and feature engineering workflows, training infrastructure, experimentation tooling, model deployment/serving, monitoring, and governance. • Define architecture and standards that create clear, scalable, and secure paths for building and operating AI systems. • Assess technology options and drive consolidation across the company to reduce fragmentation and improve consistency across the ML toolchain. • Partner with Data Science, Engineering, Product, and Sales-Enablement teams to develop AI infrastructure that delights Customers. • Lead the design and operation of the end-to-end ML lifecycle: data ingestion, feature engineering, experimentation, training, model registry, deployment, and continuous monitoring. • Guide the team to deliver high-quality platform capabilities with predictable timelines and strong technical rigor. • Implement and enforce best practices around model versioning, auditability, lineage tracking, data governance, and security controls. • Lead, mentor, and grow both senior and junior ICs across ML infrastructure, MLOps, and distributed systems.
• Bridge the gap between research and real-world application. • Ensure high-performance infrastructure, automated pipelines, and deployment strategies. • Design and maintain scalable cloud environments (GCP/AWS) using Terraform. • Manage GPU/TPU resource allocation for training, fine-tuning, and interactive notebooks. • Build internal services and CLI tools for the AI team. • Design CI/CD and training pipelines using tools such as GitHub Actions, MLFlow, Vertex AI Pipelines. • Develop reusable patterns for model serving and manage service deployments to Kubernetes. • Manage and optimize vector databases and embedding pipelines for RAG-based systems. • Implement techniques to reduce latency and increase throughput.



