FICO logo
FICO

FICO is an analytics company helping businesses make better decisions that drive higher levels of growth and success.

Principal Platform Engineer

Platform EngineerPlatform EngineerFull TimeRemoteLeadTeam 1,001-5,000Since 1956H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

69 days ago

Salary

$150.5K - $236.5K / year

Seniority

Lead

Bachelor Degree10 yrs expEnglishAWSDockerKubernetesPythonTerraform

Job Description

Principal Platform Engineer

FICO

• Lead the design, implementation, and evolution of our cloud-native platform infrastructure. • Build and maintain scalable, resilient systems that empower our engineering teams to deliver innovative solutions rapidly and reliably. • Design, deploy, and manage scalable cloud solutions on AWS public cloud platform via Infrastructure as Code. • Manage infrastructure as code (IaC) leveraging Terraform, CloudFormation and GoLang. • Design and implement Kubernetes-based platform solutions with focus on scalability, reliability, and security. • Support and maintain large Kubernetes clusters in production environments. • Implement security best practices and ensure compliance with industry standards and regulations. • Work closely with development, operations, and security teams to integrate infrastructure as code practices. • Develop automation to build and deploy Docker Containers through CI/CD pipelines for engineering teams deploy and test services. • Write policy & standard validation tests and integrations with Security Scanning software to ensure compliance. • Implement and support Observability solutions to ensure platform performance, reliability, and scalability. • Create Dashboards and integrate into Backstage IDP for visibility into system health. • Provide guidance and mentorship to team members on best practices in GitOps, CI/CD, and infrastructure management.

Job Requirements

  • 10+ years of experience in cloud & platform engineering, DevOps, developer enablement, or infrastructure roles with increasing responsibility.
  • Expert-level proficiency with AWS cloud services especially in compute, storage, networking, implementing cloud native services, and developing resilient infrastructure architecture.
  • Strong software engineering skills and methodologies.
  • Deep Knowledge of AWS EKS or running multiple Kubernetes instances, VPC Configuration, AWS Waf, S3, RDS, and other cloud services.
  • Experience with ArgoCD, Argo Workflows, CircleCI, or GitHub Actions.
  • Experience with Crossplane a plus.
  • Strong hands-on experience with developing golden paths for engineers and managing CI/CD pipelines leveraging GitOps workflows for Kubernetes application delivery.
  • Deep expertise in Infrastructure as Code using Terraform for multi-cloud resource provisioning and full stack deployments.
  • Proven experience with Kubernetes infrastructure management and service mesh technologies like Istio and API gateway solutions.
  • Strong understanding of Kubernetes architecture, operators, custom resources, and ecosystem tools .
  • Advanced programming skills in GoLang and Python for building platform tooling and automation.
  • Experience with Grafanna, OTEL, and Backstage.
  • Demonstrated ability to lead technical initiatives and influence architectural decisions across engineering teams.
  • Excellent communication and collaboration skills with both technical and non-technical.
  • Background in SRE or Software Engineering practices and principles.
  • Bachelor's Degree in Computer Science or related discipline, or relevant experience.
  • AWS certifications (Solutions Architect Professional, DevOps Engineer Professional).

Benefits

  • An inclusive culture strongly reflecting our core values: Act Like an Owner, Delight Our Customers and Earn the Respect of Others.
  • The opportunity to make an impact and develop professionally by leveraging your unique strengths and participating in valuable learning experiences.
  • Highly competitive compensation, benefits and rewards programs that encourage you to bring your best every day and be recognized for doing so.
  • An engaging, people-first work environment offering work/life balance, employee resource groups, and social events to promote interaction and camaraderie.

Related Categories

Related Job Pages

More Platform Engineer Jobs

engaige GmbH logo

Working Student AI Platform Engineer

engaige GmbH

We give your data purpose - Tech Company focused on innovation and artificial intelligence.

Part TimeRemoteTeam 1-10Since 2021H1B No Sponsor

• Responsible for designing and implementing complex AI components — from initial concept to production • Enhance and scale existing AI solutions to improve performance and efficiency • Develop robust data pipelines, backends, and API interfaces for seamless integration • Build and maintain automated deployment pipelines to streamline releases and testing • Ensure quality and performance of backend processes and NLP algorithms through continuous optimization • Work closely with the product team to translate data-driven insights into market-ready, innovative solutions

United States
engaige GmbH logo

AI Platform Engineer – m/f/d

engaige GmbH

We give your data purpose - Tech Company focused on innovation and artificial intelligence.

Full TimeRemoteTeam 1-10Since 2021H1B No Sponsor

• Design and build AI components — from concept to production • Scale and optimize existing AI solutions (performance, cost, stability — the full triathlon package) • Develop data pipelines, backends & APIs that feel good to both users and for logging • Automate CI/CD and deployments so releases don’t feel like an adventure • Ensure quality & performance — including continuous optimization of backend processes and NLP pipelines • Work closely with Product to turn data-driven insights into market-ready features

United States
Job Closed
Jobgether logo

Sr Platform Engineer

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Full TimeRemoteH1B No Sponsor

Role Description This role focuses on designing, building, and evolving scalable platforms to support computer vision and machine learning (CVML) workloads. The Sr Platform Engineer will enable ML teams by developing reliable infrastructure, tooling, and workflows that accelerate experimentation, training, and deployment of models at scale. This position combines hands-on engineering with strategic platform guidance, balancing improvements to legacy systems with delivery of new, high-impact platform capabilities. You will collaborate closely with ML engineers, robotics teams, and product stakeholders, shaping platform architecture, reliability, and performance across cloud and on-prem environments. The ideal candidate thrives in a fast-moving, multi-team environment and is motivated by creating durable, widely adopted technical solutions. This role offers the opportunity to influence platform strategy while executing tangible improvements that impact real-world applications in computer vision, robotics, and AI-powered systems. - Design, implement, and evolve platform capabilities for ML training, batch inference, and model deployment workflows - Own and maintain core platform components including compute orchestration, data pipelines, and inference systems used across multiple teams - Enhance platform reliability, scalability, and performance while addressing real-world ML workload requirements - Enable ML engineers with intuitive tools, workflows, and documentation across the full model lifecycle - Develop and optimize hybrid compute environments, leveraging Kubernetes, Slurm, and cloud platforms (AWS preferred) - Evaluate and improve system architecture, balancing incremental improvements with long-term platform health - Mentor junior engineers, provide technical guidance, and drive adoption of platform best practices Qualifications - 5+ years of professional experience in platform, infrastructure, or systems engineering - Strong technical judgment in evolving legacy platforms and delivering new, cross-team components - Proficiency in Python for production systems, tooling, and platform components - Solid understanding of ML systems and the end-to-end model lifecycle from experimentation to deployment - Hands-on experience with cloud platforms (AWS preferred) and container orchestration systems such as Kubernetes and Slurm - Ability to translate requirements from ML, robotics, and product teams into scalable platform solutions - Experience ramping quickly on new domains, tools, and complex systems - Preferred: Golang experience, ML pipeline integration (Kubeflow, Airflow), distributed training and inference (Ray), computer vision or robotics ML systems Requirements - 5+ years of professional experience in platform, infrastructure, or systems engineering - Strong technical judgment in evolving legacy platforms and delivering new, cross-team components - Proficiency in Python for production systems, tooling, and platform components - Solid understanding of ML systems and the end-to-end model lifecycle from experimentation to deployment - Hands-on experience with cloud platforms (AWS preferred) and container orchestration systems such as Kubernetes and Slurm - Ability to translate requirements from ML, robotics, and product teams into scalable platform solutions - Experience ramping quickly on new domains, tools, and complex systems - Preferred: Golang experience, ML pipeline integration (Kubeflow, Airflow), distributed training and inference (Ray), computer vision or robotics ML systems Benefits - Competitive US-based annual salary range of $160,000 - $287,000, plus eligibility for bonus programs - Comprehensive benefits package including healthcare, retirement, and paid time off - Opportunity to work remotely within the United States - Exposure to cutting-edge CVML platforms, AI research, and robotics applications - Mentorship, career development, and learning opportunities in a collaborative, inclusive environment - Chance to influence platform strategy and technical direction across multiple teams

United States
$160K - $287K / year
Job Closed
Full TimeRemoteTeam 51-200Since 2005H1B No Sponsor

Role Description Buscamos um(a) profissional sênior com forte experiência em AWS para atuar na sustentação, monitoramento e evolução de uma plataforma de dados e analytics em ambiente cloud. Esta posição possui caráter estratégico e exige atuação ponta a ponta, incluindo análise e resolução de incidentes, monitoramento de infraestrutura, atuação proativa na identificação de riscos e proposição de melhorias técnicas e de governança. O profissional atuará em um modelo de AMS estruturado, com foco em suporte N2/N3, confiabilidade da plataforma e evolução contínua do ambiente, apoiando diretamente a operação e a estabilidade dos serviços de dados e analytics. Responsibilities - Sustentação e suporte técnico - Atuar na análise e resolução de incidentes em ambiente AWS (nível N2/N3) - Investigar causas raiz de falhas em serviços de dados e infraestrutura - Realizar troubleshooting envolvendo performance, disponibilidade e custo - Apoiar tecnicamente os níveis iniciais de atendimento - Monitoramento e observabilidade - Monitorar e analisar métricas de serviços AWS, incluindo: - Redshift (CPU, filas, queries, armazenamento) - EC2 (CPU, memória, disco, status checks) - EMR (jobs, uso de recursos, HDFS) - Athena (queries, custo, performance) - SQS (backlog, throughput) - DynamoDB (throttling, latência) - Lambda (erros, duração, concorrência) - S3 (armazenamento e erros) - Identificar gargalos de performance e riscos operacionais - Criar e evoluir mecanismos de alerta e monitoramento - Atuação proativa - Identificar oportunidades de melhoria em performance, custo e estabilidade - Propor ações preventivas para evitar incidentes - Automatizar rotinas operacionais e de monitoramento - Governança e evolução da plataforma - Apoiar na definição de boas práticas de arquitetura em cloud - Contribuir com a evolução da governança da plataforma de dados - Apoiar a análise e direcionamento de vulnerabilidades - Gestão e comunicação - Apoiar na elaboração de relatórios técnicos mensais (saúde da plataforma, riscos e melhorias) - Interagir com clientes e stakeholders técnicos - Documentar incidentes, análises e soluções Qualifications - Experiência sólida com AWS em ambientes produtivos - Vivência prática com: - EC2 (monitoramento e troubleshooting) - CloudWatch (métricas, logs e alarmes) - S3 - Experiência em análise e resolução de incidentes de infraestrutura - Atuação prévia em suporte técnico nível N2/N3 ou AMS - Experiência com análise de performance (CPU, memória, disco, I/O) - Conhecimento de arquitetura em cloud, preferencialmente voltada a dados - Capacidade de atuação autônoma em cenários críticos Requirements - Experiência com ferramentas AWS como Amazon Redshift, DynamoDB, EMR e Lambda - Conhecimento em Athena e em mensageria (SQS) - Experiência com arquiteturas serverless - Experiência com práticas de FinOps (otimização de custos em cloud) - Conhecimento em segurança e vulnerabilidades em AWS - Experiência com ferramentas de observabilidade e monitoramento avançado - Vivência com metodologias ITIL ou AMS estruturado - Experiência em ambientes de dados e analytics - Inglês Intermediário ou avançado Profile Expected - Perfil analítico, com forte capacidade de investigação e diagnóstico - Proatividade na identificação e resolução de problemas - Organização e senso de priorização - Boa comunicação - Capacidade de atuar de forma prática, sem perder a visão estratégica Benefits - Vales Alimentação e Refeição (Swile) - Flexibilidade para crédito em Auxílio Home-Office (Swile) - Cobertura de até 100% em Plano de Saúde e Odontológico - Seguro de Vida em grupo - Trabalho remoto - Convênio Saúde Mental - psicoterapia online e presencial - Incentivo a certificações e cursos - Convênio para cursos de pós-graduação e MBA (Esalq/USP) - Parceria com escolas de idiomas - Parceria com academias e apps de bem-estar (Wellhub) - Palestras e rodas de conversa internas - Bônus por indicação - Happy hours - Mimos em datas comemorativas

Worldwide
Job Closed