Creai logo
Creai

Soluciones de Inteligencia Artificial a la medida

Platform Engineer

EngineerEngineerFull TimeRemoteSeniorTeam 51-200Since 2023H1B No SponsorCompany SiteLinkedIn

Location

Mexico

Posted

34 days ago

Salary

0

Seniority

Senior

Job Description

Platform Engineer

Creai

• Diseñar, implementar y mantener infraestructura en AWS y Azure utilizando Terraform o Pulumi. • Definir la estrategia cloud multi-proveedor de Creai, garantizando que toda la infraestructura sea reproducible, segura y versionada. • Diseñar y operar pipelines de integración y entrega continua robustos y reutilizables para todos los equipos de ingeniería, soportando despliegues de aplicaciones y modelos de ML/IA con testing automatizado, quality gates y estrategias de rollback. • Diseñar, desplegar y operar clústeres de Kubernetes en producción (EKS/AKS). • Gestionar namespaces, RBAC, network policies, Helm/Kustomize y estrategias de escalamiento automático para cargas de trabajo de IA. • Construir y mantener la plataforma MLOps de Creai: pipelines de entrenamiento, registro y versionado de modelos, despliegue como endpoints escalables y monitoreo de performance en producción. • Implementar infraestructura especializada para cargas de trabajo de IA generativa, incluyendo gestión de recursos GPU y arquitecturas RAG. • Ser el principal impulsor de la experiencia del desarrollador: construir herramientas, templates y abstracciones que permitan a los equipos de ingeniería y ciencia de datos enfocarse en crear valor sin fricciones operacionales. • Incorporar seguridad en todos los niveles de la plataforma: gestión de secretos, IAM, cifrado y cumplimiento de mínimo privilegio. • Definir y hacer seguimiento de SLAs/SLOs. Liderar la respuesta a incidentes y post-mortems. • Diseñar para alta disponibilidad y recuperación ante desastres. • Implementar stacks de observabilidad completos (métricas, logs y trazas) con herramientas como Prometheus, Grafana, Datadog u OpenTelemetry, garantizando visibilidad del estado de todos los servicios y modelos en producción. • Como primer miembro del equipo de Plataforma, construir no solo la infraestructura sino también la cultura, los procesos y los estándares del equipo. • Influir activamente en las decisiones arquitectónicas de toda la organización y mentorizar a futuros ingenieros de plataforma. • Participar ocasionalmente en conversaciones técnicas con clientes para definir requisitos de infraestructura, presentar arquitecturas y asegurar que las soluciones de plataforma cumplan con las expectativas de cada proyecto. • Evaluar y mejorar continuamente el stack de plataforma, las herramientas, los procesos y las prácticas de operación, optimizando la eficiencia y la fiabilidad de las soluciones. • Capacidad de comunicación clara y estructurada con stakeholders técnicos y no técnicos, presentando decisiones de arquitectura e infraestructura de manera accesible.

Job Requirements

  • Más de 4 años de experiencia en roles de Platform Engineering, DevOps, SRE o Infrastructure Engineering, con responsabilidad directa sobre infraestructura en producción a escala.
  • Experiencia sólida y comprobable en AWS y Azure, incluyendo servicios de cómputo, networking, almacenamiento, identidad (IAM/Entra ID) y Kubernetes gestionado (EKS/AKS).
  • Dominio de Terraform. Experiencia con gestión de estado remoto, módulos reutilizables y pipelines de IaC en CI/CD.
  • Experiencia avanzada diseñando y operando clústeres de Kubernetes en producción: RBAC, network policies, Helm, Kustomize, operadores y estrategias de escalamiento (HPA, VPA, Cluster Autoscaler).
  • Experiencia diseñando pipelines de CI/CD complejos en plataformas como GitHub Actions, GitLab CI, Azure DevOps o Jenkins.
  • Dominio de Docker: construcción de imágenes optimizadas, multi-stage builds y gestión de registros (ECR, ACR). Experiencia con escaneo de vulnerabilidades (Trivy, Snyk).
  • Experiencia implementando stacks de observabilidad con Prometheus, Grafana, Datadog, OpenTelemetry o ELK/Loki.
  • Sólidas habilidades de scripting en Python y Bash para automatización de tareas operacionales y desarrollo de herramientas internas.
  • Capacidad comprobada de trabajar de forma independiente, tomar decisiones técnicas complejas y ser dueño/a de resultados end-to-end en contextos de alta ambigüedad.
  • Habilidad para explicar decisiones de infraestructura a audiencias técnicas y de negocio.
  • Comunicación fluida en español e inglés, escrito y verbal.
  • Experiencia con herramientas como MLflow, Kubeflow, Seldon Core, KServe, SageMaker Pipelines o Azure ML Pipelines para gestión del ciclo de vida de modelos de ML (Valorado).
  • Experiencia gestionando infraestructura de GPU (instancias spot, scheduling) y desplegando modelos de LLMs o embeddings en producción (Valorado).
  • Certificaciones en AWS (Solutions Architect, DevOps Engineer) o Azure (AZ-104, AZ-400) (Valorado).
  • Experiencia con Istio, Linkerd o Consul para gestión de tráfico, mTLS y observabilidad de red (Valorado).
  • Experiencia operando bases de datos vectoriales como Pinecone, Weaviate o pgvector en producción (Valorado).

Benefits

  • Trabajo 100% remoto con horario alineado a CST.
  • PTO ilimitado: Confiamos en que gestionarás tu tiempo de manera efectiva.
  • Presupuesto anual para desarrollo: Acceso a cursos, certificaciones y conferencias.
  • Presupuesto para equipamiento: Configura tu espacio de trabajo remoto ideal.
  • Beneficio de salud: Acceso a cobertura médica privada o subsidios para seguro médico.
  • Oportunidades de crecimiento: Plan de carrera y mentoría con expertos en IA y tecnología.
  • Ambiente de startup dinámico y flexible: Autonomía para tomar decisiones y proponer ideas, con un enfoque en resultados en lugar de horas trabajadas.
  • Balance vida-trabajo: Cultura que prioriza la flexibilidad y el bienestar, permitiéndote gestionar tu tiempo sin sacrificar tu vida personal.

Related Categories

Related Job Pages

More Engineer Jobs

Valmet logo

Field Service Development Engineer, Pulp, Energy, Circularity

Valmet

Valmet is where the best talent from a wide variety of backgrounds comes together. With over 19,000 professionals around the world, we are the leading global developer and supplier of technologies, automation, and services for the pulp, paper, and energy industries. Our commitment to moving our customer’s performance forward requires creativity, technological innovations, service know-how, and above all, teamwork. Join the team!

Engineer34 days ago
Full TimeRemoteTeam 10,001+H1B Sponsor

• Drive development projects including business processes and IT tools and systems. • Develop process performance reports and data analytics to support decision making. • Support services process owners to secure process compliance with key KPIs incl quality and HSE requirements. • Support in developing training concepts and execute internal trainings. • Support in process continuous improvement process together with process owners.

North Carolina
Job Closed
Capital One logo

Senior Distinguished Engineer

Capital One

At Capital One, we think and work like a tech company, using our digital fluency to transform everything about the customer experience. We’re bending data to our will, and turning a stodgy industry on its head. That’s reflected in our ranking as the number one business technology innovator in the U.S. in the 2016 InformationWeek Elite 100.

Engineer34 days ago
Full TimeRemoteTeam 10,001+Since 1994H1B Sponsor

Title: Senior Distinguished Engineer Location: San Francisco, California | Richmond, Virginia | New York, New York | San Jose, California | Cambridge, Massachusetts | McLean, Virginia Type: Full-Time Overview Senior Distinguished Engineer, AI Compute (Remote Eligible) Overview: At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build. Team Description: The Capital One machine learning platform organization manages our cloud-based enterprise AI+ML system delivering the high-scale developer and runtime environments required to build, orchestrate, and deploy compute and data intensive AI systems across real-time and batch workloads. We are seeking a Senior Distinguished Engineer, a hands-on technical leader passionate about distributed systems, to engineer and scale foundational compute capabilities for our platform. You will use your experience in building large scale, highly available and high performance systems to develop our common compute infrastructure on top of CPU and GPU substrates. Your contributions will power everything from developer notebooks to ML / DL model training, model inference and feature generation pipelines to pre-training and fine tuning Transformer-based models as well as generative AI inference and agentic applications. Your depth of expertise in technologies including Golang and Python programming languages, popular distributed compute frameworks including Spark / Dask / Ray / Flink, container (e.g., Kubernetes) and serverless (e.g., AWS Lambda) runtime environments, and ML+AI workload patterns will provide an amplifying technical element that is paramount to our team's success. In this role, you will : - Architect and build control and data plane implementations required to realize a highly available, multi-tenant, large scale and a secure machine learning platform - Develop Ray and Spark distributed compute engine solutions to accelerate diverse workloads from LLM pre-training and reinforcement learning to large-scale data processing, while maximizing compute unit economics - Engineer systemic improvements for operational excellence including automating KTLO (Keep The Lights On) workflows - Direct the technical execution of a diverse project portfolio, collaborating with developers specializing in everything ranging from distributed microservices to running large foundation models - Work cross-functionally with product and program management disciplines, and stakeholder and partners across Capital One to help optimize business outcomes while driving towards strong technology solutions - Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and leading system design and code review sessions - Help elevate the Capital One Distinguished Engineering community and establish yourself as a go-to resource on given technologies and technology-enabled capabilities - Lead the way in creating next-generation talent, mentoring internal talent and actively recruiting external talent to bolster the Capital One tech talent pool Basic Qualifications - Bachelor’s Degree - At least 7 years of experience with application architecture and design patterns. - At least 5 years of experience with distributed databases, microservice architectures, and high availability systems Preferred Qualifications - Degree in Computer Science or a Master’s Degree in Software Engineering - Hands on experience in the internals of Ray (Actors/GCS/Scheduling) or Spark (Query Optimizer/Memory Management) - Experience building platforms that support LLM training, fine-tuning, or high-throughput inference - Hands-on experience with AWS-specific compute primitives (EKS, EC2 UltraClusters, Graviton) and cost-optimization strategies. Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked. Cambridge, MA: $314,800 - $359,300 for Sr. Distinguished AI Engineer McLean, VA: $314,800 - $359,300 for Sr. Distinguished AI Engineer New York, NY: $343,400 - $392,000 for Sr. Distinguished AI Engineer Richmond, VA: $286,200 - $326,700 for Sr. Distinguished AI Engineer San Francisco, CA: $343,400 - $392,000 for Sr. Distinguished AI Engineer San Jose, CA: $343,400 - $392,000 for Sr. Distinguished AI Engineer Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter. This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at theCapital One Careers website(opens in new window). Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level. This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9102 or via email at RecruitingAccommodation@capitalone.com(opens in new window). All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations. For technical support or questions about Capital One's recruiting process, please send an email to Careers@capitalone.com(opens in new window) Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site. Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

California + 3 moreAll locations: California | Virginia | New York | Massachusetts
$286.2K - $392K / year

Project Environmental Engineer

Penn Environmental & Remediation

Founded in April 1996, Penn Environmental & Remediation, Inc. (Penn E&R) is a comprehensive consulting firm specializing in environmental, engineering, and construction services ac

Engineer34 days ago

Title: Project Environmental Engineer Location: Remote, NJ job requisition id JR-761 Job Description: At Penn E&R, we empower professionals to create lasting, positive impact through environmental remediation, civil engineering design, and regulatory guidance that protect ecosystems, restore land, and improve infrastructure. Whether you're cleaning up a former industrial site or designing a stormwater management system for a new development, your work here contributes to a safer, more sustainable world. Bring your creativity, resourcefulness, and drive to Penn E&R, where every day is an opportunity to learn, grow, and lead. Ready to take on what’s next? Let’s make it happen together! We are seeking a New Jersey focused Project Environmental Engineer/Geologist to work closely with Penn E&R's LSRP and other engineers/geologists in the Doylestown, PA office. This position allows for hybrid office or fully remote work (depending where you live). You must have solid knowledge and experience with all associated New Jersey regulations. Responsibilities include project task management, conducting field investigations, ensuring environmental compliance, performing environmental assessments, and design of sustainable remedies for soil/groundwater contamination. The successful candidate will collaborate with site civil engineering, survey, ecological and water/wastewater engineering teams. Qualifications - 4-7 years' experience, with focus on NJ Site Remediation Program assessment and remediation work - Experience in Phase I/II assessments; NJ Site Investigation (SI), Remedial Investigation (RI), and Remedial Action (RA) projects per NJAC 7:26E; field investigation per NJ FSPM - Excellent technical writing skills - Independent management of field subcontractors and staff - Strong problem-solving, analytical, and organizational skills - Proficiency in AutoCAD, GIS; environmental modeling or data analysis software a plus - Bachelor’s degree in Environmental Engineering, Geology, or a related field - EIT, GIT or professional licensure a plus We know that great work happens when people feel supported both on and off the job. That’s why we offer balance and benefits that support you. - Competitive compensation and performance-based bonuses - Comprehensive health, dental, vision, and retirement benefits - Paid time off and flexible scheduling where possible - A strong commitment to field and office safety - A stable, growing company with local roots and regional impact Penn E&R, Inc. is an Equal Employment Opportunity employer. We adhere to a policy of making employment decisions without regards to race, color, religion, sex, age, disability or any other protected categories. It is our intention that all qualified applicants be given an equal opportunity and that selection decisions be based on job-related factors.

New Jersey
Depop logo

Associate Engineer

Depop

The community-powered circular fashion marketplace. Shop what you love. Sell your clothes. Do it all over.

Engineer34 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Work closely with Product Managers, Data Scientists and other Backend Engineers to understand problems and to design solutions. • Produce high-quality code that is well-structured and simple to understand that will be used by 1M+ active daily users • Embrace agile methodologies • Engage in a culture of continuous improvement by attending events such as blameless post-mortems, architecture reviews etc. • Collaborate on a daily basis with fellow engineers in the cross functional environment to solve problems and write code • You’ll own , code, workflows and data, through their entire lifecycle • Documenting any feature development

United Kingdom