iHerb is an award-winning ecommerce retailer that specializes in wellness products and nutritional supplements. The company started in 1996 with a focus on prom
Principal Engineer – AI Platform
Location
United States
Posted
5 days ago
Salary
$204K - $260K / year
Seniority
Lead
Job Description
Principal Engineer – AI Platform
iHerb
• Define and own the AI platform architecture: retrieval infrastructure, model lifecycle, evals framework, guardrails, and GenAI product feature design. • Lead the build of the shared AI platform layer consumed by all AI product features and reusable by internal business teams. • Hands-on contributor: build production AI systems, write proofs of concept, and validate architecture through working software. • Set and enforce technical standards for the AI Platform team; drive architecture reviews and model quality reviews. • Coordinate with the Personalization team to define clear boundaries between the GenAI product layer and existing ML personalization infrastructure. • Contribute AI platform-specific patterns and lessons into iHerb's shared AI-driven SDLC golden path. • Drive the hardest cross-cutting technical decisions across multiple teams and shared platform services. • Establish and evolve iHerb's AI-driven SDLC golden path: shared standards, Claude Code skills, guardrails, and automation patterns. • Lead complex multi-team technical efforts by coordinating architecture reviews, aligning peer Principals and EMs, and resolving competing approaches. • Mentor and raise the technical bar across the engineering organization through code review, architecture review, and direct coaching of senior engineers. • Represent engineering in cross-functional conversations with product, data science, security, and infrastructure. • Feed architectural decisions into the shared knowledge base so institutional knowledge compounds across the organization.
Job Requirements
- Generally requires a minimum of 10+ years of software engineering experience, with a significant portion at senior, staff, or principal IC level.
- AI-driven SDLC (required): demonstrated use of AI-assisted development tools such as Claude Code, GitHub Copilot, or Cursor to ship production systems.
- Architecture at scale: experience designing and evolving large-scale distributed systems across multiple teams and years: APIs, data pipelines, event-driven architectures, or high-traffic platforms.
- Cross-org technical leadership: track record of driving architectural standards, technical roadmaps, or platform initiatives that span multiple teams or organizations.
- Engineering quality mindset: deeply held opinions on code quality, observability, CI/CD, test automation, and maintaining velocity without accumulating hidden debt.
- Communication and influence: able to write clear architecture documents, present to technical and non-technical audiences, and build consensus without formal authority.
- Experience working in distributed teams across the US, China, and Latin America.
- Production experience designing and operating AI platform systems: RAG pipelines, vector search, embedding infrastructure, or retrieval-augmented applications at scale.
- Hands-on experience with LLM evaluation: building eval frameworks, defining quality metrics, and iterating on model and prompt quality using data.
- MLOps experience: model deployment, monitoring, lifecycle management, and cost governance in a production environment.
- Experience building and operating agentic systems using frameworks such as LangChain, LlamaIndex, or equivalent.
- GenAI in production is required; prototype or research-only experience is not sufficient at this level.
- High degree of accuracy and attention to detail.
- Excellent organization skills and ability to multi-task.
Benefits
- Employees (and their families) that meet eligibility criteria as outlined in applicable plan documents are eligible to participate in our medical, dental, vision, and basic life insurance programs and may enroll in our company’s 401(k) plan.
- Employees will also be eligible for Time Off and Paid Sick Leave pursuant to the company’s policies. Employees will enjoy paid holidays throughout the calendar year.
- Hired applicant may be awarded Restrict Stock Units and receive annual bonuses pursuant to eligibility and performance criteria defined in the respective plan documents and policies.
Related Guides
Related Job Pages
More AI Engineer Jobs
• Play a critical role in bridging data science and software engineering • Work closely with data scientists, software engineers, and product teams • Deliver robust, scalable AI solutions • Ensure seamless integration of AI models into production systems • Optimize performance and maintain reliability of AI-driven features
ML Research Engineer - AI Trainer
MercorCincinnatus is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for full-time and long-term contingent roles. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Roles hired through Cincinnatus are not project-based or freelance engagements. They are structured, role-based positions that typically involve full-time or fixed-term commitments, close collaboration with a client's internal teams, and integration into standard enterprise workflows. Cincinnatus is a legal entity separate from Mercor. While opportunities may be discovered through Mercor's platform, employment, onboarding, payroll, and benefits for these roles are administered by Cincinnatus. Equal Employment Opportunity Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. Cincinnatus is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans throughout the job application process.
Role Description Mercor connects elite creative and technical talent with leading AI research labs. The position is for a Human Baseliner for Open-Ended ML Research Tasks. - Attempt open-ended machine learning research tasks under a fixed time and compute budget. - Work independently in a sandboxed Linux environment with internet access. - Use preferred tools, including IDEs and AI coding assistants like Cursor, Claude Code, and ChatGPT. - Record full working sessions via screen recording. - Complete pre-task and post-task questionnaires. - Submit final work product, screen recording, and completed questionnaires for evaluation. Qualifications - 3+ years of machine learning experience. Time in a PhD program counts. - Attended a top-100 university or worked at FAANG or a comparable company. - Experience with PyTorch, JAX, or TensorFlow. - Deep expertise in at least one focus area: pretraining, PPO, reward shaping, fine-tuning, LoRA, RLHF, architecture design, contrastive training, generative modeling, multilingual experience, or data pipelines. Requirements - Practical experience in Pretraining, Reinforcement learning, Post-training, Dataset curation, or Model architecture. - One baseline attempt per contractor per task. - Each task may only be attempted once. - All work is confidential and covered by NDA. - Compute and environment are provided; no personal GPU required. Application Process - Upload resume - AI interview based on your resume - Submit form Resources & Support - For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome - For any help or support, reach out to: support@mercor.com - Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Role Description Onpoint Healthcare Partners builds Iris, a Medical Agent AI platform that takes administrative work off the plates of providers and care teams. Our agents handle charting, coding, care gap closure, and care coordination across the full patient journey, combining AI automation with expert clinical oversight. More than 2,000 providers across 35+ specialties rely on the platform every day, which means the systems behind it have to be accurate, reliable, and auditable without exception. We are seeking a Staff AI Engineer to design, build, and scale the agent systems behind Iris. This role combines strong software engineering fundamentals with deep experience building production AI systems. The work you ship directly determines how much time providers get back for patient care. The ideal candidate has shipped AI solutions to production that deliver measurable business value, and will evolve and be a good steward of the agent platform architecture. Key Responsibilities - Design and develop AI applications and agent workflows. - Build scalable backend services, APIs, and integrations supporting AI solutions. - Evolve and steward the agent platform architecture, including orchestration, runtime safety, and prompt governance. - Treat prompts and tool schemas as versioned code with staged rollouts and rollback. - Develop and maintain evaluation frameworks for AI agents and models, with CI gates that block bad changes before release. - Design retrieval, memory, and context management strategies. - Build observability, monitoring, and debugging capabilities, including full trace and replay for incident resolution. - Build the MLOps foundation the team is currently missing: training and retraining pipelines, model versioning, model registry, and feature stores, so model work stops being reactive. - Stand up A/B testing infrastructure and automated drift detection that triggers retraining, so model quality is monitored and maintained without manual firefighting. - Track token and tool costs per run, workflow, and tenant, and keep costs predictable as usage scales. - Improve reliability, performance, safety, and cost efficiency of production AI systems. - Partner with Product, Clinical, Data Science, and Engineering teams to deliver AI capabilities. - Own AI deployment, monitoring, and operational excellence. Qualifications - 8+ years of software engineering experience. - 3+ years building AI and ML applications. - Strong Python and/or C# development experience. - Experience deploying AI systems into production environments. - Experience with LLMs, RAG architectures, agent frameworks, and AI evaluation. - Experience with AWS and distributed systems. - Hands-on experience building MLOps infrastructure such as training pipelines, model registries, feature stores, A/B testing, and drift detection. - Strong debugging, observability, and operational skills. Preferred Qualifications - Healthcare industry experience. - Experience with HIPAA, PHI, and regulated environments. - Experience with vector databases, embeddings, and knowledge graphs. - Experience building AI systems at large scale. Success Metrics - Reliable, scalable, and cost efficient AI systems running in production. - Incidents get resolved fast because tracing and replay make root cause obvious. - Cost per workflow stays predictable as volume grows. - Model training, versioning, and retraining run as a managed pipeline rather than a reactive, manual effort. - Strong adoption and measurable business impact. - High trust in AI quality, safety, and operational performance. Physical Demands The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. While performing the duties of this job, the employee is regularly required to speak, hear, read, and type. This is largely a sedentary role. This position requires the ability to occasionally lift office products and supplies up to 40 pounds. Work Environment To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed above are representative of the knowledge, skill and/or ability required. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time, with or without notice.
Analista de Engenharia de Software – Plataforma de IA
Vivo (Telefônica Brasil)Com a conexão, queremos que você descubra novos pontos de vista e aproveite tudo o que realmente importa.
• Projetar, desenvolver e manter **APIs, SDKs e serviços de plataforma**. • Implementar integrações e camadas de orquestração para agentes e serviços de IA. • Garantir **segurança, performance e observabilidade** em sistemas distribuídos em cloud. • Colaborar com squads multidisciplinares, apoiando decisões técnicas e evolução da arquitetura. • Contribuir para a melhoria contínua de pipelines CI/CD e práticas de **DevOps/Infra as Code**. • Participar de discussões técnicas, propondo soluções alinhadas a boas práticas de engenharia. • Apoiar e orientar desenvolvedores menos experientes, fortalecendo a cultura de qualidade.


