We provide the best software engineering solutions by investing in our people first.

AI/ML Engineer: RAG & API Pipelines

AI EngineerMachine Learning EngineerFull Time Remote Mid LevelTeam 1,001-5,000Since 2003H1B No SponsorCompany Site LinkedIn

Location

Worldwide

Posted

48 days ago

Salary

Seniority

Mid Level

No structured requirement data.

Job Description

Role Description We're looking for a Senior AI/ML Engineer to act as the bridge between data infrastructure and customer-facing AI products. You'll specialize in: - Building low-latency API layers - Production-grade RAG systems - Complex ingestion pipelines - Human-in-the-Loop workflows You'll be working alongside Data Engineers to turn raw data lakes into live AI features. Qualifications - Overall Experience: 7+ years in Backend Software Engineering and AI Application Engineering, including exposure to Distributed Systems - AI & RAG Integration: 2+ years engineering production-grade RAG pipelines, managing vector retrieval context, and implementing secure validation layers for LLMs - Prototyping & Collaboration: Proven track record working synchronously with Data Engineers to rapidly turn raw data lakes and streams into production-ready AI feature prototypes - Proficiency in Advanced API Design & GraphQL Architecture - Proficiency in RAG, Data Flows & Ingestion Pipelines - Proficiency in State Management & Human-in-the-Loop (HITL) Automation - Production fluency in Python - Working knowledge of C# (.NET Core), Java, or Node.js/TypeScript for enterprise ingestion systems Requirements - Token-aware pagination for GraphQL/REST endpoints (LLM context-safe) - Custom Model Context Protocol (MCP) server development - GraphQL schema implementation using Apollo Server or AWS AppSync - Amazon Bedrock APIs (foundational model invocation and chaining) - Knowledge Bases for Amazon Bedrock (chunking, metadata extraction, vector sync from S3) - Hybrid retrieval using OpenSearch/Elasticsearch, pgvector, and MemoryDB / Redis OSS - High-throughput ingestion workers for embedding and vector generation - Ingestion pipelines with Amazon SQS, MSK (Kafka), and log sources - Third-party SaaS analytics API integration (e.g., Pendo, Hotjar, Google Analytics) - Autonomous Bedrock Agents with action groups - Amazon Bedrock Guardrails (prompt injection blocking, PII redaction, safety alignment) - Stateful orchestration with AWS Step Functions or LangGraph - Durable HITL gating workflows (pause, persist state, resume on human approval) - Idempotent processing with automated state rollbacks and dead-letter queues (DLQ) - End-to-end event tracing with OpenTelemetry (oTel) and Datadog Benefits - Remote work - 13 floating holidays - 15 vacation days per year completed - Good working environment Company Description Every qualified candidate who meets the requirements outlined in the job description will be considered in this hiring process without distinction. Furthermore, Jalasoft is an equal opportunity employer. We wholeheartedly embrace our responsibility to make employment decisions without regard to race, age, marital or social status, national origin, disability, sex, gender identity or expression, or any other characteristic or group of candidates or employees unrelated to their qualifications and suitability for the position. Our management is committed to upholding this policy with respect.

Related Categories

AI Engineer Machine Learning Engineer AI Research Scientist LLM Engineer Computer Vision Engineer NLP Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More AI Engineer Jobs

AI/ML Engineer: Agentic Workflows

Jalasoft

We provide the best software engineering solutions by investing in our people first.

AI Engineer48 days ago

Full Time RemoteTeam 1,001-5,000Since 2003H1B No Sponsor

Company Site LinkedIn

Role Description We're looking for a Senior AI/ML Engineer to design and build the foundational AI layers, multi-step agentic workflows, and developer tooling for the broader engineering organization. You'll focus on the internal runtime for AI: - Implementing prompt frameworks - Enforcing deterministic structured outputs - Deploying production-grade Human-in-the-Loop state automations - Turning raw context into reliable, zero-hallucination agent-based actions Qualifications - Overall Experience: 7+ years in Backend Software Engineering and AI/ML Application Engineering - Agentic Architecture: 2+ years engineering multi-step LLM workflows, autonomous agent loops, and multi-agent orchestration frameworks in production - Developer Platform Enablement: experience building internal SDKs, tools, or structured pipeline abstractions used by other product engineering teams to ship AI features rapidly - Proficiency in Multi-Step Agentic Workflows & Tooling - Proficiency in Advanced Amazon Bedrock & Prompt Engineering - Proficiency in Structured Output Pipelines & Performance Ingestion - Proficiency in State Management, Automation Consistency & HITL - Production fluency in Python - Proficiency in TypeScript/Node.js for Vercel AI SDK and AWS edge workers Requirements - Vercel AI SDK, LangGraph, or LlamaIndex for agentic orchestration - Reliable tool-calling (function-calling) interfaces for LLM-driven internal system navigation - Short-term and long-term agent memory architectures with ultra-low-latency in-memory stores (Amazon MemoryDB / Redis OSS / Valkey) - Amazon Bedrock APIs (foundational model invocation and chaining for agent loops) - Programmatic prompt optimization, few-shot prompting, dynamic context injection, and prompt caching - Amazon Bedrock Guardrails (safety gating, prompt injection mitigation, PII redaction) - Type-safe schema enforcement with Pydantic (Python) or Zod (TypeScript) - Ingestion workers consuming Amazon MSK (Kafka), SQS, and log sources - Optimistic Concurrency Control (OCC) and async write-delegation to prevent PostgreSQL deadlocks - Deterministic multi-step pipelines with AWS Step Functions or LangGraph - Durable HITL workflows (pause prior to DB writes or API transactions, persist state, resume on approval) - Idempotent execution layers for safe agent loop retries without record duplication - Working knowledge of C# (.NET Core) or Java for enterprise pipeline integrations Benefits - Remote work - 13 floating holidays - 15 vacation days per year completed - Good working environment Company Description Every qualified candidate who meets the requirements outlined in the job description will be considered in this hiring process without distinction. Furthermore, Jalasoft is an equal opportunity employer. We wholeheartedly embrace our responsibility to make employment decisions without regard to race, age, marital or social status, national origin, disability, sex, gender identity or expression, or any other characteristic or group of candidates or employees unrelated to their qualifications and suitability for the position. Our management is committed to upholding this policy with respect.

View details: AI/ML Engineer: Agentic Workflows

Worldwide

Apply

Engenheiro de IA Sênior

Extractta

EXTRACTTA | Informações que geram Soluções

AI Engineer49 days ago

Full Time RemoteTeam 201-500Since 2005H1B No Sponsor

Company Site LinkedIn

• Mapear oportunidades em crédito e cobrança onde agentes de IA podem substituir ou ampliar processos manuais, considerando fluxos de trabalho, riscos operacionais e critérios de sucesso em ambientes regulados • Projetar e manter arquiteturas de **Retrieval Augmented Generation (RAG)**, alimentando agentes com informações precisas e atualizadas • Gerenciar pipelines de ingestão de dados não estruturados (PDFs, áudios, textos, imagens), incluindo OCR e extração de entidades • Desenvolver e orquestrar sistemas multi-agente utilizando frameworks como **LangGraph, CrewAI ou AutoGen** • Implementar padrões arquiteturais como **ReAct, Plan-and-Execute e Reflexion** • Integrar agentes com APIs internas, sistemas legados e fontes de dados de crédito e cobrança • Estruturar e versionar prompts de sistema, instrução e contexto com técnicas avançadas (**chain-of-thought, few-shot prompting, structured outputs, self-consistency**), garantindo comportamentos determinísticos e auditáveis • Avaliar a qualidade das respostas dos agentes com métricas especializadas (**Faithfulness, Answer Relevancy, Context Precision, Hallucination Rate**) • Implementar **guardrails** e camadas de segurança para prevenção de alucinações, prompt injection e vazamento de dados sensíveis, em conformidade com a LGPD • Operar pipelines de **LLMOps** para monitoramento contínuo de custos, latência, qualidade e drift comportamental dos agentes em produção • Garantir versionamento de prompts, modelos e bases de conhecimento com rastreabilidade completa • Acompanhar o desempenho dos agentes em produção, iterando sobre base de conhecimento, lógica de orquestração e estratégias de fallback • Medir o impacto em KPIs de negócio (redução de tempo de análise de crédito, aumento na taxa de recuperação, melhoria na experiência do cliente) • Elaborar documentação técnica conforme padrões internos e exigências regulatórias, garantindo auditabilidade das decisões automatizadas

Docker Kubernetes Python React

View details: Engenheiro de IA Sênior

Brazil

Apply

Job Closed

Senior AI Engineer

Affinity.co

The CRM for private capital—with Relationship Intelligence.

AI Engineer49 days ago

Full Time RemoteTeam 201-500Since 2014

Company Site LinkedIn

• Architect, prototype, and deploy RAG pipelines, combining vector search, hybrid retrieval, reranking and contextual compression techniques. • Contribute to design and orchestration of multi-agent LLM systems using community frameworks and custom orchestration layers. • Work on a variety of information extraction, information storage and information retrieval problems for both structured and unstructured data. • Partner with cross-functional (product, infra, data engineering, and software engineering) to build robust, high-scale systems that underlie all of our data processing and ML Operations.

View details: Senior AI Engineer

Canada

$160K - $220K / year

Apply

Job Closed

Senior AI Engineer – Agentes, Plataforma & RAG

Leega

Inteligência, Inovação e Tecnologia.

AI Engineer49 days ago

Full Time RemoteTeam 201-500Since 2010H1B No Sponsor

Company Site LinkedIn

• Você vai construir a plataforma de agentes de IA da Guanabara e o ecossistema que distribui IA generativa governada à companhia. • Orquestração de agentes — construir agentes com LangGraph no padrão Supervisor/ReAct, com memória de curto/longo prazo e estado durável (PostgresSaver). • Camada semântica e RAG — evoluir a camada semântica determinística (Cube.js) e a recuperação semântica com Qdrant (busca vetorial + re-ranking) que ancoram as respostas em dado real. • Skills, subagentes e MCP — criar e versionar skills (SKILL.md), subagentes e ferramentas MCP (FastMCP) para o Claude Code, em fluxo spec-driven, conectando agentes a sistemas corporativos. • Verificação e avaliação — manter o verificador (LLM-as-judge) e suítes de eval (offline/online) com amostragem de traces para pegar drift de qualidade. • Plataforma de modelos — operar o gateway LiteLLM (Virtual Keys, budget, RBAC, roteamento multi-provider) otimizando custo e latência. • Adaptação e serving de modelos — quando RAG e prompting não bastam, fazer fine-tuning supervisionado e PEFT (LoRA/QLoRA), quantização e destilação; servir modelos abertos/privados com vLLM e roteamento SLM↔LLM. • Governança e segurança — aplicar guardrails de PII (Presidio), defesa contra prompt injection e observabilidade ponta a ponta (Langfuse).

JavaScript Neo4j Python React

View details: Senior AI Engineer – Agentes, Plataforma & RAG

Brazil

Apply

AI/ML Engineer: RAG & API Pipelines

Job Description

Related Guides

Related Categories

Related Job Pages

More AI Engineer Jobs

AI/ML Engineer: Agentic Workflows

Engenheiro de IA Sênior

Senior AI Engineer

Senior AI Engineer – Agentes, Plataforma & RAG