Dataiku

Everyday AI, Extraordinary People

Generative AI Engineer

LLM EngineerMachine Learning EngineerFull Time Remote SeniorTeam 1,001-5,000H1B SponsorCompany Site LinkedIn

Location

New York

Posted

21 days ago

Salary

$160K - $240K / year

Seniority

Senior

Bachelor DegreeEnglishAWS Azure JavaScript Node.js Python Vue.js Go

Job Description

• Design end-to-end AI solutions on Dataiku's platform, leveraging Dataiku Agent Hub, Prompt Studio, LLM Mesh, and Knowledge Banks (Vector Stores), or Python-based frameworks where needed. • Build and orchestrate multi-agent systems using Dataiku's Visual Agents (simple and structured), as well as code-based frameworks (LangGraph, CrewAI, Claude Agent SDK, OpenAI Agents SDK) as appropriate. • Integrate and optimize LLM APIs across providers (OpenAI, Anthropic, Google Gemini, AWS Bedrock, Azure, open-source models via Dataiku's LLM Mesh), applying model routing strategies to balance cost, latency, and quality. • Implement Retrieval-Augmented Generation (RAG) pipelines, including agentic RAG and GraphRAG, using Dataiku's Knowledge Banks with reranking, dynamic filtering, and document extraction capabilities. • Work exclusively with the Marketing organisation, partnering across functions such as Demand Generation, Content Marketing, Product Marketing, Field Marketing, Marketing Operations, Brand, and Communications. • Engage marketing stakeholders to gather business requirements, then go further: identify the underlying user or team pain points those requirements represent, and design solutions that address both the stated need and the deeper problem. • Own projects end-to-end, from requirements intake and solution design through to build, deployment, and handover. • Develop autonomous and semi-autonomous AI agents, using Dataiku's Agent Builder, custom Python-based architectures (LangGraph, CrewAI, Claude Agent SDK, etc.), or a combination of both. Exercise judgment on when to leverage platform capabilities and when to build custom solutions. • Design and build Agent Tools beyond documented examples, including custom API integrations, data retrieval modules, decisioning logic, and automated workflows, pushing past out-of-the-box patterns to deliver solutions tailored to specific business problems. • Build, publish, and consume MCP (Model Context Protocol) servers to enable agent-to-tool integration across systems, including designing custom MCP servers where needed. • Develop evaluation and monitoring approaches for agent systems, combining Dataiku's built-in capabilities with custom instrumentation to measure reliability, accuracy, cost, and business impact in production. • Design and maintain evaluation frameworks (evals) for LLM-based systems, measuring accuracy, latency, cost, and reliability in production. • Adhere to data governance, security, and regulatory compliance requirements (EU AI Act awareness, responsible AI practices) for all AI solutions. • Leverage Dataiku's Cost Guard and Quality Guard features to manage LLM spend, enforce usage policies, and maintain output quality standards. • Work closely with analytics and data engineering teams to maintain metadata on reference datasets for LLM consumption. • Create front-end user interfaces for AI applications using HTML, CSS, and JavaScript, within Dataiku's webapps framework, Dataiku Answers for chat-based interfaces, or standalone applications built with Vue.js and Node.js. • Collaborate on UX design, ensuring internal stakeholders find AI solutions intuitive and responsive. • Provide product feedback to the development team to improve the platform. • Stay current with the rapidly evolving AI engineering landscape, agent frameworks, model capabilities, evaluation practices, governance requirements, and tools like MCP and A2A protocols.

Job Requirements

Must have strong Python skills (including familiarity with typical data science and AI engineering libraries).
Must have hands-on experience building agentic AI systems, multi-agent orchestration, tool chaining, autonomous decision-making, and production deployment of AI agents.
Experience with Go-to-Market motions, nomenclature, and technology, including Salesforce, HubSpot, 6sense, Gong, Outreach, and related tools.
Experience with modern agent orchestration frameworks (LangGraph, CrewAI, Claude Agent SDK, OpenAI Agents SDK, or similar); familiarity with LangChain is still relevant but not sufficient on its own.
Understanding of RAG architectures (vector databases, embedding strategies, agentic RAG, GraphRAG) and when to apply each approach.
Familiarity with MCP (Model Context Protocol) for agent-to-tool integration, or demonstrated ability to quickly adopt new integration standards.
Experience with structured outputs, function/tool calling, and prompt engineering across multiple LLM providers.
Web development fundamentals (HTML, CSS, JavaScript); experience with Vue.js and Node.js preferred.
Exposure to AI evaluation practices, building evals, monitoring model/agent performance in production, and iterating based on metrics.
Comfort with AI-assisted development tools (GitHub Copilot, Cursor, Claude Code, or similar).
Familiarity with Dataiku a bonus.

Benefits

stock options
medical, dental, and vision plans
flexible spending accounts
pre-tax commuter benefits
401k company match
paid vacations and sick leave
paid parental leave
employer paid disability coverage
additional health and wellbeing perks and benefits

Related Categories

LLM Engineer AI Engineer Machine Learning Engineer AI Research Scientist Computer Vision Engineer NLP Engineer

Related Job Pages

LLM Engineer Jobs in New York Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More LLM Engineer Jobs

Senior Software Engineer, DGX Cloud AI Infrastructure

NVIDIA

LLM Engineer22 days ago

Full Time RemoteTeam 10,001+Since 1993H1B Sponsor

Company Site LinkedIn

• Lead bring-up, validation, and debugging of large-scale AI clusters, infrastructure, and end-to-end workloads, setting the standard for how the team operates. • Bring up, tune, and benchmark AI pre-training, post-training, and inference workloads using PyTorch, NeMo / Megatron, TensorRT-LLM, and adjacent NVIDIA AI software stacks. • Profile and optimize end-to-end workload performance across compute, memory, networking, and communication layers using tools such as Nsight Systems, NCCL tests, and custom microbenchmarks. • Analyze scaling efficiency for distributed LLM workloads using data, tensor, pipeline, and expert parallelism across modern GPU clusters, and translate findings into concrete tuning guidance. • Own root-cause analysis of complex failures — hangs, performance regressions, topology sensitivity in large distributed environments. • Define and build the resilience and failure-attribution stack: detecting, triaging, and attributing node, fabric, and workload failures across the cluster at scale. • Build repeatable benchmark suites, automation, acceptance criteria, and qualification workflows on new platforms. • Tune runtime settings, communication parameters, and deployment configurations in close partnership with framework, systems, and platform teams. • Deliver actionable, data-driven recommendations based on profiling, benchmark results, and cluster characterization. • Mentor engineers, drive technical standards, and act as a force multiplier across the broader performance and infrastructure organization.

Distributed Systems Node.js Python PyTorch

View details: Senior Software Engineer, DGX Cloud AI Infrastructure

California + 3 more

$184K - $356.5K / year

Apply

Generative AI Engineer – LATAM Candidates Only

Talentus Global

We facilitate talent & software solutions across the globe. Near-shore, managed services, ERP's, CRM's, EdTech/HigherEd.

LLM Engineer23 days ago

Full Time RemoteTeam 201-500Since 2020H1B No Sponsor

Company Site LinkedIn

• Design, develop, and deploy Generative AI solutions leveraging Large Language Models (LLMs) and multimodal AI technologies. • Build and maintain scalable AI applications using cloud platforms such as Azure, AWS, or GCP. • Develop and optimize Retrieval-Augmented Generation (RAG) architectures, vector databases, and knowledge retrieval systems. • Fine-tune, evaluate, and monitor foundation models to improve performance, accuracy, and reliability. • Implement prompt engineering strategies and AI orchestration frameworks to support business use cases. • Collaborate with software engineering, data science, DevOps, and security teams to integrate AI solutions into production environments. • Develop APIs, microservices, and AI-powered applications following software engineering best practices. • Ensure compliance with AI governance, security, privacy, and responsible AI standards. • Monitor AI workloads, model performance, and operational costs, recommending continuous improvements. • Stay current with emerging Generative AI technologies, frameworks, and industry trends.

AWS Azure Cloud Docker Google Cloud Platform Kubernetes Microservices Python

View details: Generative AI Engineer – LATAM Candidates Only

Colombia

Apply

Senior AI Infrastructure Engineer

Pyyne

LLM Engineer24 days ago

Full Time RemoteTeam 51-200Since 2020H1B No Sponsor

Company Site LinkedIn

• Design and deploy AI platforms that integrate with infrastructure tools • Develop AI-powered workflows to automate operational tasks • Build AI-driven automation for incident response and operational workflows • Implement AI-powered monitoring and anomaly detection capabilities • Create intelligent operational dashboards with actionable insights • Ensure AI platforms operate reliably in production environments • Develop AI solutions for cost optimization and predictive capacity planning

Ansible AWS Azure Cloud Google Cloud Platform ITSM Python Terraform

View details: Senior AI Infrastructure Engineer

Brazil

Apply

LLM Engineer, Freelancer

Monterail

Delivering Innovative Software

LLM Engineer24 days ago

Part Time RemoteTeam 51-200Since 2011H1B No Sponsor

Company Site LinkedIn

• Add AI functionality into existing Python, Node.js, and Ruby codebases • Build LLM-powered features: chat, summaries, classification, smart search, document Q&A • Design lightweight RAG pipelines using embeddings and vector search • Work with vector DBs (pgvector, Pinecone, Qdrant) • Implement safe, reliable LLM endpoints (OpenAI, Anthropic, Azure) • Work directly with clients to shape AI features and reduce manual effort • Advise clients when NOT to use AI and navigate trade-offs around latency, accuracy, and cost

Azure JavaScript Node.js Python Ruby

View details: LLM Engineer, Freelancer

Poland

Apply

Generative AI Engineer

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More LLM Engineer Jobs

Senior Software Engineer, DGX Cloud AI Infrastructure

Generative AI Engineer – LATAM Candidates Only

Senior AI Infrastructure Engineer

LLM Engineer, Freelancer