Job Closed

This listing is no longer active.

Datadog logo
Datadog

Datadog offers a monitoring and security platform to help cloud-based companies keep their systems secure and their customers happy. As an employer, Datadog enc

Senior AI Engineer – APM Integrations

AI EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 6,500Since 2010Company Site

Location

Germany

Posted

132 days ago

Salary

0

Seniority

Senior

6 yrs expEnglishJavaMicroservices.NET

Job Description

Senior AI Engineer – APM Integrations

Datadog

• Build agent workflows that take an integration need from plan to implementation and validation with humans approving at the right checkpoints. • Create systems that synthesize context from codebases, docs, specs, telemetry, and historical incidents to make changes that match Datadog conventions and customer expectations. • Generate and evolve integration code and tests, including end-to-end scenarios that reflect real customer workloads and product features. • Design evaluation harnesses that prevent silent regressions: golden sets, scenario baselines, semantic checks, performance thresholds, and release gating. • Build portfolio-level automation: proactive updates for upstream breaking changes, tracer feature rollouts across the catalog, migrations to new schemas/semantics, and targeted coverage expansion. • Partner tightly with PM, support engineers, and integration-owning teams to make the system adoptable, trustworthy, and embedded in daily engineering workflows.

Job Requirements

  • 6+ years building backend systems (Go, Java, or .NET) with strong focus on simplicity, correctness, and performance.
  • Proven experience delivering LLM/agent features to production (prompting, tooling, evals, safety/guardrails).
  • Comfortable navigating ambiguity, iterating from prototype to production, and measuring impact with clear metrics.
  • Solid grasp of the ML lifecycle (task definition, dataset construction, modeling, evaluation, deployment, iteration) and statistics for experiments.
  • Fluency with offline/online evals: golden sets, automated regressions, and evaluation harnesses that prevent silent quality drift.
  • Experience with microservices performance: tracing, latency breakdowns, concurrency, resiliency patterns.
  • Production operations mindset: monitoring, alerting, and participating in on‑call rotations where applicable.
  • Hands‑on with distributed tracing stacks (OpenTelemetry/Datadog APM), profilers, and logs/metrics pipelines (bonus).
  • Experience with planning/agent frameworks, tool‑use orchestration, RAG, and retrieval/indexing over large context (bonus).
  • Experience building developer tools (IDEs, static analysis, compilers, code transformation pipelines) or code-generation systems (bonus).
  • Familiarity with semantic conventions / schema migrations, scenario-based testing, and cross-language porting at scale (bonus).

Benefits

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development, product training, and career pathing
  • Intradepartmental mentor and buddy program for in-house networking
  • An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups)
  • Access to Inclusion Talks, our Internal panel discussions
  • Free, global mental health benefits for employees and dependents age 6+
  • Competitive global benefits

Related Job Pages

More AI Engineer Jobs

Tidio logo

Staff AI Engineer, LLM

Tidio

AI-powered customer service management platform. We help businesses convert more leads and grow sales. Join us!

AI Engineer132 days ago
Full TimeRemoteTeam 51-200Since 2013H1B No Sponsor

• Designing, implementing, and deploying end-to-end NLP and deep learning systems • Building LLM-powered applications that interact with real users • Developing and maintaining production Python services • Exposing models and pipelines via REST APIs (FastAPI, Flask) • Working on retrieval models and techniques (RAG, embeddings, ranking) • Evaluating, monitoring, and continuously improving model and system quality • Scaling systems to handle enormous volumes of requests

Poland
zł33K - zł38K / year
Tidio logo

Senior AI Engineer, LLM

Tidio

AI-powered customer service management platform. We help businesses convert more leads and grow sales. Join us!

AI Engineer132 days ago
Full TimeRemoteTeam 51-200Since 2013H1B No Sponsor

• Designing, implementing, and deploying end-to-end NLP and deep learning systems • Building LLM-powered applications that interact with real users • Developing and maintaining production Python services • Exposing models and pipelines via REST APIs (FastAPI, Flask) • Working on retrieval models and techniques (RAG, embeddings, ranking) • Evaluating, monitoring, and continuously improving model and system quality • Scaling systems to handle enormous volumes of requests

Poland
zł25K - zł33K / month
Job Closed
Tidio logo

Mid AI Engineer, LLM

Tidio

AI-powered customer service management platform. We help businesses convert more leads and grow sales. Join us!

AI Engineer132 days ago
Full TimeRemoteTeam 51-200Since 2013H1B No Sponsor

• Designing, implementing, and deploying end-to-end NLP and deep learning systems • Building LLM-powered applications that interact with real users • Developing and maintaining production Python services • Exposing models and pipelines via REST APIs (FastAPI, Flask) • Working on retrieval models and techniques (RAG, embeddings, ranking) • Evaluating, monitoring, and continuously improving model and system quality • Scaling systems to handle enormous volumes of requests

Poland
zł22K - zł25K / month
Job Closed
Alan logo

Fullstack Software Engineer – Ops AI Platform

Alan

Alan is your one-stop health partner.

AI Engineer132 days ago
Full TimeRemoteTeam 501-1,000Since 2016H1B Sponsor

• You’ll work at the intersection of product engineering and ML engineering. • Our mission is to create the platform that enables building accurate and reliable AI agents, delivering an outstanding experience to our internal Ops team for the creation, iteration and monitoring of their agents, without requiring hard technical skills. • You'll implement new capabilities for AI agents, as well as tooling to make ops autonomous on the creation, evolution, monitoring and evaluation of their agents. • You'll work on making our system scale, going from a few dozen to tens of thousands of tasks automated daily. • You'll have significant influence on system architecture and code quality. • You'll mentor engineers, contribute to our product & engineering culture, and help build the team (including participating in hiring).

France