About the Role

Datadog AI Research (DAIR) seeks an AI Research Engineer to collaborate with research scientists on building systems for observability foundation models, SRE autonomous agents, and production code repair agents. The role involves developing training and evaluation pipelines, implementing models at scale, and integrating AI capabilities into Datadog's product ecosystem.

Responsibilities

Build and operate datasets, training and evaluation pipelines, benchmarks, and internal tooling
Implement models, run experiments at scale, and profile for reliability, performance, and cost
Orchestrate distributed training and distributed RL with Ray, including scheduling, scaling, and failure recovery
Make the research stack observable, reproducible, and easier to use
Establish rigorous automated benchmarks and regression tests for forecasting, anomaly detection, multi-modal analysis, agents, and code repair tasks
Collaborate with Research Scientists, Product, and Engineering to integrate advanced AI capabilities into Datadog’s product ecosystem and to harden prototypes into reliable services
Contribute high-quality code, documentation, and open-source artifacts

Requirements

Strong software engineering skills with experience in observability, SRE, or security
Depth in distributed computing and ML systems for training and inference at scale

Wfh

San Francisco · US · 8+ employees

WFH.team provides remote job intelligence for candidates and employers, offering confirmed remote job listings and resume-based matching. They also provide various employer-facing hiring tools and public resources for remote work workflows.

Machine Learning Engineer/Machine Learning Scientist , Multi Modality

Altos Labs

SAN DIEGO · US

$140K–$273K

SAN DIEGO · US

$140K–$273K

AI Research Engineer – Datadog AI Research (DAIR)

About the Role

Responsibilities

Requirements

Wfh

Staff Solutions Architect, AI (Remote)

Research Engineer, AI for Science

AI Engineer

Related Jobs

AI Engineer & Researcher - Inference

AI Research Engineer

Nice to Have

Tech Stack

DataOps / MLOps Engineer (Strong DevOps Focus)

AI Research Manager/Scientist, Post-training

Machine Learning Engineer/Machine Learning Scientist , Multi Modality

Senior Research Engineer - Cohere

Principal Research Scientist – Foundation Models for Vision AI & Physical AI

[Remote] AI/ML Research Engineer, LLM Post-Training & Evaluation