Vol. I · Field Notes

DatadoghqDatadog | The Monitor blog

This is the official engineering blog of Datadog, a leading cloud monitoring and security platform. The blog covers how Datadog builds and operates its own massive infrastructure, from Kubernetes and databases to AI and security. It's a great read for anyone curious about real-world observability at scale, especially if you use or evaluate Datadog's products.

9 May 2026·100 posts·9 clusters
Reading Posture
From the Field
Datadog's engineering blog: monitoring everything, everywhere, all at once.
Verdict:Reach for it
Reach for it when

Read this when you want to understand how a top-tier observability platform builds, secures, and scales its own infrastructure and products.

Look elsewhere when

Skip it if you need vendor-neutral advice or deep dives into non-Datadog ecosystems.

In context

Compared to other vendor engineering blogs, this one offers unusually deep, platform-spanning technical content with a strong emphasis on AI and security.

Complexity●●●Heavy
Read time~1200 minutes
Language
Blog
Runtime
web
Dependencies
0total

What this is

As told for the tourist

This is the official engineering blog of Datadog, a leading cloud monitoring and security platform. The blog covers how Datadog builds and operates its own massive infrastructure, from Kubernetes and databases to AI and security. It's a great read for anyone curious about real-world observability at scale, especially if you use or evaluate Datadog's products.

Start Here

A recommended reading path through the code

Start Here

A recommended reading path through the code

  1. 01

    Start here because it showcases their cutting-edge work on LLM observability, a top topic.

  2. 02

    A classic infrastructure deep dive that demonstrates their operational expertise.

  3. 03

    Core to their security narrative, showing how they handle real-time threat intelligence.

  4. 04

    Excellent example of their database performance debugging content.

  5. 05

    Key for understanding their APM and instrumentation strategy.

  6. 06

    Illustrates their code security and DevSecOps approach.

  7. 07

    Advanced topic showing how they integrate cost analysis with infrastructure-as-code.

What's inside

9 sections of the codebase

Posting History

Activity over time

Posting Activity100 posts · 2026-022026-05
2026
100 posts
Less
More

The Archive

Every post, searchable and filtered

All Posts100 of 100
2026-05

Diagnose and resolve database performance issues faster with Database Investigator

6m

Datadog Database Monitoring introduces Database Investigator, an agentic feature that surfaces root causes and remediation steps for database performance issues.

Database & Query Performance#database#performance
2026-05

Analyze cloud costs with flexible spreadsheets in Datadog Sheets

5m

Datadog Sheets enables flexible spreadsheet-style analysis of live cloud cost data within Cloud Cost Management.

Cloud Cost & Governance#product-engineering#developer-tools
2026-05

Datadog for Government achieves FedRAMP® High certification

4m

Datadog for Government achieves FedRAMP High certification to support sensitive agency workloads with unified observability, security, and NIST compliance.

Security & Compliance#security#infra
2026-05

Turn security signals into structured investigations with Case Management in Datadog Cloud SIEM

5m

Datadog Cloud SIEM's Case Management provides end-to-end workflows to transition from security signals to structured investigations.

Security & Compliance#security#incident-report
2026-05

Inside Datadog’s AI Research Lab: Meet two PhD candidates behind Toto

7m

Two PhD candidates at Datadog's AI Research Lab discuss their contributions to Toto, a timeseries foundation model.

AI & LLM Observability#ml-infra#culture
2026-05

Monitor and optimize Supabase query performance with Datadog Database Monitoring

5m

Datadog Database Monitoring provides Supabase developers with query-level visibility, explain plans, and one-click setup for diagnosing performance issues.

Database & Query Performance#database#performance
2026-05

This Month in Datadog - April 2026

3m

April 2026's This Month in Datadog covers the MCP Server, Datadog Experiments, Bits AI Security Analyst, and more.

Cloud Cost & Governance#product-engineering#developer-tools
2026-05

Add dynamically updating context to logs with Reference Tables and Observability Pipelines

6m

Datadog Reference Tables and Observability Pipelines enable central enrichment of logs before routing to SIEM or data lake destinations.

Observability & APM#observability#infra
2026-04

Introducing ARFBench: A time series question-answering benchmark based on real incidents

7m

ARFBench is a time series question-answering benchmark built from real Datadog incidents to evaluate AI anomaly reasoning.

AI & LLM Observability#ml-infra#incident-report
2026-04

Test network paths with TCP, UDP, and ICMP in Datadog

5m

Datadog supports TCP, UDP, and ICMP protocols in network path testing to diagnose application performance issues.

Engineering & Developer Tools#networking#performance
2026-04

The product signal latency gap slowing your growth

6m

The post discusses latency between product signals in experiments and how prioritizing fixes can drive growth.

Product Analytics & Experimentation#product-engineering#performance
2026-04

How to investigate cloud credential compromise with Bits AI Security Analyst

6m

Bits AI Security Analyst handles time-intensive steps in cloud credential compromise investigations, letting engineers focus on human judgment.

AI & LLM Observability#security#incident-report
2026-04

Turn developer feedback into operational insight with Datadog Forms and Sheets

4m

Datadog Forms and Sheets collect structured developer feedback and analyze it alongside operational data.

Product Analytics & Experimentation#dx#developer-tools
2026-04

Evaluate, optimize, and secure your Google Cloud AI stack with Datadog

6m

Datadog helps Google Cloud teams evaluate AI agents, optimize GPU/TPU infrastructure, and strengthen security.

Cloud Cost & Governance#ml-infra#security
2026-04

Bringing observability data hosting to the UK on AWS

4m

Datadog's UK availability zone on AWS enables organizations to host observability data in the UK with end-to-end visibility.

Cloud Cost & Governance#infra#observability
2026-04

Steganography at scale: Embedding share URLs in Datadog widget screenshots

8m

Datadog embeds widget metadata into screenshots using invisible watermarks for self-describing visualizations at scale.

Engineering & Developer Tools#deep-dive#scaling
2026-04

Identify and fix code issues faster with Datadog’s Azure DevOps Source Code integration

5m

Datadog's Azure DevOps Source Code integration enables code health analysis, accelerated troubleshooting, and quality enforcement.

Code Security & Quality#developer-tools#dx
2026-04

Centralize observability management with Datadog Governance Console

5m

Datadog Governance Console centralizes usage insights and automates policy enforcement to reduce risk and control costs.

Cloud Cost & Governance#infra#product-engineering
2026-04

Every team should be A/B testing

5m

The post argues that A/B testing is valuable for a wide variety of engineering purposes beyond growth and product.

Product Analytics & Experimentation#culture#product-engineering
2026-04

Spotting CI/CD misconfigurations before the bots do: Securing GitHub Actions with Datadog IaC Security

5m

Datadog IaC Security catches GitHub Actions misconfigurations in the diff before they reach production.

Security & Compliance#security#infra
2026-04

Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines

8m

Datadog Observability Pipelines helps teams transform and normalize logs and metrics from OpenTelemetry for routing to ClickHouse and Datadog.

Observability & APM#observability#infra
2026-04

Manage service tracing across hosts with Single Step Instrumentation rules

6m

Single Step Instrumentation rules allow control over which services are traced by Datadog APM to reduce unnecessary trace data.

Observability & APM#performance#developer-tools
2026-04

Detect runtime threats in Python Lambda functions with Datadog AAP

7m

Datadog App and API Protection provides in-process security monitoring for Python AWS Lambda functions to detect application-level attacks.

Cloud Cost & Governance#security#serverless
2026-04

Offline evaluation for AI agents: Best practices

9m

Best practices for running offline evaluations to optimize AI agents in pre-production.

AI & LLM Observability#ml-infra#tutorial
2026-04

Introducing our open source AI-native SAST

7m

Datadog's open source SAST solution uses AI to surface code vulnerabilities more accurately and efficiently.

Code Security & Quality#open-source#security
2026-04

Integrate Recorded Future threat intelligence with Datadog Cloud SIEM

6m

The Recorded Future integration enriches logs, ingests alerts, and prioritizes threats in Datadog Cloud SIEM with real-time intelligence.

Security & Compliance#security#integration
2026-04

Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog

8m

Instrument Boomi integration flows with OpenTelemetry and Datadog to collect and correlate process, JVM, and database telemetry.

Observability & APM#observability#tutorial
2026-04

Platform engineering metrics: What to measure and what to ignore

10m

Guidance on which platform engineering metrics to collect and how to interpret them to quantify the platform's impact on software delivery.

Observability & APM#culture#product-engineering
2026-04

Not all index scans are equal: How we cut query latency by over 99%

12m

How misaligned predicates and column order hurt index scan performance and how to detect this pattern using DBM to cut query latency by over 99%.

Database & Query Performance#database#performance
2026-04

CI/CD security: How to secure your GitHub ecosystem

8m

Applying a detection-based threat model to secure the GitHub ecosystem by identifying key inputs, identities, and associated risks.

Security & Compliance#security#tutorial

Export & Share

Take the field notes with you

Datadog | The Monitor blog — Blog Dispatch · Archaeologist