Best MCP servers for observability
Observability is about answering questions you did not predict in advance: why is latency up, which release introduced the regression, what does this trace tell us about a slow request. An agent that can query your metrics, logs, and traces directly turns those questions into answers without dashboard archaeology. The right servers depend on your stack, a full-platform vendor, a high-cardinality tracing tool, an open dashboard-and-Prometheus layer, or an event-based analytics engine, but the recurring need is the same: let the agent run real queries against live telemetry. The servers below cover the common shapes, each a real MCP server with a verified, current install config.
Datadog
Datadog
Datadog's official remote MCP server lets agents search logs, query metrics, pull APM traces, inspect monitors, and investigate incidents.
Datadog's official server lets an agent search logs, query metrics, pull APM traces, and inspect monitors, broad coverage of the observability surface from one tool.
Grafana
Grafana Labs
Grafana Labs' official MCP server: query dashboards, Prometheus, Loki, incidents, alerts, and OnCall from your agent.
Grafana's official server queries dashboards, Prometheus, Loki, alerts, and OnCall, ideal for teams whose observability is built on the open Grafana stack.
Honeycomb
Honeycomb
Honeycomb's official MCP server: query traces, metrics, and logs, run BubbleUp, and manage Boards, Triggers, and SLOs from your agent.
Honeycomb's official server runs queries over high-cardinality traces and metrics and triggers BubbleUp, strongest when you need to slice events by arbitrary dimensions to find the outlier.
Prometheus
pab1it0
A maintained Prometheus MCP server: run PromQL instant and range queries, discover metrics and metadata, and inspect scrape targets from your agent.
The Prometheus server lets an agent run PromQL directly against your metrics, the lowest-level option for teams that own their own Prometheus instances.