Skip to main content
Glama
us-all

datadog-mcp-server

incident-triage-snapshot

Aggregate incident metadata, related events, similar past incidents, and log spike detection into a single structured response for faster incident triage.

Instructions

Aggregated incident triage: get-incident + get-events (lookback window) + search-incidents (similar on same service, last 14d) + aggregate-logs (error spike on incident's service) in one call. Replaces the 5-step triage-incident Prompt orchestration with a single structured response (incident metadata, related events, similar past incidents, log-spike heuristic, and a summary block with severity/duration/spike flag). Uses Promise.allSettled — per-fetcher failures populate caveats[].

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
incidentIdYesIncident ID. Use list-incidents or search-incidents to find one.
lookbackMinutesNoMinutes before incident creation to scan for related signals (default 60, max 720)
serviceNoOverride the service tag scan. By default, derived from incident.fields.services[0].
includeLogSpikeNoRun an aggregate-logs spike detection over the window for the incident's service
includeSimilarNoSearch for incidents on the same service in the last 14 days
extractFieldsNoComma-separated dotted paths to project from response (e.g. 'id,name,owner.name,columns.*.name'). Use `*` as wildcard for arrays/objects. Wrap field names with dots in backticks. Reduces response tokens dramatically on large entities.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses that the tool uses Promise.allSettled and that per-fetcher failures populate a caveats[] array, providing transparency about partial failures. It also lists the sub-tools included. Security or rate limit information is not mentioned, but the read-only nature is implied.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single dense paragraph that efficiently conveys purpose, components, and response structure. It is front-loaded with the core purpose. Could be slightly more structured with bullets for readability, but still clear and not overly verbose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite having no output schema, the description thoroughly explains the response structure: incident metadata, related events, similar past incidents, log-spike heuristic, summary block with severity/duration/spike flag, and caveats array. This completeness allows the agent to understand what to expect without needing an output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds significant meaning beyond the schema. For each parameter, it provides context: incidentId tells how to find it, lookbackMinutes gives defaults and max, service explains override behavior, includeLogSpike and includeSimilar describe toggling options, and extractFields explains syntax and benefit.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly defines the tool as an aggregated incident triage that combines multiple specific queries (get-incident, get-events, search-incidents, aggregate-logs) into one call. It distinguishes itself from sibling tools by explicitly replacing a 5-step orchestration and providing a single structured response.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description states that this tool replaces the `triage-incident` Prompt orchestration, implying when to use it. It also directs users to list-incidents or search-incidents for finding the incident ID. However, it does not explicitly mention when not to use it or provide alternatives beyond the replaced orchestration.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/us-all/datadog-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server