Observability
Tools and platforms for comprehensive system observability including logs, metrics, and traces. Enables monitoring, troubleshooting, and performance analysis of applications and infrastructure.
MCP ServersBrowse all →
AlicenseBqualityCmaintenanceEnables AI assistants to interact with Rollbar error tracking and monitoring service, allowing them to diagnose issues, view item details, list deployments, retrieve session replays, and update item properties through natural language commands.Last updated81,99225MIT
LangSmith MCP Serverofficial
AlicenseAqualityDmaintenanceEnables language models to access LangSmith observability platform features including fetching conversation history, managing prompts, retrieving traces and runs, working with datasets and examples, and analyzing experiments.Last updated13107- AlicenseAqualityBmaintenanceCryptographic accountability for AI agents. Ed25519-signed receipts for every MCP tool call. Constraints, chains, AI judgment, invoicing, and local dashboard included.Last updated824MIT

Scout Monitoring MCPofficial
AlicenseAqualityCmaintenanceEnables AI assistants to access Scout Monitoring performance and error data through Scout's API. Provides traces, errors, metrics, and insights for Rails, Django, FastAPI, Laravel and other applications to help identify and fix performance issues like N+1 queries, slow endpoints, and memory bloat.Last updated1328MIT
Logfire MCP Serverofficial
AlicenseAqualityFmaintenanceA Model Context Protocol server that enables LLMs to retrieve and analyze OpenTelemetry traces and metrics from Logfire, supporting exception tracking and custom SQL queries against telemetry data.Last updated4161MIT- AlicenseAqualityBmaintenanceAriadne's thread — a way out of the microservice maze. Local cross-service semantic chain hinter for microservices (GraphQL/HTTP/Kafka/frontend)Last updated145MIT
- AlicenseAqualityBmaintenanceEvery agent action is recorded in a SHA-256 hash chain. Prove to clients that your agent did what it said it did. Record, query, verify, and export agent activity.Last updated353151MIT
- AlicenseAqualityBmaintenanceProvides read-only access to Jaeger distributed tracing data through the Model Context Protocol. Enables Claude and other MCP-capable agents to search traces, inspect spans, and analyze service dependencies directly within conversations.Last updated25MIT
- AlicenseAqualityBmaintenanceMCP-native agent evaluation and observability server. Log traces, evaluate output quality with 12 built-in rules (PII detection, prompt injection, cost thresholds), and track agent costs. Real-time dashboard, OTel-compatible spans. Self-hosted, MIT licensed.Last updated29586MIT
- AlicenseAqualityBmaintenanceConnects AI assistants to Warpmetrics telemetry data to monitor AI agent performance, execution runs, and LLM costs. It allows users to query success rates, latency, and spend metrics directly through natural language interfaces.Last updated2014MIT

mcp-server-cloudflareofficial
AlicenseAqualityBmaintenanceLets you use Claude Desktop, or any MCP Client, to use natural language to accomplish things on your Cloudflare account.Last updated23,3653,707Apache 2.0- AlicenseAqualityBmaintenanceMCP server for Portkey Admin API - 116 tools for prompts, configs, analytics & more.Last updated67100214MIT

Foreman MCP Serverofficial
AlicenseBqualityCmaintenanceEnables interaction with Foreman instances to manage systems through the Model Context Protocol. It provides access to Foreman resources and tools, such as security update reports, directly within AI-powered environments like VSCode and Claude Desktop.Last updated58GPL 3.0
AgentOps MCPofficial
AlicenseBqualityCmaintenanceThe AgentOps MCP server provides access to observability and tracing data for debugging complex AI agent runs. This adds crucial context about where the AI agent succeeds or fails.Last updated411014MIT- AlicenseAqualityCmaintenanceA professional-grade MCP server that enables AI assistants to monitor web services, APIs, and HTTP endpoints with enterprise-level security.Last updated11MIT

Sentry MCP Serverofficial
FlicenseBqualityDmaintenanceA Model Context Protocol server that lets AI assistants interact with the Sentry API to retrieve and analyze error data, manage projects, and monitor application performance.Last updated1110- AlicenseAqualityCmaintenanceEnables interaction with Beszel system monitoring tool to query system and container statistics, manage alerts, and monitor infrastructure through its PocketBase backend.Last updated61MIT
- AlicenseAqualityCmaintenanceA comprehensive MCP server providing over 26 tools for querying, monitoring, and analyzing NewRelic data through NRQL queries and entity management. It enables interaction with NewRelic's NerdGraph API for managing alerts, logs, and incidents directly within Claude Code sessions.Last updated2461MIT
- AlicenseBqualityCmaintenanceEnables sending rich notifications to Discord and/or Slack webhooks with automatic service detection, retry logic, and support for embeds, blocks, and attachments. Provides secure webhook management with comprehensive input validation and rate limiting.Last updated3TypeScriptMIT
- AlicenseAqualityBmaintenanceMCP server for analyzing Perfetto traces with LLMs — query .pftrace files in PerfettoSQL via Claude Code or any MCP clientLast updated1213Apache 2.0
- AlicenseAqualityBmaintenanceMCP server for Prometheus metrics and observability. Give Claude (or any MCP-capable agent) read access to your Prometheus instance — query metrics with PromQL, inspect active alerts, and explore scrape targets without leaving the conversation. Tools: prometheus_list_metrics, prometheus_query, prometheus_query_range, prometheus_list_alerts, prometheus_list_targets.Last updated5MIT
- AlicenseBqualityBmaintenanceEnables AI assistants to interact with New Relic monitoring and observability data through programmatic access to New Relic APIs. Supports APM management, NRQL queries, alert policies, synthetic monitoring, dashboards, infrastructure monitoring, and deployment tracking.Last updated265MIT
- AlicenseAqualityBmaintenanceeBPF-based GPU causal observability agent with MCP server. Traces CUDA Runtime and Driver APIs via kernel uprobes and host events via tracepoints to build causal chains explaining GPU latency. 7 tools: get_check, get_trace_stats, get_causal_chains, get_stacks, run_demo, get_test_report, run_sql. Telegraphic compression reduces token usage ~60%. Supports stdio and HTTPS (TLS 1.3) transport.Last updated1181
- AlicenseAqualityBmaintenanceA comprehensive Model Context Protocol (MCP) server for Grafana, Prometheus, Kafka UI, and Datadog. Features a secure "Bring Your Own Key" (BYOK) architecture where credentials stay local. Provides tools for metrics querying, dashboard inspection, Kafka lag monitoring, and unified health checks.Last updated2331MIT
- AlicenseBqualityCmaintenanceA TypeScript-based MCP server that enables users to interact with Prometheus metrics using PromQL queries and discovery tools. It allows LLMs to retrieve time-series data, metadata, alerts, and system status directly from a Prometheus instance.Last updated10441MIT
- AlicenseBqualityCmaintenanceAn MCP server for perfSONAR that enables querying historical network measurements, discovering global testpoints, and scheduling active network tests. It provides tools for monitoring throughput, latency, and packet loss through integration with measurement archives and pScheduler.Last updated13MIT
- AlicenseAqualityCmaintenanceMCP server for CronAlert uptime monitoring — manage monitors, check results, and incidents from any MCP-compatible AI client.Last updated917MIT
- AlicenseBqualityBmaintenanceAn MCP server for querying Grafana Loki directly with a discovery-first workflow — labels, values, series, and LogQL queries without requiring Grafana.Last updated52MIT
- AlicenseAqualityBmaintenanceAn MCP server that exposes GPU-accelerated anomaly detection to AI assistants via the Model Context Protocol. Provides two MCP tools: waveguard_scan (send training + test data in one call, returns per-sample anomaly scores and top explanatory features) and waveguard_health (check API and GPU status). Works on time series, JSON, numbers, text, and images — fully stateless.Last updated33MIT
- AlicenseAqualityCmaintenanceCross-cloud observability for AI agents. Discover resources, correlate logs, and diagnose infrastructure issues across AWS, GCP, Vercel, and Cloudflare — without leaving your editor.Last updated4101MIT
MCP ConnectorsBrowse all →
EU-hosted website monitoring + 17-framework compliance MCP. One anonymous tool, four authenticated.
290+ quality-scored API capabilities for AI agents across 27 countries via MCP.
AI-ready vendor incident status with public active incidents and plan-scoped history.
Connect engineering metrics, DORA performance, and deploy risk scoring to any AI assistant. Score PRs for deployment risk using a 36-signal model, query team health, incidents, coverage, and more.
Service level agreement monitoring for the Hive agent fleet
Enable secure connectivity between Sentry issues and debugging data, and LLM clients, using a Model Context Protocol (MCP) server.
Free MCP tools: the only MCP linter, health checks, cost estimation, and trust evaluation.
performance-review MCP — wraps StupidAPIs (requires X-API-Key)
The Google GKE MCP server is a managed Model Context Protocol server that provides AI applications with tools to manage Google Kubernetes Engine (GKE) clusters and Kubernetes resources. It exposes a structured, discoverable interface that allows AI agents to interact with GKE and Kubernetes APIs, enabling them to inspect cluster configurations, retrieve Kubernetes resource YAMLs, monitor operations like cluster upgrades, diagnose issues, and optimize costs—all without needing to parse text output or use complex kubectl commands.
- mcpA
Gain visibility into the performance, availability, and health of your apps and infrastructure.
Live status, API pricing and rate limits for ChatGPT, Claude, Gemini, Cursor and 42+ AI tools.
ThousandEyes MCP Server for network intelligence: outages, anomalies, alerts, events, and tests.
Enterprise AI governance: spend, guardrails, policy, budgets, compliance, and provider health.
RUM platform for web performance analytics, Core Web Vitals, and third-party script monitoring.
Real-time health monitoring and heartbeat tracking for agent services
DORA OS Conductor — 16-tool meta-orchestrator for DORA compliance workflow automation.
DriftOracle - 15 tools for model/data drift monitoring: PSI, KS-test, alerts, evidence packs.
ResilienceOracle - 10 operational resilience tools: BIA, RTO/RPO, scenario testing.