KubeWhisper
Allows querying recent GitHub Actions workflow runs for a specified repository.
Allows running instant PromQL queries against a Prometheus instance to retrieve metrics.
Allows sending incident or remediation notes to a Slack channel via a bot token or webhook.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@KubeWhisperwhich pods are crash-looping in staging?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
π£οΈ KubeWhisper
An AI-DevOps MCP server β talk to your infrastructure in plain English, safely.
KubeWhisper is a Model Context Protocol (MCP) server that gives an LLM like Claude read-only-by-default access to a live environment, so you can ask:
"Which pods are crash-looping in
staging?" "What's the p95 latency right now?" "Did the last GitHub Actions run onmainpass?" "Scale thewebdeployment to 5 β but show me the preview first."
β¦and the model answers grounded in your actual cluster, metrics, and CI β not generic guesses.
MCP crossed ~97M monthly SDK downloads in March 2026 and is now adopted by Anthropic, OpenAI, Google, Microsoft and Amazon. Plenty of people use MCP servers; far fewer have built one for real DevOps work. This is a small, honest, safety-first example of how.
β¨ Tools
Tool | Mode | What it does |
| π’ read-only | list configured clusters + their routing hints |
| π’ read-only | decide which cluster an alert/error belongs to |
| π’ read-only | pods / deployments / services / nodes / events, with status |
| π’ read-only | run an instant PromQL query |
| π’ read-only | list recent workflow runs for a repo |
| π΄ write | scale a Deployment β disabled by default, previews first, every call audited |
| π‘ outbound | post an incident/remediation note to Slack |
Related MCP server: InfraClaude
π Multi-cluster + error-context routing
Drop a clusters.json next to the server (or point KUBEWHISPER_CLUSTERS at it).
Each cluster maps to a kubeconfig context, a Prometheus URL, and routing hints:
{
"default_cluster": "prod-mumbai",
"clusters": [
{ "name": "prod-mumbai", "kube_context": "gke_proj_asia-south1_prod",
"prometheus_url": "http://prometheus.prod-mumbai:9090",
"match": { "keywords": ["prod","mumbai","live","p1"], "namespaces": ["streaming","ingest"] } },
{ "name": "staging", "kube_context": "gke_proj_asia-south1_staging",
"prometheus_url": "http://prometheus.staging:9090",
"match": { "keywords": ["staging","qa"], "namespaces": ["streaming-staging"] } }
]
}Now the agent can route by the error context: given an alert like
"P1: transcode pods OOMing on the Mumbai live stream", it calls resolve_cluster,
matches mumbai / live / transcode β prod-mumbai, and targets that cluster's
kube context + Prometheus automatically. Every cluster-scoped tool also accepts an
explicit cluster argument. With no clusters.json, it falls back to your current
kube context. See clusters.example.json.
π¬ Slack bot
Two ways to use Slack:
Outbound notes from any MCP client β the
send_slack_messagetool (setSLACK_BOT_TOKENorSLACK_WEBHOOK_URL).A full Slack bot (
slack_bridge.py) β @mention it in a channel ("@KubeWhisper which pods are failing in prod?") and it runs a Claude tool-use loop over these tools, routes to the right cluster, and replies in-thread.
pip install -r requirements-slack.txt
export ANTHROPIC_API_KEY=... # console.anthropic.com
export SLACK_BOT_TOKEN=xoxb-... # scopes: app_mentions:read, chat:write
export SLACK_APP_TOKEN=xapp-... # Socket Mode
python slack_bridge.pyThe bot is read-only by default β scale_deployment stays preview-only unless an
operator sets KUBEWHISPER_ALLOW_WRITES=true on the bot process.
π Safety model (the whole point)
Read-only by default. The single write tool is off unless
KUBEWHISPER_ALLOW_WRITES=true.Preview, then confirm. A write with
confirm=Falseonly describes the change.Audit trail. Every write intent β refused, previewed, or applied β is appended to a JSONL log.
No destructive verbs.
delete/drain/cordonare deliberately not implemented in v0.
See docs/architecture.md for the diagram.
π Quickstart
# 1. Install
python -m venv .venv && source .venv/bin/activate # (Windows: .venv\Scripts\activate)
pip install -r requirements.txt
# 2. Point at a cluster + Prometheus (a local `kind` cluster is perfect)
export PROMETHEUS_URL=http://localhost:9090
# KUBEWHISPER_ALLOW_WRITES stays unset β read-only
# 3. Run it
python -m kubewhisper.serverUse it from Claude Desktop
Copy the block in config.example.json into your
claude_desktop_config.json, fix the cwd path, restart Claude Desktop, and ask it about your cluster.
π§ͺ Tests
pip install pytest
pytest -q # guardrail tests prove writes are refused unless explicitly enabledCI runs lint + import check + guardrail tests on every push (see .github/workflows/ci.yml).
πΊοΈ Roadmap
v0.1 β read-only tools + guarded scale + audit (this release)
v0.2 β Terraform
plan(read-only) tool + deploy "what-changed" diffv0.3 β eval suite proving destructive-op refusal + OPA policy layer over writes
v0.4 β one-command
docker run, publish to an MCP registry
β οΈ Disclaimer
A learning / portfolio project. Run it against clusters you own. Keep it in read-only mode unless you fully understand the write path. Built independently, on personal infrastructure.
π License
MIT β see LICENSE.
Built by Abdulhussain Kanchwala Β· Portfolio Β· LinkedIn Β· GitHub
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/abdulhusainahk/kubewhisper'
If you have feedback or need assistance with the MCP directory API, please join our Discord server