mcp-peek
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@mcp-peekPeek schema of https://api.example.com/users"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Your agent just burned 12K tokens on a 200KB JSON response — to read one field. With mcp-peek: ~250 tokens for the same field. Same call, ~50× less context. See a live demo.
mcp-peek is an MCP server that wraps arbitrary REST API calls so your agent first sees a compact schema, then pulls only what it asked for through a jq mask. Big responses stay out of context until you actually need a slice — and every response carries next_step_hints plus structured error envelopes, so the agent self-corrects instead of guessing.
Use it in
Drop the snippet for your client into its MCP config. The desktop / IDE clients all spawn npx -y mcp-peek under the hood; the wrapping JSON differs slightly per client. Docker is an alternative wrapper for any of them — see the last entry.
{
"mcpServers": {
"mcp-peek": {
"command": "npx",
"args": ["-y", "mcp-peek"]
}
}
}Config file lives at ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows). Restart Claude Desktop after editing.
claude mcp add mcp-peek -- npx -y mcp-peekOr edit ~/.claude.json (user scope) / .mcp.json (project scope) directly with the same mcpServers shape as Claude Desktop.
{
"servers": {
"mcp-peek": {
"type": "stdio",
"command": "npx",
"args": ["-y", "mcp-peek"]
}
}
}VS Code uses servers (not mcpServers) and requires type. Workspace-scoped — commit it for your team.
{
"mcpServers": {
"mcp-peek": {
"command": "npx",
"args": ["-y", "mcp-peek"]
}
}
}{
"mcpServers": {
"mcp-peek": {
"command": "npx",
"args": ["-y", "mcp-peek"]
}
}
}Cline stores this in cline_mcp_settings.json (open from the Cline panel → MCP Servers → Configure).
{
"mcpServers": {
"mcp-peek": {
"command": "npx",
"args": ["-y", "mcp-peek"]
}
}
}{
"mcpServers": {
"mcp-peek": {
"command": "docker",
"args": [
"run", "-i", "--rm",
// Same-path bind mount + PEEK_FILES_ROOT: the agent passes
// ordinary host paths under /home/me/data and they "just work"
// inside the container. Outside-the-root paths are rejected
// with a clear `invalid_input` error.
"-v", "/home/me/data:/home/me/data",
"-e", "PEEK_FILES_ROOT=/home/me/data",
"ghcr.io/ed-smartass/mcp-peek:latest"
]
}
}
}The same-path bind mount is the recommended Docker pattern — agent paths translate transparently. Drop PEEK_FILES_ROOT if you want no path constraint (and you're sure about the security tradeoffs).
Related MCP server: Code Execution MCP
What you get
http_requestruns the call, caches the body, returns the structure, not the bytes.http_readpulls fields out of the cached body viajq.http_inspectre-renders the schema in another format — no second HTTP call.server_infodebug helper — version, runtime, effective env limits.
One MCP install replaces every curl your agent would run, so you authorize once at config time instead of approving each call.
Built for the agent
mcp-peek is shaped around how an agent actually reads an API, not how a human runs curl. Four structural choices follow from that:
Schema-first responses.
http_requestreturns the shape of the body, not the body itself. The agent learns what's in there without burning tokens on bytes it will throw away.next_step_hintson every response. JSON responses come back with advisory jq masks inferred from the top-level shape — the agent has a starting point instead of guessing field paths.Structured error envelopes. Every failure is a typed
error.kindwithmessageanddetail.hint. The agent branches programmatically on cause and acts on the hint, instead of parsing a stack trace.Re-read instead of re-fetch. A
cache_idlets the agent inspect the same body in another schema format, or extract a different slice, without a second HTTP call.
The agent-side consequences:
~50× less context on real-world payloads (see savings table below).
One permission grant at install time, not per-call prompts.
A self-correcting loop — typed errors and hints keep the agent from spiralling on guesses.
The rest of this README describes the operational surface; the design above is what makes that surface worth using.
Run modes & file paths
Which mode to pick
Mode | Pick when | Trade-offs |
npx (default) | The MCP server runs on the same machine as your agent. | Simplest setup. Paths are local to your host — what the agent passes is what the server sees. |
Docker | You want isolation, or are running the server alongside other tooling in containers. | Paths are local to the container. Use the same-path bind mount + |
Remote MCP (over HTTP / SSE) | The server runs on shared infrastructure separate from the agent. | Surprising default: any path you pass for |
Where do file paths resolve?
Field | npx | Docker | Remote MCP |
| host (agent's) | container — use same-path mount | server's filesystem |
| host (agent's) | container — use same-path mount | server's filesystem |
| host (agent's) | container — use same-path mount | server's filesystem |
When PEEK_FILES_ROOT is set, all three are also constrained to that root (canonicalised — .. traversal cannot escape).
Multipart compatibility
Multipart uploads stream files via chunked transfer encoding (no Content-Length header). Most modern HTTP servers handle this fine. If you hit a server that rejects chunked uploads — typically primitive test servers or some legacy reverse proxies — file an issue with the response body, and we'll consider a non-chunked fallback in v0.3.
Tools
The four tools below are what the agent calls — you don't run them by hand. Examples show the agent's call sequence, not a CLI invocation.
Default flow: the agent calls
http_requestto get a schema +cache_id, thenhttp_readwith a jq mask to extract just the field(s) it needs. Keeps the agent's context small even on multi-MB responses. Settingbody_mode: "inline"is rarely the right call — see body modes for cost framing.
Tool | What it does |
| Run an HTTP request; return a schema (and optionally a preview / full body, governed by |
| Read a cached body, optionally filtered by a |
| Re-render the cached body's schema in another format — no second HTTP call. |
| Debug helper. No params. Returns version, runtime, |
Body modes
http_request returns a schema by default. The body_mode parameter controls how much (if any) of the actual body comes back inline:
Mode | Returns | When the agent picks it |
| schema only | The agent will follow up with |
| schema + | A quick peek to decide what to extract next. |
| schema + the full body | The response is known-small and every field is needed. Capped by |
| server picks based on byte thresholds | Inline under |
The actual mode picked, and the thresholds in effect, come back on every response in meta.body_inclusion so the agent can introspect what auto resolved to. JSON responses also get next_step_hints — advisory jq mask suggestions inferred from the top-level shape.
Debugging unexpected behaviour: server_info
When a path is rejected with invalid_input, or a tool behaves differently than you'd expect (Docker vs. host mode, a stale env var, the wrong version), call server_info first. It returns:
{
"version": "0.3.0",
"runtime": "docker" /* or "npx" / "unknown" */,
"cwd": "/app",
"files_root": "/home/me/data", // null if PEEK_FILES_ROOT is unset
"effective_limits": {
"default_timeout_ms": 30000,
"max_response_bytes": 52428800,
"inline_threshold_bytes": 8192,
"head_preview_threshold_bytes": 65536,
"inline_body_cap_bytes": 262144,
"max_inline_file_bytes": 10485760,
/* …10 more fields… */
}
}No params. Cheap to call. Beats guessing why a path rejection said "/home/me/data" when you swore you set PEEK_FILES_ROOT=/data.
Schema formats
The same /users endpoint, four ways:
paths (default) — flat path listing, type + one example per leaf:
data[].id : int (e.g. 42)
data[].name : string (e.g. "alice")
data[].roles[] : string (e.g. "admin")
data[].profile.bio : string|null
meta.total : int (e.g. 1247)
meta.next_cursor : string|null
# 187 KB · data[]: 50 itemsshape — TypeScript-like tree, more compact for deep data:
{
data: [{
id: int, name: string, roles: string[], profile: { bio: string|null }
}] (50 items),
meta: { total: int, next_cursor: string|null }
}
# 187 KBsample — first item kept verbatim, rest collapsed; long strings auto-truncated:
{
"data": [
{ "id": 42, "name": "alice", "roles": ["admin"], "created_at": "2026-05-09T..." },
"...49 more"
],
"meta": { "total": 1247, "next_cursor": "abc123" }
}json_schema — standard JSON Schema (draft 2020-12), inferred via genson-js. Useful when feeding the schema back into a typed pipeline.
Pick the format that matches what the agent is doing: paths for "what fields exist", shape for "what's the structure", sample for "show me one realistic record", json_schema for downstream tooling.
jq cheatsheet
# Pick specific fields .data | map({id, name})
# Drop heavy fields .data | map(del(.payload, .raw_html))
# Filter rows .data | map(select(.role == "admin"))
# Filter + pick .data | map(select(.active) | {id, email})
# First N .data[:5]
# Pluck single value .meta.total
# Pagination cursor .meta.next_cursor // empty
# Group by .data | group_by(.tag) | map({tag: .[0].tag, n: length})
# Stats .data | length, (map(.score) | add / length)
# Errors only .results | map(select(.error))
# Flatten nested [.. | objects | select(.id?) | {id, type}]
# Search by substring .items | map(select(.title | test("regex"; "i")))
# Sort + take top N .events | sort_by(.created_at) | reverse | .[:10]output_mode: "all" (default) returns single-output filters as their value, multi-output filters as an array. output_mode: "first" collapses to the first emitted value.
Real-world examples
Each block below is what the agent does — a sequence of MCP tool calls in its loop. You don't type these.
1. Explore an unknown REST endpoint
The agent does:
http_request {method: "GET", url: "https://api.someservice.io/v1/widgets"}
→ schema (paths) shows what's there: data[].id, data[].name, meta.next_cursor, …
http_read {cache_id, mask: ".data | map({id, name})"}
→ just the slice the agent needs2. Pull only id and created_at from GitHub issues
The agent does:
http_request {
method: "GET",
url: "https://api.github.com/repos/anthropics/claude-cookbooks/issues",
headers: {accept: "application/vnd.github+json"}
}
http_read {cache_id, mask: ".[] | {id, created_at}"}3. Upload an image via multipart
The agent does:
http_request {
method: "POST",
url: "https://upload.example.com/photos",
headers: {authorization: "Bearer …"},
multipart: { files: { photo: { path: "/host/photo.jpg", content_type: "image/jpeg" } } }
}For remote-MCP setups (or any time a server-side path makes no sense), use the inline variant — bytes travel in the JSON-RPC frame, no PEEK_FILES_ROOT constraint applies:
http_request {
method: "POST",
url: "https://upload.example.com/photos",
multipart: { files: { photo: {
content_base64: "<base64 bytes>",
filename: "photo.jpg",
content_type: "image/jpeg"
} } }
}Inline payloads are capped at PEEK_MAX_INLINE_FILE_BYTES (10 MB pre-base64 by default).
4. Stream a binary download to disk
The agent does:
http_request {
method: "GET",
url: "https://example.com/big.zip",
download_to: "/tmp/big.zip"
}
→ response is never buffered in agent context; sha256 + byte count returnedConfiguration
All env vars are optional. Defaults match common-sense limits.
Env var | Default | Purpose |
| 30000 | per-request HTTP timeout |
| 52428800 | hard cap on cached body size (50 MB) |
| 600 | cache entry lifetime (10 min) |
| 8192 |
|
| 65536 |
|
| 5 | array items kept verbatim in |
| 200 | string truncation length in |
| 262144 | hard cap on |
| 10485760 | hard cap on |
| 5000 | per-mask jq timeout |
| 0 | switch to subprocess jq (reserved, not heavily exercised) |
| 0 | skip TLS verification |
| 10 | recursion depth for schema renderers |
| 200 | per-object key cap |
| 100 | string truncation in samples |
| (unset) | restricts |
How much context does this actually save?
Approximate token costs for a single agent turn that wants one slice of a real-world API response. Tokenizer-dependent (Anthropic Claude tokens, English-heavy JSON, ~4 chars/token); your numbers will vary by ±30%.
Endpoint | Raw response | Raw tokens | Peek schema | Peek tokens | Savings |
GitHub Issues — | ~200 KB | ~12 000 | ~1 KB | ~250 | ~48× |
Stripe Charges — | ~80 KB | ~5 000 | ~0.6 KB | ~150 | ~33× |
OpenWeather — | ~30 KB | ~2 000 | ~0.5 KB | ~120 | ~17× |
Once the agent has the schema, http_read {cache_id, mask: "..."} returns just the slice — typically a handful of tokens.
Compared to alternatives
|
|
| |
All HTTP methods | ✅ | ❌ | ✅ |
Custom headers | ✅ | ✅ | ✅ |
Multipart file uploads | ✅ | ❌ | ✅ |
Schema-first responses | ✅ | ❌ | ❌ |
Field filtering (jq) | ✅ | ❌ | manual |
Doesn't dump 200KB into agent context | ✅ | ❌ | ❌ |
Single permission grant (no per-call prompt) | ✅ | ✅ | ❌ |
Structured errors + | ✅ | ❌ | ❌ |
License
MIT — see LICENSE.
Contributing
See CONTRIBUTING.md for branch naming, conventional-commit style, and the PR flow. Bug reports and feature ideas go in GitHub Issues.
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ed-smartass/mcp-peek'
If you have feedback or need assistance with the MCP directory API, please join our Discord server