Delta-MCP
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Delta-MCPlist tools"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Delta-MCP
Token-efficient MCP reimplementation. Same JSON-RPC 2.0 wire format. Leaner discovery model. 89% fewer tokens on tool definitions, measured.
Why
Standard MCP has two token bloat problems:
Tool-definition bloat. Every tool's full JSON schema loads into context at startup — even tools the model never uses. With 10 tools you're paying 850+ tokens before any work happens. With 50 tools across enterprise servers, thousands.
Tool-result bloat. Large outputs (file reads, search results, API responses) route through LLM context unfiltered. One 50KB file read can destroy your context budget.
Delta-MCP fixes both.
Related MCP server: Code Executor MCP Server
Numbers
Standard MCP | Delta-MCP | |
6-tool server init | 943 tokens | 118 tokens |
20-tool server init | ~3600 tokens | 378 tokens |
Definition overhead (1/6 tools used) | 943 tokens upfront | 229 tokens on-demand |
Tool-selection accuracy (Opus 4) | 49% | 74% |
Tool-selection accuracy (Opus 4.5) | 79.5% | 88.1% |
Compact-json wire reduction | — | −17.7% |
Accuracy numbers from Anthropic lazy tool loading research. Token numbers from conformance/scenarios/07-benchmark.test.ts against a 6-tool server with realistic schemas.
How It Works
Progressive disclosure
Delta-MCP replaces eager schema loading with a two-tier model negotiated at initialize:
tools/list → names + ≤60-char descriptions only (~115 tokens for 6 tools)
tools/describe → full schema, on-demand, cached (~30 tokens per tool)The 60-char description cap is enforced at registration — longer descriptions throw at startup. This is intentional: the schema is the right place for detail, not the discovery index. Counter-intuitively, shorter descriptions improve tool-selection accuracy. More detail increases execution steps by 67% and regresses 16% of cases.
// Standard MCP: model sees all of this before doing anything
{ name: "search", inputSchema: { type: "object", properties: { query: { type: "string", description: "Full-text search query string. Supports boolean operators AND, OR, NOT..." }, limit: { ... }, filters: { type: "object", properties: { dateRange: { enum: [...] }, language: { ... } } } } } }
// Delta-MCP tools/list: model sees this
{ name: "search", description: "Search docs and return top results" }
// Delta-MCP tools/describe (only when model decides to use it):
{ name: "search", inputSchema: { ... full schema ... } }Result handler
Every tool result passes through the result handler before hitting LLM context:
Input type | Output |
String over budget |
|
Array over page size |
|
Object over budget |
|
Upstream 429 |
|
Rate limits become tool results the model can reason about, not exceptions that terminate the agent loop. Pagination params (page, pageSize) flow automatically from tool call args — the model requests subsequent pages without the server needing explicit pagination logic.
Compact wire encoding
Negotiated at initialize, auto-fallback to standard JSON for unaware clients. Both sides switch codecs after the handshake — the initialize response itself is always plain JSON so the client can read it before the switch.
Standard: {"jsonrpc":"2.0","method":"tools/list","result":{"tools":[...]}}
Compact: {"j":"2.0","m":"tools/list","r":{"t":[...]}}CBOR binary encoding is available over HTTP via the optional cbor-x dependency. Stdio clamps to compact-json because CBOR is binary and cannot be safely newline-delimited.
The HTTP transport decodes requests by Content-Type and encodes responses by the client's Accept header. The MCP-Protocol-Version header is required on all requests except initialize — the client doesn't know the version until the handshake completes.
Sessions follow the MCP Streamable HTTP model: the server assigns an Mcp-Session-Id on initialize and the client echoes it on every subsequent request. Negotiation state (progressive disclosure, encoding) is keyed per session, so one server instance can serve delta-aware and standard MCP clients concurrently without cross-talk. Requests without a known session id get standard MCP behavior (full schemas).
Browser-originated requests (any request carrying an Origin header) are rejected with 403 unless the origin is listed in allowedOrigins — the spec-required DNS-rebinding protection. Non-browser clients are unaffected. Behind a reverse proxy, set trustProxy: true so the per-IP rate limiter keys on X-Forwarded-For instead of the proxy's address.
OAuth 2.1 (resource-server only)
Delta-MCP validates tokens, never issues them. Stateless by design:
Client → POST /mcp
Server → 401 WWW-Authenticate: Bearer resource_metadata="/.well-known/oauth-protected-resource"
Client → GET /.well-known/oauth-protected-resource (RFC 9728 PRM)
Client → discovers AS, gets token via PKCE (mandatory, no implicit flow)
Client → POST /mcp Authorization: Bearer <token>
Server → validates JWT + RFC 8707 audience binding → processes requestThe HTTP transport runs in one of two modes:
Full OAuth — pass the oauth option and the transport serves the RFC 9728 PRM document at /.well-known/oauth-protected-resource, validates tokens for audience (RFC 8707) + expiry + signature, and emits spec WWW-Authenticate challenges with error reasons. oauth mode requires either verifySignature or introspectionEndpoint — the handler refuses to start with neither, because tokens would otherwise be forgeable. Tokens without an exp claim are rejected by default. This is the production path:
createHttpHandler(handler, {
oauth: {
resourceUrl: "https://mcp.example.com", // must equal the token `aud`
authorizationServers: ["https://auth.example.com"],
verifySignature: async (token, header, payload) => verifyWithJwks(token),
},
});Presence-only (dev) — without oauth, the bearer check is presence-only: any non-empty token passes. Dev-grade. A validateToken hook narrows it without the full PRM machinery; set authRequired: false for an explicitly open server.
The full flow is exercised end-to-end in conformance CS-09 (401 → PRM fetch → authenticated call → audience/expiry rejection), not just unit-tested in isolation. A runnable server (zero deps, RS256 + curl walkthrough) lives in examples/http-oauth-server — DeltaServer.startHttp({ port, oauth }) wires all of this in one call.
The MCP-Protocol-Version header carries the baseline MCP version (2025-11-25) for ecosystem interop; Delta-MCP extensions are advertised separately in the initialize result's capabilities.
Examples
Example | What it shows |
Minimal server with two tools — start here | |
Real-world server: read/write/list/search files, path sandbox, auto-truncation + pagination | |
Connect with | |
1 000-record server, 6 scenarios: manual nav, jump-to-page, custom page size, filter+paginate, auto-fetch all, last page | |
Truncation, pagination, object summarization, and rate-limit handling in isolation | |
HTTP server with full OAuth 2.1 resource-server mode (RS256, PRM discovery, curl walkthrough) |
Quick Start
npm install @delta-mcp/server @delta-mcp/clientimport { DeltaServer } from "@delta-mcp/server";
class MyServer extends DeltaServer {
constructor() {
super({
name: "my-server",
version: "1.0.0",
resultHandler: { maxTokens: 500, paginateAfter: 50 },
});
this.tool({
name: "search",
description: "Search docs and return top results", // ≤60 chars, enforced
inputSchema: {
type: "object",
properties: {
query: { type: "string" },
page: { type: "number" },
},
required: ["query"],
},
});
}
protected async callTool(name: string, args: Record<string, unknown>): Promise<unknown> {
if (name === "search") return performSearch(args.query as string);
throw new Error(`Unknown tool: ${name}`);
}
}
new MyServer().startStdio();CLI
npx @delta-mcp/cli list node ./server.js # list tools
npx @delta-mcp/cli describe node ./server.js search # full schema
npx @delta-mcp/cli call node ./server.js search '{"query":"x"}' # call tool
npx @delta-mcp/cli bench node ./server.js # benchmarkArchitecture
┌──────────────────────────────────────────────────────┐
│ Delta-MCP Client │
│ negotiate capabilities → get index → fetch schema │
│ on demand → cached → call tool │
└─────────────────────┬────────────────────────────────┘
│ JSON-RPC 2.0 (unchanged wire)
┌─────────────────────▼────────────────────────────────┐
│ Delta-MCP Server │
│ │
│ ProgressiveToolRegistry Result Handler │
│ names + 60-char desc truncate / paginate / │
│ schemas on-demand rate-limit → result │
│ │
│ stdio / HTTP transport OAuth 2.1 resource-server│
└──────────────────────────────────────────────────────┘Packages
Package | Purpose |
| Types, transport, progressive disclosure, encoding, auth, result handler |
|
|
|
|
|
|
Conformance
114 tests across 13 scenarios (plus 46 package unit tests). Run with:
npm run conformanceScenario | Coverage |
CS-01 | Initialize handshake, capability negotiation, codec negotiation |
CS-02 | Progressive disclosure: list, describe, cache, 60-char cap |
CS-03 | tools/call: results, errors, structured output |
CS-04 | Result handler: truncation, pagination, summarization, rate limits |
CS-05 | Wire encoding: CBOR negotiation, compact-json roundtrip |
CS-06 | OAuth 2.1 primitives: PRM document, JWT validation, RFC 8707 audience |
CS-07 | Benchmark: token reduction, latency, overhead targets |
CS-08 | HTTP transport: version header exemption, codec round-trip |
CS-09 | OAuth 2.1 end-to-end: 401 → PRM discovery → authenticated call, bad-token rejection |
CS-10 | Protocol soundness: version-skew downgrade, notification semantics, honest capabilities, HTTP 202 |
CS-11 | HTTP hardening: body limit (413), rate limit (429), timeout (504), Origin validation (403), crash containment (500) |
CS-12 | Official MCP SDK compatibility: baseline clients get standard behavior |
CS-13 | HTTP session isolation: Mcp-Session-Id, concurrent delta + standard clients |
Full results: docs/benchmarks/results.md
Compatibility
Baseline: MCP 2025-11-25 — Streamable HTTP + stdio transports (older date versions 2025-06-18 / 2025-03-26 accepted on the
MCP-Protocol-Versionheader)Node.js: ≥20.0.0
Module format: ESM only —
importworks,require()does notWire format: JSON-RPC 2.0 — unchanged, fully interoperable
Standard MCP clients connecting to a Delta-MCP server get standard MCP behavior automatically (
capabilities: { tools: { listChanged: false } }, full schemas, MCPisErrorresults for tool execution failures)
License
MIT
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/norhther/delta-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server