Skip to main content
Glama

CI PyPI version License: MIT Python


⚑ Overview

distill-mcp-v2 is a high-performance, network-dependency-free Python FastMCP server designed to aggressively optimize Large Language Model (LLM) context windows. It provides specialized tools for compressing and analyzing massive AI‑agent payloads without losing critical semantic information.

By filtering noise, stabilizing cache prefixes, and running multi-model token cost estimations locally, Distill v2 dramatically reduces API costs and preserves LLM reasoning abilities when dealing with heavy payloads like infinite logs, monolithic API schemas, and multi-agent war room transcripts.

Related MCP server: FastAPI MCP Server

πŸš€ Performance & Token Compression

Our rigorous, independent stress-testing benchmarks (audited via pytest and chaos blueprints) prove that distill-mcp-v2 achieves up to 99.7% token compression while retaining 100% of the crucial context.

Scenario

Payload Profile

Raw Tokens

Distilled Tokens

Savings %

Trace Avalanche

Heavy Java Stacktraces

150,027

546

99.6%

Schema Monolith

Massive Microservice JSON

56,588

1,355

97.6%

Incident War Room

Multi-Agent Chat Logs

117,952

371

99.7%

(Tested against Claude-3-Opus budgets. Scenario 1 reduced costs from $2.25/call to $0.008/call.)

Read the full Benchmark & Execution Report for deeper insights.

πŸ›  Features & Toolset

Distill v2 exposes 8 precise tools to agents via the Model Context Protocol:

  1. distill_json β€” Compresses raw JSON payloads, retaining anomalies, exceptions, and errors.

  2. distill_logs β€” Compresses raw .log files, preserving head/tail contexts and stack traces.

  3. distill_schema β€” Compacts massive MCP tool catalogs and JSON schemas to structural parameters.

  4. distill_response β€” Progressively prunes, minifies, and truncates outputs to fit strict token budgets.

  5. distill_conversation β€” Extracts goals, decisions, blockers, and actions from multi-agent transcripts without leaking raw ISO timestamps.

  6. stabilize_for_cache β€” Maps chaotic raw identifiers (UUIDs, hex IDs) to sequential placeholders to stabilize LLM prompt caching.

  7. analyze_tokens β€” Accurately estimates token counts using tiktoken (cl100k_base).

  8. compare β€” Computes detailed diffs and cost-savings analyses between raw and distilled payloads.

πŸ“¦ Quick Start

Installation

Distill v2 requires Python 3.10+. We recommend using uv or pip in an isolated virtual environment.

# Clone the repository
git clone https://github.com/yatinkoul/distill.git
cd distill

# Create a virtual environment and activate it
python -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -e .

Running the Server

Run the FastMCP server, which natively binds to a stateless HTTP endpoint.

# Run the FastMCP server (default host: 0.0.0.0, port: 8000)
distill --host 0.0.0.0 --port 8000
# Or using uvicorn directly:
.venv/bin/uvicorn src.main:app --host 0.0.0.0 --port 8000

The server exposes a JSON‑RPC endpoint at http://localhost:8000/mcp.

πŸ§ͺ Stress Testing & Development

The repository ships with an exhaustive stress-test runner that dynamically allocates ports, executes deterministic payload scenarios, and validates the integrity of the distilled outputs.

To run the complete test suite (118/118 passing):

# Install development dependencies
pip install -e ".[dev]"

# Run the test suite
pytest

To manually execute the stress-testing blueprints and regenerate the benchmark report:

python stress_tests_blueprints/run_blueprints.py

πŸ“– Documentation

🀝 Contributing

Contributions, issues, and feature requests are welcome! Please read our CONTRIBUTING.md (coming soon) for guidelines on how to propose improvements. Ensure all tests and linting (ruff) pass before submitting pull requests.

πŸ“œ License

This project is licensed under the MIT License – see the LICENSE file for details.

Install Server
A
license - permissive license
A
quality
C
maintenance

Maintenance

–Maintainers
–Response time
–Release cycle
–Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/yatinkoul/distill-mcp-v2'

If you have feedback or need assistance with the MCP directory API, please join our Discord server