distill-mcp-v2
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@distill-mcp-v2distill these server logs keeping stack traces"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
β‘ Overview
distill-mcp-v2 is a high-performance, network-dependency-free Python FastMCP server designed to aggressively optimize Large Language Model (LLM) context windows. It provides specialized tools for compressing and analyzing massive AIβagent payloads without losing critical semantic information.
By filtering noise, stabilizing cache prefixes, and running multi-model token cost estimations locally, Distill v2 dramatically reduces API costs and preserves LLM reasoning abilities when dealing with heavy payloads like infinite logs, monolithic API schemas, and multi-agent war room transcripts.
Related MCP server: FastAPI MCP Server
π Performance & Token Compression
Our rigorous, independent stress-testing benchmarks (audited via pytest and chaos blueprints) prove that distill-mcp-v2 achieves up to 99.7% token compression while retaining 100% of the crucial context.
Scenario | Payload Profile | Raw Tokens | Distilled Tokens | Savings % |
Trace Avalanche | Heavy Java Stacktraces | 150,027 | 546 | 99.6% |
Schema Monolith | Massive Microservice JSON | 56,588 | 1,355 | 97.6% |
Incident War Room | Multi-Agent Chat Logs | 117,952 | 371 | 99.7% |
(Tested against Claude-3-Opus budgets. Scenario 1 reduced costs from $2.25/call to $0.008/call.)
Read the full Benchmark & Execution Report for deeper insights.
π Features & Toolset
Distill v2 exposes 8 precise tools to agents via the Model Context Protocol:
distill_jsonβ Compresses raw JSON payloads, retaining anomalies, exceptions, and errors.distill_logsβ Compresses raw.logfiles, preserving head/tail contexts and stack traces.distill_schemaβ Compacts massive MCP tool catalogs and JSON schemas to structural parameters.distill_responseβ Progressively prunes, minifies, and truncates outputs to fit strict token budgets.distill_conversationβ Extracts goals, decisions, blockers, and actions from multi-agent transcripts without leaking raw ISO timestamps.stabilize_for_cacheβ Maps chaotic raw identifiers (UUIDs, hex IDs) to sequential placeholders to stabilize LLM prompt caching.analyze_tokensβ Accurately estimates token counts usingtiktoken(cl100k_base).compareβ Computes detailed diffs and cost-savings analyses between raw and distilled payloads.
π¦ Quick Start
Installation
Distill v2 requires Python 3.10+. We recommend using uv or pip in an isolated virtual environment.
# Clone the repository
git clone https://github.com/yatinkoul/distill.git
cd distill
# Create a virtual environment and activate it
python -m venv .venv
source .venv/bin/activate
# Install dependencies
pip install -e .Running the Server
Run the FastMCP server, which natively binds to a stateless HTTP endpoint.
# Run the FastMCP server (default host: 0.0.0.0, port: 8000)
distill --host 0.0.0.0 --port 8000
# Or using uvicorn directly:
.venv/bin/uvicorn src.main:app --host 0.0.0.0 --port 8000The server exposes a JSONβRPC endpoint at http://localhost:8000/mcp.
π§ͺ Stress Testing & Development
The repository ships with an exhaustive stress-test runner that dynamically allocates ports, executes deterministic payload scenarios, and validates the integrity of the distilled outputs.
To run the complete test suite (118/118 passing):
# Install development dependencies
pip install -e ".[dev]"
# Run the test suite
pytestTo manually execute the stress-testing blueprints and regenerate the benchmark report:
python stress_tests_blueprints/run_blueprints.pyπ Documentation
Project Goals & Strategy β High-level strategy and milestones.
Benchmark Report β Exhaustive stress-test results.
Testing Infrastructure β Notes on the testing architecture and CI/CD readiness.
π€ Contributing
Contributions, issues, and feature requests are welcome!
Please read our CONTRIBUTING.md (coming soon) for guidelines on how to propose improvements. Ensure all tests and linting (ruff) pass before submitting pull requests.
π License
This project is licensed under the MIT License β see the LICENSE file for details.
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/yatinkoul/distill-mcp-v2'
If you have feedback or need assistance with the MCP directory API, please join our Discord server