Chain of Draft (CoD) MCP Server

Overview

This MCP server implements the Chain of Draft (CoD) reasoning approach as described in the research paper "Chain of Draft: Thinking Faster by Writing Less". CoD is a novel paradigm that allows LLMs to generate minimalistic yet informative intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.

Key Benefits

Efficiency: Significantly reduced token usage (as little as 7.6% of standard CoT)
Speed: Faster responses due to shorter generation time
Cost Savings: Lower API costs for LLM calls
Maintained Accuracy: Similar or even improved accuracy compared to CoT
Flexibility: Applicable across various reasoning tasks and domains

Features

Core Chain of Draft Implementation
- Concise reasoning steps (typically 5 words or less)
- Format enforcement
- Answer extraction
Performance Analytics
- Token usage tracking
- Solution accuracy monitoring
- Execution time measurement
- Domain-specific performance metrics
Adaptive Word Limits
- Automatic complexity estimation
- Dynamic adjustment of word limits
- Domain-specific calibration
Comprehensive Example Database
- CoT to CoD transformation
- Domain-specific examples (math, code, biology, physics, chemistry, puzzle)
- Example retrieval based on problem similarity
Format Enforcement
- Post-processing to ensure adherence to word limits
- Step structure preservation
- Adherence analytics
Hybrid Reasoning Approaches
- Automatic selection between CoD and CoT
- Domain-specific optimization
- Historical performance-based selection
OpenAI API Compatibility
- Drop-in replacement for standard OpenAI clients
- Support for both completions and chat interfaces
- Easy integration into existing workflows

Setup and Installation

Prerequisites

Python 3.10+ (for Python implementation)
Node.js 18+ (for JavaScript implementation)
Anthropic API key

Python Installation

Clone the repository
Install dependencies:
pip install -r requirements.txt
Configure API keys in .env file:
ANTHROPIC_API_KEY=your_api_key_here
Run the server:
python server.py

JavaScript Installation

Clone the repository
Install dependencies:
npm install
Configure API keys in .env file:
ANTHROPIC_API_KEY=your_api_key_here
Run the server:
node index.js

Claude Desktop Integration

To integrate with Claude Desktop:

Install Claude Desktop from claude.ai/download
Create or edit the Claude Desktop config file:
~/Library/Application Support/Claude/claude_desktop_config.json
Add the server configuration (Python version):
{ "mcpServers": { "chain-of-draft": { "command": "python3", "args": ["/absolute/path/to/cod/server.py"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }
Or for the JavaScript version:
{ "mcpServers": { "chain-of-draft": { "command": "node", "args": ["/absolute/path/to/cod/index.js"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }
Restart Claude Desktop

You can also use the Claude CLI to add the server:

# For Python implementation
claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "python3 /absolute/path/to/cod/server.py"

# For JavaScript implementation
claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "node /absolute/path/to/cod/index.js"

Available Tools

The Chain of Draft server provides the following tools:

Tool	Description
`chain_of_draft_solve`	Solve a problem using Chain of Draft reasoning
`math_solve`	Solve a math problem with CoD
`code_solve`	Solve a coding problem with CoD
`logic_solve`	Solve a logic problem with CoD
`get_performance_stats`	Get performance stats for CoD vs CoT
`get_token_reduction`	Get token reduction statistics
`analyze_problem_complexity`	Analyze problem complexity

Developer Usage

Python Client

If you want to use the Chain of Draft client directly in your Python code:

from client import ChainOfDraftClient

# Create client 
cod_client = ChainOfDraftClient()

# Use directly
result = await cod_client.solve_with_reasoning(
    problem="Solve: 247 + 394 = ?",
    domain="math"
)

print(f"Answer: {result['final_answer']}")
print(f"Reasoning: {result['reasoning_steps']}")
print(f"Tokens used: {result['token_count']}")

JavaScript Client

For JavaScript/Node.js applications:

import { Anthropic } from "@anthropic-ai/sdk";
import dotenv from "dotenv";

// Load environment variables
dotenv.config();

// Create the Anthropic client
const anthropic = new Anthropic({
  apiKey: process.env.ANTHROPIC_API_KEY,
});

// Import the Chain of Draft client
import chainOfDraftClient from './lib/chain-of-draft-client.js';

// Use the client
async function solveMathProblem() {
  const result = await chainOfDraftClient.solveWithReasoning({
    problem: "Solve: 247 + 394 = ?",
    domain: "math",
    max_words_per_step: 5
  });
  
  console.log(`Answer: ${result.final_answer}`);
  console.log(`Reasoning: ${result.reasoning_steps}`);
  console.log(`Tokens used: ${result.token_count}`);
}

solveMathProblem();

Implementation Details

The server is available in both Python and JavaScript implementations, both consisting of several integrated components:

Python Implementation

AnalyticsService: Tracks performance metrics across different problem domains and reasoning approaches
ComplexityEstimator: Analyzes problems to determine appropriate word limits
ExampleDatabase: Manages and retrieves examples, transforming CoT examples to CoD format
FormatEnforcer: Ensures reasoning steps adhere to word limits
ReasoningSelector: Intelligently chooses between CoD and CoT based on problem characteristics

JavaScript Implementation

analyticsDb: In-memory database for tracking performance metrics
complexityEstimator: Analyzes problems to determine complexity and appropriate word limits
formatEnforcer: Ensures reasoning steps adhere to word limits
reasoningSelector: Automatically chooses between CoD and CoT based on problem characteristics and historical performance

Both implementations follow the same core principles and provide identical MCP tools, making them interchangeable for most use cases.

License

This project is open-source and available under the MIT license.

Install Server

HTTP connection URL

security – no known vulnerabilities

license - not found

quality - confirmed to work

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Tools

Implements the Chain of Draft reasoning approach to generate minimalistic intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.

Related Resources

Reddit Discussion about this server

Related MCP Servers

DeepSeek-Claude MCP Server
HarshJ23
A
security
A
license
A
quality
Enhances Claude's reasoning capabilities by integrating DeepSeek R1's advanced reasoning engine for intricate multi-step reasoning tasks with precision and efficiency.
Last updated -
1
35
Python
MIT License
DeepSeek-Claude MCP Server
ygboy1986
-
security
A
license
-
quality
Enhances Claude's reasoning capabilities by integrating DeepSeek R1's advanced reasoning engine, allowing Claude to tackle complex multi-step reasoning tasks with greater precision.
Last updated -
Python
MIT License
Sequential Thinking MCP Server
zalab-inc
-
security
-
license
-
quality
Provides a tool for dynamic and reflective problem-solving by breaking complex problems into manageable steps with support for revision, branching, and hypothesis generation.
Last updated -
13,824
JavaScript
think-mcp-server
marcopesani
-
security
A
license
-
quality
A minimal MCP Server that provides Claude AI models with the 'think' tool capability, enabling better performance on complex reasoning tasks by allowing the model to pause during response generation for additional thinking steps.
Last updated -
525
1
TypeScript
MIT License

View all related MCP servers

Chain of Draft (CoD) MCP Server