Skip to main content
Glama

Chain of Draft (CoD) MCP Server

by stat-guy

Chain of Draft (CoD) MCP Server

Overview

This MCP server implements the Chain of Draft (CoD) reasoning approach as described in the research paper "Chain of Draft: Thinking Faster by Writing Less". CoD is a novel paradigm that allows LLMs to generate minimalistic yet informative intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.

Key Benefits

  • Efficiency: Significantly reduced token usage (as little as 7.6% of standard CoT)

  • Speed: Faster responses due to shorter generation time

  • Cost Savings: Lower API costs for LLM calls

  • Maintained Accuracy: Similar or even improved accuracy compared to CoT

  • Flexibility: Applicable across various reasoning tasks and domains

Features

  1. Core Chain of Draft Implementation

    • Concise reasoning steps (typically 5 words or less)

    • Format enforcement

    • Answer extraction

  2. Performance Analytics

    • Token usage tracking

    • Solution accuracy monitoring

    • Execution time measurement

    • Domain-specific performance metrics

  3. Adaptive Word Limits

    • Automatic complexity estimation

    • Dynamic adjustment of word limits

    • Domain-specific calibration

  4. Comprehensive Example Database

    • CoT to CoD transformation

    • Domain-specific examples (math, code, biology, physics, chemistry, puzzle)

    • Example retrieval based on problem similarity

  5. Format Enforcement

    • Post-processing to ensure adherence to word limits

    • Step structure preservation

    • Adherence analytics

  6. Hybrid Reasoning Approaches

    • Automatic selection between CoD and CoT

    • Domain-specific optimization

    • Historical performance-based selection

  7. OpenAI API Compatibility

    • Drop-in replacement for standard OpenAI clients

    • Support for both completions and chat interfaces

    • Easy integration into existing workflows

Setup and Installation

Prerequisites

  • Python 3.10+ (for Python implementation)

  • Node.js 18+ (for JavaScript implementation)

  • Anthropic API key

Python Installation

  1. Clone the repository

  2. Install dependencies:

    pip install -r requirements.txt
  3. Configure API keys in .env file:

    ANTHROPIC_API_KEY=your_api_key_here
  4. Run the server:

    python server.py

JavaScript Installation

  1. Clone the repository

  2. Install dependencies:

    npm install
  3. Configure API keys in .env file:

    ANTHROPIC_API_KEY=your_api_key_here
  4. Run the server:

    node index.js

Claude Desktop Integration

To integrate with Claude Desktop:

  1. Install Claude Desktop from claude.ai/download

  2. Create or edit the Claude Desktop config file:

    ~/Library/Application Support/Claude/claude_desktop_config.json
  3. Add the server configuration (Python version):

    { "mcpServers": { "chain-of-draft": { "command": "python3", "args": ["/absolute/path/to/cod/server.py"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }

    Or for the JavaScript version:

    { "mcpServers": { "chain-of-draft": { "command": "node", "args": ["/absolute/path/to/cod/index.js"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }
  4. Restart Claude Desktop

You can also use the Claude CLI to add the server:

# For Python implementation claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "python3 /absolute/path/to/cod/server.py" # For JavaScript implementation claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "node /absolute/path/to/cod/index.js"

Available Tools

The Chain of Draft server provides the following tools:

Tool

Description

chain_of_draft_solve

Solve a problem using Chain of Draft reasoning

math_solve

Solve a math problem with CoD

code_solve

Solve a coding problem with CoD

logic_solve

Solve a logic problem with CoD

get_performance_stats

Get performance stats for CoD vs CoT

get_token_reduction

Get token reduction statistics

analyze_problem_complexity

Analyze problem complexity

Developer Usage

Python Client

If you want to use the Chain of Draft client directly in your Python code:

from client import ChainOfDraftClient # Create client cod_client = ChainOfDraftClient() # Use directly result = await cod_client.solve_with_reasoning( problem="Solve: 247 + 394 = ?", domain="math" ) print(f"Answer: {result['final_answer']}") print(f"Reasoning: {result['reasoning_steps']}") print(f"Tokens used: {result['token_count']}")

JavaScript Client

For JavaScript/Node.js applications:

import { Anthropic } from "@anthropic-ai/sdk"; import dotenv from "dotenv"; // Load environment variables dotenv.config(); // Create the Anthropic client const anthropic = new Anthropic({ apiKey: process.env.ANTHROPIC_API_KEY, }); // Import the Chain of Draft client import chainOfDraftClient from './lib/chain-of-draft-client.js'; // Use the client async function solveMathProblem() { const result = await chainOfDraftClient.solveWithReasoning({ problem: "Solve: 247 + 394 = ?", domain: "math", max_words_per_step: 5 }); console.log(`Answer: ${result.final_answer}`); console.log(`Reasoning: ${result.reasoning_steps}`); console.log(`Tokens used: ${result.token_count}`); } solveMathProblem();

Implementation Details

The server is available in both Python and JavaScript implementations, both consisting of several integrated components:

Python Implementation

  1. AnalyticsService: Tracks performance metrics across different problem domains and reasoning approaches

  2. ComplexityEstimator: Analyzes problems to determine appropriate word limits

  3. ExampleDatabase: Manages and retrieves examples, transforming CoT examples to CoD format

  4. FormatEnforcer: Ensures reasoning steps adhere to word limits

  5. ReasoningSelector: Intelligently chooses between CoD and CoT based on problem characteristics

JavaScript Implementation

  1. analyticsDb: In-memory database for tracking performance metrics

  2. complexityEstimator: Analyzes problems to determine complexity and appropriate word limits

  3. formatEnforcer: Ensures reasoning steps adhere to word limits

  4. reasoningSelector: Automatically chooses between CoD and CoT based on problem characteristics and historical performance

Both implementations follow the same core principles and provide identical MCP tools, making them interchangeable for most use cases.

License

This project is open-source and available under the MIT license.

Deploy Server
A
security – no known vulnerabilities
F
license - not found
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Implements the Chain of Draft reasoning approach to generate minimalistic intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.

  1. Overview
    1. Key Benefits
      1. Features
        1. Setup and Installation
          1. Prerequisites
          2. Python Installation
          3. JavaScript Installation
        2. Claude Desktop Integration
          1. Available Tools
            1. Developer Usage
              1. Python Client
              2. JavaScript Client
            2. Implementation Details
              1. Python Implementation
              2. JavaScript Implementation
            3. License

              Related MCP Servers

              • A
                security
                A
                license
                A
                quality
                Enhances AI model capabilities with structured, retrieval-augmented thinking processes that enable dynamic thought chains, parallel exploration paths, and recursive refinement cycles for improved reasoning.
                Last updated -
                16
                MIT License
                • Apple
              • A
                security
                A
                license
                A
                quality
                Chain of Draft Server is a powerful AI-driven tool that helps developers make better decisions through systematic, iterative refinement of thoughts and designs. It integrates seamlessly with popular AI agents and provides a structured approach to reasoning, API design, architecture decisions, code r
                Last updated -
                1
                14
                24
                MIT License
              • A
                security
                A
                license
                A
                quality
                An enhanced sequential thinking tool optimized for programming tasks that helps break down complex coding problems into structured, self-auditing thought steps with branching and revision capabilities.
                Last updated -
                1
                238
                236
                MIT License
                • Apple
              • A
                security
                A
                license
                A
                quality
                Provides structured sequential thinking capabilities for AI assistants to break down complex problems into manageable steps, revise thoughts, and explore alternative reasoning paths.
                Last updated -
                16
                MIT License

              View all related MCP servers

              MCP directory API

              We provide all the information about MCP servers via our MCP API.

              curl -X GET 'https://glama.ai/api/mcp/v1/servers/stat-guy/chain-of-draft'

              If you have feedback or need assistance with the MCP directory API, please join our Discord server