Token Analyzer MCP

IMPLEMENTATION_NOTES.md•7.95 kB

# Token Analyzer MCP Server - Implementation Notes **Date:** 2025-11-12 **Implementation:** Option C - Built real MCP server from functional CLI tool --- ## What Was Done ### 1. Replaced Stub Implementation with Real Token Counting **Before:** - MCP server returned hardcoded mock data - Tools like `analyze_token_usage` returned static values (total_tokens: 15420, etc.) - No actual token counting occurred **After:** - Integrated real TokenMeasurementEngine from the functional CLI version - All tools now perform actual token counting and analysis - Uses character-based approximation (~3.5-4 chars/token for technical content) ### 2. New MCP Server Tools The server now exposes 6 functional tools: #### `analyze_mcp_servers` - Analyzes token consumption of all configured MCP servers - Provides detailed breakdown per server - Includes incremental impact analysis (cumulative effect) - Generates optimization recommendations - **Uses real analysis**: Connects to MCP servers and measures actual token overhead #### `count_text_tokens` - Counts tokens in arbitrary text - Returns token count, character count, word count - Shows average chars per token - **Perfect for**: Counting tokens in config files, code, documentation #### `analyze_context_efficiency` - Evaluates total context window usage - Shows overhead percentage (builtin tools + MCP servers) - Provides status: optimal/acceptable/moderate/high - **Returns**: Available context tokens and percentage #### `get_mcp_configuration` - Shows current MCP server configuration - Lists all servers with their status (active/disabled) - Displays configuration path - **No server connection needed**: Fast configuration review #### `quick_overhead_estimate` - Fast estimation without connecting to servers - Uses conservative estimate (1500 tokens per server) - Shows active servers and estimated overhead - **Use case**: Quick check before full analysis #### `analyze_file_tokens` - Count tokens in single file or multiple files - Returns per-file breakdown (tokens, characters, lines) - Provides summary statistics - **Use case**: Analyzing CLAUDE.md config files, scripts, etc. --- ## Implementation Details ### Architecture ``` index.ts (MCP Server) ├─ TokenMeasurementEngine.js - Character-based token approximation ├─ IncrementalImpactAnalyzer.js - Full MCP analysis orchestration ├─ MCPConfigurationAnalyzer.js - Config file discovery and parsing ├─ MCPSchemaExtractor.js - MCP server connection and schema extraction └─ ReportGenerator.js - Formatting (not used by MCP server directly) ``` ### Key Changes to index.ts 1. **Added imports** for analysis engines: ```typescript import { TokenMeasurementEngine } from "./TokenMeasurementEngine.js"; import { IncrementalImpactAnalyzer } from "./IncrementalImpactAnalyzer.js"; import { MCPConfigurationAnalyzer } from "./MCPConfigurationAnalyzer.js"; ``` 2. **Initialized engines** at server startup: ```typescript const tokenEngine = new TokenMeasurementEngine(); const impactAnalyzer = new IncrementalImpactAnalyzer(); const configAnalyzer = new MCPConfigurationAnalyzer(); ``` 3. **Replaced mock data** with real analysis calls: - `analyze_mcp_servers`: Calls `impactAnalyzer.performCompleteAnalysis()` - `count_text_tokens`: Calls `tokenEngine.approximateTokenCount(text)` - `analyze_file_tokens`: Reads files and uses `tokenEngine.approximateTokenCount()` 4. **Added proper error handling** with structured error responses ### Build Process - TypeScript source in `src/index.ts` - Compiled output in `dist/index.js` - Build command: `npm run build` (runs `tsc`) - MCP config points to `dist/index.js` --- ## Testing Results ### ✅ Server Startup ```bash node dist/index.js # Successfully starts MCP server ``` ### ✅ Tools List ```bash echo '{"jsonrpc":"2.0","id":1,"method":"tools/list","params":{}}' | node dist/index.js # Returns 6 tools with proper schemas ``` ### ✅ Token Counting (Verified Earlier) Using functional TokenMeasurementEngine to count CLAUDE.md + imports: - **Result**: 7,915 tokens (3.96% of 200k context) - **Method**: Character-based approximation - **Accuracy**: Suitable for technical content analysis --- ## Configuration ### MCP Config Location `~/.claude/config.json` ### Server Entry ```json { "token-analyzer": { "command": "node", "args": ["/home/cordlesssteve/projects/Utility/DEV-TOOLS/mcp-workspace/servers/your-servers/token-analyzer/dist/index.js"], "cwd": "/home/cordlesssteve/projects/Utility/DEV-TOOLS/mcp-workspace/servers/your-servers/token-analyzer", "env": {} } } ``` **Note**: Changed from `src/index.js` (CLI tool) to `dist/index.js` (compiled MCP server) --- ## Usage Examples ### Example 1: Count Tokens in Text ```json { "tool": "count_text_tokens", "arguments": { "text": "This is sample text to count tokens." } } ``` **Response:** ```json { "tokens": 10, "characters": 37, "words": 7, "avg_chars_per_token": "3.70", "method": "character-based approximation (~3.5-4 chars/token)" } ``` ### Example 2: Analyze File Tokens ```json { "tool": "analyze_file_tokens", "arguments": { "files": [ "/home/user/.claude/CLAUDE.md", "/home/user/.claude/config/intellectual-honesty.md" ] } } ``` ### Example 3: Quick MCP Overhead Estimate ```json { "tool": "quick_overhead_estimate", "arguments": {} } ``` **Response:** ```json { "estimate": { "builtin_tools": 2250, "active_servers": 8, "estimated_mcp_tokens": 12000, "total_estimated": 14250, "percentage": "7.1" }, "note": "This is a fast estimate. Use 'analyze_mcp_servers' for accurate measurement." } ``` --- ## Differences from Original Stub | Feature | Stub (Before) | Real Implementation (After) | |---------|---------------|----------------------------| | Token counting | Hardcoded values | Real character-based approximation | | MCP analysis | Mock data | Connects to actual MCP servers | | Tools available | 8 (all mocked) | 6 (all functional) | | File analysis | N/A | Can analyze arbitrary files | | Configuration | N/A | Reads real MCP config | | Error handling | Generic | Structured with stack traces | | Performance | Instant (fake) | Real analysis (~5-10s for full scan) | --- ## Maintenance ### Rebuilding After Changes ```bash cd ~/projects/Utility/DEV-TOOLS/mcp-workspace/servers/your-servers/token-analyzer npm run build ``` ### Testing MCP Server ```bash # List tools echo '{"jsonrpc":"2.0","id":1,"method":"tools/list","params":{}}' | node dist/index.js | jq # Test count_text_tokens echo '{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{"name":"count_text_tokens","arguments":{"text":"Hello world"}}}' | node dist/index.js | jq ``` ### Dependencies All required dependencies already present: - `@modelcontextprotocol/sdk` - MCP protocol implementation - Functional analysis modules (TokenMeasurementEngine, etc.) --- ## Known Limitations 1. **Token counting is approximate** - Uses character-based heuristic, not Anthropic's actual tokenizer 2. **MCP server analysis requires servers to be accessible** - Some servers may timeout or fail to connect 3. **No caching** - Each analysis connects to MCP servers fresh (could be optimized) 4. **TypeScript types** - Used `any` for some complex return types (could be properly typed) --- ## Future Enhancements 1. **Add Anthropic tokenizer** - Use official tokenizer for exact counts 2. **Add caching** - Cache MCP server analysis results 3. **Add session tracking** - Track token usage over time (would require persistence) 4. **Add comparison tools** - Compare token usage across different configurations 5. **Add proper TypeScript types** - Define interfaces for analysis results --- **Status:** ✅ COMPLETE - Real token counting MCP server operational **Next Session:** Will have access to functional token-analyzer via MCP tools

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/cordlesssteve/token-analyzer-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server