Skip to main content
Glama
ingpoc

Token-Efficient MCP Server

by ingpoc

Token-Efficient MCP Server

A project-agnostic MCP (Model Context Protocol) server that provides 95%+ token savings through sandboxed data processing, progressive tool loading, and multi-language code execution.

πŸš€ Key Features

Multi-Language Code Execution

Execute code in sandboxed environment:

  • Python, Bash, Node.js/JavaScript support

  • Critical for agent systems (initializer, coding, tester, verifier)

  • Run commands, tests, and validations with 98% token savings

Progressive Tool Disclosure

Load tools on-demand to reduce context usage:

  • Level 1: Tool names only (~100 tokens)

  • Level 2: Names + summaries (~2K tokens)

  • Level 3: Full definitions (only when needed)

Sandboxed Data Processing

Process data securely before returning to context:

  • CSV filtering and aggregation (99% savings)

  • Log file analysis (95% savings)

  • Code execution with output filtering

  • Token measurement and optimization

Project Agnostic

Works with any project that needs:

  • Multi-language code execution

  • Large dataset processing

  • Log analysis

  • Token optimization

πŸ“¦ Installation

# Clone the repository
git clone https://github.com/your-repo/token-efficient-mcp.git
cd token-efficient-mcp

# Install dependencies
npm install

# Build TypeScript
npm run build

βš™οΈ Configuration

Add to your global ~/.claude.json:

{
  "mcpServers": {
    "token-efficient": {
      "command": "srt",
      "args": [
        "node",
        "/path/to/token-efficient-mcp/dist/index.js"
      ]
    }
  }
}

Note: The srt command provides OS-level sandboxing via sandbox-exec (macOS) or bubblewrap (Linux).

πŸ› οΈ Available Tools

1. execute_code

Execute code in multiple languages with sandboxing.

// Run bash commands
execute_code({
  code: "npm test",
  language: "bash"
})

// Run Python scripts
execute_code({
  code: "import sys; print(sys.version)",
  language: "python"
})

// Run Node.js code
execute_code({
  code: "console.log('Hello from Node')",
  language: "node"
})

// Check health endpoint
execute_code({
  code: "curl -m 3 http://localhost:8000/api/health",
  language: "bash"
})

Supported Languages: python, bash, sh, node, javascript

2. list_token_efficient_tools

Discover available tools with progressive disclosure.

// Level 1: Names only (100 tokens)
list_token_efficient_tools({ level: "names_only" })

// Level 2: Summaries (2K tokens)
list_token_efficient_tools({ level: "summary" })

// Level 3: Full definitions
list_token_efficient_tools({ level: "full" })

4. process_csv

Process CSV files with filtering and aggregation.

// Example: Find expensive stocks
process_csv({
  file_path: "data/stocks.csv",
  filter_expr: "price > 100 and volume > 1000000",
  columns: ["symbol", "price", "volume", "change"],
  limit: 10,
  response_format: "summary"
})

5. process_logs

Filter and analyze log files efficiently.

// Example: Find all errors with context
process_logs({
  file_path: "logs/application.log",
  pattern: "ERROR|CRITICAL",
  context_lines: 2,
  limit: 50,
  response_format: "summary"
})

6. get_token_savings_report

Get optimization tips and savings potential.

πŸ“Š Token Savings Examples

Code Execution

// Without execute_code: Multi-turn conversation
// Agent: "Should I run npm test?" β†’ User: "Yes" β†’ Run β†’ Parse output
// Estimated: 5,000+ tokens across multiple turns

// With execute_code: Single call
execute_code({ code: "npm test", language: "bash" })
// Returns: { success: true, output: "Tests passed", exit_code: 0 }
// Result: 200 tokens (98% savings)

CSV Processing

// Without optimization: 200,000 tokens
// All 10,000 rows returned to context

// With token-efficient MCP: 2,000 tokens (99% savings)
// Only 100 filtered rows returned

Log Analysis

// Without optimization: 500,000 tokens
// All 100,000 log lines returned

// With token-efficient MCP: 5,000 tokens (99% savings)
// Only 500 matching lines with context returned

Tool Loading

// Traditional MCP: 150,000 tokens
// All tool definitions loaded at startup

// Token-efficient MCP: 2,000 tokens (98.7% savings)
// Tools loaded on-demand

πŸ”’ Security

The server uses OS-level sandboxing via srt wrapper:

  • Filesystem isolation: Limited to temp directories for code execution

  • Network restrictions: No outbound connections by default

  • Process monitoring: Timeouts (1-300s) and resource limits

  • Multi-language support: Sandboxed Python, Bash, Node.js execution

πŸ§ͺ Testing

# Build the project
npm run build

# Test execute_code tool
node -e "
const { exec } = require('child_process');
const code = \`echo 'Hello from test'\`;
exec(\`node dist/index.js\`, (err, stdout) => {
  console.log(stdout);
});
"

# Or test directly with MCP
# The server will be loaded by Claude Code via ~/.claude.json config

πŸ“ˆ Performance Metrics

The server tracks and reports:

  • Input tokens: Size of request

  • Output tokens: Size of response

  • Processing efficiency: Items processed per token

  • Estimated savings: Percentage of tokens saved

Example response:

{
  "token_metrics": {
    "input_tokens": 250,
    "output_tokens": 1500,
    "estimated_savings_percent": 98.5
  }
}

🀝 Contributing

  1. Fork the repository

  2. Create a feature branch

  3. Add tests for new functionality

  4. Ensure token efficiency principles are followed

  5. Submit a pull request

πŸ“ License

MIT License - see LICENSE file for details.

Install Server
A
security – no known vulnerabilities
F
license - not found
A
quality - confirmed to work

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ingpoc/token-efficient-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server