claude-cost-mcp
Compares costs across Gemini Flash and Pro 2.0 models in provider comparisons.
Compares costs across GPT-4o, GPT-4o-mini, and o1 models in provider comparisons.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@claude-cost-mcpHow much would a 2,000-token prompt with 800 tokens out cost on Sonnet?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
claude-cost-mcp
MCP server that estimates Claude API token counts, per-model cost, and prompt-caching break-even — right inside Claude Desktop, Claude Code, Cursor, or any MCP client. Korean/CJK token-aware. Fully local, no API key, no network.
Built by claudeguide.io.
Tools
Tool | What it does |
| Token count for any text (Korean/CJK-aware — Korean costs ~30% more than English) |
| Cost across Haiku 4.5 / Sonnet 4.5 / Opus 4.5, with optional prompt-caching ratio and call count |
| Prompt-caching savings + the ~1.28-reuse break-even point |
| Batch API savings (50% off both directions) for async workloads — daily and monthly $ |
| Cost table across 8 models / 3 providers (Claude vs GPT vs Gemini), sorted cheapest-first |
Pricing mirrors the 2026-05 canonical rates: Haiku $1/$5, Sonnet $3/$15, Opus $5/$25 per 1M tokens. Provider comparison uses verified 2026-05 public rates (OpenAI GPT-4o/-mini/o1, Google Gemini Flash/Pro 2.0).
Related MCP server: llm-token-tracker
Install
Claude Desktop
Add to claude_desktop_config.json:
{
"mcpServers": {
"claude-cost": {
"command": "npx",
"args": ["-y", "claude-cost-mcp"]
}
}
}Claude Code
claude mcp add claude-cost -- npx -y claude-cost-mcpThen ask: "How much would a 2,000-token prompt with 800 tokens out cost on each Claude model?" or "How many tokens is this Korean paragraph?"
Run locally
npm install
npm test # 10 tests: pricing logic + live MCP handshake
npm start # stdio serverWhy
Most developers guess at Claude costs and overpay — especially in Korean, where the same prompt uses ~1.3x the tokens. This puts accurate numbers one question away inside the tool you already use. Full benchmark and free calculators at claudeguide.io.
License
MIT
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/firstshout/claude-cost-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server