tokencost-mcp-server

Find Cheapest Models

tokencost_find_cheapest

Read-onlyIdempotent

Compare and identify the most cost-effective LLM models by filtering providers, context window requirements, and sorting by input, output, or combined pricing.

Instructions

Find the cheapest LLM models, optionally filtered by provider or minimum context window.

Args:

provider (string, optional): Filter by provider (e.g., "OpenAI", "Anthropic", "Google")
min_context (number, optional): Minimum context window size in tokens
sort_by (string, optional): Sort by "input", "output", or "combined" cost (default: "combined")
limit (number, optional): Number of results to return (default: 10, max: 30)

Returns: Ranked list of cheapest models with pricing details.

Examples:

{} → Top 10 cheapest models overall
{ provider: "OpenAI" } → Cheapest OpenAI models
{ min_context: 200000, sort_by: "input" } → Cheapest 200K+ context models by input price

Input Schema

TableJSON Schema

Name	Required	Description	Default
`provider`	No	Filter by provider name (e.g., 'OpenAI', 'Anthropic')
`min_context`	No	Minimum context window size in tokens
`sort_by`	No	Sort by input, output, or combined cost	combined
`limit`	No	Number of results to return

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already cover key behavioral traits (read-only, non-destructive, idempotent, closed-world), so the description adds minimal value beyond this. It does not disclose additional context such as rate limits, authentication needs, or data sources. However, it does not contradict the annotations, as 'Find' aligns with read-only behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded with a clear purpose statement, followed by structured sections for args, returns, and examples. However, the 'Args' section is somewhat redundant with the schema, and the examples could be more concise, slightly reducing efficiency.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations, and full schema coverage, the description is mostly complete. It explains the tool's purpose, parameters, and outputs with examples, though it lacks details on output format (e.g., structure of pricing details) and does not have an output schema to compensate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents all parameters. The description repeats parameter information in the 'Args' section without adding significant meaning beyond what the schema provides, such as clarifying edge cases or usage nuances. This meets the baseline for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with a specific verb ('Find') and resource ('cheapest LLM models'), and distinguishes it from siblings by focusing on cost optimization rather than comparison, estimation, listing, or provider listing. It specifies the core function of identifying the most economical models.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for usage through examples that show when to apply filters (e.g., by provider or context window), but it does not explicitly state when to use this tool versus alternatives like 'tokencost_compare_models' or 'tokencost_list_models'. The examples imply usage scenarios but lack direct sibling differentiation.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ankit-aglawe/tokencost-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server