quelllm-mcp

Name: quelllm-mcp
Author: MGM-FALCON

by MGM-FALCON

Overview Schema Related Servers Score Discussions

Python

Remote

quelllm-mcp

MCP server exposing the quelllm.fr catalog of 190+ open-weights LLMs via Model Context Protocol tools. Use it from Claude Code, Cursor, Continue, or any MCP-compatible client to query models, compare them, estimate VRAM, and compute API vs self-hosted cost.

Tools exposed

Tool	Description
`list_models(filter_origin?, filter_family?, max_params_b?)`	List models with filters (origin code, family, max params in B)
`get_model(model_id)`	Full record for one model (params, vram per quant, context window, family, tags, license, URLs)
`compare(model_a_id, model_b_id)`	Side-by-side comparison with verdict
`estimate_vram(model_id, quant)`	VRAM in GB at chosen quant + recommended GPU/Mac tiers
`estimate_cost(input_tokens_per_month, output_tokens_per_month, ...)`	Cost in EUR — full table API providers vs self-hosted hardware OR a specific id
`search_models(query, limit?)`	Fuzzy search by name, family, tag, author

Related MCP server: HydraMCP

Install

Install from source (not yet on PyPI) :

pip install git+https://github.com/MGM-FALCON/quelllm-mcp.git

Or run without installing, using uv :

uvx --from git+https://github.com/MGM-FALCON/quelllm-mcp.git quelllm-mcp

For local development :

git clone https://github.com/MGM-FALCON/quelllm-mcp.git
cd quelllm-mcp
pip install -e .

Use with Claude Code

Add to ~/.claude.json or a project's .mcp.json. If you installed with pip :

{
  "mcpServers": {
    "quelllm": {
      "command": "quelllm-mcp"
    }
  }
}

Or zero-install with uvx :

{
  "mcpServers": {
    "quelllm": {
      "command": "uvx",
      "args": ["--from", "git+https://github.com/MGM-FALCON/quelllm-mcp.git", "quelllm-mcp"]
    }
  }
}

Use with Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) :

{
  "mcpServers": {
    "quelllm": {
      "command": "quelllm-mcp"
    }
  }
}

Use with Cursor / Continue / Cline

Most MCP clients accept the same JSON config :

{
  "command": "quelllm-mcp"
}

Example queries (from your client)

> Quels LLM Mistral peuvent tourner sur RTX 5070 Ti 16GB ?
→ list_models(filter_family='Mistral', max_params_b=24)
→ estimate_vram('mistral-small-24b', 'q4')

> Compare Llama 3.3 70B vs Qwen 2.5 32B
→ compare('llama33-70b', 'qwen25-32b')

> J'utilise 10M tokens input + 2.5M output / mois. Combien je paye chez OpenAI vs DeepSeek ?
→ estimate_cost(10_000_000, 2_500_000)

Data source

All data pulled from quelllm.fr/api/ (CC BY 4.0, no key, CORS-enabled). Cached locally for 1h to avoid rate-limiting.

API pricing data (GPT-5, Claude Opus 4.7, Gemini 2.5, DeepSeek, Mistral) and hardware pricing (RTX 50-series, Mac M4) are hardcoded as of 2026-05 — verify semestrially.

License

MIT — see LICENSE.

Contributing

Source : https://github.com/MGM-FALCON/quelllm-mcp Issues + PRs welcome. Particularly :

API pricing updates (semestrial)
Hardware additions (new GPUs, Mac Mx series)
New tools (e.g. find_alternatives_to(model_id), recommend_gpu(budget_eur))

Tests

A pytest smoke suite lives under tests/. It covers all 6 tools and the v1.1.0 output invariants, never touches the network (local fixture + mocked httpx), and stubs the mcp SDK when it isn't importable — so it also runs on Python 3.9.

pip install -e ".[test]"
pytest

Author

Mohamed Meguedmi — LinkedIn · Hugging Face Founder of La Gazette IA and QuelLLM.fr.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

3wRelease cycle

2Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Related MCP Servers

MCP Server with Local LLM
Code Analysis Text Summarization Language Translation
0xsaju
F
license
-
quality
D
maintenance
Integrates local language models (like Qwen3-8B) with MCP clients, providing tools for chat, code analysis, text generation, translation, and content summarization using your own hardware.
Last updated 2025-09-25
HydraMCP
RAG Systems Agent Orchestration Autonomous Agents
Pickle-Pixel
A
license
A
quality
C
maintenance
An MCP server that enables users to query, compare, and synthesize responses from multiple local and cloud LLMs simultaneously using existing subscriptions. It provides tools for parallel model evaluation, consensus polling with an LLM-as-judge, and response synthesis across different model providers.
Last updated 2026-02-08
8
32
15
MIT
mcp-turboquant
AI & Machine Learning Developer Tools Code Execution
ShipItAndPray
A
license
A
quality
C
maintenance
MCP server for LLM quantization. Compress any HuggingFace model to GGUF, GPTQ, or AWQ format. 6 tools: info, check, recommend, quantize, evaluate, push. Self-contained Python server — no external CLI needed.
Last updated 2026-04-02
6
4
MIT
OpenRouter MCP Server
AI & Machine Learning Search Developer Tools
lumishoang
F
license
A
quality
C
maintenance
An MCP server for discovering and querying over 300 AI models available on OpenRouter. It enables users to list, search, filter, compare, and get detailed information about models with pricing, context limits, and capabilities.
Last updated 2026-04-21
5
1

View all related MCP servers

Related MCP Connectors

mcp-aichat
MCP server for AI dialogue using various LLM models via AceDataCloud
TokenOracle
Hosted MCP server for LLM cost estimation, model comparison, and budget-aware routing.
mcp
MCP server providing access to the Scorecard API to evaluate and optimize LLM systems.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/MGM-FALCON/quelllm-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

quelllm-mcp

Tools exposed

Install

Use with Claude Code

Use with Claude Desktop

Use with Cursor / Continue / Cline

Example queries (from your client)

Data source

License

Contributing

Tests

Author

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

MCP Server with Local LLM

HydraMCP

mcp-turboquant

OpenRouter MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API