Which integrations are available for this server?

Allows interaction with local Ollama models through the LiteLLM gateway for single-turn and multi-turn queries.

How do I use litellm-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@litellm-mcp explain the concept of recursion" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

litellm-mcp

by LeonRLewis

Overview Schema Related Servers Score Discussions

Python

Hybrid

litellm-mcp

MCP server that lets Claude Code dispatch tasks to the LiteLLM gateway (192.168.10.175:4000) as worker agents. Unlike ollama-mcp (which reaches only the local Ollama models directly), this shim reaches the full gateway model set — the local Ollama models and the Anthropic-via-gateway models (claude-sonnet-4-6, claude-haiku-4-5).

Tools

list_models() — model ids registered on the gateway.
ask_model(prompt, model=None, system=None) — single-turn worker call. Defaults to qwen2.5:7b-instruct (a reliable non-reasoning local model — a safer headless default than the reasoning model qwen3.5:4b).
chat(messages, model=None) — multi-turn.

Related MCP server: Claude-LMStudio-Bridge

Credentials — per-request header (not at rest on Proteus)

This shim brokers spend on Anthropic, so the gateway master key is sensitive. It is not stored in /etc/mcp/litellm-mcp.env on the shared Proteus host. Instead the key travels as a per-request header from the client's ~/.claude.json:

"litellm": {
    "type": "sse",
    "url": "http://192.168.10.190:8107/sse",
    "headers": { "x-litellm-key": "<LITELLM_MASTER_KEY from Infisical>" }
}

The server reads x-litellm-key (or Authorization: Bearer ...) from each tool call. For local/stdio use, LITELLM_API_KEY env is accepted as a fallback.

Config (non-sensitive, env file on Proteus)

Var	Default
`LITELLM_BASE_URL`	`http://192.168.10.175:4000/v1`
`LITELLM_DEFAULT_MODEL`	`qwen2.5:7b-instruct`
`MCP_TRANSPORT` / `MCP_HOST` / `MCP_PORT`	`sse` / `0.0.0.0` / `8107`

Tests

pip install -e ".[test]"
pytest -q                       # unit (respx-mocked)
LITELLM_API_KEY=... pytest -q   # also runs live integration against the gateway

Integration tests skip automatically when the gateway is unreachable or LITELLM_API_KEY is unset (e.g. in CI on the GitLab runner). Deploy is via GitLab CI → Proteus (/opt/mcp/litellm-mcp/, systemd litellm-mcp.service, port 8107).

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/LeonRLewis/litellm-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server