Skip to main content
Glama

Multi-AI Hub

Privacy-first local MCP hub for coordinating multiple AI providers from Claude Code.

The hub runs on your machine, exposes a small Model Context Protocol server, and lets a commander agent ask bounded questions to other providers without handing them your whole workspace. It supports local Ollama seats for private work, cloud seats for non-sensitive critique, and a structured council mode for independent review.

What It Does

  • Exposes MCP tools for Claude Code: ask_openai, ask_gemini, ask_ollama, ask_qwen, ask_llama_fast, council_review, multi_ai_status, and optional cloud seats.

  • Keeps API keys server-side in .env; they are never sent to the browser.

  • Uses local Ollama seats for private review and offline fallback.

  • Blocks obvious sensitive prompts from cloud seats with a fail-closed routing check.

  • Supports optional OpenAI-compatible cloud reviewers: Groq, Cerebras, Mistral, OpenRouter, GLM, and NVIDIA NIM.

  • Provides smoke tests and a secret/private marker scanner for safer sharing.

Related MCP server: MCP-AI-Gateway

Providers

Base providers:

  • Claude via Claude Code CLI / Claude Max, or Anthropic API

  • OpenAI API

  • Gemini API

  • Ollama local models

Council seats:

  • Local: Qwen 30B, fast local seat, manual DeepSeek

  • Cloud: Gemini, Groq, Cerebras, Mistral, OpenRouter, GLM, NVIDIA NIM

Cloud seats are dormant until their matching API key is set.

Quick Start

Requirements:

  • Node.js 20+

  • pnpm

  • Claude Code installed or available through this package

  • Optional: Ollama for local seats

Install and configure:

pnpm install
Copy-Item .env.example .env

Add provider keys to .env as needed:

GEMINI_API_KEY=
OPENAI_API_KEY=
NVIDIA_API_KEY=
OLLAMA_URL=http://127.0.0.1:11434

Start the local HTTP server:

pnpm start

Run the MCP server from Claude Code by pointing your MCP config at:

node node_modules/tsx/dist/cli.mjs src/mcp-server.ts

MCP Tools

Always registered:

  • ask_openai

  • ask_gemini

  • ask_ollama

  • ask_qwen

  • ask_llama_fast

  • ask_deepseek

  • council_review

  • multi_ai_status

Conditionally registered when configured:

  • generate_image

  • ask_groq

  • ask_cerebras

  • ask_mistral

  • ask_openrouter

  • ask_glm

  • ask_nvidia

multi_ai_status reports provider availability, model names, Claude CLI status, and council seat metadata without exposing secrets.

NVIDIA NIM

NVIDIA NIM is supported as an OpenAI-compatible cloud seat. See docs/providers/nvidia.md.

Default values:

NVIDIA_MODEL=meta/llama-3.3-70b-instruct
NVIDIA_BASE_URL=https://integrate.api.nvidia.com/v1

Smoke-test configured cloud seats:

pnpm smoke:cloud-seats

Safety Model

This is a local single-user tool, not a hosted multi-tenant service.

Do not bind it to 0.0.0.0 or expose it publicly without adding authentication, authorization, HTTPS, rate limits, audit logs, and per-user spend controls.

Sensitive prompts are blocked from obvious cloud routes, but the classifier is a guardrail, not a legal or security guarantee. Use local/private seats for identity, legal, financial, medical, credential, or proprietary content.

Before publishing, forking, or sharing a modified copy:

pnpm secret:scan

Development

pnpm typecheck
pnpm test
pnpm audit --prod
pnpm secret:scan

License

MIT

A
license - permissive license
-
quality - not tested
C
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Star-957/multi-ai-hub'

If you have feedback or need assistance with the MCP directory API, please join our Discord server