Nexus-MCP
Provides tools for managing local Ollama models (list, pull, delete), running inference (generate, chat), evaluating responses, and performing RAG with embeddings, all against a local Ollama instance.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Nexus-MCPchat with llama3.2"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
#Nexus-MCP ( Foundry-Reverse )
Newer Foundry MCP reverse-engineered to work on Ollama
A fully local, open-source alternative to the Azure AI Foundry MCP Server, powered entirely by Ollama.
No Azure subscription, no API keys, no cloud required.
Features
Category | Tools |
Health |
|
Model Management |
|
Inference |
|
Evaluation |
|
Knowledge / RAG |
|
Plus MCP Resources (ollama://models, ollama://running, ollama://indexes) and reusable Prompts (summarize, rag_answer, code_review).
Requirements
Quick Start
# 1. Clone
git clone https://github.com/deadSwank001/Foundry-Reverse.git
cd Foundry-Reverse
# 2. Copy and edit configuration
cp .env.example .env
# 3. Run the MCP server (stdio transport)
uv run foundry-reverseVS Code / Copilot Integration
Copy mcp.json to your VS Code workspace .vscode/mcp.json (or user-level MCP config), then restart VS Code.
{
"mcpServers": {
"foundry-reverse": {
"command": "uv",
"args": ["run", "foundry-reverse"],
"env": {
"OLLAMA_BASE_URL": "http://localhost:11434"
}
}
}
}Configuration
Variable | Default | Description |
|
| Ollama API endpoint |
|
| Request timeout in seconds |
|
| Ollama model used for embeddings |
| (first available) | Ollama model used as evaluator |
|
| Path for the local RAG store |
Development
# Install deps (including dev)
uv sync --all-groups
# Run tests
uv run pytestArchitecture
src/foundry_reverse/
├── __init__.py # Package version
├── server.py # FastMCP server – all tools, resources, prompts
├── ollama_client.py # Async Ollama REST API client
├── evaluation.py # LLM-as-judge evaluation helpers
└── knowledge.py # In-memory vector store (cosine similarity + Ollama embeddings)Comparison with Azure AI Foundry MCP
Feature | Azure Foundry MCP | Foundry-Reverse |
Model catalog | Azure AI model registry | Local Ollama models |
Inference | Azure OpenAI / serverless | Ollama ( |
Embeddings | Azure OpenAI embeddings | Ollama ( |
Vector search | Azure AI Search | In-memory cosine similarity |
Evaluation | Azure AI Evaluation SDK | Local LLM-as-judge |
Auth | Azure Service Principal / keys | None (local only) |
Cost | Pay-per-token | Free |
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/deadSwank001/Nexus-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server