A-Modular-Kingdom
Allows AI agents to perform web searches using DuckDuckGo, providing current events and latest documentation.
Provides vision analysis and text embeddings using local Ollama models, enabling private and offline AI capabilities.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@A-Modular-Kingdomsearch my codebase for authentication logic using v2 RAG"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
๐ฐ A-Modular-Kingdom
High-Performance MCP Foundation for RAG, Scoped Memory, and Agentic Tools
โจ New: Automated Local Agent Setup Harness Integrated โ
๐๏ธ Project Status: Setup Harness Integrated
A-Modular-Kingdom (AMK) is equipped with an automated local setup harness that auto-configures your local coding environments (such as Codex and Claude Code) to run this MCP server.
To connect your local agents instantly, just run:
./scripts/setup_mcp.shRelated MCP server: Tyra Advanced Memory MCP Server
The Solution
A-Modular-Kingdom is the infrastructure layer you're missing. Now any agent (Claude Desktop, Codex, custom chatbots) gets instant access to:
โ Hierarchical Scoped Memory (Global Rules, Project Context, Persona)
โ Advanced V3 RAG (Hybrid Fusion + Cross-Encoder Reranking)
โ 27+ Production Tools (Vision, Code Exec, Web Search, TTS/STT)
๐๏ธ Automated Local Setup Harness
To get up and running immediately, we provide a unified local setup script that handles the configuration path gymnastics and thermal throttling safety checks:
# Clone the repository
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom
# Run the automated setup script
./scripts/setup_mcp.shThis script will automatically:
Register
modular_kingdom_hostin your Codex configuration (~/.codex/config.toml).Register the server with Claude Code (
claude mcp add) if the CLI is active.Use a lightweight Thermal Safety Runner (
scripts/thermal_runner.py) as a wrapper to monitor your CPU core temperatures and throttle execution if the temperature exceeds a safe threshold (85ยฐC), preventing agent processes from causing workspace lag.
๐๏ธ Architecture
๐ Table of Contents
โจ Core Features
MCP Protocol - Standard interface for AI tool access
3 RAG Versions - Choose your retrieval strategy (FAISS, Qdrant, custom)
Scoped Memory - Global rules, preferences, project-specific context
8+ Tools - Vision, code exec, browser, web search, TTS/STT, and more
No Vendor Lock-in - Local Ollama models, open-source stack
Production Ready - Smart reindexing, Unicode support, error handling
๐ Quick Start
Prerequisites
# Required
Python 3.10+
Ollama (for embeddings: ollama pull embeddinggemma)
# Optional
UV package manager (faster than pip)Installation
# Clone the repository
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom
# Install dependencies
uv sync
# or: pip install -e .
# Pull required Ollama model
ollama pull embeddinggemmaStart the MCP Server
# Start host.py MCP server
python src/agent/host.pyConnect Your Agent
Option 1: Claude Desktop
// Add to claude_desktop_config.json
{
"mcpServers": {
"a-modular-kingdom": {
"command": "python",
"args": ["/full/path/to/A-Modular-Kingdom/src/agent/host.py"]
}
}
}Option 2: Interactive Client
# Use the included chat interface
python src/agent/main.pyOption 3: Custom Integration
# Connect via MCP in your own agent
from mcp import StdioServerParameters
server_params = StdioServerParameters(
command="python",
args=["/path/to/host.py"]
)
# Use with ToolCollection.from_mcp(server_params)๐ ๏ธ Available Tools
The MCP server exposes these tools:
Tool | Description | Use Case |
| RAG search (v1/v2/v3) | "How does auth work in this codebase?" |
| Scoped memory storage | Save global rules or project context |
| Semantic memory search | Retrieve past decisions/preferences |
| DuckDuckGo search | Current events, latest docs |
| Safe Python sandbox | Run code in isolated environment |
| TTS (pyttsx3/kokoro) | Generate audio from text |
| Whisper STT | Transcribe audio files |
๐ RAG System
Three implementations with different trade-offs:
V1 - Simple & Fast
Stack: FAISS + BM25
Speed: <1s
Use Case: Small projects, quick prototypes
V2 - Production (Recommended)
Stack: Qdrant + BM25 + CrossEncoder reranking
Speed: <1s with smart caching
Use Case: Production apps, large codebases
Features: Smart reindexing, cloud-ready
V3 - Advanced (Highest Accuracy)
Stack: Qdrant + BM25 + RRF fusion + CrossEncoder reranking
Speed: <1s (cached), 6.7s (cold start)
Use Case: Maximum accuracy, complex queries
Features: Contextual retrieval, hybrid fusion
Benchmark Results (LLM-as-Judge)
Metric | V2 | V3 |
Groundedness | 100% | 100% |
Relevance | 80-98% | 78-88% |
Completeness | 75-95% | 75-98% |
Average | 84-98% | 84-88% |
Evaluated with curated queries on Napoleon.pdf and RAG documentation. Judge: Gemini 2.5 Flash. Results vary based on indexed content.
Usage:
# Via MCP tool
query_knowledge_base(
query="How does authentication work?",
version="v2", # or "v1", "v3"
doc_path="./src" # optional
)Supported Files: .py, .md, .txt, .pdf, .ipynb, .js, .ts
๐ง Memory System
Hierarchical scoped memory with automatic categorization:
Memory Scopes
Scope | Persistence | Use Case |
Global Rules | Forever, all projects | "Always use type hints" |
Global Preferences | Forever, all projects | "Prefer dark mode" |
Global Personas | Forever, all projects | Reusable agent personalities |
Project Context | Current project | Architecture decisions, tech stack |
Project Sessions | Temporary | Current task, recent changes |
Usage
# Save with explicit scope
save_memory(content="Always validate user input", scope="global_rules")
# Or use prefix shortcuts
save_memory(content="#global:rule:Never use eval()")
save_memory(content="#project:context:Uses FastAPI backend")
# Auto-inference from keywords
save_memory(content="User prefers Python 3.12") # โ global_preferences
# Search with priority (global โ project)
search_memories(query="coding standards", top_k=5)Storage: ~/.modular_kingdom/memories/ (global) + project-specific folders
๐ฏ Integration Examples
Claude Desktop
Already using Claude Code? Add A-Modular-Kingdom tools:
{
"mcpServers": {
"a-modular-kingdom": {
"command": "python",
"args": ["/path/to/src/agent/host.py"]
}
}
}Now Claude has access to your codebase RAG, persistent memory, and all tools.
Gemini CLI
// gemini-extension.json
{
"mcpServers": {
"unified_knowledge_agent": {
"command": "python",
"args": ["/path/to/src/agent/host.py"]
}
}
}Custom Agent
from smolagents import ToolCallingAgent, ToolCollection
from mcp import StdioServerParameters
# Connect to MCP server
params = StdioServerParameters(
command="python",
args=["/path/to/host.py"]
)
with ToolCollection.from_mcp(params) as tools:
agent = ToolCallingAgent(tools=list(tools.tools))
result = agent.run("Search the codebase for auth logic")๐ค Example Applications
This repository includes example multi-agent systems built on the foundation:
Council Chamber (Hierarchical)
3-tier agent hierarchy (Queen โ Teacher โ Code Agent)
Validation loops and task delegation
Uses ACP SDK + smolagents
Location:
multiagents/council_chamber/
Gym (Sequential)
Fitness planning workflow (Interview โ Plan โ Nutrition)
CrewAI-powered coordination
Web interface included
Location:
multiagents/gym/
Note: These are demonstration applications, not the core product. The foundation (host.py) is the main offering.
๐๏ธ Architecture
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Your AI Application โ
โ (Agents, Chatbots, Workflows) โ
โโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโ
โ MCP Protocol
โโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโโ
โ A-Modular-Kingdom โ
โ โโโโโโโโโโโ โโโโโโโโโโโ โโโโโโโโโโโ
โ โ RAG โ โ Memory โ โ Tools โโ
โ โ V1/V2/V3โ โ Scoped โ โ 8+ โโ
โ โโโโโโโโโโโ โโโโโโโโโโโ โโโโโโโโโโโ
โ host.py (MCP Server) โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ๐งช Testing & Performance
Run Tests
# Run all tests
pytest tests/ -v
# Run specific test suites
pytest tests/test_rag_v2.py -v
pytest tests/test_rag_v3.py -v
pytest tests/test_memory_global.py -v
# Run benchmarks
python tests/benchmark_rag.pyPerformance
Benchmark Results (GPU/CUDA):
Version | Docs | Cold Start | Warm Query |
V2 | 100 | 26.8s | 0.31s |
V3 | 100 | 13.9s | 0.02s (15x faster!) |
Key Features:
โ GPU acceleration (CUDA) for embeddings and reranking
โ Smart caching (warm queries <0.5s)
โ Tested with .py, .md, .txt, .ipynb files
โ Global memory access from any directory
See detailed benchmarks: docs/RAG_PERFORMANCE.md
Docker Testing
Package verified to work in isolation:
docker build -f Dockerfile.test -t rag-mem-test .
docker run --rm rag-mem-test๐ค Contributing
Contributions welcome! Focus areas:
Additional RAG strategies - New retrieval techniques
New tool integrations - Expand MCP tool offerings
Performance optimizations - Speed improvements
Documentation improvements - Tutorials, examples
Development Setup
# Fork and clone
git clone https://github.com/MasihMoafi/A-Modular-Kingdom.git
cd A-Modular-Kingdom
# Create branch
git checkout -b feature/your-feature
# Install dev dependencies
uv sync
# Make changes and test
pytest tests/
# Commit with descriptive message
git commit -m "feat: add new tool"
# Push and create PR
git push origin feature/your-feature๐ License
MIT License - See LICENSE for details
Links
Medium Article: https://medium.com/@masihmoafi12/a-modular-kingdom-fcaa69a6c1f0
Demo Video: https://www.youtube.com/watch?v=hWoQnAr6R_E
A-Modular-Kingdom: The infrastructure layer AI agents deserve ๐ฐ
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/MasihMoafi/A-Modular-Kingdom'
If you have feedback or need assistance with the MCP directory API, please join our Discord server