agentic-mcp-server
Enables agents to interact with local Git repositories through the git MCP server, supporting operations like reading commits, branches, and file history.
Provides integration with Ollama to run local language models (e.g., Llama 3.1, Mistral) for executing agent tasks, with support for runtime model switching.
Allows agents to query and manage local SQLite databases via the sqlite MCP server, providing direct read/write access to database tables and records.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@agentic-mcp-serverrun research workflow on AI trends"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Multi-Agent AI Application (100% Offline)
A fully local, open-source agentic AI system with 45 specialized agents, 50 predefined workflows, and an MCP server — powered by Ollama. No API keys, no cloud services, no internet required after setup.
Table of Contents
Related MCP server: Agentic Control Framework (ACF)
Complete Installation Guide
Prerequisites
Python 3.12+
8GB+ RAM (16GB recommended for 13B models)
~5GB disk space (for model + dependencies)
No GPU required (but speeds up inference)
Step 1: Install Ollama (Local LLM Runtime)
# Linux
curl -fsSL https://ollama.com/install.sh | sh
# macOS
brew install ollama
# Windows — download installer from https://ollama.com/download
# Verify installation
ollama --versionStep 2: Start Ollama Service
# Start the Ollama daemon
ollama serve
# It runs on http://localhost:11434 by default
# Keep this terminal open, or run as a system service:
# sudo systemctl enable ollama && sudo systemctl start ollama (Linux)Step 3: Pull a Local LLM Model
# Recommended: good balance of speed and quality
ollama pull llama3.1:8b
# Alternatives:
ollama pull mistral # Fast, 7B params
ollama pull codellama:13b # Best for code tasks
ollama pull qwen2.5:7b # Good multilingual
ollama pull llama3.1:70b # Best quality (needs 40GB+ RAM)
ollama pull deepseek-coder:6.7b # Specialized for code
ollama pull phi3:mini # Smallest, fastest
# Verify model is available
ollama listStep 4: Set Up the Application
cd multi-agent-app
# Create Python virtual environment
python3 -m venv .venv
# Activate it
source .venv/bin/activate # Linux/macOS
# .venv\Scripts\activate # Windows PowerShell
# .venv\Scripts\activate.bat # Windows CMD
# Install all dependencies
pip install -r requirements.txtStep 5: Configure (Optional)
Edit config.py if you changed defaults:
OLLAMA_BASE_URL = "http://localhost:11434" # Ollama address
MODEL_NAME = "llama3.1:8b" # Model you pulled
MAX_TOKENS = 2048 # Max response lengthStep 6: Run
# Interactive CLI mode
python main.py
# Or as MCP server
python main.py --mcp-serverLocal LLM Setup (Ollama)
Managing Models
# List installed models
ollama list
# Pull a new model
ollama pull <model-name>
# Remove a model
ollama rm <model-name>
# Show model details
ollama show llama3.1:8b
# Test a model directly
ollama run llama3.1:8b "Hello, how are you?"Recommended Models by Use Case
Use Case | Model | RAM Needed | Speed |
General (default) |
| 8GB | Fast |
Code-heavy work |
| 16GB | Medium |
Fast responses |
| 4-8GB | Very fast |
Complex reasoning |
| 40GB+ | Slow |
Multilingual |
| 8GB | Fast |
Code + explanation |
| 8GB | Fast |
Switch Model at Runtime
No restart needed — switch in the CLI:
/model codellama:13b
/model mistralOllama Configuration
# Change Ollama host/port (if needed)
export OLLAMA_HOST=0.0.0.0:11434
# Set GPU layers (for partial GPU offload)
export OLLAMA_NUM_GPU=999
# Set number of threads
export OLLAMA_NUM_THREAD=8Local MCP Server Setup
This App as an MCP Server
Your multi-agent system IS an MCP server. Start it:
# stdio transport (for Claude Desktop, Cursor, etc.)
python main.py --mcp-server
# SSE/HTTP transport (for web clients or remote access on LAN)
python main.py --mcp-server --transport sse --host 0.0.0.0 --port 8080What Gets Exposed via MCP
Type | Count | Description |
Tools | 47 |
|
Resources | 2 |
|
MCP Server CLI Arguments
python main.py --mcp-server [OPTIONS]
Options:
--transport {stdio,sse} Transport protocol (default: stdio)
--host HOST Bind address for SSE (default: 0.0.0.0)
--port PORT Port for SSE (default: 8080)External Local MCP Servers
Your agents can consume tools from OTHER local MCP servers running on your machine.
Install Local MCP Servers
# Install Node.js MCP servers (one-time, cached locally)
npx -y @modelcontextprotocol/server-filesystem /tmp
npx -y @modelcontextprotocol/server-sqlite mydb.sqlite
npx -y @modelcontextprotocol/server-memory
# Or install Python-based MCP servers
pip install mcp-server-fetch
pip install mcp-server-gitConfigure External Servers
Edit config.py:
EXTERNAL_MCP_SERVERS = [
# Filesystem access — agents can read/write local files
{
"name": "filesystem",
"transport": "stdio",
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", "/home/jaspal/projects"]
},
# SQLite — agents can query local databases
{
"name": "sqlite",
"transport": "stdio",
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-sqlite", "/home/jaspal/data/app.db"]
},
# Memory/Knowledge base — persistent agent memory
{
"name": "memory",
"transport": "stdio",
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-memory"]
},
# Git — agents can interact with local repos
{
"name": "git",
"transport": "stdio",
"command": "python",
"args": ["-m", "mcp_server_git", "--repo", "/home/jaspal/projects/myapp"]
},
# Custom local MCP server (running on localhost)
{
"name": "custom-tools",
"transport": "sse",
"url": "http://localhost:9090/sse"
},
]How It All Connects (100% Local)
┌─────────────────────────────────────────────────────────┐
│ YOUR MACHINE │
│ │
│ ┌─────────────┐ ┌──────────────────────────────┐ │
│ │ Ollama │ │ Multi-Agent App │ │
│ │ (Local LLM) │◄───►│ 45 agents + supervisor │ │
│ │ :11434 │ │ MCP server (stdio/SSE) │ │
│ └─────────────┘ └──────────┬───────────────────┘ │
│ │ │
│ ┌─────────────┼─────────────┐ │
│ ▼ ▼ ▼ │
│ ┌──────────────┐ ┌───────────┐ ┌───────────────┐ │
│ │ MCP Server: │ │MCP Server:│ │ MCP Server: │ │
│ │ filesystem │ │ sqlite │ │ memory │ │
│ │ (local files)│ │ (local db)│ │ (local store) │ │
│ └──────────────┘ └───────────┘ └───────────────┘ │
│ │
│ ┌──────────────────────────────────────────────────┐ │
│ │ MCP Clients: Claude Desktop / Cursor / VS Code │ │
│ └──────────────────────────────────────────────────┘ │
│ │
│ Network: ZERO external traffic │
└─────────────────────────────────────────────────────────┘Quick Start
# Terminal 1: Start Ollama
ollama serve
# Terminal 2: Run the app
cd multi-agent-app
source .venv/bin/activate
python main.pyThen type:
🧑 You: write a Python REST API with Flask for a todo appThe supervisor automatically routes to the best agent(s) and returns the result.
CLI Commands Reference
Agent Execution
Command | Description | Example |
(just type) | Auto-route via supervisor |
|
| Run specific agent |
|
| Chain agents sequentially |
|
| Run concurrently |
|
| Compare outputs |
|
| Predefined pipeline |
|
| Auto-select workflow |
|
| Iterative refinement |
|
| Batch process |
|
| Stream response |
|
File Context
Command | Description | Example |
| Single file context |
|
| Multiple files |
|
Session Management
Command | Description |
| Save session |
| Load session |
| Export as markdown |
| List sessions |
| Show history |
| Clear history |
Memory
Command | Description |
| Store persistent note |
| Recall notes |
| Clear memory |
System
Command | Description |
| List all 45 agents |
| List all 50 workflows |
| Token usage stats |
| Switch model |
| Check Ollama |
| Re-run last request |
| Show help |
| Exit |
Agents (45)
Agent | Description | Temp |
researcher | Gathers information and provides summaries | 0.7 |
coder | Writes clean, production-quality code | 0.3 |
reviewer | Reviews code/content for quality and correctness | 0.4 |
planner | Breaks down complex tasks into actionable steps | 0.5 |
debugger | Diagnoses errors and suggests targeted fixes | 0.2 |
writer | Writes documentation, emails, and reports | 0.6 |
tester | Writes test cases and testing strategies | 0.3 |
optimizer | Performance optimization and bottleneck analysis | 0.3 |
security | Security analysis and vulnerability detection | 0.2 |
data_analyst | Data analysis, SQL queries, and data modeling | 0.4 |
devops | CI/CD, Docker, Kubernetes, and infrastructure | 0.3 |
translator | Translation and localization between languages | 0.5 |
architect | System architecture design and trade-off analysis | 0.4 |
mentor | Explains concepts and guides learning | 0.6 |
summarizer | Condenses content into key points and summaries | 0.3 |
api_designer | API design, OpenAPI specs, and contracts | 0.3 |
database | Schema design, SQL optimization, and DB architecture | 0.3 |
ux_designer | UI/UX design, wireframes, and accessibility | 0.5 |
refactorer | Code restructuring and maintainability improvements | 0.2 |
explainer | Code walkthroughs and detailed explanations | 0.5 |
validator | Verifies implementations match requirements | 0.2 |
automator | Automation scripts, CLI tools, and workflows | 0.3 |
migrator | Code/database/infrastructure migrations | 0.3 |
prompt_engineer | Crafts and optimizes LLM prompts | 0.4 |
diagrammer | Creates Mermaid/PlantUML system diagrams | 0.3 |
estimator | Effort and time estimation for tasks | 0.4 |
compliance | Regulatory compliance and standards audits | 0.2 |
product_manager | Requirements, user stories, and prioritization | 0.5 |
interviewer | Interview questions and answer evaluation | 0.5 |
git_expert | Git workflows, branching, and conflict resolution | 0.3 |
accessibility | WCAG compliance and inclusive design | 0.3 |
performance_tester | Load testing and scalability analysis | 0.3 |
error_handler | Error handling patterns and resilience | 0.3 |
documentation | API docs, changelogs, and guides | 0.5 |
regex_expert | Crafts and explains regular expressions | 0.2 |
shell_expert | Shell scripting and Unix tools | 0.3 |
ml_engineer | ML pipelines, training, and evaluation | 0.4 |
concurrency | Async, threading, and parallel processing | 0.3 |
config_manager | Configuration, env vars, and feature flags | 0.3 |
code_generator | Boilerplate, scaffolding, and templates | 0.3 |
tech_lead | Technical decisions and team guidance | 0.4 |
seo_expert | SEO optimization and web performance | 0.4 |
monitoring | Observability, alerting, and SRE practices | 0.3 |
networking | DNS, load balancing, and network architecture | 0.3 |
contract_tester | API contract testing and compatibility | 0.2 |
Workflows (50)
Development
Workflow | Pipeline |
| planner → coder → reviewer → tester |
| coder → reviewer → tester |
| debugger → coder → tester |
| explainer → refactorer → reviewer → tester |
| planner → code_generator → coder → tester |
| coder → optimizer → reviewer |
| error_handler → coder → tester → reviewer |
| architect → concurrency → coder → tester → reviewer |
API & Backend
Workflow | Pipeline |
| api_designer → coder → tester → writer |
| api_designer → code_generator → coder → tester → documentation → security |
| api_designer → contract_tester → tester → documentation |
| planner → database → reviewer |
| architect → api_designer → coder → contract_tester → devops |
| planner → architect → api_designer → database → coder → tester |
| data_analyst → coder → tester → devops |
DevOps & Infrastructure
Workflow | Pipeline |
| devops → security → validator |
| coder → error_handler → security → performance_tester → devops |
| tester → security → compliance → documentation → devops |
| monitoring → devops → shell_expert |
| config_manager → devops → validator |
| networking → security → devops → validator |
| git_expert → devops → automator |
| shell_expert → automator → tester |
Security & Quality
Workflow | Pipeline |
| coder → security → compliance |
| security → compliance → validator |
| explainer → reviewer → security → optimizer → accessibility |
| performance_tester → optimizer → reviewer |
Frontend & UX
Workflow | Pipeline |
| ux_designer → coder → accessibility → reviewer |
| ux_designer → accessibility → reviewer |
| seo_expert → coder → performance_tester |
Documentation & Learning
Workflow | Pipeline |
| researcher → writer → reviewer |
| researcher → mentor → summarizer |
| explainer → diagrammer → summarizer |
| documentation → diagrammer → mentor → explainer |
| translator → reviewer → writer |
Planning & Management
Workflow | Pipeline |
| planner → architect → diagrammer |
| planner → estimator → reviewer |
| product_manager → architect → api_designer → estimator |
| researcher → tech_lead → architect → estimator |
| product_manager → planner → coder → tester |
| product_manager → planner → architect → code_generator → coder → tester → devops |
Migration & Modernization
Workflow | Pipeline |
| planner → migrator → tester → reviewer |
| explainer → architect → migrator → coder → tester |
Incident & Operations
Workflow | Pipeline |
| debugger → devops → summarizer |
| debugger → monitoring → devops → summarizer → documentation |
Specialized
Workflow | Pipeline |
| researcher → ml_engineer → coder → tester → documentation |
| regex_expert → tester → explainer |
| prompt_engineer → tester → optimizer |
| researcher → interviewer → mentor |
| explainer → mentor → diagrammer |
Use Cases with Examples
🚀 Build a New Feature
/workflow full_dev implement user authentication with JWT and refresh tokens🐛 Fix a Bug
/workflow bug_fix TypeError: Cannot read property 'map' of undefined in UserList.tsx🏗️ Design a System
/workflow design design a real-time notification system for 100k users📝 Write Documentation
/workflow docs document the payment processing module with API reference🔒 Security Review
/files src/auth.py,src/middleware.py security audit these files⚡ Optimize Performance
/workflow perf_audit our API response time is 2s, analyze and optimize🎯 Direct Agent Call
/ask coder write a Python decorator for caching with TTL
/ask database design a schema for multi-tenant SaaS
/ask devops write a GitHub Actions CI/CD pipeline for a Node.js app
/ask shell_expert write a bash script to backup PostgreSQL daily🔄 Iterative Refinement
/feedback coder reviewer write a thread-safe LRU cache in Python(Coder writes → reviewer critiques → coder improves → until approved)
📊 Compare Approaches
/compare architect,optimizer design a caching strategy for product catalog⚡ Parallel Execution
/parallel security,optimizer,accessibility audit this React component📋 Batch Processing
/batch coder implement stack;;implement queue;;implement linked list;;implement BST🤖 Auto-Routing
/auto our login endpoint is returning 500 errors in production(Automatically selects bug_fix workflow)
📁 Multi-File Analysis
/files src/api.py,src/models.py,src/tests.py review for consistency issues🎓 Learning
/workflow learn explain event-driven architecture with examples
/ask mentor explain the CAP theorem like I'm a junior developer🚢 Production Release
/workflow release prepare v2.0 release for the payment service🏢 Full Startup MVP
/workflow startup_mvp build a SaaS invoicing app with Stripe integrationMCP Client Integration
Claude Desktop
Add to ~/.config/claude/claude_desktop_config.json (Linux) or ~/Library/Application Support/Claude/claude_desktop_config.json (macOS):
{
"mcpServers": {
"multi-agent": {
"command": "/home/jaspal/jscode/js-ai-apps-api/multi-agent-app/.venv/bin/python",
"args": ["/home/jaspal/jscode/js-ai-apps-api/multi-agent-app/main.py", "--mcp-server"]
}
}
}Cursor
Add to Cursor MCP settings:
{
"multi-agent": {
"command": "/home/jaspal/jscode/js-ai-apps-api/multi-agent-app/.venv/bin/python",
"args": ["main.py", "--mcp-server"],
"cwd": "/home/jaspal/jscode/js-ai-apps-api/multi-agent-app"
}
}VS Code (Copilot MCP)
{
"mcp": {
"servers": {
"multi-agent": {
"command": "python",
"args": ["main.py", "--mcp-server"],
"cwd": "/home/jaspal/jscode/js-ai-apps-api/multi-agent-app"
}
}
}
}Any MCP Client (SSE/HTTP)
# Start HTTP server
python main.py --mcp-server --transport sse --port 8080
# Connect from any MCP client to: http://localhost:8080/sseAdvanced Features
🔄 Feedback Loop
Iteratively refines output until approved:
/feedback coder reviewer write a production-ready connection poolAgent writes → reviewer evaluates → agent improves → repeat (max 3 rounds).
🧠 Persistent Memory
Notes that survive across sessions:
/remember project Using PostgreSQL 15 with pgvector extension
/remember style snake_case, type hints, 4-space indent
/recall project
/forget project⚡ Parallel Execution
Multiple agents simultaneously:
/parallel security,optimizer,reviewer analyze this module📦 Batch Processing
Same agent, multiple tasks:
/batch tester write tests for login;;signup;;logout;;password-reset🎯 Auto-Routing
Keyword-based workflow selection:
/auto deploy our app to kubernetes with monitoring📁 Multi-File Context
Cross-file analysis:
/files src/api.py,src/models.py,tests/test_api.py find inconsistencies🔀 Custom Chains
Build pipelines on the fly:
/chain planner|architect|coder|tester|documentation build a rate limiter💾 Session Persistence
/save my-project
/load my-project
/export📊 Token Tracking
/tokens
# Output: Tokens: ~12,450 total (4,200 in / 8,250 out) | Requests: 7Configuration Reference
config.py
Setting | Default | Description |
|
| Ollama API endpoint |
|
| Default model |
|
| Default temperature |
|
| Max response tokens |
|
| MCP server name |
|
| Default transport |
|
| SSE bind address |
|
| SSE port |
|
| Request timeout (seconds) |
|
| External local MCP servers |
Environment Variables (Ollama)
export OLLAMA_HOST=0.0.0.0:11434 # Bind address
export OLLAMA_NUM_GPU=999 # GPU layers
export OLLAMA_NUM_THREAD=8 # CPU threads
export OLLAMA_KEEP_ALIVE=5m # Model keep-alive timeProject Structure
multi-agent-app/
├── agent_registry.py # 45 agent definitions (single source of truth)
├── config.py # All settings + supervisor prompt
├── graph.py # LangGraph supervisor orchestration
├── runners.py # Execution modes + 50 workflows + advanced features
├── session.py # History, tokens, save/load/export
├── main.py # CLI dispatcher + entry point
├── mcp_server.py # FastMCP server (dynamic tool registration)
├── tool_registry.py # External MCP server consumption
├── requirements.txt # Python dependencies
└── sessions/ # Saved sessions + agent memory
├── *.json # Session files
├── *.md # Exported conversations
└── memory.json # Persistent agent memoryExtending the System
Add a New Agent
Add one entry to agent_registry.py:
"my_agent": {
"description": "What it does",
"temperature": 0.3,
"prompt": "You are a ... agent. Your job is to: 1) ... 2) ... 3) ...",
},Automatically available in: supervisor routing, /ask, /chain, MCP tools.
Add a New Workflow
Add to WORKFLOWS in runners.py:
"my_workflow": ["planner", "my_agent", "reviewer", "tester"],Add External MCP Server
Add to EXTERNAL_MCP_SERVERS in config.py:
{"name": "my-server", "transport": "stdio", "command": "python", "args": ["my_server.py"]}Troubleshooting
Ollama not reachable
# Check if running
curl http://localhost:11434/api/tags
# Start it
ollama serveModel not found
# List available models
ollama list
# Pull the model
ollama pull llama3.1:8bSlow responses
Use a smaller model:
/model mistralor/model phi3:miniReduce
MAX_TOKENSin config.pyUse GPU: install CUDA/ROCm drivers
Out of memory
Use smaller model:
llama3.1:8binstead of13b/70bClose other applications
Set
OLLAMA_NUM_GPU=0to use CPU only (slower but less RAM)
Import errors
# Make sure venv is activated
source .venv/bin/activate
# Reinstall dependencies
pip install -r requirements.txtRequirements
Python 3.12+
Ollama (any version)
8GB+ RAM (16GB recommended)
No GPU required
No internet after initial setup
Python Dependencies
langchain>=0.3.0
langchain-ollama>=0.2.0
langgraph>=0.2.0
pydantic>=2.0.0
mcp>=1.0.0
requests>=2.28.0License
MIT
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/jsxtech/agentic-mcp-server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server