iGenius Memory MCP Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@iGenius Memory MCP ServerGive me my current session briefing"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
iGenius Memory — Persistent AI Memory for Any Agent
A structured, AI-powered memory backend that gives any MCP-compatible agent persistent memory via the iGenius Memory service. All AI processing happens server-side — you just need an API key.
3 Ways to Use iGenius
Client | Install | Best For |
🧩 VS Code Extension | Full sidebar UI, memory browser, AI provider settings | |
⚡ MCP Server |
| Any MCP client — VS Code, Claude Desktop, Cursor, Windsurf |
🖥️ Desktop App | Standalone system-tray app, works with any editor |
Get a free API key at igenius-memory.online — all three clients use the same key.
1. VS Code Extension (Marketplace)
Install directly from the VS Code Marketplace — no pip, no config files:
ext install igenius-memory.igenius-memoryOr search "iGenius Memory" in the Extensions panel. Includes sidebar UI, memory browser, status bar indicator, AI provider settings, and auto-warms briefings on a configurable interval.
2. MCP Server (pip)
For any MCP-compatible client (VS Code Copilot, Claude Desktop, Cursor, Windsurf, etc.):
pip install igenius-mcpThen add to your MCP config:
VS Code — ~/.vscode/mcp.json:
{
"servers": {
"igenius-memory": {
"command": "igenius-mcp",
"env": { "IGENIUS_API_KEY": "ig_your_key_here" },
"type": "stdio"
}
}
}Claude Desktop / Cursor / Windsurf — add to your MCP config file:
{
"mcpServers": {
"igenius-memory": {
"command": "python",
"args": ["-m", "igenius_mcp.server"],
"env": { "IGENIUS_API_KEY": "ig_your_key_here" }
}
}
}⚠️ Windows users: If VS Code can't find
igenius-mcp, usepython -m igenius_mcp.serverinstead.
3. Desktop App (Windows)
Standalone system-tray application — works alongside any editor or IDE:
Download Installer (NSIS setup or MSI)
Built with Tauri + Rust — lightweight, native, ~5 MB
System tray with quick access to briefings, search, and memory stats
Configure LLM provider (LM Studio, OpenAI, Anthropic, Google) from the UI
Restart VS Code after installing the extension or adding MCP config — all 17 memory tools become available to Copilot and any MCP-compatible agent.
Related MCP server: Central Intelligence
Available Tools
Tool | Description |
| Session briefing from all memory layers (call FIRST) |
| Ingest user/agent messages for AI extraction |
| Merge accumulated extracts into master briefing |
| Detect trigger words and auto-classify text |
| Direct store to a specific memory layer |
| Natural language search across memories |
| Retrieve all persistent session memories |
| LLM-powered summary of a memory layer |
| Delete a memory by ID |
| Update fields on an existing memory |
| List short-term memories for triage |
| Promote short-term → long-term |
| Pin a fact permanently (user-confirmed, never expires) |
| List trigger words and their layers |
| Add a new trigger word |
| Render URL → screenshot → vision analysis → full UI/UX report (requires |
| Render URL → return base64 PNG (requires |
LLM Requirements
iGenius uses an LLM backend for AI extraction, consolidation, and (optionally) visual analysis. You can use a local or remote LLM provider.
Local Setup (LM Studio, Ollama, etc.)
Requirement | Minimum |
GPU VRAM | 6 GB+ |
Recommended model | Qwen 3.5 4B (non-thinking) or equivalent |
Context window | 3,000+ tokens |
⚠️ IMPORTANT: Do NOT use thinking/reasoning models (e.g. QwQ, DeepSeek R1, o1, o3). Thinking models emit
<think>chains before the actual response, which breaks iGenius's structured JSON extraction pipeline. Only use standard non-thinking (instruct/chat) models.
Why these specs? iGenius sends structured extraction prompts that expect clean JSON output. A 4B-parameter non-thinking model at 3k context is the sweet spot for fast, accurate extraction without hallucination or timeouts. Larger models (8B+) work too — just ensure you have the VRAM headroom and that the model is a non-thinking variant.
Remote Setup (OpenAI, Anthropic, Google, etc.)
No local hardware requirements. Any API-accessible model works — configure the provider, model name, and API key in the VS Code extension settings or environment variables.
Environment Variables
Variable | Required | Default |
| Yes | — |
| No |
|
Visual Tools (Optional)
Give your AI agent eyes — render any URL, take a pixel-perfect screenshot, and get instant UI/UX analysis from a local vision model.
Install
pip install "igenius-mcp[visual]"
python -m playwright install chromiumThen load a vision-capable model in LM Studio (e.g. Qwen 3.5 9B Vision, non-thinking).
⚠️ Do NOT use thinking/reasoning vision models — same restriction as above.
Visual MCP Tools
Tool | Description |
| Render URL → screenshot → vision analysis → full UI/UX report |
| Render URL → return base64-encoded PNG (no analysis) |
Visual Environment Variables
Variable | Default | Description |
|
| Vision model API endpoint |
| auto-detect | Override the vision model name |
| — | API key for vision endpoint (e.g. LM Studio auth token) |
|
| Screenshot viewport width |
|
| Screenshot viewport height |
100% local — screenshots and analysis never leave your machine.
Agent Instructions
For best results, add the iGenius agent instructions to your workspace:
VS Code: Place
igenius.instructions.mdin~/.vscode/prompts/Claude Code: Add to
CLAUDE.mdWorkspace: Add to
.github/copilot-instructions.md
Get the template at igenius-memory.info
How It Works
Agent ←→ MCP (stdio) ←→ igenius-mcp ←→ REST API ←→ iGenius BackendThe memory tools are a thin proxy — they translate MCP tool calls into REST API requests. All AI extraction, LLM summarization, and encryption happens server-side.
The visual tools run locally — Playwright renders URLs on your machine and a local vision model (e.g. LM Studio + Qwen2.5-VL) analyzes the screenshots. Screenshots and analysis never leave your machine.
Plans
Plan | Price | Requests | API Keys | IPs/Key |
Starter | Free | 1,000/week | 1 | 3 |
Pro | $19/mo | 50,000/day | 5 | 10 |
Enterprise | Contact | 500,000/day | 20 | 50 |
Details at igenius-memory.store
Coming Soon
iGenius Context Engine — unlimited effective context for local LLMs through intelligent recursive summarization. Run a 3B model with a 4K context window and handle conversations of any length.
Links
Support the Project
iGenius Memory is built and maintained by NovaMind Labs. If you find it useful, here's how you can help:
Star the repo — it helps more developers discover iGenius
Upgrade to Pro — $19/mo directly funds development → igenius-memory.store
Report bugs & ideas — open an issue
Spread the word — tell your friends, tweet about it, write a blog post
Every user, star, and subscription helps keep iGenius alive and improving. Thank you!
License
MIT
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/vehoelite/igenius-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server