Enables advanced memory optimization through OpenAI's reasoning models (o3/o4-mini) with high reasoning capabilities for analyzing and optimizing distributed memory banks
KuzuMem-MCP
A TypeScript implementation of a distributed memory bank as an MCP (Model Context Protocol) tool, storing memories in a KùzuDB graph database with repository and branch filtering capabilities. Branch isolation is achieved by using a graph-unique identifier for entities, enabling a centralized memory bank while allowing repository-specific and branch-specific views. Fully compliant with MCP specification for seamless integration with IDEs and AI agents.
Key Features
- 🧠 AI-Powered Memory Optimization - Advanced reasoning models (OpenAI o3/o4-mini, Claude 4) with MCP sampling for intelligent memory management
- 🛡️ Production-Ready Safety - Automatic snapshot system with guaranteed rollback capabilities
- 🎯 Context-Aware Intelligence - MCP sampling analyzes actual memory state for adaptive optimization strategies
- 🔧 Unified Tool Architecture - 12 consolidated tools covering all memory bank operations
- 🧵 Thread-Safe Singleton Pattern - Ensures each resource is instantiated only once, with proper thread safety
- 📊 Distributed Graph Structure - Follows the advanced memory bank specification using a KùzuDB graph
- 🌿 Repository & Branch Awareness - All operations are contextualized by repository name and branch
- ⚡ Asynchronous Operations - Uses async/await for better performance
- 🔌 Multiple Access Interfaces - Access via CLI and multiple MCP server implementations
- 💾 KùzuDB Backend - Utilizes KùzuDB for graph-based memory storage and querying
- ✅ Fully MCP Compliant - All tools follow the Model Context Protocol for client integration
- 📡 Progressive Results Streaming - Supports streaming for long-running graph operations
- 🏠 Client Project Root Isolation - Each client project gets its own isolated database instance
- 🧠 High-Reasoning Analysis - Leverages OpenAI HIGH reasoning and Anthropic extended thinking for memory optimization
- 🗑️ Safe Bulk Operations - Advanced bulk deletion with dependency validation and dry-run capabilities
Unified Tools
The system currently broadcasts 12 unified tools that consolidate all memory bank operations:
- memory-bank - Initialize and manage memory bank metadata
- entity - Create, update, delete, and retrieve all entity types (components, decisions, rules, files, tags)
- introspect - Explore graph schema and metadata
- context - Manage work session context
- query - Unified search across contexts, entities, relationships, dependencies, governance, history, and tags
- associate - Create relationships between entities
- analyze - Run graph algorithms (PageRank, K-Core, Louvain, Shortest Path)
- detect - Detect patterns (strongly/weakly connected components)
- bulk-import - Efficient bulk entity import
- search - Full-text search across all entity types with KuzuDB FTS integration
- delete - Safe deletion of entities with dependency validation and bulk operations
- memory-optimizer - 🧠 AI-powered core memory optimization with MCP sampling, snapshots, and rollback
For detailed tool documentation, see Unified Tools Documentation.
Documentation
- Extended Documentation - Architecture and advanced usage patterns
- Graph Schema - Database schema details
Installation
Configuration
Create a .env
file in the root directory (copy from .env.example
):
Core Memory Optimization Setup
The Core Memory Optimization Agent requires API keys for high-reasoning models:
- OpenAI: Get your API key from OpenAI Platform
- Anthropic: Get your API key from Anthropic Console
Supported Models:
- OpenAI:
o3
,o4-mini
(with HIGH reasoning, 32,768 tokens) - Anthropic:
claude-4
(with extended thinking, 2,048 tokens)
For detailed setup instructions, see Core Memory Optimization Setup Guide.
Add to your IDE's MCP configuration:
Quick Start
1. Initialize Memory Bank
2. Create Entities
3. Query Dependencies
4. Run Analysis
🧠 Core Memory Optimization Agent
The Core Memory Optimization Agent provides AI-powered memory graph optimization with advanced reasoning capabilities and production-ready safety features:
Features
- 🧠 High-Reasoning Analysis: Uses OpenAI o3/o4-mini (HIGH reasoning) or Claude (extended thinking) for intelligent memory analysis
- 🎯 MCP Sampling: Context-aware prompts that adapt to actual memory state and project characteristics
- 🛡️ Automatic Snapshots: Production-ready safety with automatic backup before optimization
- 🔄 Guaranteed Rollback: Complete state restoration with transactional safety
- ⚖️ Safe Optimization: Conservative, balanced, and aggressive strategies with safety validation
- 🔍 Stale Entity Detection: Identifies outdated entities based on age and usage patterns
- 🔗 Redundancy Removal: Finds and consolidates duplicate or redundant entities
- 📊 Dependency Optimization: Optimizes relationship chains while preserving integrity
- 👀 Dry-Run Mode: Preview optimizations without making changes
- 📈 Project Intelligence: Automatic project maturity, activity, and complexity analysis
Quick Start
1. Analyze Memory Graph (with MCP Sampling)
2. Preview Optimization (Dry Run)
3. Execute Optimization (with Automatic Snapshot)
4. List Available Snapshots
5. Rollback to Previous State
Optimization Strategies
- Conservative: Max 5 deletions, 6-month stale threshold (recommended for production)
- Balanced: Max 20 deletions, 3-month stale threshold (recommended for development)
- Aggressive: Max 50 deletions, 1-month stale threshold (use with caution)
MCP Sampling Strategies
- Representative: Balanced sample across all entity types (default)
- Problematic: Focus on stale, disconnected, or deprecated entities
- Recent: Sample newly created entities (< 30 days) for safety analysis
- Diverse: Ensure representation from all entity types for complex systems
Safety Features
- 🛡️ Automatic Snapshots: Created before every optimization (unless dry-run)
- 🔄 Transactional Rollback: Complete state restoration with database consistency
- ✅ Validation System: Snapshot integrity checks before rollback operations
- 📊 Context-Aware Safety: Activity-level and complexity-based safety measures
For complete setup and usage instructions, see:
Testing
E2E Test Requirements
For memory optimizer E2E tests, set environment variables:
Note: All core functionality is operational with comprehensive E2E test coverage for both stdio and HTTP stream protocols.
Architecture
KuzuMem-MCP follows official MCP TypeScript SDK patterns with clean architecture:
Key Components
- MCP Servers: Official SDK implementations using
McpServer
with HTTP Stream and Stdio transports - Tool Handlers: Business logic for each MCP tool with simplified context handling
- Memory Service: Core orchestration and repository management
- Repository Layer: Thread-safe singletons for each entity type
- Database Layer: KùzuDB embedded graph database
Official SDK Compliance
✅ Session Management: Uses built-in SDK session handling
✅ Tool Registration: Uses official tool()
method with Zod validation
✅ Transport Handling: Leverages SDK transport implementations
✅ Error Handling: Follows SDK error patterns and best practices
For detailed architecture information, see Extended Documentation.
Agent Development Loop (Rules-Enforced)
When both the repository-level "Always-Applied Workspace Rules" (project_config_updated.md
) and the short-term workflow rules (workflow_state_updated.mdc
) are active, every IDE or AI agent that communicates with KuzuMem-MCP must follow the five-phase finite-state loop below. Each transition is observable via the unified context
tool and is backed by mandatory MCP calls that keep the graph database in sync and governance rules enforced.
- ANALYZE – Pull the latest context, inspect 1-hop neighbourhood, and optionally execute a PageRank analysis. Produce a high-level problem statement.
- BLUEPRINT – Draft a numbered implementation plan and persist it as a
Decision
entity (status: proposed
, tagarchitecture
). Wait for explicit user approval. - CONSTRUCT – Execute plan steps, apply code edits, and immediately mirror changes through
entity
,associate
, andcontext
tool calls while honouring dependency & tagging rules. - VALIDATE – Run the full test & linter suite. If green, update the
Decision
toimplemented
; if red, log context and loop back to CONSTRUCT. - ROLLBACK – Automatically triggered on unrecoverable errors, reverting partial work before returning to ANALYZE.
Phase Diagram
License
Apache-2.0
Contributing
Contributions are welcome! Please ensure:
- All tests pass (or create issues for failing tests)
- Code follows the existing style
- New features include tests
- Documentation is updated
Future Improvements
- Vector Embeddings - Semantic similarity search (pending KuzuDB vector column updates)
- Advanced Graph Algorithms - Additional analysis capabilities
- Graph Schema Updates - Based on how well the automated development loop works, the graph schema may need to be updated to support new features
- Complete Semantic Search - Implementation of the semantic-search tool (currently placeholder - KuzuDB Vector Indexes are immutable and would make developing this feature difficult since updating memories wouldn't update the vector indexes)
MCP Review
This MCP is verified by MCP Review
https://mcpreview.com/mcp-servers/Jakedismo/KuzuMem-MCP
Automatic Codereviews with Codrabbit
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A distributed memory bank MCP server that stores AI agent memories in a KùzuDB graph database with repository/branch isolation. Features AI-powered memory optimization, dependency tracking, and comprehensive graph analysis capabilities.
Related MCP Servers
- -securityAlicense-qualityAn MCP server that extends AI agents' context window by providing tools to store, retrieve, and search memories, allowing agents to maintain history and context across long interactions.Last updated -9TypeScriptMIT License
- -securityAlicense-qualityA Redis-backed MCP server that enables multiple AI agents to communicate, coordinate, and collaborate while working on parallel development tasks, preventing conflicts in shared codebases.Last updated -15PythonMIT License
- -security-license-qualityA sophisticated MCP server providing advanced memory capabilities with RAG, hallucination detection, and enterprise-grade AI infrastructure for intelligent agent ecosystems.Last updated -Python
- -securityAlicense-qualityAn enhanced MCP server that provides intelligent memory and task management for AI assistants, featuring semantic search, automatic task extraction, and knowledge graphs to help manage development workflows.Last updated -11PythonMIT License