The MCP Memory Service provides semantic memory and persistent storage for Claude Desktop. Key capabilities include:
Store information with optional metadata tags
Retrieve memories using semantic search with similarity scores
Search by tags to find stored memories
Time-based recall using natural language expressions
Find memories via exact content match
Manage duplicates by detecting and removing them
Database operations: optimization, health monitoring, statistics
Memory management: delete specific memories or tagged sets
Data protection: create automatic backups
Debug tools for analyzing retrieval processes
Cross-platform compatibility with hardware-aware optimization
Mentioned as a potential cloud storage option where users should ensure sync is complete before accessing from another device.
MCP Memory Service
Production-ready MCP memory service with zero database locks, hybrid backend (fast local + cloud sync), and intelligent memory search for AI assistants. Features v8.9.0 auto-configuration for multi-client access, 5ms local reads with background Cloudflare sync, Natural Memory Triggers with 85%+ accuracy, and OAuth 2.1 team collaboration. Works with Claude Desktop, VS Code, Cursor, Continue, and 13+ AI applications.
π Quick Start (2 minutes)
π Latest Release: v8.50.1 (Dec 14, 2025)
Critical Bug Fixes - Configuration & Installation
π§ MCP_EMBEDDING_MODEL Environment Variable Fixed - Custom embedding models now properly respected (PR #276, fixes #275)
π¦ Installation Script Backend Support - Added missing cloudflare/hybrid options, removed stale ChromaDB references (fixes #273)
π i18n Quality Analytics Complete - Added 125 translations across 5 languages (Spanish, French, German, Japanese, Korean)
β User-Reported Issues Resolved - Both configuration bugs fixed in patch release
Previous Releases:
v8.50.0 - Fallback Quality Scoring (DeBERTa + MS-MARCO hybrid, technical content rescue, 20/20 tests passing)
v8.49.0 - DeBERTa Quality Classifier (absolute quality assessment, eliminates self-matching bias)
v8.48.4 - Cloudflare D1 Drift Detection Performance (10-100x faster queries, numeric comparison fix)
v8.48.3 - Code Execution Hook Fix - 75% token reduction now working (fixed time_filter parameter, Python warnings, venv detection)
v8.48.2 - HTTP Server Auto-Start & Time Parser Improvements (smart service management, "last N periods" support)
v8.48.1 - Critical Hotfix - Startup Failure Fix (redundant calendar import removed, immediate upgrade required)
v8.48.0 - CSV-Based Metadata Compression (78% size reduction, 100% sync success, metadata validation)
v8.47.1 - ONNX Quality Evaluation Bug Fixes (self-match fix, association pollution, sync queue overflow, realistic distribution)
v8.47.0 - Association-Based Quality Boost (connection-based enhancement, network effect leverage, metadata persistence)
v8.46.3 - Quality Score Persistence Fix (ONNX scores in hybrid backend, metadata normalization)
v8.46.2 - Session-Start Hook Crash Fix + Hook Installer Improvements (client-side tag filtering, isolated version metadata)
v8.46.1 - Windows Hooks Installer Fix + Quality System Integration (UTF-8 console configuration, backend quality scoring)
v8.45.3 - ONNX Ranker Model Export Fix (automatic model export, offline mode support, 7-16ms CPU performance)
v8.45.2 - Dashboard Dark Mode Consistency Fixes (global CSS overrides, Chart.js dark mode support)
v8.45.1 - Quality System Test Infrastructure Fixes (HTTP API router, storage retrieval, async test client)
v8.45.0 - Memory Quality System - AI-Driven Automatic Quality Scoring (ONNX-powered local SLM, multi-tier fallback, quality-based retention)
v8.44.0 - Multi-Language Expansion (Japanese, Korean, German, French, Spanish - 359 keys each, complete i18n coverage)
v8.43.0 - Internationalization & Quality Automation (English/Chinese i18n, Claude branch automation, quality gates)
v8.42.1 - MCP Resource Handler Fix (
AttributeErrorwith Pydantic AnyUrl objects)v8.42.0 - Memory Awareness Enhancements (visible memory injection, quality session summaries, LLM-powered summarization)
v8.41.2 - Hook Installer Utility File Deployment (ALL 14 utilities copied, future-proof glob pattern)
v8.41.1 - Context Formatter Memory Sorting (recency sorting within categories, newest first)
v8.41.0 - Session Start Hook Reliability Improvements (error suppression, clean output, memory filtering, classification fixes)
v8.40.0 - Session Start Version Display (automatic version comparison, PyPI status labels)
v8.39.1 - Dashboard Analytics Bug Fixes: Three critical fixes (top tags filtering, recent activity display, storage report fields)
v8.39.0 - Performance Optimization: Storage-layer date-range filtering (10x faster analytics, 97% data transfer reduction)
v8.38.1 - Critical Hotfix: HTTP MCP JSON-RPC 2.0 compliance fix (Claude Code/Desktop connection failures resolved)
v8.38.0 - Code Quality: Phase 2b COMPLETE (~176-186 lines duplicate code eliminated, 10 consolidations)
v8.37.0 - Code Quality: Phase 2a COMPLETE (5 duplicate high-complexity functions eliminated)
v8.36.1 - Critical Hotfix: HTTP server startup crash fix (forward reference error in analytics.py)
v8.36.0 - Code Quality: Phase 2 COMPLETE (100% of target achieved, -39 complexity points)
v8.35.0 - Code Quality: Phase 2 Batch 1 (install.py, cloudflare.py, -15 complexity points)
v8.34.0 - Code Quality: Phase 2 Complexity Reduction (analytics.py refactored, 11 β 6-7 complexity)
v8.33.0 - Critical Installation Bug Fix + Code Quality Improvements (dead code cleanup, automatic MCP setup)
v8.32.0 - Code Quality Excellence: pyscn Static Analysis Integration (multi-layer QA workflow)
v8.31.0 - Revolutionary Batch Update Performance (21,428x faster memory consolidation)
v8.30.0 - Analytics Intelligence: Adaptive Charts & Critical Data Fixes (accurate trend visualization)
v8.28.1 - Critical HTTP MCP Transport JSON-RPC 2.0 Compliance Fix (Claude Code compatibility)
v8.28.0 - Cloudflare AND/OR Tag Filtering (unified search API, 3-5x faster hybrid sync)
v8.27.1 - Critical Hotfix: Timestamp Regression (created_at preservation during metadata sync)
v8.26.0 - Revolutionary MCP Performance (534,628x faster tools, 90%+ cache hit rate)
v8.25.0 - Hybrid Backend Drift Detection (automatic metadata sync, bidirectional awareness)
v8.24.4 - Code Quality Improvements from Gemini Code Assist (regex sanitization, DOM caching)
v8.24.3 - Test Coverage & Release Agent Improvements (tag+time filtering tests, version history fix)
v8.24.2 - CI/CD Workflow Fixes (bash errexit handling, exit code capture)
v8.24.1 - Test Infrastructure Improvements (27 test failures resolved, 63% β 71% pass rate)
v8.24.0 - PyPI Publishing Enabled (automated package publishing via GitHub Actions)
v8.23.1 - Stale Virtual Environment Prevention System (6-layer developer protection)
v8.23.0 - Consolidation Scheduler via Code Execution API (88% token reduction)
π Full Details: CHANGELOG.md | All Releases
PyPI Installation (Simplest)
Install from PyPI:
Then configure Claude Desktop by adding to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or equivalent:
For advanced configuration with the interactive installer, clone the repo and run python scripts/installation/install.py.
Developer Setup (Contributing)
For development and contributing, use editable install to ensure source code changes take effect immediately:
β οΈ Important: Editable install (-e flag) ensures MCP servers load from source code, not stale site-packages. Without this, source changes won't be reflected until you reinstall the package.
Version Mismatch Check:
See CLAUDE.md for complete development guidelines.
Traditional Setup Options
Universal Installer (Most Compatible):
π Installation Options Explained:
Default (recommended): Lightweight SQLite-vec with ONNX embeddings - fast, works offline, <100MB dependencies
--with-ml: Adds PyTorch + sentence-transformers for advanced ML features - heavier but more capable--storage-backend hybrid: Hybrid backend with SQLite-vec + Cloudflare sync - best for multi-device access
Docker (Fastest):
Smithery (Claude Desktop):
Related MCP server: MCP DuckDB Knowledge Graph Memory Server
β οΈ v6.17.0+ Script Migration Notice
Updating from an older version? Scripts have been reorganized for better maintainability:
Recommended: Use
python -m mcp_memory_service.serverin your Claude Desktop config (no path dependencies!)Alternative 1: Use
uv run memory serverwith UV toolingAlternative 2: Update path from
scripts/run_memory_server.pytoscripts/server/run_memory_server.pyBackward compatible: Old path still works with a migration notice
β οΈ First-Time Setup Expectations
On your first run, you'll see some warnings that are completely normal:
"WARNING: Failed to load from cache: No snapshots directory" - The service is checking for cached models (first-time setup)
"WARNING: Using TRANSFORMERS_CACHE is deprecated" - Informational warning, doesn't affect functionality
Model download in progress - The service automatically downloads a ~25MB embedding model (takes 1-2 minutes)
These warnings disappear after the first successful run. The service is working correctly! For details, see our First-Time Setup Guide.
π Python 3.13 Compatibility Note
sqlite-vec may not have pre-built wheels for Python 3.13 yet. If installation fails:
The installer will automatically try multiple installation methods
Consider using Python 3.12 for the smoothest experience:
brew install python@3.12Alternative: Use Cloudflare backend with
--storage-backend cloudflareSee Troubleshooting Guide for details
π macOS SQLite Extension Support
macOS users may encounter enable_load_extension errors with sqlite-vec:
System Python on macOS lacks SQLite extension support by default
Solution: Use Homebrew Python:
brew install python && rehashAlternative: Use pyenv:
PYTHON_CONFIGURE_OPTS='--enable-loadable-sqlite-extensions' pyenv install 3.12.0Fallback: Use Cloudflare or Hybrid backend:
--storage-backend cloudflareor--storage-backend hybridSee Troubleshooting Guide for details
π― Memory Awareness in Action
Intelligent Context Injection - See how the memory service automatically surfaces relevant information at session start:
What you're seeing:
π§ Automatic memory injection - 8 relevant memories found from 2,526 total
π Smart categorization - Recent Work, Current Problems, Additional Context
π Git-aware analysis - Recent commits and keywords automatically extracted
π― Relevance scoring - Top memories scored at 100% (today), 89% (8d ago), 84% (today)
β‘ Fast retrieval - SQLite-vec backend with 5ms read performance
π Background sync - Hybrid backend syncing to Cloudflare
Result: Claude starts every session with full project context - no manual prompting needed.
π Complete Documentation
π Visit our comprehensive
π§ v7.1.3 Natural Memory Triggers (Latest)
Natural Memory Triggers v7.1.3 Guide - Intelligent automatic memory awareness
β 85%+ trigger accuracy with semantic pattern detection
β Multi-tier performance (50ms instant β 150ms fast β 500ms intensive)
β CLI management system for real-time configuration
β Git-aware context integration for enhanced relevance
β Zero-restart installation with dynamic hook loading
π v7.0.0 OAuth & Team Collaboration
π OAuth 2.1 Setup Guide - NEW! Complete OAuth 2.1 Dynamic Client Registration guide
π Integration Guide - Claude Desktop, Claude Code HTTP transport, VS Code, and more
π‘οΈ Advanced Configuration - Updated! OAuth security, enterprise features
𧬠v8.23.0+ Memory Consolidation
π Memory Consolidation System Guide - NEW! Automated memory maintenance with real-world performance metrics
β Dream-inspired consolidation (decay scoring, association discovery, compression, archival)
β 24/7 automatic scheduling (daily/weekly/monthly via HTTP server)
β Token-efficient Code Execution API (90% token reduction vs MCP tools)
β Real-world performance data (4-6 min for 2,495 memories with hybrid backend)
β Three manual trigger methods (HTTP API, MCP tools, Python API)
π Setup & Installation
π Installation Guide - Complete installation for all platforms and use cases
π₯οΈ Platform Setup Guide - Windows, macOS, and Linux optimizations
β‘ Performance Optimization - Speed up queries, optimize resources, scaling
π§ Advanced Topics
π¨βπ» Development Reference - Claude Code hooks, API reference, debugging
π§ Troubleshooting Guide - Updated! OAuth troubleshooting + common issues
β FAQ - Frequently asked questions
π Examples - Practical code examples and workflows
π Internal Documentation
π Repository Statistics - 10 months of development metrics, activity patterns, and insights
ποΈ Architecture Specs - Search enhancement specifications and design documents
π©βπ» Development Docs - AI agent instructions, release checklist, refactoring notes
π Deployment Guides - Docker, dual-service, and production deployment
π Additional Guides - Storage backends, migration, mDNS discovery
β¨ Key Features
π Production-Ready Reliability π v8.9.0
Hybrid Backend - Fast 5ms local SQLite + background Cloudflare sync (RECOMMENDED default)
Zero user-facing latency for cloud operations
Automatic multi-device synchronization
Graceful offline operation
Zero Database Locks - Concurrent HTTP + MCP server access works flawlessly
Auto-configured SQLite pragmas (
busy_timeout=15000,cache_size=20000)WAL mode with proper multi-client coordination
Tested: 5/5 concurrent writes succeeded with no errors
Auto-Configuration - Installer handles everything
SQLite pragmas for concurrent access
Cloudflare credentials with connection testing
Claude Desktop integration with hybrid backend
Graceful fallback to sqlite_vec if cloud setup fails
π Document Ingestion System v8.6.0
Interactive Web UI - Drag-and-drop document upload with real-time progress
Multiple Formats - PDF, TXT, MD, JSON with intelligent chunking
Document Viewer - Browse chunks, view metadata, search content
Smart Tagging - Automatic tagging with length validation (max 100 chars)
Optional semtools - Enhanced PDF/DOCX/PPTX parsing with LlamaParse
Security Hardened - Path traversal protection, XSS prevention, input validation
7 New Endpoints - Complete REST API for document management
π Enterprise Authentication & Team Collaboration
OAuth 2.1 Dynamic Client Registration - RFC 7591 & RFC 8414 compliant
Claude Code HTTP Transport - Zero-configuration team collaboration
JWT Authentication - Enterprise-grade security with scope validation
Auto-Discovery Endpoints - Seamless client registration and authorization
Multi-Auth Support - OAuth + API keys + optional anonymous access
π§ Intelligent Memory Management
Semantic search with vector embeddings
Natural language time queries ("yesterday", "last week")
Tag-based organization with smart categorization
Memory consolidation with dream-inspired algorithms
Document-aware search - Query across uploaded documents and manual memories
π Universal Compatibility
Claude Desktop - Native MCP integration
Claude Code - HTTP transport + Memory-aware development with hooks
πͺ Windows Support:
/session-startcommand for manual session initialization (workaround for issue #160)π macOS/Linux: Full automatic SessionStart hooks + slash command
VS Code, Cursor, Continue - IDE extensions
13+ AI applications - REST API compatibility
πΎ Flexible Storage
Hybrid π (RECOMMENDED) - Fast local SQLite + background Cloudflare sync (v8.9.0 default)
5ms local reads with zero user-facing latency
Multi-device synchronization
Zero database locks with auto-configured pragmas
Automatic backups and cloud persistence
SQLite-vec - Local-only storage (lightweight ONNX embeddings, 5ms reads)
Good for single-user offline use
No cloud dependencies
Cloudflare - Cloud-only storage (global edge distribution with D1 + Vectorize)
Network-dependent performance
Note: All heavy ML dependencies (PyTorch, sentence-transformers) are now optional to dramatically reduce build times and image sizes. SQLite-vec uses lightweight ONNX embeddings by default. Install with
--with-mlfor full ML capabilities.
π Production Ready
Cross-platform - Windows, macOS, Linux
Service installation - Auto-start background operation
HTTPS/SSL - Secure connections with OAuth 2.1
Docker support - Easy deployment with team collaboration
Interactive Dashboard - Web UI at http://127.0.0.1:8888/ for complete management
π‘ Basic Usage
π Document Ingestion (v8.6.0+)
π Team Collaboration with OAuth (v7.0.0+)
π§ Memory Operations
π§ Configuration
Claude Desktop Integration
Recommended approach - Add to your Claude Desktop config (~/.claude/config.json):
Alternative approaches:
Environment Variables
Hybrid Backend (v8.9.0+ RECOMMENDED):
SQLite-vec Only (Local):
ποΈ Architecture
π οΈ Development
Project Structure
Contributing
Fork the repository
Create a feature branch
Make your changes with tests
Submit a pull request
See CONTRIBUTING.md for detailed guidelines.
π Support
π Documentation: Wiki - Comprehensive guides
π Bug Reports: GitHub Issues
π¬ Discussions: GitHub Discussions
π§ Troubleshooting: Troubleshooting Guide
β Configuration Validator: Run
python scripts/validation/validate_configuration_complete.pyto check your setupπ Backend Sync Tools: See scripts/README.md for CloudflareβSQLite sync
π In Production
Real-world metrics from active deployments:
1700+ memories stored and actively used across teams
5ms local reads with hybrid backend (v8.9.0)
Zero database locks with concurrent HTTP + MCP access (v8.9.0)
Tested: 5/5 concurrent writes succeeded
Auto-configured pragmas prevent lock errors
<500ms response time for semantic search (local & HTTP transport)
65% token reduction in Claude Code sessions with OAuth collaboration
96.7% faster context setup (15min β 30sec)
100% knowledge retention across sessions and team members
Zero-configuration setup success rate: 98.5% (OAuth + hybrid backend)
π Recognition
Verified MCP Server
Featured AI Tool
Production-tested across 13+ AI applications
Community-driven with real-world feedback and improvements
π License
Apache License 2.0 - see LICENSE for details.
Ready to supercharge your AI workflow? π
π Start with our Installation Guide or explore the Wiki for comprehensive documentation.
Transform your AI conversations into persistent, searchable knowledge that grows with you.
Appeared in Searches
- Understanding the concept of 'handle' in different contexts
- A search for information about thinking or related concepts
- How to add documentation to a project or system
- An open-source vector database for similarity search and AI applications
- Why do models always lose memory and have poor memory retention?