MCP Memory Service

Overview Schema Related Servers Score Discussions

mcp-memory-service

CLAUDE.md•18.4 KiB

# CLAUDE.md This file provides guidance to Claude Code (claude.ai/code) when working with this MCP Memory Service repository. > **📝 Personal Customizations**: You can create `CLAUDE.local.md` (gitignored) for personal notes, custom workflows, or environment-specific instructions. This file contains shared project conventions. > **Note**: Comprehensive project context has been stored in memory with tags `claude-code-reference`. Use memory retrieval to access detailed information during development. ## 🔴 Critical Directives **IMPORTANT**: Before working with this project, read: - **`.claude/directives/memory-tagging.md`** - MANDATORY: Always tag memories with `mcp-memory-service` as first tag - **`.claude/directives/README.md`** - Additional topic-specific directives ## Overview MCP Memory Service is a Model Context Protocol server providing semantic memory and persistent storage for Claude Desktop with SQLite-vec, Cloudflare, and Hybrid storage backends. > **🆕 v8.52.0**: **Time-of-Day Emoji Icons** - Visual indicators on all memory timestamps with 8 emoji icons for 3-hour segments (🌙🌅☕💻🍽️⛅🍷🛏️). Features: tooltips, dark mode support, automatic timezone detection, negligible performance impact (~0.1ms per memory). See [CHANGELOG.md](CHANGELOG.md) for full version history. > > **Note**: When releasing new versions, update this line with current version + brief description. Use `.claude/agents/github-release-manager.md` agent for complete release workflow. ## Essential Commands | Category | Command | Description | |----------|---------|-------------| | **Setup** | `python scripts/installation/install.py --storage-backend hybrid` | Install with hybrid backend (recommended) | | | `uv run memory server` | Start server | | | `pytest tests/` | Run tests | | **Memory Ops** | `claude /memory-store "content"` | Store information | | | `claude /memory-recall "query"` | Retrieve information | | | `claude /memory-health` | Check service status | | **Quality System** | `curl http://127.0.0.1:8000/api/quality/distribution` | Get quality analytics (v8.45.0+) | | | `curl -X POST http://127.0.0.1:8000/api/quality/memories/{hash}/rate -d '{"rating":1}'` | Rate memory quality | | | `curl http://127.0.0.1:8000/api/quality/memories/{hash}` | Get quality metrics | | | `export MCP_QUALITY_BOOST_ENABLED=true` | Enable quality-boosted search | | **Validation** | `python scripts/validation/validate_configuration_complete.py` | Comprehensive config validation | | | `python scripts/validation/diagnose_backend_config.py` | Cloudflare diagnostics | | **Maintenance** | `python scripts/maintenance/consolidate_memory_types.py --dry-run` | Preview type consolidation | | | `python scripts/maintenance/find_all_duplicates.py` | Find duplicates | | | `python scripts/sync/check_drift.py` | Check hybrid backend drift (v8.25.0+) | | **Quality** | `bash scripts/pr/quality_gate.sh 123` | Run PR quality checks | | | `bash scripts/pr/quality_gate.sh 123 --with-pyscn` | Comprehensive quality analysis (includes pyscn) | | | `bash scripts/quality/track_pyscn_metrics.sh` | Track quality metrics over time | | | `bash scripts/quality/weekly_quality_review.sh` | Generate weekly quality review | | | `pyscn analyze .` | Run pyscn static analysis | | **Consolidation** | `curl -X POST http://127.0.0.1:8000/api/consolidation/trigger -H "Content-Type: application/json" -d '{"time_horizon":"weekly"}'` | Trigger memory consolidation | | | `curl http://127.0.0.1:8000/api/consolidation/status` | Check scheduler status | | | `curl http://127.0.0.1:8000/api/consolidation/recommendations` | Get consolidation recommendations | | **Backup** | `curl -X POST http://127.0.0.1:8000/api/backup/now` | Trigger manual backup (v8.29.0+) | | | `curl http://127.0.0.1:8000/api/backup/status` | Check backup status and schedule | | | `curl http://127.0.0.1:8000/api/backup/list` | List available backups | | **Sync Controls** | `curl -X POST http://127.0.0.1:8000/api/sync/pause` | Pause hybrid backend sync (v8.29.0+) | | | `curl -X POST http://127.0.0.1:8000/api/sync/resume` | Resume hybrid backend sync | | **Service** | `systemctl --user status mcp-memory-http.service` | Check HTTP service status (Linux) | | | `scripts/service/memory_service_manager.sh status` | Check service status | | **Debug** | `curl http://127.0.0.1:8000/api/health` | Health check | | | `npx @modelcontextprotocol/inspector uv run memory server` | MCP Inspector | See [scripts/README.md](scripts/README.md) for complete command reference. ## Architecture **Core Components:** - **Server Layer**: MCP protocol with async handlers, global caches (`src/mcp_memory_service/server.py:1`) - **Storage Backends**: SQLite-Vec (5ms reads), Cloudflare (edge), Hybrid (local + cloud sync) - **Web Interface**: FastAPI dashboard at `http://127.0.0.1:8000/` with REST API - **Document Ingestion**: PDF, DOCX, PPTX loaders (see [docs/document-ingestion.md](docs/document-ingestion.md)) - **Memory Hooks**: Natural Memory Triggers v7.1.3+ with 85%+ accuracy (see below) **Key Patterns:** - Async/await for I/O, type safety (Python 3.10+), platform hardware optimization (CUDA/MPS/DirectML/ROCm) ## Document Ingestion Supports PDF, DOCX, PPTX, TXT/MD with optional [semtools](https://github.com/run-llama/semtools) for enhanced quality. ```bash claude /memory-ingest document.pdf --tags documentation claude /memory-ingest-dir ./docs --tags knowledge-base ``` See [docs/document-ingestion.md](docs/document-ingestion.md) for full configuration and usage. ## Interactive Dashboard Web interface at `http://127.0.0.1:8000/` with CRUD operations, semantic/tag/time search, real-time updates (SSE), mobile responsive. Performance: 25ms page load, <100ms search. **API Endpoints:** `/api/search`, `/api/search/by-tag`, `/api/search/by-time`, `/api/events`, `/api/quality/*` (v8.45.0+) ## Memory Quality System 🆕 (v8.45.0+) **AI-driven automatic quality scoring** with local-first design for zero-cost, privacy-preserving memory evaluation. ### Quick Overview **Tier 1**: Local SLM via ONNX - `nvidia-quality-classifier-deberta` (v8.49.0+) - Cost: $0 (runs locally), Privacy: ✅ Full, Offline: ✅ Yes - Eliminates self-matching bias, uniform distribution (mean 0.60-0.70) - Performance: 80-150ms (CPU), 20-40ms (GPU) **Tier 2-3**: Groq/Gemini APIs (optional), **Tier 4**: Implicit signals ### Key Features 1. **Automatic Quality Scoring** - Evaluates every retrieved memory (0.0-1.0 score) - Combines AI evaluation + usage patterns - Non-blocking (async background scoring) 2. **Quality-Boosted Search** - Reranks results: `0.7 × semantic + 0.3 × quality` - Over-fetches 3×, returns top N by composite score - Opt-in via `MCP_QUALITY_BOOST_ENABLED=true` 3. **Quality-Based Forgetting** - High quality (≥0.7): Preserved 365 days inactive - Medium (0.5-0.7): Preserved 180 days inactive - Low (<0.5): Archived 30-90 days inactive 4. **Dashboard Integration** - Quality badges on all memory cards (color-coded) - Analytics view with distribution charts - Top/bottom performers lists ### Essential Configuration ```bash # Recommended setup (implicit signals only for technical corpora) export MCP_QUALITY_SYSTEM_ENABLED=true export MCP_QUALITY_AI_PROVIDER=none export MCP_QUALITY_BOOST_ENABLED=false ``` **For complete configuration options, MCP tools, performance metrics, and troubleshooting:** → See [`.claude/directives/quality-system-details.md`](.claude/directives/quality-system-details.md) → Full guide: [docs/guides/memory-quality-guide.md](docs/guides/memory-quality-guide.md) ### Hooks Integration (v8.45.3+) Quality scoring is now integrated with memory awareness hooks: **Phase 1: Backend Quality in Hooks** - `memory-scorer.js` reads `quality_score` from memory metadata - Weight: 20% of hook scoring (reduces contentQuality/contentRelevance) - Graceful fallback to 0.5 if quality_score not available **Phase 2: Async Quality Evaluation** - Session-end hook triggers `/api/quality/memories/{hash}/evaluate` - Non-blocking: 10s timeout, doesn't delay session end - ONNX ranker provides ~355ms evaluation time **Phase 3: Quality-Boosted Retrieval** ```bash # Search with quality boost curl -X POST http://127.0.0.1:8000/api/search \ -H "Content-Type: application/json" \ -d '{"query": "...", "quality_boost": true, "quality_weight": 0.3}' ``` **Complete Flow:** ``` Session End → Store Memory → Trigger /evaluate (async) ↓ ONNX Ranker (355ms) ↓ Update metadata.quality_score ↓ Next Session → Hook Retrieval → backendQuality = 20% weight ``` ### Documentation **Updated for v8.48.3** - See [docs/guides/memory-quality-guide.md](docs/guides/memory-quality-guide.md) for: - Comprehensive user guide - Configuration examples - Troubleshooting - Best practices - Performance benchmarks ## Memory Consolidation System 🆕 **Dream-inspired memory consolidation** with automatic scheduling and Code Execution API (v8.23.0+). ### Quick Overview **Scheduler**: HTTP Server (runs 24/7, independent of Claude Desktop) **Performance**: 5-25s (SQLite), 2-4min (Cloudflare), 4-6min (Hybrid) **Features**: Exponential decay, association discovery, clustering, compression, controlled forgetting ### Essential Commands ```bash # Trigger consolidation curl -X POST http://127.0.0.1:8000/api/consolidation/trigger \ -H "Content-Type: application/json" -d '{"time_horizon": "weekly"}' # Check status curl http://127.0.0.1:8000/api/consolidation/status # Code Execution API (token-efficient) from mcp_memory_service.api import consolidate result = consolidate('weekly') # 90% token reduction vs MCP tools ``` ### Essential Configuration ```bash export MCP_CONSOLIDATION_ENABLED=true export MCP_CONSOLIDATION_QUALITY_BOOST_ENABLED=true # 20% boost for ≥5 connections ``` **For complete details:** → See [`.claude/directives/consolidation-details.md`](.claude/directives/consolidation-details.md) → Full guide: [docs/guides/memory-consolidation-guide.md](docs/guides/memory-consolidation-guide.md) → Wiki: [Memory Consolidation System Guide](https://github.com/doobidoo/mcp-memory-service/wiki/Memory-Consolidation-System-Guide) ## Environment Variables **Essential Configuration:** ```bash # Storage Backend (Hybrid is RECOMMENDED for production) export MCP_MEMORY_STORAGE_BACKEND=hybrid # hybrid|cloudflare|sqlite_vec # Cloudflare Configuration (REQUIRED for hybrid/cloudflare backends) export CLOUDFLARE_API_TOKEN="your-token" # Required for Cloudflare backend export CLOUDFLARE_ACCOUNT_ID="your-account" # Required for Cloudflare backend export CLOUDFLARE_D1_DATABASE_ID="your-d1-id" # Required for Cloudflare backend export CLOUDFLARE_VECTORIZE_INDEX="mcp-memory-index" # Required for Cloudflare backend # Web Interface (Optional) export MCP_HTTP_ENABLED=true # Enable HTTP server export MCP_HTTPS_ENABLED=true # Enable HTTPS (production) export MCP_API_KEY="$(openssl rand -base64 32)" # Generate secure API key ``` **Configuration Precedence:** Environment variables > .env file > Global Claude Config > defaults **✅ Automatic Configuration Loading (v6.16.0+):** The service now automatically loads `.env` files and respects environment variable precedence. CLI defaults no longer override environment configuration. **⚠️ Important:** When using hybrid or cloudflare backends, ensure Cloudflare credentials are properly configured. If health checks show "sqlite-vec" when you expect "cloudflare" or "hybrid", this indicates a configuration issue that needs to be resolved. **Platform Support:** macOS (MPS/CPU), Windows (CUDA/DirectML/CPU), Linux (CUDA/ROCm/CPU) ## Claude Code Hooks Configuration 🆕 > **🚨 CRITICAL - Windows Users**: SessionStart hooks cause Claude Code to hang on Windows (#160). **Workaround**: Use `/session-start` slash command or UserPromptSubmit hooks instead. **Natural Memory Triggers v7.1.3** - 85%+ trigger accuracy, multi-tier processing (50ms → 150ms → 500ms) **Installation:** ```bash cd claude-hooks && python install_hooks.py --natural-triggers # CLI Management node ~/.claude/hooks/memory-mode-controller.js status node ~/.claude/hooks/memory-mode-controller.js profile balanced ``` **Performance Profiles:** - `speed_focused`: <100ms, minimal memory awareness - `balanced`: <200ms, optimal for general development (recommended) - `memory_aware`: <500ms, maximum context awareness → Complete configuration: [`.claude/directives/hooks-configuration.md`](.claude/directives/hooks-configuration.md) ## Storage Backends | Backend | Performance | Use Case | Installation | |---------|-------------|----------|--------------| | **Hybrid** ⚡ | Fast (5ms read) | **🌟 Production (Recommended)** | `--storage-backend hybrid` | | **Cloudflare** ☁️ | Network dependent | Cloud-only deployment | `--storage-backend cloudflare` | | **SQLite-Vec** 🪶 | Fast (5ms read) | Development, single-user | `--storage-backend sqlite_vec` | **Hybrid Backend Benefits:** - 5ms read/write + multi-device sync + graceful offline operation **Database Lock Prevention (v8.9.0+):** - After adding `MCP_MEMORY_SQLITE_PRAGMAS` to `.env`, **restart all servers** - SQLite pragmas are per-connection, not global → Complete details: [`.claude/directives/storage-backends.md`](.claude/directives/storage-backends.md) ## Development Guidelines **CRITICAL**: Before starting development, read these directives: → [`.claude/directives/development-setup.md`](.claude/directives/development-setup.md) - Editable install (MANDATORY) → [`.claude/directives/pr-workflow.md`](.claude/directives/pr-workflow.md) - Pre-PR quality checks (MANDATORY) → [`.claude/directives/version-management.md`](.claude/directives/version-management.md) - Use github-release-manager agent ### Quick Reminders **Development Setup:** - Always `pip install -e .` for development (loads from source, not site-packages) - Verify: `pip show mcp-memory-service | grep Location` (should show `.../src`) - Common symptom of stale install: Code shows v8.23.0 but server reports v8.5.3 **Before Creating PR:** ```bash bash scripts/pr/pre_pr_check.sh # MANDATORY - catches 20+ issues before review gh pr create --fill gh pr comment <PR> --body "/gemini review" ``` **Version Management:** - **ALWAYS** use github-release-manager agent (even for hotfixes) - Four-file bump: `__init__.py` → `pyproject.toml` → `README.md` → `uv lock` - Manual releases miss steps (see v8.20.1 lesson) **Memory & Documentation:** - Use 24 core memory types (see `scripts/maintenance/memory-types.md`) - Capture decisions with `claude /memory-store` during development - Tag manually stored memories with `mcp-memory-service` as first tag **Architecture & Testing:** - Storage backends implement abstract base class - All features require corresponding tests - UI testing: page load <2s, operations <1s ## Code Quality Monitoring **Three-layer strategy:** 1. **Pre-commit** (<5s) - Groq/Gemini complexity + security (blocks: complexity >8, any security issues) 2. **PR Quality Gate** (10-60s) - `quality_gate.sh --with-pyscn` (blocks: security, health <50) 3. **Periodic Review** (weekly) - pyscn analysis + trend tracking **Health Score Thresholds:** - <50: 🔴 Release blocker (cannot merge) - 50-69: 🟡 Action required (refactor within 2 weeks) - 70+: ✅ Continue development → Complete workflow: [`.claude/directives/code-quality-workflow.md`](.claude/directives/code-quality-workflow.md) ## Configuration Management **Quick Validation:** ```bash python scripts/validation/validate_configuration_complete.py # Comprehensive validation python scripts/validation/diagnose_backend_config.py # Cloudflare diagnostics ``` **Configuration Hierarchy:** - Global: `~/.claude.json` (authoritative) - Project: `.env` file (Cloudflare credentials) - **Avoid**: Local `.mcp.json` overrides **Common Issues & Quick Fixes:** | Issue | Quick Fix | |-------|-----------| | Wrong backend showing | `python scripts/validation/diagnose_backend_config.py` | | Port mismatch (hooks timeout) | Verify same port in `~/.claude/hooks/config.json` and server (default: 8000) | | Schema validation errors after PR merge | Run `/mcp` in Claude Code to reconnect with new schema | | Accidental `data/memory.db` | Delete safely: `rm -rf data/` (gitignored) | See [docs/troubleshooting/hooks-quick-reference.md](docs/troubleshooting/hooks-quick-reference.md) for comprehensive troubleshooting. ## Hook Troubleshooting **SessionEnd Hooks:** - Trigger on `/exit`, terminal close (NOT Ctrl+C) - Require 100+ characters, confidence > 0.1 - Memory creation: topics, decisions, insights, code changes **Windows SessionStart Issue (#160):** - CRITICAL: SessionStart hooks hang Claude Code on Windows - Workaround: Use `/session-start` slash command or UserPromptSubmit hooks See [docs/troubleshooting/hooks-quick-reference.md](docs/troubleshooting/hooks-quick-reference.md) for full troubleshooting guide. ## Agent Integrations Workflow automation agents using Gemini CLI, Groq API, and Amp CLI. All agents in `.claude/agents/` directory. | Agent | Tool | Purpose | Priority | Usage | |-------|------|---------|----------|-------| | **github-release-manager** | GitHub CLI | Complete release workflow | Production | Proactive on feature completion | | **amp-bridge** | Amp CLI | Research without Claude credits | Production | File-based prompts | | **code-quality-guard** | Gemini CLI / Groq API | Fast code quality analysis | Active | Pre-commit, pre-PR | | **gemini-pr-automator** | Gemini CLI | Automated PR review loops | Active | Post-PR creation | **Quick Start:** ```bash # Release workflow @agent github-release-manager "Check if we need a release" # Code quality ./scripts/utils/groq "Complexity check: $(cat file.py)" # PR automation bash scripts/pr/auto_review.sh <PR_NUMBER> ``` **Groq Bridge (RECOMMENDED):** 10x faster than Gemini (200-300ms vs 2-3s), no OAuth interruption. → Complete workflows: `.claude/directives/agents.md` > **For detailed troubleshooting, architecture, and deployment guides:** > - **Backend Configuration Issues**: See [Wiki Troubleshooting Guide](https://github.com/doobidoo/mcp-memory-service/wiki/07-TROUBLESHOOTING#backend-configuration-issues) for comprehensive solutions to missing memories, environment variable issues, Cloudflare auth, hooks timeouts, and more > - **Historical Context**: Retrieve memories tagged with `claude-code-reference` > - **Quick Diagnostic**: Run `python scripts/validation/diagnose_backend_config.py`

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/doobidoo/mcp-memory-service'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

CLAUDE.md•18.4 KiB