Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Hippocampus Memory MCP ServerWhat do I know about Python programming?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
π§ Hippocampus Memory MCP Server
Persistent, Semantic Memory for Large Language Models
Features β’ Installation β’ Quick Start β’ Documentation β’ Architecture
π Overview
A Python-based Model Context Protocol (MCP) server that gives LLMs persistent, hippocampus-inspired memory across sessions. Store, retrieve, consolidate, and forget memories using semantic similarity search powered by vector embeddings.
Why Hippocampus? Just like the human brain's hippocampus consolidates short-term memories into long-term storage, this server intelligently manages LLM memory through biological patterns:
π Consolidation - Merge similar memories to reduce redundancy
π§Ή Forgetting - Remove outdated information based on age/importance
π Semantic Retrieval - Find relevant memories through meaning, not keywords
β¨ Features
Feature | Description |
ποΈ Vector Storage | FAISS-powered semantic similarity search |
π― MCP Compliant | Full MCP 1.2.0 spec compliance via FastMCP |
𧬠Bio-Inspired | Hippocampus-style consolidation and forgetting |
π Security | Input validation, rate limiting, injection prevention |
π Semantic Search | Sentence transformer embeddings (CPU-optimized) |
βΎοΈ Unlimited Storage | No memory count limits, only per-item size limits |
π 100% Free | Local embedding model - no API costs |
π Quick Start
5 Core MCP Tools
π¦ Installation
Quick Install (Recommended)
Prerequisites: Python 3.9+ β’ ~200MB disk space (for embedding model)
Claude Desktop Integration
Add to your Claude Desktop config (claude_desktop_config.json):
π That's it! Claude will now have persistent memory across conversations.
Install from Source (Alternative)
π Documentation
Memory Operations via MCP
Once connected to Claude, use natural language:
Direct API Usage
βοΈ Writing Memories
π Reading Memories
π Consolidating Memories
π§Ή Forgetting Memories
Testing
Run the included test suite:
This tests all 5 operations with sample data.
ποΈ Architecture
π Memory Lifecycle
Step | Process | Technology |
π Write | Text β 384-dim vector embedding | Sentence Transformers (CPU) |
πΎ Store | Normalized vector β FAISS index | FAISS IndexFlatIP |
π Search | Query β embedding β top-k similar | Cosine similarity |
π Consolidate | Group similar (>0.85) β merge | Vector clustering |
π§Ή Forget | Filter by age/importance/tags β delete | Metadata filtering |
π Security
Protection | Implementation |
π‘οΈ Injection Prevention | Regex filtering of script tags, eval(), path traversal |
β±οΈ Rate Limiting | 100 requests per 60-second window per client |
π Size Limits | 50KB text, 5KB metadata, 20 tags per memory |
β Input Validation | Pydantic models + custom sanitization |
π Safe Logging | stderr only (prevents JSON-RPC corruption) |
βοΈ Configuration
Environment Variables
Storage Limits
β Unlimited total memories (no count limit)
β οΈ Per-memory limits: 50KB text, 5KB metadata, 20 tags
π Troubleshooting
First run downloads all-MiniLM-L6-v2 (~90MB). Ensure internet connection and ~/.cache/ write permissions.
The model runs on CPU. Ensure 2GB+ free RAM. Reduce top_k in read operations if needed.
π License
MIT License - feel free to use in your projects!
π€ Contributing
PRs welcome! Please:
Follow MCP security guidelines
Add tests for new features
Update documentation
π Resources
Built with π§ for persistent LLM memory