What can you do with this server?

Forgetful is a semantic memory and knowledge management MCP server for AI agents, enabling structured storage and retrieval of information across sessions and multiple agent instances, built on Zettelkasten principles. * Memory Management: Create, search, update, link/unlink, and retire atomic memories (<400 words, one concept per note) with semantic embeddings for natural language retrieval and provenance tracking (source repo, confidence, encoding agent). * Semantic Search & Ranking: Query using natural language with dense + sparse retrieval, Reciprocal Rank Fusion (RRF), and cross-encoder reranking for high-precision recall. * Automatic Knowledge Graph: Memories are auto-linked to semantically similar entries (≥0.7 threshold); manual linking also supported. Includes typed entity-entity relationships (e.g., works_for, owns). * Entity Management: Create and manage real-world entities (people, organizations, teams, devices) with aliases, notes, relationships, and links to memories and projects. * Project Organization: Group memories, entities, documents, and code artifacts under named projects for scoped retrieval. * Document Storage: Store long-form content (>300 words) such as guides, analysis, and reports, with the ability to extract and link atomic memories from them. * Code Artifact Storage: Save reusable code snippets with language tagging, descriptions, and project/memory associations. * Procedural Skill Storage: Store step-by-step instructions and agent capabilities with semantic search and import/export in Agent Skills SKILL.md format. * Token Budget Management: Query results are automatically prioritized by importance then recency and truncated to fit a configurable token budget (default 8K tokens). * Meta-Tool Pattern: Only 3 tools are exposed to MCP clients (discover_forgetful_tools, how_to_use_forgetful_tool, execute_forgetful_tool), preserving context window space while providing access to 42+ underlying tools. * Cross-Agent Knowledge Sharing: Acts as a shared knowledge base accessible by multiple AI agents simultaneously. * CLI Access: Full terminal client (forgetful command) for managing memories, projects, entities, and tools directly from the command line. * Flexible Deployment & Auth: Supports STDIO or HTTP transport, SQLite or PostgreSQL backends, Docker deployment, and multiple auth flows including OAuth and bearer tokens.

How do I use Forgetful?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Forgetful search for memories about Zettelkasten method" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Forgetful

by ScottRBK

Overview Schema Related Servers Score Discussions

Python

Hybrid

Forgetful is a semantic memory and knowledge management MCP server for AI agents, enabling structured storage and retrieval of information across sessions and multiple agent instances, built on Zettelkasten principles.

Memory Management: Create, search, update, link/unlink, and retire atomic memories (<400 words, one concept per note) with semantic embeddings for natural language retrieval and provenance tracking (source repo, confidence, encoding agent).
Semantic Search & Ranking: Query using natural language with dense + sparse retrieval, Reciprocal Rank Fusion (RRF), and cross-encoder reranking for high-precision recall.
Automatic Knowledge Graph: Memories are auto-linked to semantically similar entries (≥0.7 threshold); manual linking also supported. Includes typed entity-entity relationships (e.g., works_for, owns).
Entity Management: Create and manage real-world entities (people, organizations, teams, devices) with aliases, notes, relationships, and links to memories and projects.
Project Organization: Group memories, entities, documents, and code artifacts under named projects for scoped retrieval.
Document Storage: Store long-form content (>300 words) such as guides, analysis, and reports, with the ability to extract and link atomic memories from them.
Code Artifact Storage: Save reusable code snippets with language tagging, descriptions, and project/memory associations.
Procedural Skill Storage: Store step-by-step instructions and agent capabilities with semantic search and import/export in Agent Skills SKILL.md format.
Token Budget Management: Query results are automatically prioritized by importance then recency and truncated to fit a configurable token budget (default 8K tokens).
Meta-Tool Pattern: Only 3 tools are exposed to MCP clients (discover_forgetful_tools, how_to_use_forgetful_tool, execute_forgetful_tool), preserving context window space while providing access to 42+ underlying tools.
Cross-Agent Knowledge Sharing: Acts as a shared knowledge base accessible by multiple AI agents simultaneously.
CLI Access: Full terminal client (forgetful command) for managing memories, projects, entities, and tools directly from the command line.
Flexible Deployment & Auth: Supports STDIO or HTTP transport, SQLite or PostgreSQL backends, Docker deployment, and multiple auth flows including OAuth and bearer tokens.

Forgetful

Python License MCP FastMCP FastEmbed Discord

Forgetful is a storage and retrieval tool for AI Agents. Designed as a Model Context Protocol (MCP) server built using the FastMCP framework. Once connected to this service, MCP clients such as Coding Agents, Chat Bots or your own custom built Agents can store and retrieve information from the same knowledge base.

Banner

Related MCP server: local-memory-mcp

Overview

A lot of us are using AI Agents now, especially in the realm of software development. The pace at which work and decisions are made can make it difficult for you to keep up from a notes and context persistence perspective.

So if you are following something like the BMAD Method for example and you want to take your brain storming session you've just had with Claude on your desktop/mobile and use it for the basis of your next Claude Code session, then having a shared knowledge base across the two agents can help with this.

This is just one example use case to illustrate the point, more and more agentic applications are going to surface and the use cases for sharing data across them is going to increase.

Knowledge bases are going to become a key infrastructure component for your interactions with AIs. There are many excellent knowledge base solutions available (many for free on github) and I would encourage you to check them out and find one that works for you (even if Forgetful doesn't) as I found from personal experience that interactions with my agents got easier and more rewarding once they knew more about me, my work and previous interactions that I had had with them or other AI systems.

What makes Forgetful different from other Memory based MCP services is that it is a rather opinionated view on how AI Agents such store and retrieve data.

Forgetful imposes the Zettelkasten principle when clients wish to record memories, that is each memory must be atomic (one concept per note). Along with the note (title and content), we also ask the client / agent to provide context around what it was doing when creating the note, along with keywords and tags. With this information we create semantic embeddings and store these to aid with later retrieval and in addition to this we also automatically link the memory to existing memories that have a particular similarity score, allowing for the automatic construction of a knowledge graph.

In this sense Forgetful becomes a little bit like Obsidian for AI Agents, where the auto linking nudges them in building up a graph of the knowledge.

We find, as do others (A-MEM: Agentic Memory or LLM Agents), all this helps in ensuring that when the agent requires relevant information from the memory system later, the correct information is returned.

In addition to just memories, Forgetful also has the concept of entities (think organisation, people, products), projects, documents, code artifacts, skills (procedural knowledge following the Agent Skills standard), and plans with tasks for multi-agent coordination, all of which can be associated with one or more memories.

Architecture

Features

Configure either STDIO or HTTP transport mechanism (or stand up two services to support both)
Multiple Authentication supported, flows see FastMCP docs for full list
Meta Tool Discovery, only three tools exposed to client application to preserve context window.
Flexible Storage– SQLite (default, zero-config) or PostgreSQL (for scale and production deployments)
Stores memories as vectors and allowing memories to be retrieved from natural language queries from AI.
Cross Encoder reranking to improve recall and precision of memory retrieval.
Flexible ranking (embedding and cross encoder) providers, run everything locally without calls to the cloud thanks to FastEmbed
Automatic linking of semantically similar memories, automating the creation of the knowledge graph.
Plans and Tasks for multi-agent coordination -- structure work into plans with tasks that have acceptance criteria, state management with optimistic locking, and dependency tracking with cycle detection.
Skills for procedural memory -- store step-by-step instructions and agent capabilities with semantic search, import/export in Agent Skills SKILL.md format, and cross-referencing with memories.

For the complete roadmap, see Features Roadmap.

Quick Start

Option 1: PyPI (Recommended)

# Run directly with uvx (no installation needed)
uvx forgetful-ai

# Or install globally
uv tool install forgetful-ai
forgetful

Data stored in platform-appropriate locations (~/.local/share/forgetful on Linux/Mac, AppData on Windows).

By default, runs with stdio transport for MCP clients. For HTTP:

uvx forgetful-ai --transport http --port 8020

Option 2: From Source

git clone https://github.com/ScottRBK/forgetful.git
cd forgetful

# Install dependencies with uv
uv sync

# Run the server (uses SQLite by default)
uv run main.py

The server starts with stdio transport. For HTTP: uv run main.py --transport http

Option 3: Docker Deployment (Production/Scale)

Forgetful provides two Docker deployment options:

SQLite with Docker (Simpler, Single-Container)

See docker-compose.sqlite.yml

cd docker
cp .env.example .env
# Edit .env: Set DATABASE=SQLite and SQLITE_PATH=data/forgetful.db
docker compose -f docker-compose.sqlite.yml up -d

The SQLite database persists in the ./data directory on the host.

PostgreSQL with Docker (Recommended for multitenant)

See docker-compose.postgres.yml and .env.example

cd docker
cp .env.example .env
# Edit .env: Set DATABASE=Postgres and configure POSTGRES_* settings
docker compose -f docker-compose.postgres.yml up -d

Note: If no .env file exists, the application uses defaults from app/config/settings.py. For all configuration options, see Configuration Guide.

Connecting to An Agent

For detailed connection guides (Claude Code, Claude Desktop, other clients that support MCP), see Connectivity Guide.

Claude Code
VS Code
Copilot CLI (includes custom agents and skills)
Cursor
Codex
Gemini CLI (includes custom commands)
Opencode (includes custom commands and skills)

Add Forgetful to your MCP client configuration:

stdio transport (recommended for local use):

{
  "mcpServers": {
    "forgetful": {
      "type": "stdio",
      "command": "uvx",
      "args": ["forgetful-ai"]
    }
  }
}

HTTP transport (for Docker/remote):

{
  "mcpServers": {
    "forgetful": {
      "type": "http",
      "url": "http://localhost:8020/mcp"
    }
  }
}

CLI

The forgetful command is also a full terminal client over the same tool registry the MCP meta-tools use - against your local database by default, or a remote deployment after auth login.

uv tool install forgetful-ai

# Curated verbs for daily use
forgetful memory save "Set generateResolvConf false to fix WSL2 DNS" \
    --title "WSL2 DNS fix" --importance 7
forgetful memory search "wsl dns" -c "wsl networking" -n 5
forgetful memory get 812
forgetful memory recent -n 10 -p my-project
forgetful project list

# Generic passthrough to any of the 42 tools
forgetful tools list --category memory
forgetful tools info query_memory
forgetful call create_project --args '{"name": "Homelab", "description": "...", "project_type": "personal"}'

# Remote deployment (browser OAuth; saves FORGETFUL_SERVER to ~/.config/forgetful/.env)
forgetful auth login --server https://forgetful.example.com
forgetful auth status
forgetful memory search "wsl dns" -c "wsl networking"          # now runs remotely
forgetful memory search "wsl dns" -c "wsl networking" --local  # force local mode per invocation

# Scripting: --json emits machine-readable output
forgetful memory search "wsl dns" -c "wsl networking" --json | jq '.primary_memories[0].id'

forgetful serve is the canonical way to run the MCP server (forgetful serve --transport http --port 8020); the bare forgetful / uvx forgetful-ai invocation and the legacy --transport/--re-embed flags keep working indefinitely, so existing MCP client configurations are unaffected. Headless environments can set FORGETFUL_TOKEN (bearer) instead of the OAuth flow. See the Configuration Guide for precedence rules, or walk through the forgetful-cli-setup skill for install, auth, and verification steps end-to-end.

Usage Examples

Forgetful exposes tools through a meta-tools pattern - only 3 tools visible to your MCP client, with 42 tools accessible via execute_forgetful_tool. See Complete Tool Reference for all tools.

Example 1: Project-Scoped Memory

Create a memory linked to a project for better organization and scoped retrieval.

# Create project for organizing related knowledge
project = execute_forgetful_tool(
    "create_project",
    {
        "name": "E-Commerce Platform Redesign",
        "project_type": "work",
        "status": "active"
    }
)

# Create memory linked to project
memory = execute_forgetful_tool(
    "create_memory",
    {
        "title": "Payment gateway: Stripe chosen over PayPal",
        "content": "Selected Stripe for better API docs, lower fees, and built-in fraud detection. PayPal lacks webhooks for subscription management.",
        "importance": 9,
        "tags": ["payment", "stripe", "decision"],
        "project_id": project["project_id"]
    }
)

# Later, query within project scope
results = execute_forgetful_tool(
    "query_memory",
    {
        "query": "payment processing implementation",
        "project_id": project["project_id"]
    }
)
# Returns: Stripe decision + auto-linked related memories

Example 2: Knowledge Graph with Entities

Track people, organizations, and relationships - perfect for team and infrastructure management.

# New engineer joins your company
new_hire = execute_forgetful_tool(
    "create_entity",
    {
        "name": "Jordan Taylor",
        "entity_type": "Individual",
        "description": "Backend Engineer - Payments Team",
        "tags": ["engineering", "backend", "payments"]
    }
)

# Get company entity (create if needed)
company = execute_forgetful_tool(
    "create_entity",
    {
        "name": "TechFlow Systems",
        "entity_type": "Organization",
        "description": "SaaS platform company"
    }
)

# Create employment relationship
execute_forgetful_tool(
    "create_entity_relationship",
    {
        "from_entity_id": new_hire["entity_id"],
        "to_entity_id": company["entity_id"],
        "relationship_type": "works_for",
        "metadata": {
            "role": "Backend Engineer II",
            "department": "Payments",
            "start_date": "2025-01-20"
        }
    }
)

# Create memory about hiring
hire_memory = execute_forgetful_tool(
    "create_memory",
    {
        "title": "Jordan Taylor hired - payments focus",
        "content": "Jordan joins to build Stripe integration and handle PCI compliance. Previous experience with payment systems at FinanceApp Corp.",
        "importance": 7,
        "tags": ["team", "hiring", "payments"]
    }
)

# Link person to memory
execute_forgetful_tool(
    "link_entity_to_memory",
    {
        "entity_id": new_hire["entity_id"],
        "memory_id": hire_memory["memory_id"]
    }
)

# Query Jordan's related knowledge
results = execute_forgetful_tool(
    "query_memory",
    {"query": "Jordan payment implementation"}
)
# Returns: Hiring memory + linked entity + relationship context

Tool Categories

Forgetful provides tools across 7 categories:

Memory Tools (7) – create, query, update, link, mark obsolete
Project Tools (5) – organize knowledge by context/scope
Entity Tools (15) – track people, orgs, devices; build knowledge graphs
Code Artifact Tools (5) – store reusable code snippets
Document Tools (5) – store long-form content (>400 words)
Skill Tools (10) – store procedural knowledge with semantic search and SKILL.md import/export
User Tools (2) – profile and authentication

For complete documentation with extensive examples, see Complete Tool Reference.

How It Works

Atomic Memory Principle

Inspired by Zettelkasten, each memory stores one concept in ~300-400 words:

Easily titled – Forces clarity (200 char limit)
Self-contained – Understandable without external context
Linkable – Small units enable precise knowledge graphs

For detailed content, use Documents and extract 3-7 atomic memories that link to the parent document.

Automatic Knowledge Graph

When you create a memory:

Embedding generated – FastEmbed converts content to 384-dimensional vector
Similarity search – Finds top semantically-related memories (≥0.7 threshold)
Auto-linking – Creates bidirectional links to top 3-5 matches (configurable)
Graph traversal – Queries return primary results + 1-hop linked memories

Entities and Knowledge Graphs

Entities represent concrete, real-world things (people, organizations, teams, devices) that can be linked to memories:

Typed entities – Organizations, Individuals, Teams, Devices, Systems, or custom types
Relationships – Directional connections (e.g., "Person works_at Organization") with strength and metadata
Memory linking – Associate entities with relevant memories for context
Knowledge graph – Build networks showing how entities relate to each other and your knowledge base

Use entities for concrete things (Sarah Chen, TechFlow Systems, Cache Server 01) and memories for abstract concepts (architectural patterns, decisions, learnings).

Token Budget Management

Prevents context window overflow:

Configurable budget (default 8K tokens)
Results prioritized by importance (9-10 first) → recency (newest first)
Truncates gracefully if over budget
Respects max memory count (default 20)

This ensures agents get the most relevant context without overwhelming the LLM.

For deep dive on search architecture (dense → sparse → RRF → cross-encoder), see Search Documentation.

Configuration

No configuration required – Forgetful uses sensible defaults out of the box.

Key Settings (Optional)

MEMORY_TOKEN_BUDGET – Max tokens for query results (default: 8000)
EMBEDDING_MODEL – Embedding model (default: BAAI/bge-small-en-v1.5)
MEMORY_NUM_AUTO_LINK – Auto-link count (default: 3, set 0 to disable)
SERVER_PORT – HTTP server port (default: 8020)
MAX_GRAPH_LIMIT – Upper bound for /api/v1/graph ?limit and /api/v1/graph/subgraph ?max_nodes (default: 2000)

For all 40+ environment variables with detailed explanations, see Configuration Guide.

Documentation

Guides

Core Concepts – Memories vs Entities vs Documents explained
Complete Tool Reference – All 42 tools with extensive examples
REST API Reference – HTTP endpoints for web UI integration
Configuration Guide – All environment variables explained
Connectivity Guide – Connect Claude and other MCP clients
Self-Hosting Guide – Deploy on a VPS with Docker
Search Documentation – Embedding pipeline and retrieval architecture
Embedding Migration – Switch embedding providers safely
Features Roadmap – Planned features and priorities

External Resources

MCP Protocol Specification – Model Context Protocol docs
pgvector – PostgreSQL vector extension
FastEmbed – Local embedding generation
Zettelkasten Principle – Atomic note-taking method

Contributing

We welcome contributions! Forgetful uses integration + E2E testing with Docker Compose orchestration.

See Contributors Guide for:

Testing workflows (integration tests, E2E tests, GitHub Actions)
Development setup (local vs Docker)
CI/CD pipeline details
Release process

License

MIT License - see LICENSE for details.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

8dResponse time

1wRelease cycle

29Releases (12mo)

Issues opened vs closed

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ScottRBK/forgetful'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Forgetful

Table of Contents

Overview

Features

Quick Start

Option 1: PyPI (Recommended)

Option 2: From Source

Option 3: Docker Deployment (Production/Scale)

SQLite with Docker (Simpler, Single-Container)

PostgreSQL with Docker (Recommended for multitenant)

Connecting to An Agent

CLI

Usage Examples

Example 1: Project-Scoped Memory

Example 2: Knowledge Graph with Entities

Tool Categories

How It Works

Atomic Memory Principle

Automatic Knowledge Graph

Entities and Knowledge Graphs

Token Budget Management

Configuration

Key Settings (Optional)

Documentation

Guides

External Resources

Contributing

License

Maintenance

Resources

Looking for Admin?

Tools

Latest Blog Posts

MCP directory API