What can you do with this server?

Vestige is a local, neuroscience-inspired cognitive memory server for MCP-compatible AI agents — no cloud dependency. It goes far beyond simple storage: Core Memory Operations * Store (smart_ingest): Ingest single or batch memories with Prediction Error Gating (auto-decides CREATE/UPDATE/SUPERSEDE) * Search (search): 7-stage pipeline — HyDE expansion, keyword, semantic, reranking, temporal, competition, spreading activation * Retrieve/Manage (memory): Get, edit, promote, demote, or purge individual memories * Session Init (session_context): One-call session start bundling search, intentions, status, and predictions Cognitive Reasoning * Deep reasoning (deep_reference): 8-stage pipeline with FSRS-6 trust scoring, contradiction analysis, and temporal supersession * Contradiction detection (contradictions): Scan for trust-weighted disagreements on a topic * Dreaming (dream): Replay recent memories to discover hidden connections and synthesize insights * Graph exploration (explore_connections): Build reasoning chains and find bridging memories via spreading activation * Proactive prediction (predict): Suggest relevant memories based on current context Active Forgetting * Suppress (suppress): Inhibitory control (reversible within 24h) that cascades decay to related memories — distinct from deletion Memory Health & Maintenance * System status (system_status): Combined health, stats, FSRS preview, and recommendations * Memory health (memory_health): Retention distribution, trends, and recommendations * Consolidation (consolidate): Run FSRS-6 spaced-repetition decay cycle * Garbage collection (gc): Remove stale low-retention memories (dry-run by default) * Timeline & audit (memory_timeline, memory_changelog): Browse memories chronologically or view per-memory state transitions * Importance scoring (importance_score): 4-channel neuroscience model (novelty, arousal, reward, attention) Deduplication & Merging * Detect duplicates (find_duplicates, merge_candidates), preview plans (plan_merge, plan_supersede), apply or undo them (apply_plan, merge_undo) * Protect memories from auto-merge/GC (protect); configure per-project thresholds (merge_policy) Specialized Memory Types * Codebase (codebase): Store and retrieve code patterns and architectural decisions per project * Intentions (intention): Prospective memory triggers by time, context, or event — set, check, update, snooze External System Sync * Source sync (source_sync): Index GitHub Issues or Redmine into a semantically-searchable offline index with incremental updates and tombstoning Visualization & Graph Export * Memory graph (memory_graph): Export subgraphs for 3D force-directed visualization * Composed graph (composed_graph): Browse composition events, label outcomes, explore research lanes * 3D Dashboard: Real-time WebSocket-powered Three.js graph of the memory network Data Management * Backup (backup), export as JSON/JSONL/archive (export), and restore with optional merge (restore)

Which integrations are available for this server?

Allows syncing memory archives via Dropbox for portable memory transfer between machines. Allows syncing memory archives via Git repositories for portable memory transfer between machines. Allows syncing memory archives via iCloud for portable memory transfer between machines. Provides memory integration for JetBrains MCP clients, enabling agents to use local cognitive memory. Supports using OpenAI-compatible endpoints for the Sanhedrin verification hook (optional post-response verifier). Allows syncing memory archives via Syncthing for portable memory transfer between machines. Provides memory integration for Xcode MCP clients, enabling agents to use local cognitive memory.

How do I use Vestige?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Vestige remember that I prefer dark mode in all my tools" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Vestige

by samvallad33

Overview Schema Related Servers Score Discussions

Rust

Local

Vestige is a local, neuroscience-inspired cognitive memory server for MCP-compatible AI agents — no cloud dependency. It goes far beyond simple storage:

Core Memory Operations

Store (smart_ingest): Ingest single or batch memories with Prediction Error Gating (auto-decides CREATE/UPDATE/SUPERSEDE)
Search (search): 7-stage pipeline — HyDE expansion, keyword, semantic, reranking, temporal, competition, spreading activation
Retrieve/Manage (memory): Get, edit, promote, demote, or purge individual memories
Session Init (session_context): One-call session start bundling search, intentions, status, and predictions

Cognitive Reasoning

Deep reasoning (deep_reference): 8-stage pipeline with FSRS-6 trust scoring, contradiction analysis, and temporal supersession
Contradiction detection (contradictions): Scan for trust-weighted disagreements on a topic
Dreaming (dream): Replay recent memories to discover hidden connections and synthesize insights
Graph exploration (explore_connections): Build reasoning chains and find bridging memories via spreading activation
Proactive prediction (predict): Suggest relevant memories based on current context

Active Forgetting

Suppress (suppress): Inhibitory control (reversible within 24h) that cascades decay to related memories — distinct from deletion

Memory Health & Maintenance

System status (system_status): Combined health, stats, FSRS preview, and recommendations
Memory health (memory_health): Retention distribution, trends, and recommendations
Consolidation (consolidate): Run FSRS-6 spaced-repetition decay cycle
Garbage collection (gc): Remove stale low-retention memories (dry-run by default)
Timeline & audit (memory_timeline, memory_changelog): Browse memories chronologically or view per-memory state transitions
Importance scoring (importance_score): 4-channel neuroscience model (novelty, arousal, reward, attention)

Deduplication & Merging

Detect duplicates (find_duplicates, merge_candidates), preview plans (plan_merge, plan_supersede), apply or undo them (apply_plan, merge_undo)
Protect memories from auto-merge/GC (protect); configure per-project thresholds (merge_policy)

Specialized Memory Types

Codebase (codebase): Store and retrieve code patterns and architectural decisions per project
Intentions (intention): Prospective memory triggers by time, context, or event — set, check, update, snooze

External System Sync

Source sync (source_sync): Index GitHub Issues or Redmine into a semantically-searchable offline index with incremental updates and tombstoning

Visualization & Graph Export

Memory graph (memory_graph): Export subgraphs for 3D force-directed visualization
Composed graph (composed_graph): Browse composition events, label outcomes, explore research lanes
3D Dashboard: Real-time WebSocket-powered Three.js graph of the memory network

Data Management

Backup (backup), export as JSON/JSONL/archive (export), and restore with optional merge (restore)

Vestige

Local-first long-term memory for AI agents, delivered over MCP. Vestige remembers your decisions, catches contradictions before they cost you, and traces a failure back to the older memory that actually caused it. One 25MB Rust binary. No cloud. Your data never leaves your machine.

Release Tests Binary License

What it is · Install · First interaction · vs RAG · Backward reach · Benchmark · Science · Tools · Dashboard · Integrations · Pro · Docs

What Vestige is

Hi, I'm Sam. I built Vestige because my agents kept re-learning the same lessons. They would recommend a change I had already tested and rejected, re-derive a fix that was already written down, and treat every session as if the last one never happened.

Vestige is the memory layer that fixes that. It runs locally as an MCP server, so any MCP-capable agent (Claude Code, Claude Desktop, Codex, Cursor, and others) can write memories during a session and retrieve them later. Your data lives in a SQLite file on your own machine. After a one-time model download it works fully offline, with no API keys and no telemetry.

The part that makes it more than a note store: Vestige models memory on real cognitive science. It merges what is redundant, supersedes what is contradicted, keeps what you actually use, and lets unused memories fade. Most importantly, when a failure hits it can reach backward to the earlier decision that caused it, even when the cause and the symptom share no vocabulary. The cause never looks like the bug.

Related MCP server: Agent Memory

Install

Three steps. You need Node.js installed (for the npm command) and nothing else.

1. Install the server

No Docker, no API key, no signup.

npm install -g vestige-mcp-server@latest

This installs the vestige-mcp command. Prebuilt binaries ship for macOS (Apple Silicon and Intel), Linux x86_64, and Windows x86_64, so there is no compile step.

2. Connect it to your agent

Vestige speaks MCP, so it works with any MCP-capable agent. Every MCP client understands this config. Add it to your client's MCP settings:

{
  "mcpServers": {
    "vestige": {
      "command": "vestige-mcp"
    }
  }
}

If you prefer the CLI, use the one-line shortcut for your agent:

Agent	Setup
Claude Code	`claude mcp add vestige vestige-mcp -s user`
Codex	`codex mcp add vestige -- vestige-mcp`
Cursor / VS Code / Windsurf	add the JSON above to the editor's MCP settings, or see docs/integrations/
Cline / Continue / Zed / Goose	add the JSON above to that client's MCP config
Claude Desktop	docs/CONFIGURATION.md#claude-desktop-macos

3. Verify

On first run, Vestige downloads its embedding model once (about 130MB). After that it never needs the network again. To confirm the server is healthy, open the dashboard:

vestige dashboard

Then visit http://localhost:3927/dashboard. If you see the graph, you are connected. For a fuller walkthrough see docs/GETTING-STARTED.md.

Your first real interaction

Memories go in as you work. The interesting behavior shows up when a new claim conflicts with something you already stored.

Say your agent recorded this earlier:

We use Postgres for the primary datastore. Decided against MySQL for the JSONB support.

Later, someone tells the agent the opposite:

Our primary datastore is MySQL.

When the agent tries to store that, Vestige does not silently append it. The engine returns a claim_contradicts_memory status and surfaces the older, conflicting memory, so the agent can resolve the conflict instead of quietly holding two incompatible facts.

The other command you will reach for is backfill. When something breaks, run:

vestige backfill --contrast

This walks backward from the failure to the earlier memory that most plausibly caused it, and shows you the contrast between what you believed then and what went wrong now. That backward reach is the feature the rest of this README builds up to.

How it differs from RAG

RAG retrieves text that resembles your query. That is the right tool when the answer looks like the question. It is the wrong tool when the cause of a problem looks nothing like the symptom.

	Plain RAG / vector search	Vestige
Retrieval basis	Text similarity to the query	Causal and temporal links, plus similarity
Finding a root cause	Cannot, because the cause does not resemble the bug	Reaches backward to the root-cause memory
Contradictions	Stored side by side, both returned	Detected and flagged (`claim_contradicts_memory`)
Redundant writes	Accumulate as duplicates	Merged on write via prediction-error gating
Unused memories	Persist at full weight	Fade over time (FSRS-6 spaced repetition)
Where it runs	Usually a cloud service	Local single binary, offline after setup
Your data	Leaves your machine	Never leaves your machine

The distinction is not marketing. DeepMind proved that single-vector retrieval is mathematically incapable of representing certain relevance patterns (arXiv:2508.21038, ICLR 2026). That theorem is about the limits of the vector-only approach. The measured gap on the task below is my own.

Backward reach: the backfill feature

Most memory systems only look forward: you ask a question, they return similar text. Vestige also looks backward.

When a failure lands, the useful memory is rarely the one that resembles the error message. It is an older decision, made in different words, that set the failure up. A config choice from three weeks ago. A library pin. An assumption nobody wrote down as risky at the time.

Vestige implements Retroactive Salience Backfill (Zaki, Cai et al., Nature 2024, 637:145-155, DOI 10.1038/s41586-024-08168-4). When a memory turns out to matter, the system reaches backward and raises the salience of the earlier memories that led to it, so the causal chain becomes retrievable even though the surface text never matched.

In practice you run vestige backfill --contrast. Vestige returns the earlier memory that most plausibly caused the current failure, alongside the contradiction between then and now. It finds the cause you would not have thought to search for.

Silent Rotation: a reproducible benchmark

The claim above is testable, and the test ships with every transcript it produced.

Silent Rotation lives at benchmarks/silent-rotation/. Three coding agents fix one failing end-to-end test in a TypeScript monorepo. The fix needs the currently live signing key id, which is randomized per trial from a 50-key keyring and appears in no file the agents can read. It exists only in the memory layer.

Reproduce the central result in two seconds. Python standard library only, no API keys, no network:

git clone -b benchmark/silent-rotation --depth 1 https://github.com/samvallad33/vestige.git
cd vestige/benchmarks/silent-rotation
python3 tests/bm25_baseline.py results/runA-trial-1/corpus-export.json --no-dense

What it measures. A fleet either converges on the correct key, converges on a planted decoy, or splits and fails to merge. The second outcome is the dangerous one: tests pass, the merge is clean, and production breaks.

The numbers. 6 models, 25 trials, 246 published agent transcripts.

Arm	Converged correct	Converged wrong	Split
No memory	0/25	21/25	4/25
Dense cosine RAG	4/23	12/23	7/23
Vestige	20/23	0/23	3/23

Two separate claims, kept separate on purpose:

The theorem (DeepMind). Single-vector retrieval is mathematically incapable of these relevance gaps (arXiv:2508.21038, ICLR 2026). This is a fundamental limit of vector search.
The measurement (mine). On the verbatim queries the agents actually typed, the causal memory ranks 7th of 8 under both dense cosine and BM25, while the decoy ranks 1st.

The caveats are published alongside the results, including the trials where a plain cosine baseline ties Vestige and the trial Vestige loses.

The science

Every mechanism below is a cited result, implemented in Rust, running locally. None of it calls a cloud model to sound smart. Full write-up in docs/SCIENCE.md.

Mechanism	What it does	Source
Prediction-Error Gating	Stores only what is novel: merges redundant, supersedes contradictory	Hippocampal novelty gating
FSRS-6 spaced repetition	21-parameter schedule so used memories persist and unused ones fade	Modern spaced-repetition research
Retroactive Salience Backfill	Reaches backward to a failure's root-cause memory	Zaki, Cai et al. 2024, Nature 637:145-155, 10.1038/s41586-024-08168-4
Synaptic Tagging	Marks memories for later consolidation	Frey & Morris 1997, 10.1038/385533a0
Spreading Activation	Retrieving one memory activates related ones through the graph	Collins & Loftus 1975, 10.1037/0033-295X.82.6.407
Dual-Strength	Separates how well something is stored from how easily it is retrieved	Bjork & Bjork 1992
Memory Dreaming	Sleep-like consolidation that replays and synthesizes memories	Sleep consolidation and replay
Active Forgetting	Top-down inhibition that suppresses a memory, cascades to neighbors, reversible for 24 hours	Anderson 2025, Davis 2020

The 13 tools

Vestige exposes exactly 13 MCP tools. Your agent calls them; you rarely call them by hand.

Tool	Purpose
`recall`	Retrieve memories relevant to the current context
`backfill`	Reach backward from a failure to its root-cause memory
`smart_ingest`	Store a fact, with gating for novelty and contradiction
`memory`	Read, inspect, promote, or demote individual memories
`graph`	Explore the memory graph and its links
`maintain`	Run consolidation and lifecycle maintenance
`dedup`	Find and merge duplicate memories
`suppress`	Actively forget a memory (reversible for 24h)
`memory_status`	Report health, counts, and model readiness
`codebase`	Index and query codebase-scoped memory
`intention`	Track goals and open intentions across sessions
`source_sync`	Sync memories from external connected sources
`session_start`	Prime the agent with relevant context at session start

The dashboard

vestige dashboard

Open http://localhost:3927/dashboard to watch your memory as a live 3D graph.

It is built with SvelteKit 2 and Svelte 5, rendering with WebGPU and Three.js with bloom, driven by a live WebSocket feed, holding 1000+ nodes at 60fps. Memories appear, link, strengthen, and fade in real time as your agent works. It installs as a PWA if you want it as a standalone app.

Works with every agent

Vestige is a standard MCP server, so it works with any MCP-capable client. The universal config is all most agents need:

{
  "mcpServers": {
    "vestige": {
      "command": "vestige-mcp"
    }
  }
}

Client	Setup
Claude Code	`claude mcp add vestige vestige-mcp -s user`
Codex	`codex mcp add vestige -- vestige-mcp`
Cursor	docs/integrations/cursor.md
VS Code	docs/integrations/vscode.md
Windsurf	docs/integrations/windsurf.md
Claude Desktop	docs/CONFIGURATION.md#claude-desktop-macos
Cline / Continue / Zed / Goose	add the universal config above

Full configuration reference: docs/CONFIGURATION.md. Intel Mac notes: docs/INSTALL-INTEL-MAC.md.

Optional: make the agent use memory automatically

By default your agent calls the tools when it decides to. If you want memory to be a standing habit (recall at the start of a task, save durable facts as they land), give the agent a short protocol.

General agent memory protocol: docs/AGENT-MEMORY-PROTOCOL.md
Claude-specific setup and templates: docs/CLAUDE-SETUP.md

This is opt-in. Vestige works fine with no protocol at all.

Vestige Pro

Everything above is free forever and never metered. The engine runs on your machine, with no account, no quota, and no upsell inside the product.

Vestige Pro is for when that memory needs to follow you. It is managed, end-to-end encrypted continuity of your memory graph and your accountability history (Black Box traces, receipts, memory PRs) across every machine you work on. You record a decision on the laptop, and the agent on the desktop already knows it.

	Detail
Price	$19/month
What syncs	Your memory graph plus your accountability history
Encryption	XChaCha20-Poly1305, applied on your machine before anything is uploaded
Key derivation	Argon2id over a passphrase you choose
What the server holds	Ciphertext only

Zero-knowledge is the design, not a setting. You pick one passphrase, you use the same one on every device, and it never leaves your machine. The server stores bytes it cannot read, and the client refuses to sync anything in plaintext. If you lose that passphrase, the encrypted data is unrecoverable, by me and by anyone else. That is the property you are paying for, not a gap in it.

Availability. Checkout is not open yet, so there is nothing to buy today and no payment link here pretending otherwise. The client half already ships in this release, which is why vestige sync --cloud exists and tells you what it needs. Subscriptions open shortly. To catch the announcement, watch Releases or follow Discussions.

Under the hood

Vestige is a single Rust binary. No sidecar services, no external database, no cloud dependency.

Component	Detail
Language	Rust 2024 edition, about 96,000 lines
Distribution	Single 25MB binary, prebuilt for all platforms
Embeddings	Nomic Embed Text v1.5 (768d reduced to 256d via Matryoshka, 8192-token context)
Reranker	Qwen3 reranker, optional
Vector search	USearch HNSW
Storage	SQLite with FTS5, optional SQLCipher encryption
First run	Downloads about 130MB embedding model once, then fully offline forever
Platforms	macOS (ARM + Intel), Linux x86_64, Windows x86_64, all prebuilt
Quality	1,550 tests passing, clippy clean with `-D warnings`

Storage internals and encryption: docs/STORAGE.md.

Go deeper

Doc	What's in it
Getting Started	Full first-run walkthrough
FAQ	Common questions
The Science	Every mechanism with its citation
Configuration	All options and per-agent setup
Storage	Storage format and encryption
Agent Memory Protocol	Teaching an agent to use memory automatically
Intel Mac install	Notes for older Macs
Silent Rotation	The reproducible benchmark
Changelog	Release history

If Vestige saves you from one repeated mistake, that is the whole point: never solve the same problem twice. If it earns a place in your setup, star it on GitHub. It genuinely helps me keep building.

Built by Sam. Licensed under AGPL-3.0.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

17hResponse time

6dRelease cycle

29Releases (12mo)

Commit activity

Issues opened vs closed

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

View all tools

Related MCP Servers

Memory MCP
Knowledge & Memory Autonomous Agents
a157034816
F
license
-
quality
C
maintenance
A Rust-based MCP server that provides long-term memory capabilities for AI agents using keyword-based storage and retrieval. It supports isolated namespaces for different users or projects, allowing LLMs to remember and recall information across sessions.
Last updated 2026-02-08
Agent Memory
Knowledge & Memory File Systems
tverney
A
license
A
quality
A
maintenance
MCP server that exposes agent-memory-daemon to any MCP-compatible client — Kiro (CLI & IDE), Claude Desktop, Cursor, and others. The daemon does the thinking (consolidation + extraction); this server is a thin filesystem bridge so agents can read, append, and search memory through the Model Context Protocol.
Last updated 2026-07-24
4
42
4
MIT
dakera-mcpofficial
Knowledge & Memory Vector Databases Autonomous Agents
Dakera-AI
F
license
A
quality
B
maintenance
Self-hosted MCP-native agent memory server. Gives AI agents persistent, decay-weighted memory via 83 MCP tools — no cloud, full control. RocksDB+HNSW backend. Works with Claude Code, Cursor, and any MCP-compatible agent.
Last updated 2026-07-29
14
7
repo-memory
Developer Tools Knowledge & Memory
yubinkim444
A
license
A
quality
B
maintenance
A different approach from typical persistent-memory MCPs. Instead of a local SQLite + embeddings store, the memory lives as plain files in a .ai-memory/ directory you commit to your repo (facts.jsonl, decisions/\*.md, gotchas.md). Git is the sync layer — what one Claude/Cursor/Cline learns about a repo, the next session (or a teammate's agent) picks up automatically. 5 MCP tools: get_rep
Last updated 2026-05-20
5
1
MIT

View all related MCP servers

Related MCP Connectors

XMemo
User-owned memory for AI agents, Copilot, Claude, IDEs, CLIs, and chat apps over remote MCP.
Motecloud Memory
Cloud-hosted MCP server for durable AI memory
UltraMemory
One memory, every AI: Claude, ChatGPT, Perplexity, Gemini, Cursor, OpenClaw, Hermes, any MCP client.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/samvallad33/vestige'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Vestige

What Vestige is

Install

1. Install the server

2. Connect it to your agent

3. Verify

Your first real interaction

How it differs from RAG

Backward reach: the backfill feature

Silent Rotation: a reproducible benchmark

The science

The 13 tools

The dashboard

Works with every agent

Optional: make the agent use memory automatically

Vestige Pro

Under the hood

Go deeper

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

Memory MCP

Agent Memory

dakera-mcpofficial

repo-memory

Related MCP Connectors

Latest Blog Posts

MCP directory API