How do I use agent-memory?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@agent-memory remember that my favorite language is Python" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

agent-memory

by tahasiddiquii

Overview Schema Related Servers Score Discussions

Python

Local

agent-memory

Long-term memory for agents, done properly. A memory service that keeps per-user memories, ranks recall by relevance, recency, and importance together, consolidates repeated events into semantic facts, and forgets low-value memories over time. Retrieval is always scoped, so one user's memory can never surface for another. Exposed as an MCP server so an agent can remember and recall over the protocol. Fully offline and keyless.

python license offline

An agent that forgets everything between sessions cannot help you twice. But naive memory is worse than none: dump every message into a vector store and recall returns the most recent chatter instead of the fact you need, or worse, one user's data leaks into another's session. This service treats memory as a ranking and governance problem, built from my retrieval and evaluation work.

What this demonstrates

Capability	Where
Scoped memory: retrieval never crosses users	memory.py
Ranking by relevance, recency, and importance together	memory.py
Grounded recall with a refusal path	recall.py
Consolidation of repeats into semantic facts	memory.py
Decay and forgetting of low-value memories	memory.py
Exposed as an MCP server	server.py
Recall quality and isolation gated in CI	evals.py

Related MCP server: Recall

Architecture

flowchart LR
    E[events per user] --> S[(scoped store)]
    S --> C[consolidate repeats to facts]
    S --> D[decay and forget low value]
    Q[question + user scope] --> R{{relevance + recency + importance}}
    S --> R
    R --> A[grounded recall or refuse]

Quickstart

make dev            # venv + install -e ".[dev]"

amem demo           # ranked recall vs a most-recent baseline, plus consolidation and decay
amem recall "what is my favorite programming language" --scope alice
amem eval           # the recall and isolation gate
amem serve          # live MCP server: remember / recall / consolidate

No keys, no network. Embeddings are a deterministic hashing vectorizer; set a real embedder behind the same interface in production.

The gate that matters

amem eval answers questions whose facts were introduced in earlier sessions (report):

metric	value	gate
full_recall	1.000	>= 0.90
naive_recall (most-recent)	0.200	< full
precision_at_k	1.000	>= 0.80
cross_scope_leaks	0	= 0
consolidation_merged	2	reported
forgotten_after_decay	3	reported

The comparison is the point. Ranked recall answers every question; a most-recent baseline answers one in five, because the fact you asked about is usually an older memory buried under recent chatter. Scoped retrieval leaks nothing across users even though two users have a memory about the same topic. CI fails if recall drops, if the baseline is not beaten, or if a single cross-scope leak appears.

What it does

Ranks, not just stores. Recall combines relevance, recency, and importance, so a salient old fact beats a trivial recent one. Naive most-recent recall gets this wrong, which the eval measures directly.
Keeps users apart. Two users each say "my favorite language is ..."; each only ever recalls their own. Isolation is structural, not best-effort.
Consolidates. Repeated events are folded into a single semantic fact with boosted importance, so the store does not bloat with duplicates.
Forgets on purpose. Low-importance, old memories decay below a threshold and are pruned, while important facts survive. Forgetting is a feature, and the eval reports what was dropped.
Refuses. With nothing relevant in scope, recall says it does not remember rather than returning noise.

Design decisions

Scope is the first filter. Retrieval is restricted to the caller's scope before ranking, so cross-user leakage is impossible by construction.
One score, three signals. Relevance alone recalls stale facts; recency alone recalls chatter; importance alone ignores the query. Combining them is what makes recall useful.
Decay is query-free. Forgetting uses recency and importance only, so the store sheds low-value memories independent of any particular question.

Layout

src/agent_memory/  embed · memory · recall · server · evals · cli
data/  memories.jsonl · questions.jsonl
reports/  memory_report_example.md

Part of a portfolio on production ML and LLM engineering:

hybrid-graph-rag: hybrid and graph retrieval benchmark
mcp-knowledge-server: permission-aware knowledge MCP server
llm-eval-observability: RAG evaluation and observability
ai-harness: multi-stage agent harness
agent-memory: this repo.

License

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/tahasiddiquii/agent-memory'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

agent-memory

What this demonstrates

Architecture

Quickstart

The gate that matters

What it does

Design decisions

Layout

Related repositories

License

Maintenance

Resources

Looking for Admin?

Latest Blog Posts

MCP directory API