What can you do with this server?

Mnemos is a persistent memory engine for AI coding agents that stores, retrieves, and manages project knowledge across sessions using SQLite with full-text and semantic search. Core MCP Tools: * mnemos_store: Save memories with optional metadata — type (short_term, long_term, episodic, semantic, working), tags, category, summary, source, and project scope. Automatically deduplicates and indexes content. * mnemos_search: Hybrid FTS5 + semantic search with configurable mode (text, semantic, or hybrid) and RRF ranking, optionally scoped to a project. * mnemos_context: Assemble the most relevant memories for a query within a token budget — ideal for context injection at session start. * mnemos_get: Fetch a specific memory by ID. * mnemos_update: Modify an existing memory's content, summary, or tags (PATCH semantics). * mnemos_delete: Soft-delete a memory by ID (recoverable). * mnemos_relate: Create typed relationships between memories with optional strength, building a knowledge graph. * mnemos_maintain: Run decay scoring, archival, and garbage collection to keep the memory store efficient. Deployment & Integration: * Runs as an MCP server (stdio) or optional REST API server * One-command autopilot setup for Claude Code, Kiro, Cursor, and Windsurf * Single Go binary with embedded SQLite — no cloud, no runtime dependencies * Optional semantic search via Ollama or OpenAI embeddings * Optional Markdown mirror for human-readable memory export * Sub-60ms operations regardless of dataset size

Which integrations are available for this server?

Integrates with Ollama to provide local semantic search capabilities, using embeddings to allow agents to retrieve memories based on conceptual meaning. Supports semantic similarity search by connecting with OpenAI embedding models to index and retrieve relevant project context.

de en es ja ko ru zh

mnemos

by s60yucca

Overview Schema Related Servers Score Discussions

Hybrid

mnemos

The autopilot knowledge base for your coding agent.

Install once. From then on, your agent builds itself a structured knowledge base of your project — while you code. No prompts to remember, no remember() calls, no API to learn.

Single Go binary. Embedded SQLite. Zero cloud. No Docker. No Python. No Node runtime.

Agent (Claude Code / Cursor / Kiro / Gemini CLI / Codex / ...)
    ↓ MCP stdio
mnemos serve
    ↓
Auto-compiled knowledge base (~/.mnemos/mnemos.db)

What makes mnemos different

Every memory server stores text. Mnemos compiles a knowledge base.

While other servers expect you (or a carefully-tuned prompt) to decide when to store and when to retrieve, mnemos runs a full pipeline in the background:

Agent action → mnemos auto-pipeline:
                ├── Quality gate        (reject/rewrite low-value content)
                ├── 3-tier dedup        (hash → fuzzy → semantic)
                ├── Auto-summarize      (extractive, fast; LLM if available)
                ├── File linking        (extract identifiers, link to code)
                ├── Type classification (episodic / long_term / semantic / skill)
                ├── Quality scoring     (for retrieval ranking)
                └── Decay scheduling    (so knowledge base stays relevant)

Retrieval:
                ├── Hybrid search       (FTS5 + optional semantic + RRF)
                ├── File-overlap boost  (memories about active files rank higher)
                ├── MMR diversity       (kill redundant results)
                ├── Adaptive packing    (full content or summary based on budget)
                └── Token-budget cap    (always fits in context)

The pipeline runs behind the MCP and hook interfaces. Agents use normal memory tools; they do not need to orchestrate deduplication, summarization, ranking, packing, or lifecycle maintenance themselves.

Related MCP server: flux7-memory

Three layers, all shipping today

Layer 1 — MCP transport. Standard MCP server, stdio, works with any MCP client.

Layer 2 — Autopilot hooks. One command (mnemos setup claude) wires hooks + steering + MCP config. Session start auto-injects relevant context. Prompt submit auto-searches on topic change. Session end verifies coverage.

Layer 3 — Auto-compiled knowledge base. Quality gate, 3-tier dedup, auto-summarization, file linking, MMR context assembly — all automatic. You never trigger them. Includes passive background daemon that continuously detects staleness, contradiction, and missing relations across your memory base.

Compared honestly

	Mem0	Zep/Graphiti	engram	OMEGA	mnemos
MCP-native	✓	✓	✓	✓	✓
Single binary, no runtime deps	—	—	✓	—	✓
Zero cloud / local-first	partial	—	✓	✓	✓
1-command autopilot setup	—	—	—	—	✓
Auto-quality gate	—	—	—	—	✓
Auto-summarization	—	—	—	—	✓
Auto file-linking (git-aware)	—	—	—	—	✓
MMR context assembly	—	—	—	—	✓
Passive background daemon	—	—	—	—	✓
Temporal knowledge graph	—	✓	—	—	partial (decay + supersede)
Self-host cost	$0-cloud	~$50/mo (Neo4j)	$0	$0	$0

Mnemos isn't trying to be Zep — different bet. Zep is the best answer if you need temporal reasoning over business facts and have enterprise infrastructure. Mnemos is the best answer if you're a coding agent user who wants an autopilot knowledge base that runs itself on your laptop.

Install

# Homebrew (macOS / Linux)
brew install s60yucca/tap/mnemos && mnemos setup claude

# curl (verify mnemos.dev is live before using)
curl -fsSL https://mnemos.dev/install.sh | bash && mnemos setup claude

# npm wrapper
npx -y @s60yucca/mnemos setup claude

# Build from source (requires Go 1.23+)
git clone https://github.com/s60yucca/mnemos
cd mnemos && make build

Swap claude for cursor, kiro, gemini-cli, codex, or trae. Restart your client. Autopilot runs from here.

What autopilot actually does

mnemos setup <client> writes:

Steering file (CLAUDE.md, .cursorrules, .kiro/steering/mnemos.md) — tells the agent what's worth storing
Hook config (.claude/hooks.json or equivalent) — wires lifecycle events
MCP config (.mcp.json) — registers mnemos serve as tool provider

Three hooks run automatically:

Session start → mnemos hook session-start Assembles relevant memories within a token budget (MMR-diversified, file-boosted). Injects into context. Cold start < 200 ms.

Prompt submit → mnemos hook prompt-submit Detects topic + intent changes. Auto-searches knowledge base when the shift is meaningful. Respects cooldown to avoid noise.

Session end → mnemos hook session-end Verifies whether durable memory was captured. Optionally stores a minimal breadcrumb. Cleans up session state.

Steering tells the agent what is worth remembering. Hooks handle retrieval, dedup, summarization, linking — so the agent doesn't waste tokens thinking about memory logistics.

Passive autopilot daemon

Beyond hooks, mnemos runs a background daemon that continuously improves your knowledge base:

Staleness detection — flags memories that reference deleted files or outdated patterns
Contradiction detection — finds memories that conflict with each other
Relation inference — automatically links related memories
Backfill — retroactively generates summaries for memories that lack them
Auto-compile — compiles eligible source memories into reusable project knowledge

mnemos autopilot status          # check daemon state
mnemos autopilot run             # trigger immediate run
mnemos autopilot run --dry-run   # preview findings without writing
mnemos autopilot report          # view latest findings

Verify the automatic loop

Mnemos exposes separate views for configuration, raw activity, knowledge quality, and end-to-end loop readiness:

mnemos status                    # effective data path and automatic feature settings
mnemos health                    # raw feature firing rates and denominators
mnemos eval --project myapp      # memory quality, duplication, freshness, usefulness
mnemos check --project myapp     # consolidated read-only loop verification
mnemos check --launch            # stricter public-launch readiness gates

mnemos check opens the database read-only. It does not run migrations, start workers, or modify memories. mnemos check --fix is intentionally narrow: it only archives older generated autopilot reports after producing a verified cleanup plan.

Performance benchmark (latency)

Operation	350 memories	1,500 memories
`store` (new, with full pipeline)	57 ms	24 ms
`store` (dedup hit)	55 ms	22 ms
`search` hybrid (RRF + file boost)	42 ms	39 ms
`maintain` (decay + GC)	27 ms	108 ms
hook session-start (cold)	< 200 ms	—
binary size	~12 MB	—

Hardware: M1 Pro, 16GB RAM, SQLite on SSD. Your latency may vary.

Most operations stay under 60 ms regardless of dataset size. Hook subcommands use InitLight mode — no background workers, no session interrupt.

Value benchmark (token savings, precision, gotcha avoidance) is in progress. See DOGFOODING_RUNBOOK.md for methodology. Real numbers will replace this placeholder before public launch.

MCP tools

Tool	What it does
`mnemos_store`	Store a memory (full auto-pipeline runs transparently)
`mnemos_search`	Hybrid FTS + semantic + file-overlap search
`mnemos_context`	Assemble budget-aware, MMR-diversified context
`mnemos_get`	Fetch by ID
`mnemos_update`	Update content, summary, or tags
`mnemos_delete`	Soft-delete (recoverable via maintain)
`mnemos_relate`	Link two memories (supersedes, caused_by, depends_on)
`mnemos_maintain`	Run decay, archival, GC, stale detection

Quick start after install

# Agents call these automatically via MCP. You can also use directly:
mnemos store "JWT uses RS256, 1h expiry, config in auth/config.go"
mnemos search "token expiry"
mnemos stats
mnemos maintain

Configuration

Most users never touch this. But if you want:

# .mnemos/config.yaml (project-local) or ~/.mnemos/config.yaml (global)
embeddings:
  provider: noop              # noop (default) | ollama | openai
  # Pure FTS works fine. Enable semantic for meaning-based search.

quality_gate:
  min_words: 5
  max_words: 200
  min_density: 0.3
  require_specific: true      # long_term memories need project identifiers
  duplicate_threshold: 0.8

summarization:
  extractive: true             # always on, fast, offline

file_linking:
  enabled: true                # auto-disables outside git

hook:
  enabled: true
  search_cooldown: 5m
  session_start_max_tokens: 2000
  mmr_lambda: 0.7              # 0=max diversity, 1=max relevance
  file_boost: 0.3

autopilot:
  enabled: true
  interval: 15m
  contradiction_enabled: false
  auto_compile_enabled: true
  min_auto_compile_sources: 5

Resolution order is: explicit --config, project-local .mnemos/config.yaml, then ~/.mnemos/config.yaml. Environment variables use the MNEMOS_ prefix.

Memory types

Mnemos auto-classifies. Override manually via --type flag.

Type	Decay rate	Use for
`short_term`	fast (~1 day)	todos, temp notes, WIP
`episodic`	medium (~1 month)	session events, bug fixes
`long_term`	slow (~6 months)	architecture decisions
`semantic`	very slow	facts, definitions, knowledge
`skill`	slow	repeatable procedures and operational workflows
`compiled`	managed	auto-compiled project knowledge

Why I built this

I got tired of re-explaining my own project to Claude Code every morning.

I tried the existing memory servers. Most of them stored text fine. But every one expected me — or a carefully-tuned prompt — to decide when to store and when to retrieve. That's not a knowledge base. That's a database with an MCP wrapper.

Mnemos is what I built to make it actually automatic. mnemos setup claude, restart the editor, and the knowledge base compiles itself.

Autopilot setup — one command per client

mnemos setup claude       # writes CLAUDE.md, .claude/hooks.json, .mcp.json
mnemos setup cursor       # writes .cursorrules, .mcp.json
mnemos setup kiro         # writes .kiro/steering/mnemos.md, .kiro/mcp.json
mnemos setup gemini-cli   # writes GEMINI.md, .gemini/settings.json, .mcp.json
mnemos setup codex        # writes ~/.codex/config.toml (global, CLI + VSCode)

Flags: --global (install for all projects), --force (overwrite existing), --project <id> (override project ID).

Codex note: Codex uses a single global config shared between the CLI and VSCode extension. --global and --local are ignored. The setup command writes MNEMOS_PROJECT_ID and accepts --project <id> as an explicit override. Restart both Codex CLI and the VSCode extension after setup.

🤖 Using with Trae, OpenClaw, Paperclip, or any MCP Client

If you are using emerging AI frameworks like Trae (Solo Agent), OpenClaw (Claw bot), Paperclip, or Claude Desktop, you can easily connect Mnemos manually. Mnemos speaks standard MCP over stdio.

Just add this JSON snippet to your client's MCP configuration file (e.g., trae.json, openclaw.json, paperclip.config.json, or claude_desktop_config.json):

{
  "mcpServers": {
    "mnemos": {
      "command": "mnemos",
      "args": ["serve"]
    }
  }
}

CLI reference

mnemos init                                           # first-time setup
mnemos store "..."                                    # store (auto-pipeline)
mnemos search "auth"                                  # hybrid search
mnemos list --project myapp                           # list memories
mnemos get <id>                                       # fetch by id
mnemos update <id> --content "..."                    # update
mnemos delete <id>                                    # soft delete
mnemos relate <src> <tgt> --type supersedes           # typed relation
mnemos stats                                          # storage + quality stats
mnemos maintain                                       # decay + stale + GC
mnemos status                                         # automatic feature configuration
mnemos eval [--project myapp]                         # knowledge quality metrics
mnemos health [--project myapp]                       # raw feature activity
mnemos check [--project myapp]                        # verify the automatic knowledge loop
mnemos check --launch                                 # apply public-launch readiness gates
mnemos check --fix                                    # archive older generated reports safely
mnemos serve                                          # MCP server (stdio)
mnemos version

# Autopilot setup
mnemos setup claude | cursor | kiro | gemini-cli | codex | trae [--global] [--force] [--project <id>]

# Passive autopilot daemon
mnemos autopilot status
mnemos autopilot run [--dry-run] [--project <id>]
mnemos autopilot report [--project <id>]

# Backfill
mnemos backfill summaries --project <id> [--dry-run] [--limit N]

# Benchmark readiness
mnemos bench status
mnemos bench mode on | off
mnemos bench export [--project <id>] [--include-mixed] [--output sessions.csv]

# Hook subcommands (called by clients, not manually)
mnemos hook session-start
mnemos hook prompt-submit
mnemos hook session-end

What mnemos is not

Not a chatbot memory SaaS. For user-preference recall in customer support bots, use Mem0.
Not a temporal knowledge graph database. For valid-at/invalid-at reasoning over business facts, use Zep.
Not a cloud product. There is no mnemos cloud. There will never be one.
Not framework-specific. MCP-native. Works with anything that speaks MCP.

Mnemos does one thing: give agents a knowledge base that compiles itself.

Roadmap

See ROADMAP.md. Short version:

v1.1.14 (current): full auto-pipeline, auto-compile, passive auto-inject, loop verification, provenance-backed benchmarks, Codex support
v1.2 (next): distribution, public benchmark evidence, MCP Registry, HN launch
v1.3 (planned): team memory via git — shared .mnemos/shared/ for teammate knowledge
v2.0+ (TBD): cross-project memory scopes, memory compaction, driven by user feedback

Community

License

MIT

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

2dRelease cycle

30Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Tools

View all tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/s60yucca/mnemos'

If you have feedback or need assistance with the MCP directory API, please join our Discord server