Which integrations are available for this server?

Supports GenAI semantic convention tracing for monitoring, auditing, and observability of memory retrieval and storage operations. Enables the discovery and ingestion of legacy memory data and prior assistant session history stored in SQLite databases.

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Muninn recall our discussion about the database schema changes from last week" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Muninn

by wjohns989

Overview Schema Related Servers Score Discussions

Python

Local

Muninn

"Muninn flies each day over the world to bring Odin knowledge of what happens." — Prose Edda

Local-first persistent memory infrastructure for coding agents and MCP-compatible tools.

Muninn provides deterministic, explainable memory retrieval with robust transport behavior and production-grade operational controls. Designed for long-running development workflows where continuity, auditability, and measurable quality matter — across sessions, across assistants, and across projects.

🚩 Status

Current Version: v3.24.0 (Phase 26 COMPLETE) Stability: Production Beta Test Suite: 1422+ passing, 0 failing

What's New in v3.24.0

Cognitive Architecture (CoALA): Integration of a proactive reasoning loop bridging memory with active decision-making.
Knowledge Distillation: Background synthesis of episodic memories into structured semantic manuals for long-term wisdom.
Epistemic Foraging: Active inference-driven search to resolve ambiguities and fill information gaps autonomously.
Omission Filtering: Automated detection of missing context required for successful task execution.
Elo-Rated SNIPS Governance: Dynamic memory retention system mapping retrieval success to Elo ratings for usage-driven decay.

Previous Milestones

Version	Phase	Key Feature
v3.24.0	26	Cognitive Architecture Complete
v3.23.0	23	Elo-Rated SNIPS Governance
v3.22.0	22	Temporal Knowledge Graph
v3.19.0	20	Multimodal Hive Mind Operations
v3.18.3	19	Bulk legacy import, NLI conflict detection, uncapped discovery
v3.18.1	19	Scout synthesis, hunt mode

🚀 Features

Core Memory Engine

Local-First: Zero cloud dependency — all data stays on your machine
Multimodal: Native support for Text, Image, Audio, Video, and Sensor data
5-Signal Hybrid Retrieval: Dense vector · BM25 lexical · Graph traversal · Temporal relevance · Goal relevance
Explainable Recall Traces: Per-signal score attribution on every search result
Bi-Temporal Reasoning: Support for "Valid Time" vs "Transaction Time" via Temporal Knowledge Graph
Project Isolation: scope="project" memories never cross repo boundaries; scope="global" memories are always available
Cross-Session Continuity: Memories survive session ends, assistant switches, and tool restarts
Bi-Temporal Records: created_at (real-world event time) vs ingested_at (system intake time)

Memory Lifecycle

Elo-Rated Governance: Dynamic retention driven by retrieval feedback (SNIPS) and usage statistics
Consolidation Daemon: Background process for decay, deduplication, promotion, and shadowing — inspired by sleep consolidation
Zero-Trust Ingestion: Isolated subprocess parsing for PDF/DOCX to neutralize document-based exploits
ColBERT Multi-Vector: Native Qdrant multi-vector storage for MaxSim scoring
NL Temporal Query Expansion: Natural-language time phrases ("last week", "before the refactor") parsed into structured time ranges
Goal Compass: Retrieval signal for project objectives and constraint drift
NLI Conflict Detection: Transformer-based contradiction detection (cross-encoder/nli-deberta-v3-small) for memory integrity
Bulk Legacy Import: One-click ingestion of all discovered legacy sources (batched, error-isolated) via dashboard or API

Operational Controls

MCP Transport Hardening: Framed + line JSON-RPC, timeout-window guardrails, protocol negotiation
Runtime Profile Control: get_model_profiles / set_model_profiles for dynamic model routing
Profile Audit Log: Immutable event ledger for profile policy mutations
Browser Control Center: Web UI for search, ingestion, consolidation, and admin at http://localhost:42069
OpenTelemetry: GenAI semantic convention tracing (feature-gated via MUNINN_OTEL_ENABLED)

Multi-Assistant Interop

Handoff Bundles: Export/import memory checkpoints with checksum verification and idempotent replay
Legacy Migration: Discover and import memories from prior assistant sessions (JSONL chat history, SQLite state) — uncapped provider limits
Bulk Import: POST /ingest/legacy/import-all ingests all discovered sources in batches of 50 with per-batch error isolation
Hive Mind Federation: Push-based low-latency memory synchronization across assistant runtimes
MCP 2025-11 Compliant: Full protocol negotiation, lifecycle gating, schema annotations

Quick Start

git clone https://github.com/wjohns989/Muninn.git
cd Muninn
pip install -e .

Set the auth token (shared between server and MCP wrapper):

# Windows (persists across sessions)
setx MUNINN_AUTH_TOKEN "your-token-here"

# Linux/macOS
export MUNINN_AUTH_TOKEN="your-token-here"

Start the backend:

python server.py

Verify it's running:

curl http://localhost:42069/health
# {"status":"ok","memory_count":0,...,"backend":"muninn-native"}

Runtime Modes

Mode	Command	Description
Muninn MCP	`python mcp_wrapper.py`	stdio MCP server for active assistant/IDE sessions
Huginn Standalone	`python muninn_standalone.py`	Browser-first UX for direct ingestion/search/admin
REST API	`python server.py`	FastAPI backend at `http://localhost:42069`
Packaged App	`python scripts/build_standalone.py`	PyInstaller executable (Huginn Control Center)

All modes use the same memory engine and data directory.

MCP Client Configuration

Claude Code (recommended — bakes auth token into registration):

claude mcp add -s user muninn \
  -e MUNINN_AUTH_TOKEN="your-token-here" \
  -- python /absolute/path/to/mcp_wrapper.py

Generic MCP client (claude_desktop_config.json or equivalent):

{
  "mcpServers": {
    "muninn": {
      "command": "python",
      "args": ["/absolute/path/to/mcp_wrapper.py"],
      "env": {
        "MUNINN_AUTH_TOKEN": "your-token-here"
      }
    }
  }
}

Important: Both server.py and mcp_wrapper.py must share the same MUNINN_AUTH_TOKEN. If either process generates a random token (when the env var is unset), all MCP tool calls fail with 401.

MCP Tools

Tool	Description
`add_memory`	Store a memory with optional `scope`, `project`, `namespace`, `media_type`
`search_memory`	Hybrid 5-signal search with `media_type` filtering and recall traces
`get_all_memories`	Paginated memory listing with filters
`update_memory`	Update content or metadata of an existing memory
`delete_memory`	Remove a memory by ID
`set_project_goal`	Set the current project's objective and constraints
`get_project_goal`	Retrieve the active project goal
`set_project_instruction`	Store a project-scoped rule (`scope="project"` by default)
`get_model_profiles`	Get active model routing profiles
`set_model_profiles`	Update model routing profiles
`get_model_profile_events`	Audit log for profile policy changes
`export_handoff`	Export a memory handoff bundle
`import_handoff`	Import a handoff bundle (idempotent)
`ingest_sources`	Ingest files/folders into memory
`discover_legacy_sources`	Find prior assistant session files for migration
`ingest_legacy_sources`	Import discovered legacy memories
`record_retrieval_feedback`	Submit outcome signal for adaptive calibration

Python SDK

from muninn import Memory

# Sync client
client = Memory(base_url="http://127.0.0.1:42069", auth_token="your-token-here")
client.add(
    content="Always use typed Pydantic models for API payloads",
    metadata={"project": "muninn", "scope": "project"}
)

results = client.search("API payload patterns", limit=5)
for r in results:
    print(r.content, r.recall_trace)

Async client:

from muninn import AsyncMemory

async def main():
    async with AsyncMemory(base_url="http://127.0.0.1:42069", auth_token="your-token-here") as client:
        await client.add(content="...", metadata={})
        results = await client.search("...", limit=5)

REST API

Method	Path	Description
`GET`	`/health`	Server health + memory/vector/graph counts
`POST`	`/add`	Add a memory (supports `media_type`)
`POST`	`/search`	Hybrid search (supports `media_type` filtering)
`GET`	`/get_all`	Paginated memory listing
`PUT`	`/update`	Update a memory
`DELETE`	`/delete/{memory_id}`	Delete a memory
`POST`	`/ingest`	Ingest files/folders
`POST`	`/ingest/legacy/discover`	Discover legacy session files
`POST`	`/ingest/legacy/import`	Import selected legacy memories
`POST`	`/ingest/legacy/import-all`	Discover and import ALL legacy sources (batched)
`GET`	`/ingest/legacy/status`	Legacy discovery scheduler status
`GET`	`/ingest/legacy/catalog`	Paginated cached catalog of discovered sources
`GET`	`/profiles/model`	Get model routing profiles
`POST`	`/profiles/model`	Set model routing profiles
`GET`	`/profiles/model/events`	Profile audit log
`GET`	`/profile/user/get`	Get user profile
`POST`	`/profile/user/set`	Update user profile
`POST`	`/handoff/export`	Export handoff bundle
`POST`	`/handoff/import`	Import handoff bundle
`POST`	`/feedback/retrieval`	Submit retrieval feedback
`GET`	`/goal/get`	Get project goal
`POST`	`/goal/set`	Set project goal

Auth: Authorization: Bearer <MUNINN_AUTH_TOKEN> required on all non-health endpoints.

Configuration

Key environment variables:

Variable	Default	Description
`MUNINN_AUTH_TOKEN`	random	Shared secret between server and MCP wrapper
`MUNINN_SERVER_URL`	`http://localhost:42069`	Backend URL for MCP wrapper
`MUNINN_PROJECT_SCOPE_STRICT`	off	`=1` disables cross-project fallback entirely
`MUNINN_MCP_SEARCH_PROJECT_FALLBACK`	off	`=1` enables global-scope fallback on empty results
`MUNINN_OPERATOR_MODEL_PROFILE`	`balanced`	Default model routing profile
`MUNINN_OTEL_ENABLED`	off	`=1` enables OpenTelemetry tracing
`MUNINN_OTEL_ENDPOINT`	`http://localhost:4318`	OTLP HTTP endpoint for trace export
`MUNINN_CHAINS_ENABLED`	off	`=1` enables graph memory chain detection (PRECEDES/CAUSES edges)
`MUNINN_COLBERT_MULTIVEC`	off	`=1` enables native ColBERT multi-vector storage
`MUNINN_FEDERATION_ENABLED`	off	`=1` enables P2P memory synchronization
`MUNINN_FEDERATION_PEERS`	-	Comma-separated list of peer base URLs
`MUNINN_FEDERATION_SYNC_ON_ADD`	off	`=1` enables real-time push-on-add to peers
`MUNINN_TEMPORAL_QUERY_EXPANSION`	off	`=1` enables NL time-phrase parsing in search

Evaluation & Quality Gates

Muninn includes an evaluation toolchain for measurable quality enforcement:

# Run full benchmark dev-cycle
python -m eval.ollama_local_benchmark dev-cycle

# Check phase hygiene gates
python -m eval.phase_hygiene

# Emit SOTA+ signed verdict artifact
python -m eval.ollama_local_benchmark sota-verdict \
  --longmemeval-report path/to/lme_report.json \
  --min-longmemeval-ndcg 0.60 \
  --min-longmemeval-recall 0.65 \
  --signing-key "$SOTA_SIGNING_KEY"

# Run LongMemEval adapter selftest (no server needed)
python eval/longmemeval_adapter.py --selftest

# Run StructMemEval adapter selftest (no server needed)
python eval/structmemeval_adapter.py --selftest

# Run StructMemEval against a live server
python eval/structmemeval_adapter.py \
  --dataset path/to/structmemeval.jsonl \
  --server-url http://localhost:42069 \
  --auth-token "$MUNINN_AUTH_TOKEN"

Metrics tracked: nDCG@k, Recall@k, MRR@k, Exact Match, token-F1, p50/p95 latency, significance testing (Bonferroni/BH correction), effect-size analysis.

The sota-verdict command emits a signed JSON artifact with commit_sha, SHA256 file hashes, and HMAC-SHA256 promotion_signature — enabling auditable, commit-pinned SOTA+ evidence.

Data & Security

Default data dir: ~/.local/share/AntigravityLabs/muninn/ (Linux/macOS) · %LOCALAPPDATA%\AntigravityLabs\muninn\ (Windows)
Storage: SQLite (metadata) + Qdrant (vectors) + KuzuDB (memory chains graph)
No cloud dependency: All data local by default
Auth: Bearer token required on all API calls; token shared via env var
Namespace isolation: user_id + namespace + project boundaries enforced at every retrieval layer

Documentation Index

Document	Description
`SOTA_PLUS_PLAN.md`	Active development phases and roadmap
`HANDOFF.md`	Operational setup, auth flow, known issues
`docs/ARCHITECTURE.md`	System architecture deep-dive
`docs/MUNINN_COMPREHENSIVE_ROADMAP.md`	Full feature roadmap (v3.1→v3.3+)
`docs/AGENT_CONTINUATION_RUNBOOK.md`	How to resume development across sessions
`docs/PYTHON_SDK.md`	Python SDK reference
`docs/INGESTION_PIPELINE.md`	Ingestion pipeline internals
`docs/OTEL_GENAI_OBSERVABILITY.md`	OpenTelemetry integration guide
`docs/PLAN_GAP_EVALUATION.md`	Gap analysis against SOTA memory systems

Licensing

Code: Apache License 2.0 (LICENSE)
Third-party dependency licenses remain with their respective owners
Attribution: See NOTICE

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/wjohns989/Muninn'

If you have feedback or need assistance with the MCP directory API, please join our Discord server