Skip to main content
Glama
jumpino27

RLM Memory MCP Server

by jumpino27

RLM Memory MCP Server

Persistent memory + semantic file discovery for ANY AI coding agent.

Works with Claude Code, OpenAI Codex, Gemini CLI, Cursor, Windsurf β€” anything that speaks MCP.

What is this?

AI agents forget everything between sessions. This MCP server fixes that:

  • 🧠 Memory β€” after every task, the agent records what it changed and why. Next session, it remembers.

  • πŸ—ΊοΈ File map β€” a semantic index of your codebase ("this file is the login form", "this is the checkout API"), so the agent finds the right files without grepping the whole repo.

  • πŸ”„ Bi-directional β€” the agent asks the MCP ("user wants to fix the submit button β€” which files?") and the MCP answers with files, history, and suggestions.

The core idea (Recursive Large Model): the agent stays intentionally "blind" to the filesystem and uses the MCP as its eyes and memory β€” making it faster, cheaper, and more focused.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  AI Agent (Claude Code, Codex, Gemini CLI, Cursor...)  β”‚
β”‚                                                        β”‚
β”‚  "User wants to fix login" ──► rlm_query               β”‚
β”‚                            ◄── relevant files+history  β”‚
β”‚  [does the work]                                       β”‚
β”‚  "Here's what I changed"   ──► rlm_smart_memory        β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                          β”‚ stdio (MCP)
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  RLM Memory MCP Server                                 β”‚
β”‚  β€’ JSON storage per project (projects/<name>/.rlm/)    β”‚
β”‚  β€’ AI-powered matching via OpenRouter or Gemini        β”‚
β”‚  β€’ Web UI for you at http://localhost:3848             β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Related MCP server: Heimdall MCP Server

Setup in 3 Steps

1. Install & build

git clone https://github.com/jumpino27/RLM-Memory-MCP-Server.git
cd RLM-Memory-MCP-Server
npm install
npm run build

2. Add an API key

Copy .env.example to .env and set one key:

# Option A (recommended): OpenRouter β€” one key, any model
# https://openrouter.ai/keys
OPENROUTER_API_KEY="sk-or-..."

# Option B: Google Gemini direct β€” https://aistudio.google.com/
# GEMINI_API_KEY="..."
  • With OpenRouter the server uses google/gemini-3.5-flash by default β€” fast, cheap, near-Pro quality.

  • With Gemini direct it uses gemini-3.5-flash.

  • No key at all? Everything still works using keyword matching (just less smart).

Want a different model? Set LLM_MODEL (e.g. anthropic/claude-haiku-4.5 or openai/gpt-4o-mini on OpenRouter). See .env.example for all options.

3. Connect your AI agent

Replace C:\\path\\to with where you cloned the repo.

Claude Code (one command):

claude mcp add rlm-memory -- node C:\\path\\to\\RLM-Memory-MCP-Server\\dist\\index.js

OpenAI Codex CLI β€” add to ~/.codex/config.toml:

[mcp_servers.rlm-memory]
command = "node"
args = ["C:\\path\\to\\RLM-Memory-MCP-Server\\dist\\index.js"]

Gemini CLI β€” add to ~/.gemini/settings.json:

{
  "mcpServers": {
    "rlm-memory": {
      "command": "node",
      "args": ["C:\\path\\to\\RLM-Memory-MCP-Server\\dist\\index.js"]
    }
  }
}

Any other MCP client: launch node dist/index.js over stdio.

πŸ’‘ Tell your agent how to use it: copy the rules from example_agents.md into your agent's instructions file (CLAUDE.md, AGENTS.md, .cursorrules, …).


The Workflow

First time on a project β€” index it once:

rlm_init β†’ rlm_index_codebase β†’ rlm_verify_index

Every task after that β€” three steps:

1. rlm_query          "User wants X β€” which files?"   β†’ files + history + tips
2. (agent does the actual work)
3. rlm_smart_memory   "Here's what I changed"         β†’ remembered forever

That's it. The more the agent works, the smarter the memory gets.


The Tools

Daily drivers

Tool

What it does

rlm_query

⭐ Start every task here. Ask about the user's request β†’ get relevant files, past memories, and suggestions

rlm_smart_memory

⭐ End every task here. Record changes with rich metadata (component types, feature areas, edit history)

Project setup

Tool

What it does

rlm_init

Register a project for memory tracking

rlm_index_codebase

Scan a codebase and build the semantic file map

rlm_verify_index

Post-index check: "Is this everything?" β€” shows breakdown + gaps

Maintenance & extras

Tool

What it does

rlm_manage_sitemap

Keep the file map in sync when files are deleted/moved/renamed

rlm_status

Project statistics

rlm_list_projects

All tracked projects

rlm_recall_memory

Simple keyword memory search (legacy β€” prefer rlm_query)

rlm_find_files_by_intent

Semantic file search (legacy β€” prefer rlm_query)

rlm_create_memory

Basic memory creation (legacy β€” prefer rlm_smart_memory)

Example: rlm_query

{
  "project_name": "my-app",
  "user_request": "The user wants to fix the submit button color on the login form"
}

Returns:

{
  "relevant_files": [
    { "path": "src/components/LoginForm.tsx", "description": "Login form with submit button",
      "component_type": "form", "feature_area": "auth",
      "recent_changes": ["Added hover state to submit button"] }
  ],
  "relevant_memories": [
    { "summary": "Changed submit button to theme primary color", "date": "..." }
  ],
  "ai_analysis": "The submit button lives in LoginForm.tsx and uses theme.ts colors...",
  "suggestions": ["Check theme.ts for the color tokens"]
}

Example: rlm_smart_memory

{
  "project_name": "my-app",
  "user_prompt": "Fix the submit button color",
  "changes_context": "Changed the submit button in LoginForm to use the primary theme color instead of hardcoded blue. Added hover state.",
  "files_modified": [
    { "path": "src/components/LoginForm.tsx", "change_type": "modified",
      "change_summary": "Button color now uses theme.primary, added hover state" }
  ],
  "affected_areas": ["auth", "ui"]
}

The Web UI (for you, the human)

npm start   # β†’ http://localhost:3848
  • Browse all projects, memories, and the semantic file map

  • Test every MCP tool from the browser

  • Delete stale memories / file entries

  • See live AI provider status (e.g. openrouter Β· google/gemini-3.5-flash)


Configuration Reference

All settings live in .env (see .env.example):

Variable

Default

Description

OPENROUTER_API_KEY

β€”

OpenRouter key (recommended) β€” get one

GEMINI_API_KEY

β€”

Google Gemini key β€” get one

LLM_PROVIDER

auto

auto / openrouter / gemini. Auto prefers OpenRouter

LLM_MODEL

google/gemini-3.5-flash (OpenRouter) / gemini-3.5-flash (direct)

Any model your provider offers

LLM_REASONING_EFFORT

low

minimal / low / medium / high β€” thinking depth for helper calls

LLM_MAX_TOKENS

4096

Max output tokens per call

LLM_TIMEOUT_MS

60000

Per-request timeout

UI_PORT

3848

Web UI port

RLM_DATA_DIR

<install dir>/projects

Where project memories are stored (set it to keep data outside the install tree)

Scripts: npm start (web UI) Β· npm run mcp (MCP server directly) Β· npm run build Β· npm test (end-to-end smoke test) Β· npm run dev (UI with auto-reload) Β· npm run typecheck


How data is stored

Everything is plain JSON β€” no database needed:

RLM-Memory-MCP-Server/
└── projects/
    └── my-app/.rlm/
        β”œβ”€β”€ config.json       # project info
        β”œβ”€β”€ memory_log.json   # every recorded task
        └── file_map.json     # the semantic file index

Back up by copying projects/. Inspect with any text editor or the web UI.


FAQ

Does this work without an API key? Yes β€” all tools fall back to weighted keyword matching. AI matching is just smarter.

Why store data centrally instead of in each repo? One place to back up, browsable across projects in the UI, no .rlm clutter in your repos, survives repo deletion.

Which agent works best? Any MCP-capable agent. The tool descriptions teach the agent how to use them, and example_agents.md has drop-in instructions.

rlm_query vs rlm_recall_memory? rlm_query searches files + memories + edit history and adds AI analysis. rlm_recall_memory only searches memories by keyword. Use rlm_query.

How much does the AI cost? Helper calls are small and run at low reasoning effort. With google/gemini-3.5-flash ($1.50/M input, $9/M output) typical queries cost fractions of a cent.


Project structure (for contributors)

src/
β”œβ”€β”€ index.ts            # MCP server entry (stdio) β€” registers all tools
β”œβ”€β”€ ui/server.ts        # Web UI (Express) at localhost:3848
β”œβ”€β”€ services/
β”‚   β”œβ”€β”€ llm.ts          # Multi-provider AI layer (OpenRouter / Gemini + fallbacks)
β”‚   └── database.ts     # JSON file storage
β”œβ”€β”€ tools/              # MCP tool implementations
β”œβ”€β”€ schemas/index.ts    # Zod input validation
β”œβ”€β”€ types.ts            # Shared types
└── constants.ts        # Paths, models, limits

License

MIT

F
license - not found
-
quality - not tested
C
maintenance

Maintenance

–Maintainers
–Response time
–Release cycle
–Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/jumpino27/RLM-Memory-MCP-Server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server