Which integrations are available for this server?

Provides optional LLM-based memory extraction using Google Gemini models (requires gemini extra).

How do I use mem0-local-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@mem0-local-mcp Remember that I like hiking." That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

mem0-local-mcp

by sanchezvivi

Overview Schema Related Servers Score Discussions

Python

Local

mem0-local-mcp

A fully local, self-hosted mem0 memory server for MCP clients (Claude Code, Claude Desktop, Cursor, …).

Local embeddings via fastembed (ONNX, no PyTorch, no GPU) — BAAI/bge-small-en-v1.5, 384-dim. Can be used with different LLM via API key.
Local vector store via Qdrant on localhost.
Switchable extraction LLM — Anthropic Claude (default) or Google Gemini, chosen by one env var.
No API key needed for normal use. Reads and verbatim writes (infer=False) run entirely on your machine. A provider key is used only when you opt into LLM fact-extraction (infer=True).

Tools

Tool	Purpose
`add_memory(content, user_id, agent_id?, run_id?, infer=False)`	Store a memory. `infer=False` stores verbatim (local, no LLM); `infer=True` extracts/dedupes via the LLM.
`search_memories(query, user_id, agent_id?, run_id?, top_k=5)`	Semantic search within a scope.
`list_memories(user_id, agent_id?, run_id?)`	List a scope without ranking.
`delete_memory(memory_id)`	Delete a single memory.

Memories are scoped by any of user_id / agent_id / run_id.

Related MCP server: mcp-recall

Instructions for your agent

Copy the block below and paste it to your coding agent (Claude Code, Cursor, …) — it will install and verify everything for you:

Set up the mem0-local-mcp memory server on this machine for me:

1. Ensure a local Qdrant is running on port 6333: run `curl -s http://localhost:6333/`.
   If it is not running, start it with Docker:
   docker run -d --name qdrant --restart always -p 6333:6333 -v ~/.qdrant_storage:/qdrant/storage qdrant/qdrant
   If Docker is unavailable, download the Qdrant binary for my OS from
   https://github.com/qdrant/qdrant/releases and run it.
2. Ensure `uv` is installed (`which uv`); if not, install it from https://docs.astral.sh/uv/.
3. Register the MCP server at user scope:
   claude mcp add mem0 --scope user -- uvx --from git+https://github.com/sanchezvivi/mem0-local-mcp.git mem0-local-mcp
4. Verify with `claude mcp list` that `mem0` shows Connected. The first run downloads the
   embedding model (~130 MB), so allow up to a minute.
5. Reads and infer=False writes need no API key. Only if I want infer=True (LLM) extraction,
   confirm ANTHROPIC_API_KEY (or GOOGLE_API_KEY with MEM0_LLM_PROVIDER=gemini) is set in my
   environment before launching the client. Then tell me it is ready.

Prerequisites

1. A running Qdrant (pick one):

# Docker
docker run -d --name qdrant -p 6333:6333 -v ~/.qdrant_storage:/qdrant/storage qdrant/qdrant

# or the binary (macOS/Linux release from https://github.com/qdrant/qdrant/releases)
./qdrant

2. uv to run the server (uvx), or pip install mem0-local-mcp.

The embedding model (~130 MB) downloads automatically on first use.

Install in Claude Code

claude mcp add mem0 --scope user -- \
  uvx --from git+https://github.com/sanchezvivi/mem0-local-mcp.git mem0-local-mcp

For Gemini extraction, install the extra and select the provider:

claude mcp add mem0 --scope user \
  --env MEM0_LLM_PROVIDER=gemini \
  -- uvx --from "git+https://github.com/sanchezvivi/mem0-local-mcp.git#egg=mem0-local-mcp[gemini]" mem0-local-mcp

Other MCP clients

Any stdio MCP client can launch it. Example config entry:

{
  "mcpServers": {
    "mem0": {
      "command": "uvx",
      "args": ["--from", "git+https://github.com/sanchezvivi/mem0-local-mcp.git", "mem0-local-mcp"]
    }
  }
}

Configuration

All optional, with defaults:

Variable	Default	Notes
`MEM0_USER_ID`	`default_user`	Default scope when a tool call omits `user_id`.
`MEM0_LLM_PROVIDER`	`anthropic`	`anthropic` or `gemini`. Only used for `infer=True`.
`MEM0_ANTHROPIC_MODEL`	`claude-haiku-4-5`
`MEM0_GEMINI_MODEL`	`gemini-2.5-flash`	Requires the `gemini` extra.
`MEM0_EMBED_MODEL`	`BAAI/bge-small-en-v1.5`	Any fastembed model.
`MEM0_EMBED_DIMS`	`384`	Must match the embed model.
`MEM0_QDRANT_HOST`	`localhost`
`MEM0_QDRANT_PORT`	`6333`
`MEM0_COLLECTION`	`mem0`
`ANTHROPIC_API_KEY`	—	Only needed for `infer=True` with Anthropic.
`GOOGLE_API_KEY`	—	Only needed for `infer=True` with Gemini.

Keys are read from the process environment; nothing is written to disk by this server.

Local development

git clone https://github.com/sanchezvivi/mem0-local-mcp.git
cd mem0-local-mcp
uv venv && uv pip install -e ".[gemini]"
.venv/bin/mem0-local-mcp        # stdio server

License

MIT — see LICENSE.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/sanchezvivi/mem0-local-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server