second-brain-mcp
Allows fetching full papers from arXiv: auto-upgrades abstract URLs to full HTML and converts them to Markdown, including all figures.
Provides synchronization of the pure Markdown vault with Google Drive for storage and backup.
Provides synchronization of the pure Markdown vault with iCloud for storage and backup.
Enables fetching and converting PubMed papers into the knowledge database as Markdown notes.
second-brain MCP Server
A self-maintaining personal knowledge database — powered by MCP, DuckDB, and biological memory models.
For anyone who saves more papers, notes, and figures than they could ever re-read. second-brain turns everything you capture into a database that maintains itself — auto-linking related notes, compressing what you stop reading, and keeping every figure searchable by its content. What you saved a year ago is still one query away, at a fraction of the token cost.
Why Does This Exist?
Problem | Solution |
📄 You save dozens of papers but can never find the right figure |
|
📑 arXiv gives you the abstract; you need the full paper | Auto-upgrades |
🗂 Notes pile up; older ones never get cleaned up | Vault Sleep: low-access notes compress automatically every Sunday while you sleep (60–90% token reduction) |
🔗 New notes stay isolated; you forget what's connected | Auto-wikilinks: every saved note is automatically linked to semantically related notes already in your vault |
🔎 Semantic search needs a cloud API or Docker stack | Self-hosted |
🔒 Every AI memory tool locks you into their format | Pure Markdown vault — sync with Google Drive, iCloud, or git; switch agents anytime |
🖼 Figure context is lost when you read a paper | Every figure is downloaded, OCR'd by Claude Vision, and stored in DuckDB — searchable by gene name, p-value, axis label |
The One-Command Demo
save_article("https://arxiv.org/abs/2405.01234")
↓
• /abs/ auto-upgraded to /html/ — full paper, not just abstract
• Full text converted to Markdown
• All figures downloaded + OCR'd by Claude Vision
• Semantic embeddings computed
• Auto-linked to related notes already in your vault ← auto-wikilinks
• Stored in 30-resources/ — queryable immediately
search_figures("UMAP cluster batch correction")
↓
• Returns the exact figure from the exact paper
• Works across your entire saved literature libraryWhat Makes It Different
flowchart LR
subgraph input["📥 Any Content Source"]
A1["arXiv / PubMed paper"]
A2["Web article / blog"]
A3["Local PDF / DOCX"]
A4["Personal note"]
end
subgraph core["⚙️ second-brain-mcp"]
B1["Markdown note<br/>30-resources/"]
B2["Figure OCR<br/>+ VLM description"]
B3["Semantic embedding<br/>+ auto-wikilinks"]
B4["Ebbinghaus score<br/>ranking"]
B5["PNG snapshots<br/>60–90% token reduction"]
end
subgraph query["🔍 Queryable Knowledge"]
C1["search_figures<br/>'UMAP melanocyte'"]
C2["search_notes<br/>'batch correction scRNA'"]
C3["get_context<br/>top-20 relevant notes"]
end
input --> core
B1 --> B2
B1 --> B3
B3 --> B4
B4 --> B5
B2 --> C1
B3 --> C2
B4 --> C3Eight things most self-hosted memory tools can't do — combined in one:
Most memory tools… | second-brain |
Save a link or PDF, then leave you to read and tag it | 🔬 One command builds the database — |
Store the arXiv abstract you pasted | 📑 Full text, not abstracts — |
Leave new notes isolated until you tag them | 🔗 The knowledge graph builds itself — every note is auto-linked to semantically related notes already in your vault |
Cost the same whether a note is read daily or never | 🧠 Memory that forgets like a brain — Ebbinghaus score ranks by recency × frequency; stale notes compress while you sleep |
Search documents, not what's inside the figures | 🖼 Figure-level search across your whole library — |
Forget your project decisions between sessions | 📋 The AI learns your rules — hot notes auto-extract constraints into |
Grow more expensive as the vault grows | 📉 Token cost shrinks with age — PNG snapshots replace old text at 60–90% compression; frequently-read papers stay full-fidelity |
Lock you into their database format | 🔓 Zero lock-in — pure Markdown, any MCP agent, sync via any cloud drive or git |
Cross-Session Continuity — Pick Up Where You Left Off
Every project you work on can be resumed in a new session with full context — no re-explaining, no lost progress.
flowchart LR
A["🟢 Session Start<br/>get_context()"] --> B["AI receives:<br/>• goals.md — current priorities<br/>• Top-20 recent notes<br/>• Extracted rules"]
B --> C["Work on project<br/>new_note / search / read"]
C --> D["🔴 Before ending session<br/>update_goals(...)"]
D --> E["New session<br/>get_context() again"]
E --> BHow It Works in Practice
End of session — tell the agent to save state:
Update goals: currently working on the scRNA batch correction pipeline.
Completed: harmony integration. Blocked on: choosing n_components for PCA.
Next session: start from the PCA parameter sweep in 20-areas/research/harmony-notes.mdThe agent calls update_goals() and optionally new_note("project", ...) for detailed progress.
Start of next session — just say:
Get context and continue where we left off.The agent calls get_context() and immediately sees:
goals.mdwith the state you savedThe harmony-notes.md surfaced at the top (recently accessed, high Ebbinghaus score)
Rules auto-extracted from that note, e.g.:
RULE: use n_components=30 for this dataset — tested 20/30/50, 30 minimises batch effect without losing resolution
RULE: exclude sample CRC_04 — library size outlier confirmed by QCThese rules live in memory/rules.md and are injected at every get_context() call — the AI carries your hard-won decisions forward automatically, without you having to repeat them.
What Gets Persisted
What | Where | Always in context? |
Current priorities / blocked items |
| ✅ every session |
Project progress notes |
| ✅ if recently accessed |
Decisions and rationale |
| via |
Extracted rules from notes |
| ✅ every session |
Saved papers and figures |
| via |
This works across any project — bioinformatics analysis, coding, writing, research. Save state with one sentence at the end of a session; resume instantly at the start of the next.
Example Queries
# Resume a project from last session
get_context() # → goals + recent notes + rules loaded automatically
# Find a specific figure panel across all saved papers
search_figures("p < 0.001 UMAP cluster")
# Semantic search across all notes
search_notes("single cell integration batch correction")
# Decision records for a specific project
get_decisions("MyProject")Memory Architecture — Biological Analogy
Biological Brain | This System |
Hippocampal consolidation during sleep | Vault Sleep: weekly LLM-compression of old low-access notes |
Ebbinghaus forgetting curve | Score-based ranking: |
Visual long-term memory | PNG snapshots — resolution degrades gracefully with age |
Associative recall | Semantic search + auto-generated |
Sleep-dependent consolidation | launchd cron, runs Sunday 02:00 while you sleep |
Token Efficiency
Memory that gets cheaper over time — unlike flat-file systems where old notes cost the same forever.
Note age → fresh (0–3 mo) 3–6 months 6–12 months 1 year+
────────────── ────────── ─────────── ───────
token cost: ██████████████ ██████ ████ ██
~1,000 tokens ~400 tokens ~256 tokens ~100 tokens
▼ 60% ▼ 74% ▼ 90%Tier assigned by score × age (adaptive). Frequently-accessed notes stay full-text regardless of age.
Search Performance
Measured on Apple Silicon MacBook (20-rep average, BM25-only mode).
Vault BM25-only p50 Hybrid BM25+semantic p50
────── ───────────────── ────────────────────────
10 n ████░░░░░ 21 ms ████████████ 37 ms
50 n ██████░░░ 25 ms █████████████ 39 ms
100 n ███████░░ 27 ms ██████████████ 45 msVault Size | BM25 p50 | Hybrid p50 | Recall@1 | Recall@5 | MRR |
10 notes | 21 ms | 37 ms | 30% | 60% | 0.42 |
50 notes | 25 ms | 39 ms | 70% | 90% | 0.78 |
100 notes | 27 ms | 45 ms | 70% | 80% | 0.73 |
Hybrid mode adds ~18 ms for embedding lookup. Both modes scale sub-linearly with vault size.
Recall figures at this scale (10–100 notes) carry high sample variance — a single ambiguous query shifts Recall@1 by 10%. Treat them as directional, not as benchmarks against large corpora; the takeaway is that hybrid consistently beats BM25-only on relevance for a fixed query set.
System Architecture
┌─────────────────────────────────────────────────────┐
│ AI Agent Layer │
│ Claude Code · Gemini CLI · Any MCP │
└──────────────────────┬──────────────────────────────┘
│ MCP Protocol (27 tools)
┌──────────────────────▼──────────────────────────────┐
│ Layer 2 — MCP Server │
│ server.py │
│ get_context · search_notes · save_article · … (27) │
└──────┬───────────────┬────────────────┬─────────────┘
│ │ │
┌──────▼──────┐ ┌──────▼──────┐ ┌──────▼──────┐
│ vault_sleep│ │ vault_db │ │ figures │
│ compress │ │ DuckDB FTS │ │ PNG snap │
│ Phase 3–9 │ │ + semantic │ │ OCR · VLM │
└──────┬──────┘ └──────┬──────┘ └─────────────┘
│ │
┌──────▼───────────────▼──────────────────────────────┐
│ Layer 0 — Markdown Vault │
│ 00-inbox · 10-projects · 20-areas · 30-resources │
│ 40-archive · decisions · memory · templates │
│ (syncs via Google Drive / iCloud / git) │
└─────────────────────────────────────────────────────┘Vault Sleep — Auto-compression Flow
Every Sunday 02:00 (launchd, no interaction needed)
│
▼
sync_index + embeddings
│
▼ age > 90d AND Ebbinghaus score ≤ 0.5
┌──────────────────────────────────────┐
│ Adaptive Tier Selection │
│ score > 1.5 → text (keep full) │ ← frequently-read: never compressed
│ score > 0.8 → large ~400 tokens │
│ score > 0.3 → base ~256 tokens │
│ otherwise → small ~100 tokens │
└────────────────┬─────────────────────┘
│
Gemini CLI → Claude CLI → naive (auto-fallback, no LLM required)
│
compressed → vault / original → 40-archive/ / snapshot → .pngMCP Tools (27 total)
Tool | Description |
| Session start: goals + top-20 Ebbinghaus-ranked notes + auto-rules |
| Fetch URL/PDF → Markdown + auto-extract figures |
| Hybrid BM25 + semantic search across all notes |
| Search figure OCR text / VLM descriptions |
| Search cnews morning briefs and financial news by keyword |
| Manually trigger figure extraction for a saved article |
| Read note + record access (updates Ebbinghaus score) |
| Return PNG snapshot for token-efficient reading |
| Create note with correct template and folder by type |
| List ADR decision records, optionally filtered by project |
| Update |
| Rebuild DuckDB index from vault files |
| Show note counts by type |
| Compress old low-activity notes (dry_run=True by default) |
| Show compression candidates without acting |
| Render note to PNG at chosen resolution tier |
| Extract L3 rules from frequently-accessed notes |
| Merge semantically similar notes into one abstract note |
| Refresh auto-generated |
| Delete archived originals that have a PNG snapshot |
| Find semantically related notes by cosine similarity (finance & knowledge management) |
| Hybrid search returning knowledge notes + cnyes morning briefs in one call |
| Rank notes by Ebbinghaus score or recency — find your most-engaged knowledge nodes |
| Overwrite an existing note with new content (auto-reindexes) |
| Append content to end of an existing note — safe, never loses existing text |
| Create or repair vault directory structure and templates |
| Return full AGENTS.md operating manual — call at remote session start to learn vault SOP |
Test Results
tests/test_figures.py 19 passed (OCR, snapshots, VLM)
tests/test_server.py 13 passed (MCP tools, path safety)
tests/test_vault_db.py 39 passed (FTS, semantic search, embeddings)
tests/test_vault_sleep.py 44 passed (compression, consolidation, rules, prune)
────────────────────────────────────────
115 passed in 3.37sInstallation
Prerequisites
Dependency | Required | Notes |
Python 3.11+ | ✅ | |
✅ | PNG snapshot rendering | |
Dev only | Only needed for | |
Optional | Enables semantic search; BM25 fallback if absent | |
Optional | ~300 MB — needed for llama-server path only | |
| Optional | Better compression quality in vault_sleep; naive fallback if absent |
Vault structure is auto-created on first server start — no manual
mkdirneeded.
macOS / Linux — Quick Start
Step 1 — Install
pip install mcp-second-brain
playwright install chromiumStep 2 — Register with your AI agent
Claude Code (CLI) — global, works in any project:
claude mcp add --scope user second-brain \
--env SECOND_BRAIN_PATH=~/second-brain \
-- python -m mcp_second_brainClaude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"second-brain": {
"command": "python",
"args": ["-m", "mcp_second_brain"],
"env": { "SECOND_BRAIN_PATH": "/Users/yourname/second-brain" }
}
}
}Step 3 — First run
Start your agent and say:
init_vaultThe server auto-creates all directories and templates on startup. Call init_vault explicitly to verify or repair the structure.
Windows — Quick Start
Step 1 — Install Python and the package
# Install Python 3.11+ from python.org, or via winget:
winget install Python.Python.3.11
pip install mcp-second-brain
playwright install chromiumStep 2 — Choose a vault location
# Local folder:
$vault = "C:\Users\$env:USERNAME\second-brain"
# Or a cloud-synced folder (Google Drive, OneDrive, etc.):
$vault = "G:\My Drive\second-brain"The vault directories and templates are created automatically when the server first starts. No manual setup needed.
Step 3 — Register with your AI agent
Claude Code (VSCode extension) — create .mcp.json in your vault folder:
{
"mcpServers": {
"second-brain": {
"command": "python",
"args": ["-m", "mcp_second_brain"],
"env": { "SECOND_BRAIN_PATH": "C:\\Users\\YourName\\second-brain" }
}
}
}Claude Desktop — add to %APPDATA%\Claude\claude_desktop_config.json:
{
"mcpServers": {
"second-brain": {
"command": "python",
"args": ["-m", "mcp_second_brain"],
"env": { "SECOND_BRAIN_PATH": "C:\\Users\\YourName\\second-brain" }
}
}
}Gemini CLI / other IDEs — edit %USERPROFILE%\.gemini\mcp_config.json (or IDE-specific config):
{
"mcpServers": {
"second-brain": {
"command": "C:\\Users\\YourName\\.venvs\\mcp-second-brain\\Scripts\\python.exe",
"args": ["-m", "mcp_second_brain"],
"env": { "SECOND_BRAIN_PATH": "C:\\Users\\YourName\\second-brain" }
}
}
}Tip: Use a dedicated venv at
C:\Users\YourName\.venvs\mcp-second-brain\(local SSD) rather than a venv on a network drive. Python loads thousands of small files at startup — on a cloud drive this causes 15–30 s delays andcontext deadline exceedederrors.
Step 4 — Semantic search (optional, Windows)
Ollama is the easiest path on Windows:
winget install Ollama.Ollama
ollama pull nomic-embed-textThen add to the env block of your MCP config:
"EMBED_URL": "http://localhost:11434/v1/embeddings",
"EMBED_PORT": "11434"Ollama starts automatically with Windows. No extra configuration needed.
Alternatively, build llama.cpp for Windows and register it as a scheduled task:
# Register llama-server as a login-triggered scheduled task:
$exe = "C:\Users\$env:USERNAME\llama.cpp\build\bin\llama-server.exe"
$model = "C:\Users\$env:USERNAME\nomic-embed-text-v1.5.Q8_0.gguf"
$args = "-m `"$model`" --port 11435 --embedding --pooling mean -np 4 -c 2048 --log-disable"
$action = New-ScheduledTaskAction -Execute $exe -Argument $args
$trigger = New-ScheduledTaskTrigger -AtLogOn
$settings = New-ScheduledTaskSettingsSet -RestartCount 5 -RestartInterval (New-TimeSpan -Minutes 1)
Register-ScheduledTask -TaskName "llama-embed" -Action $action -Trigger $trigger `
-Settings $settings -RunLevel Highest -ForceStep 5 — Weekly vault maintenance (optional, Windows)
# Run vault_sleep every Sunday at 02:00
$pythonExe = "C:\Users\$env:USERNAME\.venvs\mcp-second-brain\Scripts\python.exe"
$serverPy = "C:\path\to\second-brain-mcp\run_sleep.py"
$vaultPath = "C:\Users\$env:USERNAME\second-brain"
$action = New-ScheduledTaskAction -Execute $pythonExe -Argument "`"$serverPy`"" `
-WorkingDirectory (Split-Path $serverPy)
$trigger = New-ScheduledTaskTrigger -Weekly -DaysOfWeek Sunday -At 2am
$env_var = [System.Environment]::SetEnvironmentVariable(
"SECOND_BRAIN_PATH", $vaultPath, "Machine")
Register-ScheduledTask -TaskName "vault-sleep" -Action $action -Trigger $trigger -ForceDevelopment Install (clone)
git clone https://github.com/ddmanyes/second-brain-mcp
cd second-brain-mcp
uv sync
uv run playwright install chromiumRegister with Claude Code:
# macOS / Linux
claude mcp add --scope user second-brain \
--env SECOND_BRAIN_PATH=~/second-brain \
-- uv run --project /path/to/second-brain-mcp python server.py
# Windows (PowerShell)
claude mcp add --scope user second-brain `
--env SECOND_BRAIN_PATH="C:\Users\$env:USERNAME\second-brain" `
-- uv run --project C:\path\to\second-brain-mcp python server.pyEnvironment Variables
Variable | Default | Description |
|
| Path to your vault directory |
|
| Embedding server endpoint |
|
| Embedding model name |
|
| llama-server port (use |
Auto-start (macOS, optional)
# Embedding server — always on, restarts on crash
cp examples/launchd/com.yourname.llama-embed.plist ~/Library/LaunchAgents/
# Edit paths inside the file, then:
launchctl load ~/Library/LaunchAgents/com.yourname.llama-embed.plist
# Weekly vault maintenance — every Sunday 02:00
cp examples/launchd/com.yourname.vault-sleep.plist ~/Library/LaunchAgents/
launchctl load ~/Library/LaunchAgents/com.yourname.vault-sleep.plistTroubleshooting
Symptom | Likely cause | Fix |
| Templates missing from vault | Run |
Semantic search silently falls back to BM25 | Embedding server not running | Start Ollama ( |
| Playwright chromium not installed |
|
| No | Export |
Agent sees no notes / empty results | Index not built | Run |
Notes land in the wrong place |
| Set it in your MCP config |
Tools unavailable when working in other project folders | Installed as local config instead of user scope | Re-register with |
Windows: MCP server times out on connect ( | Python venv is on a network/cloud drive | Move the venv to local SSD (e.g. |
Windows: Semantic search returns no results after | Ollama not running |
|
Vault Structure
vault/
├── 00-inbox/ # Unprocessed captures — clear daily
├── 10-projects/ # Active projects
├── 20-areas/
│ ├── research/ # Ongoing research domains
│ ├── coding/ # Dev tools and workflows
│ └── consolidated/ # Auto-merged similar notes (Phase 8)
├── 30-resources/ # ← Papers and articles (save_article writes here)
├── 40-archive/ # Compressed originals (auto-managed by vault_sleep)
├── decisions/ # Architecture Decision Records (ADR format)
├── memory/
│ ├── goals.md # Current priorities — injected at every session start
│ ├── index.md # Vault map
│ └── rules.md # Auto-extracted L3 rules — injected at every session start
└── templates/ # Note templates (note, decision, project, research)Running Tests
uv run pytest tests/ -v
uv run python benchmark.py --quick --markdown # search latency + accuracy reportReferences & Acknowledgements
Papers That Directly Inspired This Project
Paper | Where Used |
Do Language Models Need Sleep? Offline Recurrence for Improved Online Inference (2026) | Phase 3 Vault Sleep — hippocampal replay as batch memory consolidation |
Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents (2026) | Phase 9 adaptive tier — score × age dual-axis; addresses the "missing diagonal" in existing systems |
Phase 4 PNG tiers — image as compressed medium, 10× compression at 97% fidelity | |
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning (2026) | Phase 4 vision API — Playwright render → VLM reading pipeline |
Active Context Compression: Autonomous Memory Management in LLM Agents (2026) | Phase 3 design comparison — session-level vs. nightly batch consolidation |
Phase 8 consolidation — 3-stage semantic compression, 30× token reduction | |
Memory for Autonomous LLM Agents: Mechanisms, Evaluation, and Emerging Frontiers (2026) | Architecture positioning — mechanisms, evaluation, and frontiers |
Cognitive Science Foundations
Ebbinghaus, H. (1885). Über das Gedächtnis. — forgetting curve; basis for
access_count / ln(age_days + 1)Stickgold, R. (2005). Nature, 437, 1272–1278. — sleep-dependent memory consolidation
Built With
MarkItDown · DuckDB · llama.cpp · nomic-embed-text · FastMCP · Playwright · Anthropic Claude API
Known Issues & Fixes
WAL corruption (Failure while replaying WAL file)
Cause: DuckDB write was interrupted mid-transaction (IDE restart, pkill -9, or machine sleep).
Symptom: MCP server crashes on startup; every DB operation fails.
Fix:
rm -f ~/.second-brain/vault.db ~/.second-brain/vault.db.walRestart the server — it will rebuild a clean DB. Run sync_index to re-index your vault.
Note: If running inside a sandboxed IDE (e.g. Antigravity), the agent cannot delete
~/.second-brain/. Run the command in your local Terminal, or ask Claude Code (VSCode extension) which has full shell access.
vault.db created in the wrong directory
Cause: server.py was launched with a non-home working directory; DuckDB created vault.db relative to cwd.
Symptom: ~/.second-brain/vault.db is tiny (< 1 MB) but a large vault.db exists elsewhere.
Fix:
# Find the real vault.db
find ~ -name "vault.db" -size +1M 2>/dev/null
# Move it to the correct location
cp /path/to/found/vault.db ~/.second-brain/vault.dbIDE reports no such file or directory for .venv/bin/python
Root cause: The venv was created inside the Google Drive folder. Google Drive sync breaks symlinks — bin/python points to an absolute path on another machine, which doesn't exist locally.
Permanent fix — use a local venv (macOS):
# Create venv on local machine (once per machine)
python3 -m venv ~/.venvs/second-brain
~/.venvs/second-brain/bin/pip install -r /path/to/second-brain/requirements.txt
# Register MCP with Claude Code using the local venv
claude mcp add --scope user second-brain \
~/.venvs/second-brain/bin/python \
/path/to/second-brain/server.py \
-e PYTHONPATH=/path/to/second-brain \
-e SECOND_BRAIN_PATH=/path/to/vaultSource code (server.py, etc.) stays on Google Drive and syncs normally. Only the venv lives on the local machine.
Do not create the venv inside the Google Drive folder — it will break on every other machine that syncs it.
Contributing
PRs and Issues welcome. Please open an issue first to discuss significant changes.
License
MIT License — © 2026 Chan Chi Ru. See LICENSE.
This server cannot be installed
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ddmanyes/second-brain-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server