MCP-RAG
Your Personal NotebookLM for Claude Desktop
Universal RAG (Retrieval-Augmented Generation) MCP server for Claude Desktop. Index documents via CLI, search them in Claude Desktop with 0% hallucination.
What is MCP-RAG?
Think of it as NotebookLM for Claude Desktop:
π Index any documents: PDF, Word, PowerPoint, Excel, νκΈ, TXT, MD
π Natural language search: Ask questions in Claude Desktop
β 0% Hallucination: Answers based ONLY on your documents
π» 100% Local: All data stays on your computer (ChromaDB)
π― Simple workflow: CLI for indexing β Claude Desktop for searching
Architecture
Two-Part System:
CLI = Document management (add, delete, list)
Claude Desktop = Search and Q&A
Quick Start
1. Install
2. Start ChromaDB Server
Keep this running in a separate terminal:
3. Add Documents (CLI)
Supported formats:
Documents: PDF, DOCX, HWP, TXT, MD
Presentations: PPTX
Spreadsheets: XLSX, XLS
4. Configure Claude Desktop
Windows: %APPDATA%\Claude\claude_desktop_config.json
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Add this:
Important: Use your actual installation path!
5. Restart Claude Desktop
6. Ask Questions!
In Claude Desktop:
CLI Commands
Examples
MCP Tools (Claude Desktop)
When you ask questions in Claude Desktop, these tools are automatically used:
Tool | Description |
| Search in specific collection or all collections |
| List all available collections |
| Get details about a collection |
Note: Document addition is CLI-only, not available in Claude Desktop.
How It Works
Indexing (CLI)
Searching (Claude Desktop)
Use Cases
π Students
β "Explain the concept of derivatives from my math collection"
π’ Professionals
β "What's our vacation policy?"
π¬ Researchers
β "Summarize the methodology from the literature collection"
Features
β Multi-collection support - Organize by topic
β Semantic search - ChromaDB vector embeddings
β Source attribution - See which document/chunk
β Relevance scoring - Know how confident the match is
β Multiple file formats - PDF, DOCX, PPTX, XLSX, HWP, TXT, MD
β 100% local - No cloud, all on your machine
β 0% hallucination - Only document-based answers
Comparison
Feature | NotebookLM | MCP-RAG |
Platform | Google Cloud | Local |
AI Model | Gemini | Claude |
Privacy | Cloud | 100% Local |
Multi-collection | β | β |
CLI | β | β |
Cost | Free (limited) | Free (unlimited) |
Troubleshooting
ChromaDB Connection Error
Problem: Cannot connect to ChromaDB
Solution:
Keep this terminal open!
Claude Desktop: MCP Server Not Showing
Check
claude_desktop_config.jsonsyntaxUse absolute path (not relative)
Restart Claude Desktop completely
Check ChromaDB is running
No Search Results
Advanced
Batch Add Files
PowerShell:
Bash:
Custom Chunk Size
Edit src/indexer.js:
Larger chunks = more context, fewer chunks Smaller chunks = more precise, more chunks
Project Structure
Requirements
Node.js 18+
Python 3.8+ (for ChromaDB)
Claude Desktop (latest version)
Contributing
Contributions welcome! This is a universal tool that can benefit many users.
License
MIT License - see LICENSE
Credits
Built with:
Model Context Protocol (MCP) - Anthropic
ChromaDB - Vector database
pdf-parse - PDF extraction
mammoth - DOCX extraction
officeparser - PPTX extraction
xlsx - Excel extraction
node-hwp - νκΈ extraction
MCP-RAG - Your documents, Claude's intelligence, zero hallucination.
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
Turns Claude Desktop into a personal document question-answering system using local vector search. Index PDF, TXT, and Markdown documents into collections and get answers based strictly on your documents with zero hallucination.