ickyMCP
RAG MCP Server for Document Search. Built for legal professionals and business users who need to search across large document collections.
Features
Semantic Search: Find relevant content based on meaning, not just keywords
Document Support: PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx), Markdown, Text
4K Token Chunks: Large chunks preserve context for legal and business documents
Incremental Indexing: Only re-index changed files
Local Embeddings: Uses nomic-embed-text-v1.5 (no API costs)
SQLite Storage: Single portable database file
Installation
Configuration
Environment Variables
Variable | Default | Description |
| 4000 | Tokens per chunk |
| 500 | Overlap between chunks |
|
| Path to SQLite database |
|
| Embedding model |
Claude Code Configuration
Add to your claude_desktop_config.json or MCP settings:
Usage
Tools Available
index
Index documents from a file or directory.
search
Semantic search across indexed documents.
similar
Find chunks similar to a given text.
refresh
Re-index only files that have changed.
list
List all indexed documents.
delete
Remove documents from the index.
status
Get server status and statistics.
How It Works
Indexing: Documents are parsed, split into 4K token chunks with 500 token overlap
Embedding: Each chunk is embedded using nomic-embed-text-v1.5 (768 dimensions)
Storage: Embeddings stored in SQLite with sqlite-vec for fast vector search
Search: Query is embedded, compared against all chunks using cosine similarity
Results: Top-K most similar chunks returned with full text and metadata
System Requirements
Python 3.10+
4GB RAM (2GB for model + headroom)
~1GB disk space (model + database)
License
MIT