Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@KB-MCP Serversearch for information about local-first knowledge bases"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
KB-MCP Server
A local-first Knowledge Base with Model Context Protocol (MCP) support. Give your AI a reliable memory. Run it locally. Stream answers in real time.
What is This?
A Knowledge Base (KB) is a structured collection of facts, documents, and embeddings stored in machine-readable form, with interfaces to:
Add knowledge
Query knowledge (semantic + keyword search)
Update/Delete knowledge
This MCP server exposes your KB to any MCP-compatible AI client (Claude, custom agents, etc.).
Why Local-First?
Benefit | Description |
Privacy | No cloud leaks — your data stays on your machine |
Zero latency | No network round-trips |
Offline support | Works without internet |
Full control | You own the data and the logic |
No vendor lock-in | Swap components freely |
Quick Start
Installation
npm install
npm run buildRun the Server
npm startOr for development:
npm run devConfigure with Claude Desktop
Add to your claude_desktop_config.json:
{
"mcpServers": {
"knowledge-base": {
"command": "node",
"args": ["/path/to/kb-mcp-server/dist/index.js"],
"env": {
"KB_DATA_DIR": "/path/to/your/data"
}
}
}
}Available Tools
Tool | Description |
| Add a document with title, content, and metadata |
| Semantic search across all documents |
| List documents with pagination |
| Get full document by ID |
| Update existing document |
| Remove document from KB |
| Get knowledge base statistics |
How It Works
1. User asks a question
↓
2. AI sends MCP query → KB-MCP Server
↓
3. KB retrieves relevant facts (semantic search)
↓
4. AI grounds the answer with real data
↓
5. Response streams to user
↓
6. (Optional) New insights stored backResult: AI answers correctly. Knowledge compounds. No hallucinations.
Architecture
┌─────────────────┐
│ AI Client │
│ (Claude, Agent) │
└────────┬────────┘
│ MCP Protocol
↓
┌─────────────────┐
│ KB-MCP Server │ ← stdio transport
│ ┌───────────┐ │
│ │ Tools │ │ ingest | query | list | delete
│ └─────┬─────┘ │
│ ↓ │
│ ┌───────────┐ │
│ │ Engine │ │ Embeddings + Similarity Search
│ └─────┬─────┘ │
│ ↓ │
│ ┌───────────┐ │
│ │ Store │ │ JSON file (swap with Chroma/pgvector)
│ └───────────┘ │
└─────────────────┘Configuration
Environment Variable | Default | Description |
|
| Directory for storing knowledge base data |
Production Enhancements
For production use, consider:
Real embeddings: Replace hash-based embeddings with OpenAI, Cohere, or local models (Ollama)
Vector database: Swap JSON store with Chroma, Qdrant, or pgvector
Chunking: Split large documents into chunks for better retrieval
Hybrid search: Combine semantic + BM25 keyword search
Access control: Add authentication for multi-user setups
License
MIT — Use freely.
Author
Matrix Agent