MCP-server-ragdocs

Node.js Package NPM Downloads Version codecov License: MIT

An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context.

Usage

The RAG Documentation tool is designed for:

Enhancing AI responses with relevant documentation
Building documentation-aware AI assistants
Creating context-aware tooling for developers
Implementing semantic documentation search
Augmenting existing knowledge bases

Features

Vector-based documentation search and retrieval
Support for multiple documentation sources
Support for local (Ollama) embeddings generation or OPENAI
Semantic search capabilities
Automated documentation processing
Real-time context augmentation for LLMs

Configuration

{ "mcpServers": { "rag-docs": { "command": "npx", "args": ["-y", "@sanderkooger/mcp-server-ragdocs"], "env": { "EMBEDDINGS_PROVIDER": "ollama", "QDRANT_URL": "your-qdrant-url", "QDRANT_API_KEY": "your-qdrant-key" # if applicable } } } }

Usage with Claude Desktop

Add this to your claude_desktop_config.json:

OpenAI Configuration

{ "mcpServers": { "rag-docs-openai": { "command": "npx", "args": ["-y", "@sanderkooger/mcp-server-ragdocs"], "env": { "EMBEDDINGS_PROVIDER": "openai", "OPENAI_API_KEY": "your-openai-key-here", "QDRANT_URL": "your-qdrant-url", "QDRANT_API_KEY": "your-qdrant-key" } } } }

Ollama Configuration

{ "mcpServers": { "rag-docs-ollama": { "command": "npx", "args": ["-y", "@sanderkooger/mcp-server-ragdocs"], "env": { "EMBEDDINGS_PROVIDER": "ollama", "OLLAMA_BASE_URL": "http://localhost:11434", "QDRANT_URL": "your-qdrant-url", "QDRANT_API_KEY": "your-qdrant-key" } } } }

Ollama run from this codebase

"ragdocs-mcp": { "command": "node", "args": [ "/home/sander/code/mcp-server-ragdocs/build/index.js" ], "env": { "QDRANT_URL": "http://127.0.0.1:6333", "EMBEDDINGS_PROVIDER": "ollama", "OLLAMA_URL": "http://localhost:11434" }, "alwaysAllow": [ "run_queue", "list_queue", "list_sources", "search_documentation", "clear_queue", "remove_documentation", "extract_urls" ], "timeout": 3600 }

Environment Variables Reference

Variable	Required For	Default	remarks
`EMBEDDINGS_PROVIDER`	All	`ollama`	"openai" or "ollama"
`OPENAI_API_KEY`	OpenAI	-	Obtain from OpenAI dashboard
`OLLAMA_BASE_URL`	Ollama	`http://localhost:11434`	Local Ollama server URL
`QDRANT_URL`	All	`http://localhost:6333`	Qdrant endpoint URL
`QDRANT_API_KEY`	Cloud Qdrant	-	From Qdrant Cloud console

Local Deployment

The repository includes Docker Compose configuration for local development:

Docker Compose Download

docker compose up -d

This starts:

Qdrant vector database on port 6333
Ollama LLM service on port 11434

Access endpoints:

Qdrant: http://localhost:6333
Ollama: http://localhost:11434

Cloud Deployment

For production deployments:

Use hosted Qdrant Cloud service
Set these environment variables:

QDRANT_URL=your-cloud-cluster-url QDRANT_API_KEY=your-cloud-api-key

Tools

search_documentation

Search through stored documentation using natural language queries. Returns matching excerpts with context, ranked by relevance.

Inputs:

query (string): The text to search for in the documentation. Can be a natural language query, specific terms, or code snippets.
limit (number, optional): Maximum number of results to return (1-20, default: 5). Higher limits provide more comprehensive results but may take longer to process.

list_sources

List all documentation sources currently stored in the system. Returns a comprehensive list of all indexed documentation including source URLs, titles, and last update times. Use this to understand what documentation is available for searching or to verify if specific sources have been indexed.

extract_urls

Extract and analyze all URLs from a given web page. This tool crawls the specified webpage, identifies all hyperlinks, and optionally adds them to the processing queue.

Inputs:

url (string): The complete URL of the webpage to analyze (must include protocol, e.g., https://). The page must be publicly accessible.
add_to_queue (boolean, optional): If true, automatically add extracted URLs to the processing queue for later indexing. Use with caution on large sites to avoid excessive queuing.

remove_documentation

Remove specific documentation sources from the system by their URLs. The removal is permanent and will affect future search results.

Inputs:

urls (string[]): Array of URLs to remove from the database. Each URL must exactly match the URL used when the documentation was added.

list_queue

List all URLs currently waiting in the documentation processing queue. Shows pending documentation sources that will be processed when run_queue is called. Use this to monitor queue status, verify URLs were added correctly, or check processing backlog.

run_queue

Process and index all URLs currently in the documentation queue. Each URL is processed sequentially, with proper error handling and retry logic. Progress updates are provided as processing occurs. Long-running operations will process until the queue is empty or an unrecoverable error occurs.

clear_queue

Remove all pending URLs from the documentation processing queue. Use this to reset the queue when you want to start fresh, remove unwanted URLs, or cancel pending processing. This operation is immediate and permanent - URLs will need to be re-added if you want to process them later.

Project Structure

The package follows a modular architecture with clear separation between core components and MCP protocol handlers. See ARCHITECTURE.md for detailed structural documentation and design decisions.

Using Ollama Embeddings without docker

Install Ollama:

curl -fsSL https://ollama.com/install.sh | sh

Download the nomic-embed-text model:

ollama pull nomic-embed-text

Verify installation:

ollama list

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Contributing

We welcome contributions! Please see our CONTRIBUTING.md for detailed guidelines, but here are the basics:

Fork the repository
Install dependencies: npm install
Create a feature branch: git checkout -b feat/your-feature
Commit changes with npm run commit to ensure compliance with Conventional Commits
Push to your fork and open a PR

Forkception Acknowledgments

This project is based on a fork of hannesrudolph/mcp-ragdocs, which itself was forked from the original work by qpd-v/mcp-ragdocs. The original project provided the foundation for this implementation.

This server cannot be installed

-

security - not tested

A

license - permissive license

-

quality - not tested

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context.

Uses Ollama or OpenAI to generate embeddings.

Docker files included

Related Resources

Reddit Discussion about this server

Related MCP Servers

mcp-ragdocs
hannesrudolph
A
security
A
license
A
quality
An MCP server implementation that provides tools for retrieving and processing documentation through vector search, enabling AI assistants to augment their responses with relevant documentation context
Last updated -
22
229
MIT License
Rust Docs MCP Server
laptou
-
security
F
license
-
quality
An MCP server that provides AI tools with access to Rust documentation from docs.rs, enabling search for crates, documentation, type information, feature flags, version information, and source code.
Last updated -
28
Documentation Retrieval MCP Server (DOCRET)
Sreedeep-SS
-
security
A
license
-
quality
An MCP server that enables AI assistants to access up-to-date documentation for Python libraries like LangChain, LlamaIndex, and OpenAI through dynamic fetching from official sources.
Last updated -
1
MIT License
API Docs MCP Server
ShotaNagafuchi
-
security
F
license
-
quality
An MCP server that crawls API documentation websites and exposes their content to AI models, enabling them to search, browse, and reference API specifications.
Last updated -

View all related MCP servers

@sanderkooger/mcp-server-ragdocs

MCP-server-ragdocs

Table of Contents

Usage

Features

Configuration

Usage with Claude Desktop

OpenAI Configuration

Ollama Configuration

Ollama run from this codebase

Environment Variables Reference

Local Deployment

Cloud Deployment

Tools

search_documentation

list_sources

extract_urls

remove_documentation

list_queue

run_queue

clear_queue

Project Structure

Using Ollama Embeddings without docker

License

Contributing

Forkception Acknowledgments

Related Resources

Related MCP Servers

mcp-ragdocs

Rust Docs MCP Server

Documentation Retrieval MCP Server (DOCRET)

API Docs MCP Server

Appeared in Searches

New MCP Servers

MCP directory API