Converts Docusaurus documentation sites into searchable knowledge bases with tools for content navigation and information retrieval
Transforms GitBook documentation sites into AI-accessible knowledge bases with tools for searching content, retrieving specific pages, and extracting code blocks
Works with Next.js documentation sites to provide AI-accessible knowledge retrieval with automatic domain detection and content extraction
Supports React documentation sites with auto-detection to create appropriately named tools for content search and retrieval
Mentioned as a compatible example for scraping and providing AI-accessible tools for Stripe's documentation content
Mentioned as a compatible example for API documentation, enabling AI to search and retrieve information from Twilio docs
Enables AI access to Vercel-hosted documentation through search tools, page retrieval, and section navigation capabilities
Universal Documentation MCP Server
A high-performance MCP (Model Context Protocol) server that transforms any documentation website into an AI-accessible knowledge base. Originally built for GitBook but works with Vercel docs, Next.js sites, Docusaurus, and many other documentation platforms. Features instant startup, intelligent caching, and automatic domain detection.
✨ Key Features
⚡ Instant Startup - SQLite storage with sub-second server initialization
🔍 Advanced Search - FTS5 full-text search with fuzzy matching and ranking
🧠 Smart Auto-Detection - Automatically detects domain, keywords, and branding
📝 Markdown Perfect - Preserves formatting with syntax-highlighted code blocks
🔄 Background Updates - Non-blocking change detection and cache refresh
🌐 Universal Support - Works with GitBook, Vercel docs, Next.js sites, and many other documentation platforms
📡 Dual Interface - Both MCP tools and REST API endpoints
🚀 Production Ready - Rate limiting, error handling, and robust caching
Related MCP server: RAG Documentation MCP Server
🚀 Quick Start
💡 Recommended: Use the interactive creator for the best experience!
🎨 Web UI Management Dashboard
The Web UI provides:
🚀 Visual Server Creation - Step-by-step wizard with live URL validation
📊 Server Management - Start/stop/delete servers with real-time status
📋 Claude Desktop Integration - One-click copy config or add via CLI
🖥️ Live Terminal - Real-time feedback and command execution
⚠️ Safety Features - Confirmation dialogs and cancel functionality
⭐ One-Command Setup
That's it! 🎉 The interactive wizard will:
✨ Guide you through setup with smart defaults
🔍 Auto-detect domain/keywords from your documentation site
📦 Install in organized directories (
~/.config/mcpbooks/servers/[name])🌍 Optionally install globally (accessible as
your-server-namecommand)🤖 Auto-configure Claude Desktop (optional)
🚀 Pre-cache everything for instant server startup
🛠️ Manual Setup (Advanced Users)
Install and Configure
npm install echo "GITBOOK_URL=https://docs.yoursite.com" > .envBuild with Auto-Detection
npm run build # Automatically detects and configures your domainStart Server
npm start # Instant startup with SQLite cacheTest with MCP Inspector
npx @modelcontextprotocol/inspector node dist/index.js
📦 Installation Options
Option 1: Local Development
Option 2: Global Installation
Option 3: Claude Desktop Integration
Config file locations:
macOS:
~/Library/Application Support/Claude/claude_desktop_config.jsonWindows:
%APPDATA%\\Claude\\claude_desktop_config.json
Option 4: StreamableHTTP Transport
Option 5: REST API Server
🌐 Platform Compatibility
While originally designed for GitBook, this MCP server has proven compatibility with many documentation platforms:
✅ Guaranteed to Work
GitBook (original target platform)
Custom GitBook instances
🎯 Successfully Tested
Vercel-hosted docs (
docs.vercel.com,aptos.dev)Next.js documentation sites
Static site generators with consistent navigation
Most HTML-based documentation platforms
🔧 How It Works
The scraper intelligently:
Discovers navigation through link crawling
Extracts content from any HTML structure
Adapts to different layouts automatically
Handles various authentication and routing patterns
💡 Pro Tip: If a site has consistent navigation and accessible content, our scraper will likely work! The auto-detection feature adapts to different site structures automatically.
⚙️ Configuration
Auto-Detection (Recommended)
The server will automatically:
Generate domain-specific tool names (
stripe_docs_search,api_docs_get_page)Extract relevant keywords from content
Create contextual descriptions for better AI integration
Manual Configuration
Configuration Examples
API Documentation:
→ Results in: api_search_content, api_get_page, etc.
Product Documentation:
→ Results in: help_search_content, help_get_page, etc.
🛠️ Available Tools
The server exposes 7 MCP tools with automatic prefixing:
Core Tools
Tool | Description | Parameters |
| Advanced search with ranking |
: Search terms |
| Get specific page content |
: Page path (e.g., "/api/auth") |
| Get table of contents | None |
| Get all pages in section |
: Section name |
| Force cache refresh | None |
| Extract code with syntax highlighting |
: Page path |
| Get formatted markdown |
: Page path |
MCP Prompts
explain_section- Generate comprehensive tutorialssummarize_page- Create concise summariescompare_sections- Compare documentation sectionsapi_reference- Format as API documentationquick_start_guide- Generate quick start guides
🌐 HTTP Interfaces
The server supports both MCP StreamableHTTP and traditional REST API:
StreamableHTTP MCP Protocol:
REST API (separate server):
🎯 Usage Examples
Auto-Detection Results
docs.stripe.com→stripe_search_content,stripe_get_pagedocs.react.dev→react_search_content,react_get_pageapi.yourcompany.com→api_search_content,api_get_pageGeneric sites →
docs_search_content,docs_get_page
MCP Tool Usage
🏗️ Architecture
SQLite Storage - Fast startup with FTS5 full-text search
Background Updates - Non-blocking change detection
Auto-Detection - Domain and keyword extraction
Parallel Scraping - Configurable concurrency
Smart Caching - Only updates changed content
Key Components
GitBookScraper- Web scraping and content extractionSQLiteStore- High-performance storage with FTS5 searchDomainDetector- Automatic domain and keyword detectionGitBookMCPServer- MCP server with tool handlersGitBookRestAPI- HTTP endpoints for web integration
🔧 Development
🌍 Universal GitBook Support
Works with any public GitBook, including:
API Documentation - Stripe, Twilio, etc.
Framework Docs - React, Vue, Angular
Product Guides - Help centers and tutorials
Developer Resources - SDKs and references
Company Wikis - Internal documentation
⚡ Performance
Instant Startup: Sub-second initialization with SQLite cache
Background Updates: Non-blocking change detection
Smart Indexing: FTS5 full-text search with ranking
Efficient Storage: SQLite replaces slow JSON parsing
Memory Optimized: On-demand loading instead of full memory cache
🚧 Limitations
Public GitBooks Only - Requires publicly accessible sites
Static Content - Scrapes published HTML, not API-based
Manual Refresh - No real-time updates (use refresh tool)
Text-Focused - Extracts text content, not interactive elements
📄 License
MIT
Need help? Check the MCP documentation or open an issue.