Markdown RAG Documentation

integration.md•28.5 KiB

# Integration Guide This document describes how to integrate mcp-markdown-ragdocs with MCP clients including VS Code, Claude Desktop, and other compatible applications. ## Transport Methods This server supports two transport methods: 1. **Stdio Transport (Recommended)**: Used by VS Code, Claude Desktop, and most MCP clients. Server communicates via stdin/stdout. 2. **HTTP Transport**: REST API for development, testing, or custom integrations. | Transport | Command | Use Case | |-----------|---------|----------| | Stdio | `mcp-markdown-ragdocs mcp` | VS Code, Claude Desktop, MCP client integrations | | HTTP | `mcp-markdown-ragdocs run` | Development, testing, direct HTTP API access | ## MCP Protocol Overview The Model Context Protocol (MCP) enables LLMs to access external tools and data sources. This server exposes a `query_documents` tool that allows AI assistants to search and retrieve information from local Markdown documentation. ## Query Document Tools ### `query_documents` Tool Parameters The `query_documents` tool supports flexible result filtering through the `uniqueness_mode` parameter: ```typescript { query: string; // Required: Search query top_n?: number; // Optional: Max results (default: 5, max: 100) uniqueness_mode?: string; // Optional: Result uniqueness strategy (default: "allow_multiple") min_score?: number; // Optional: Min relevance score (default: 0.3) show_stats?: boolean; // Optional: Show compression stats (default: false) similarity_threshold?: number; // Optional: Dedup threshold (default: 0.85) excluded_files?: string[]; // Optional: Files to exclude from results } ``` #### Uniqueness Mode Controls whether multiple chunks from the same document can appear in results: - **`"allow_multiple"` (default)**: Returns multiple chunks per document for depth within files - Use when you need comprehensive coverage of a topic that may span multiple sections - Example: Understanding all aspects of authentication in a single guide - Results may include several sections from the same file if they're all relevant - **`"one_per_document"`**: Returns at most one chunk per document for breadth across files - Use when you want to discover which files are relevant across the codebase - Example: Finding all files that mention a particular API or concept - Each result comes from a different file, maximizing document diversity **Usage Examples:** ```typescript // Depth mode: Get comprehensive info from relevant sections { "query": "JWT authentication implementation", "top_n": 10, "uniqueness_mode": "allow_multiple" } // May return: auth.md (3 sections), api.md (2 sections), config.md (1 section) ``` ```typescript // Breadth mode: Discover which files discuss JWT { "query": "JWT authentication implementation", "top_n": 10, "uniqueness_mode": "one_per_document" } // Returns: auth.md (best section), api.md (best section), config.md (best section), security.md (best section)... ``` #### Migration from query_unique_documents **`query_unique_documents` has been removed.** Use `query_documents` with `uniqueness_mode="one_per_document"` instead: ```typescript // Old approach (removed) { "tool": "query_unique_documents", "query": "authentication", "top_n": 5 } // New approach { "tool": "query_documents", "query": "authentication", "top_n": 5, "uniqueness_mode": "one_per_document" } ``` ## Integration with VS Code ### Prerequisites - VS Code with MCP extension support - Python 3.13+ - mcp-markdown-ragdocs installed via `uv` ### Configuration The server uses **stdio transport**, not HTTP. VS Code manages the server lifecycle automatically. **File:** `.vscode/settings.json` (workspace) or `~/.config/Code/User/mcp.json` (user-global) ```json { "mcpServers": { "markdown-docs": { "command": "uv", "args": [ "--directory", "/absolute/path/to/mcp-markdown-ragdocs", "run", "mcp-markdown-ragdocs", "mcp" ], "type": "stdio" } } } ``` **Path Resolution:** Replace `/absolute/path/to/mcp-markdown-ragdocs` with the actual installation directory: ```zsh cd /path/to/mcp-markdown-ragdocs pwd # Copy this path ``` **For installed package (not recommended for development):** ```json { "mcpServers": { "markdown-docs": { "command": "mcp-markdown-ragdocs", "args": ["mcp"], "type": "stdio" } } } ``` ### Multi-Project Configuration To specify which project to activate: ```json { "mcpServers": { "markdown-docs": { "command": "uv", "args": [ "--directory", "/absolute/path/to/mcp-markdown-ragdocs", "run", "mcp-markdown-ragdocs", "mcp", "--project", "my-project" ], "type": "stdio" } } } ``` The `--project` flag accepts: - Project name from global config (e.g., `"my-project"`) - Absolute path to project root (e.g., `"/home/user/project"`) ### Verification After configuration: 1. Restart VS Code or reload window 2. Open MCP extension panel 3. Verify `markdown-docs` server appears as connected 4. Check available tools list includes `query_documents` ### Usage in Copilot Chat Query your documentation from Copilot Chat: ``` @markdown-docs How do I configure authentication? ``` Or let Copilot automatically invoke the tool based on context: ``` I need to implement OAuth2 authentication according to our project docs ``` The tool returns ranked document chunks. Use the file paths and sections to locate and read the full documentation. ## Integration with Claude Desktop ### Prerequisites - Claude Desktop app with MCP support - Python 3.13+ - mcp-markdown-ragdocs installed ### Configuration Claude Desktop uses **stdio transport** for MCP servers. **File:** `~/Library/Application Support/Claude/claude_desktop_config.json` (macOS) **File:** `%APPDATA%\Claude\claude_desktop_config.json` (Windows) **File:** `~/.config/Claude/claude_desktop_config.json` (Linux) ```json { "mcpServers": { "markdown-docs": { "command": "uv", "args": [ "--directory", "/absolute/path/to/mcp-markdown-ragdocs", "run", "mcp-markdown-ragdocs", "mcp" ] } } } ``` **With project override:** ```json { "mcpServers": { "markdown-docs": { "command": "uv", "args": [ "--directory", "/absolute/path/to/mcp-markdown-ragdocs", "run", "mcp-markdown-ragdocs", "mcp", "--project", "my-project" ] } } } ``` ### Verification 1. Close and reopen Claude Desktop 2. Start a new conversation 3. Type a documentation-related question 4. Claude will automatically invoke `query_documents` when appropriate ### Usage Claude will implicitly use the tool based on conversation context: ``` What does the authentication guide say about OAuth tokens? ``` The tool returns ranked document chunks. Review the results to identify relevant sections, then use file reading capabilities to access full context. ## HTTP API Integration (Alternative) For custom integrations or testing, use the HTTP API. ### Start HTTP Server ```zsh uv run mcp-markdown-ragdocs run --host 127.0.0.1 --port 8000 ``` ### Generic HTTP MCP Client Configuration pattern for HTTP-based MCP clients: ```json { "server_url": "http://127.0.0.1:8000", "tools": [ { "endpoint": "/query_documents", "method": "POST", "name": "query_documents", "parameters": { "query": "string", "top_n": "integer" } } ] } ``` ### Direct API Testing Test the endpoint with curl: ```zsh curl -X POST http://127.0.0.1:8000/query_documents \ -H "Content-Type: application/json" \ -d '{"query": "How do I configure the search weights?"}' ``` Response: ```json { "results": [ { "chunk_id": "configuration_search_0", "content": "To configure search weights, edit the [search] section in config.toml...", "file_path": "docs/configuration.md", "header_path": ["Configuration Sections", "search"], "score": 0.92 } ] } ``` **Note:** HTTP endpoint uses full JSON format. MCP stdio transport uses compact text format: `[N] file § section (score)\ncontent`. ``` ## Configuration for Different Use Cases ### Use Case 1: Project-Local Documentation Run the server from your project root with project-local configuration: **File:** `./config.toml` ```toml [indexing] documents_path = "./docs" index_path = ".ragdocs-index/" ``` **MCP Configuration:** ```json { "url": "http://127.0.0.1:8000" } ``` ### Use Case 2: Shared Team Documentation Run the server as a shared service on a team server: **File:** `/opt/ragdocs/config.toml` ```toml [server] host = "0.0.0.0" port = 8080 [indexing] documents_path = "/opt/team-docs" index_path = "/var/lib/ragdocs-indices" ``` **MCP Configuration (each team member):** ```json { "url": "http://docs-server.local:8080" } ``` ### Use Case 3: Personal Knowledge Base (Obsidian) Run the server pointing at your Obsidian vault: **File:** `~/.config/mcp-markdown-ragdocs/config.toml` ```toml [indexing] documents_path = "~/Documents/ObsidianVault" index_path = "~/.cache/ragdocs-obsidian" [search] semantic_weight = 1.2 recency_bias = 0.7 # Prioritize recent notes ``` **MCP Configuration:** ```json { "url": "http://127.0.0.1:8000" } ``` ## Server Management ### Running as a Background Service #### systemd (Linux) Create a systemd service file: **File:** `/etc/systemd/system/mcp-ragdocs.service` ```ini [Unit] Description=MCP Markdown RAG Documentation Server After=network.target [Service] Type=simple User=youruser WorkingDirectory=/opt/ragdocs ExecStart=/usr/local/bin/uv run mcp-markdown-ragdocs run Restart=on-failure RestartSec=10 [Install] WantedBy=multi-user.target ``` Enable and start: ```zsh sudo systemctl enable mcp-ragdocs sudo systemctl start mcp-ragdocs ``` #### Docker Create a Dockerfile: ```dockerfile FROM python:3.13-slim WORKDIR /app # Install uv RUN pip install uv # Copy project files COPY . . # Install dependencies RUN uv sync # Expose port EXPOSE 8000 # Start server CMD ["uv", "run", "mcp-markdown-ragdocs", "run", "--host", "0.0.0.0"] ``` Build and run: ```zsh docker build -t mcp-ragdocs . docker run -d -p 8000:8000 -v /path/to/docs:/docs \ -e INDEX_PATH=/app/.index_data \ mcp-ragdocs ``` ### Monitoring Server Status Check health: ```zsh curl http://127.0.0.1:8000/health ``` Response: ```json { "status": "ok" } ``` Check detailed status: ```zsh curl http://127.0.0.1:8000/status ``` Response: ```json { "server_status": "running", "indexing_service": { "pending_queue_size": 0, "last_sync_time": "2025-12-22T10:30:45.123Z", "failed_files": [] }, "indices": { "document_count": 147, "index_version": "1.0.0" } } ``` ## Response Format Differences ### HTTP vs MCP Stdio The server uses different response formats for HTTP and MCP stdio transport: **HTTP JSON (Full Format):** ```json { "results": [ { "chunk_id": "auth_0", "content": "Authentication uses JWT tokens...", "file_path": "docs/auth.md", "header_path": ["Authentication", "Tokens"], "score": 0.95 } ] } ``` **MCP Stdio (Compact Format, 60% less overhead):** ``` [1] docs/auth.md § Authentication > Tokens (0.95) Authentication uses JWT tokens... ``` ### Query-Aware Truncation Factual queries (configuration, commands, syntax) return truncated content to reduce token usage: **Detection Patterns:** - camelCase or snake_case identifiers - Backticks or code-like terms - Keywords: "what is", "configure", "syntax", "command" **Example:** Query: `getUserById function` Response: ``` [1] api.md § Functions (0.92) getUserById(id: string): User | null Retrieves user by ID. Returns null if not found... ``` Conceptual queries (explanations, guides) return full content: Query: `why use JWT authentication?` Response includes complete section content. ### Compression Statistics Default compression enabled (`min_confidence=0.3`, `max_chunks_per_doc=2`, `dedup_enabled=true`). Set `show_stats=true` to see filtering results: ``` Compression Stats: - Original results: 47 - After score filter (≥0.3): 23 - After deduplication: 12 - After document limit (2 per doc): 8 - Clusters merged: 11 ``` **Token Savings:** Compression reduces context usage by 40-60% depending on query and corpus. ## Troubleshooting ### Problem: VS Code Cannot Start MCP Server **Symptoms:** - Server not appearing in MCP extension - "Failed to start server" error - Tool not available **Solutions:** 1. **Verify uv installation:** ```zsh which uv uv --version ``` 2. **Check absolute path in config:** ```zsh cd /path/to/mcp-markdown-ragdocs pwd # Use this exact path in config ``` 3. **Test server manually:** ```zsh cd /path/to/mcp-markdown-ragdocs uv run mcp-markdown-ragdocs mcp ``` Server should start without errors. Press Ctrl+C to stop. 4. **Check VS Code logs:** - Open Output panel - Select "MCP" from dropdown - Look for startup errors 5. **Verify file permissions:** ```zsh ls -la /path/to/mcp-markdown-ragdocs ``` ### Problem: Query Returns Empty Results **Symptoms:** - `query_documents` returns no answer or "No results found" **Solutions:** 1. **Check indices exist:** ```zsh ls ~/.local/share/mcp-markdown-ragdocs/ # Multi-project mode ls .index_data/ # Local mode ``` 2. **Verify project detection:** ```zsh cd /your/project uv run mcp-markdown-ragdocs check-config ``` Should show detected project and document count. 3. **Rebuild index manually:** ```zsh cd /your/project uv run mcp-markdown-ragdocs rebuild-index ``` 4. **Check document paths in config:** ```toml [indexing] documents_path = "/correct/path/to/docs" ``` ### Problem: Wrong Project Detected **Symptoms:** - Queries search incorrect project - Multi-project setup not working **Solutions:** 1. **Explicitly specify project:** ```json { "args": [ "mcp", "--project", "correct-project-name" ] } ``` 2. **Verify project configuration:** ```zsh cat ~/.config/mcp-markdown-ragdocs/config.toml ``` Check `[[projects]]` paths are absolute and correct. 3. **Test detection:** ```zsh cd /project/directory uv run mcp-markdown-ragdocs check-config ``` ### Problem: Claude Desktop Not Finding Tool **Symptoms:** - Claude does not use documentation tool - No indication tool is available **Solutions:** 1. **Verify config file location:** ```zsh # macOS cat ~/Library/Application\ Support/Claude/claude_desktop_config.json # Linux cat ~/.config/Claude/claude_desktop_config.json ``` 2. **Check JSON syntax:** Use `jq` to validate: ```zsh cat path/to/config.json | jq ``` 3. **Restart Claude Desktop completely:** - Quit application (not just close window) - Reopen 4. **Test server manually:** ```zsh uv run mcp-markdown-ragdocs mcp ``` ## Security Considerations ### Network Exposure By default, the server binds to `127.0.0.1` (localhost only). To expose on a network: ```toml [server] host = "0.0.0.0" # Listen on all interfaces ``` **Warning:** This exposes the server to any client on your network. Consider: - Running behind a reverse proxy with authentication - Using firewall rules to restrict access - Deploying in a trusted network only ### File System Access The server has read access to all files under `documents_path`. Avoid pointing to: - System directories (`/`, `/etc`, `/usr`) - Home directory root (`$HOME` without subdirectory) - Directories with sensitive files (credentials, keys) **Recommended:** Scope to specific documentation directories. ### Resource Limits No built-in resource limits. Consider: - Using systemd resource limits (CPUQuota, MemoryLimit) - Docker resource constraints (--memory, --cpus) - Rate limiting via reverse proxy ## Git History Search The `search_git_history` tool provides semantic search over git commit history. ### From VS Code Configure in settings (same as `query_documents` tool). Query from Copilot Chat: ``` @markdown-docs show me commits that fixed authentication bugs in the last month ``` The tool automatically discovers `.git` directories in your `documents_path` and indexes commit history. ### From Claude Desktop Configure in claude_desktop_config.json (same as above). Query from conversation: ``` Search the git history for commits related to the JWT authentication refactor ``` Claude automatically invokes `search_git_history` when commit history context is useful. ### MCP Tool Parameters **`search_git_history`:** ```typescript { query: string; // Required: Natural language query top_n?: number; // Optional: Max results (default: 5, max: 100) min_score?: number; // Optional: Relevance threshold (default: 0.0) file_pattern?: string; // Optional: Glob filter (e.g., "src/**/*.py") author?: string; // Optional: Filter by author name/email after?: number; // Optional: Unix timestamp (commits after) before?: number; // Optional: Unix timestamp (commits before) } ``` ### Example Queries **Basic semantic search:** ```json { "query": "authentication improvements", "top_n": 10 } ``` Returns commits with "authentication" in title/message/delta, ranked by semantic similarity. **Filter by files changed:** ```json { "query": "API changes", "file_pattern": "src/api/**/*.py", "top_n": 5 } ``` Returns commits matching query where changed files match glob pattern. **Filter by author:** ```json { "query": "performance optimization", "author": "alice@example.com", "top_n": 5 } ``` Returns commits from specific author. **Temporal filtering:** ```json { "query": "security fixes", "after": 1704067200, "before": 1706745600, "top_n": 10 } ``` Returns commits within date range (Jan 1 - Feb 1, 2024). **Combined filters:** ```json { "query": "refactor database layer", "file_pattern": "src/db/**", "author": "bob", "after": 1704067200, "min_score": 0.3, "top_n": 5 } ``` All filters applied: query match, file pattern, author contains "bob", after date, score threshold. ### Configuration Enable/disable git indexing in config.toml: ```toml [git_indexing] enabled = true exclude_patterns = [".venv", "node_modules", "build"] max_delta_lines = 200 ``` See [Configuration Reference](configuration.md#git_indexing) for all options. ### Incremental Indexing Git commit indexing uses **incremental updates** by default: **On Startup:** - Queries last indexed timestamp per repository - Fetches only commits added since last index - Skips indexing if no new commits found - Logs incremental vs first-time indexing **State Persistence:** - Last indexed timestamp stored in SQLite per repository - Repository paths normalized (absolute, no `.git` suffix, no trailing slashes) - Ensures consistent timestamp tracking across restarts **Full Rebuild:** - Use `rebuild-index` CLI command to force full reindex - Clears commit database and reindexes all commits - Incremental indexing resumes after rebuild **Zero-Commit Optimization:** - Repositories with no new commits skipped during startup - Reduces unnecessary git operations for static repositories ### Performance - **Indexing:** 60 commits/sec (includes git operations, parsing, embedding) - **Incremental startup:** Near-instant when no new commits - **Full rebuild:** Scales with commit count (10k commits ≈ 3 minutes) - **Query:** 5ms average for 10k commits - **Storage:** ~2KB per commit (metadata + embedding + truncated delta) ### Troubleshooting **No commits indexed:** Check git binary availability: ```zsh which git ``` If git not found, install or add to PATH. **Commits reindexed on every startup:** This issue was fixed in recent versions. If experiencing: 1. Update to latest version 2. Run `rebuild-index` once to reset state with normalized paths 3. Verify logs show "Last indexed at..." rather than "First-time indexing" on subsequent startups **Verifying incremental indexing:** Check server logs on startup: ``` Repository /path/to/repo: Last indexed at 2026-01-15T10:30:00, found 5 new commits ``` "Last indexed at" confirms incremental indexing is working. "First-time indexing" indicates state was reset or not found. **Slow initial indexing:** Large repositories (10k+ commits) take several minutes for first-time indexing. Progress logged at INFO level. Consider excluding archived branches. Subsequent startups with incremental indexing are near-instant when no new commits exist. **High memory usage:** Delta truncation limits embedding size. Reduce `max_delta_lines` if memory constrained. ## HyDE Search (Hypothesis-Driven) The `search_with_hypothesis` tool enables hypothesis-driven document embeddings for vague queries. ### Concept Standard search requires well-formed queries with specific terms. HyDE inverts this: instead of searching with a question, search with a hypothesis describing what the answer documentation looks like. The hypothesis is embedded directly and matched against document embeddings. ### When to Use - Vague queries where exact terminology is unknown - AI assistants that can describe expected documentation content - Exploratory searches where the answer format is known but content is not ### MCP Tool Parameters **`search_with_hypothesis`:** ```typescript { hypothesis: string; // Required: Description of expected documentation top_n?: number; // Optional: Max results (default: 5, max: 100) excluded_files?: string[]; // Optional: Files to exclude from results } ``` ### Usage Example **Problem:** User asks "How do I add a new feature to the server?" **Standard Search:** The query "add new feature server" may not match documentation that uses different terminology. **HyDE Approach:** AI generates hypothesis: ``` To add a new feature, I would modify src/mcp_server.py and add a Tool definition in the list_tools() method. The Tool should include name, description, and inputSchema with parameter definitions. ``` **Example Call:** ```json { "hypothesis": "To add a new MCP tool, modify src/mcp_server.py. Add a Tool instance in list_tools() with name, description, and inputSchema. Implement a handler method _handle_<tool_name>() and register it in call_tool().", "top_n": 5 } ``` **Response:** ``` # HyDE Search Results [1] src/mcp_server.py § Tool Registration (0.89) The server exposes tools via list_tools()... [2] docs/development.md § Adding Tools (0.82) To add a new tool, create a Tool definition... ``` ### From VS Code Query using Copilot Chat with a hypothesis: ``` I think the configuration documentation describes a TOML file with [server], [indexing], and [search] sections. It probably includes host, port, documents_path, and search weight settings. ``` Copilot invokes `search_with_hypothesis` with this hypothesis. ### Configuration Enable/disable HyDE in config.toml: ```toml [search.advanced] hyde_enabled = true ``` When disabled, `search_with_hypothesis` falls back to standard `query_documents`. ## Memory Management Tools The server exposes tools for managing persistent AI memories when `memory.enabled = true`. ### Available Tools | Tool | Description | |------|-------------| | `create_memory` | Create a new memory file with frontmatter | | `read_memory` | Read full content of a memory file | | `update_memory` | Replace memory content | | `append_memory` | Append content to existing memory | | `delete_memory` | Move memory to trash (soft delete) | | `search_memories` | Hybrid search with recency boost | | `search_linked_memories` | Find memories linking to a document | | `get_memory_stats` | Memory bank statistics | | `merge_memories` | Consolidate multiple memories | | `get_memory_relationships` | Query memory relationships (versions/dependencies/contradictions) | ### Tool Parameters **`create_memory`:** ```typescript { filename: string; // Required: Filename (e.g., "project-notes.md") content: string; // Required: Memory body content (NO frontmatter - auto-generated) tags: string[]; // Required: List of tags (added to auto-generated frontmatter) memory_type?: string; // Optional: "journal" | "plan" | "fact" | "observation" | "reflection" } ``` **IMPORTANT:** The system automatically generates YAML frontmatter with `type`, `status`, `tags`, and `created_at`. The `content` parameter should contain ONLY the body text - do NOT include frontmatter. **`search_memories`:** ```typescript { query: string; // Required: Natural language query limit?: number; // Optional: Max results (default: 5) filter_tags?: string[]; // Optional: Filter by tags (OR logic) filter_type?: string; // Optional: Filter by memory type after_timestamp?: number; // Optional: Unix timestamp (memories after, inclusive) before_timestamp?: number; // Optional: Unix timestamp (memories before, exclusive) relative_days?: number; // Optional: Last N days (overrides absolute timestamps, ≥ 0) } ``` **`search_linked_memories`:** ```typescript { query: string; // Required: Query to rank results target_document: string; // Required: Document path (e.g., "src/auth.py") limit?: number; // Optional: Max results (default: 5) } ``` ### Usage Examples **Create a memory:** ```json { "filename": "auth-refactor-plan.md", "content": "Plan to refactor [[src/auth.py]] to use JWT tokens.\n\n## Goals\n- Remove session-based auth\n- Add token refresh", "tags": ["refactor", "auth"], "memory_type": "plan" } ``` The system will automatically generate frontmatter: ```yaml --- type: "plan" status: "active" tags: ["refactor", "auth"] created_at: "2025-01-11T12:00:00.000000+00:00" --- ``` **Search memories:** ```json { "query": "authentication improvements", "filter_type": "plan", "limit": 10 } ``` **Find memories linking to a file:** ```json { "query": "refactor plans", "target_document": "src/auth.py" } ``` Returns memories containing `[[src/auth.py]]` with anchor context showing the surrounding text. **Search with time range (last 7 days):** ```json { "query": "bug fixes", "relative_days": 7 } ``` **Search with absolute time range:** ```json { "query": "quarterly planning", "after_timestamp": 1704067200, "before_timestamp": 1706745600, "filter_tags": ["planning"] } ``` **Time Filtering Notes:** - `relative_days` overrides `after_timestamp` and `before_timestamp` - Times use `created_at` from frontmatter, fall back to file modification time - All timestamps are Unix timestamps (seconds since epoch) and normalized to UTC - Validation: `after_timestamp < before_timestamp`, `relative_days ≥ 0` ### Configuration Enable memory management in config.toml: ```toml [memory] enabled = true storage_strategy = "project" # or "user" for shared memory recency_boost_days = 7 recency_boost_factor = 1.2 ``` See [Memory Management Guide](memory.md) for complete documentation. ## API Reference ### POST /query_documents Query the documentation index. **Request Parameters:** | Parameter | Type | Default | Description | |-----------|------|---------|-------------| | `query` | string | (required) | Natural language search query | | `top_n` | integer | 5 | Maximum results to return (max: 100) | | `uniqueness_mode` | string | `"allow_multiple"` | `"allow_multiple"` or `"one_per_document"` | | `min_score` | float | 0.3 | Minimum relevance score threshold (0-1) | | `show_stats` | boolean | false | Include compression statistics | | `similarity_threshold` | float | 0.85 | Deduplication similarity threshold (0.5-1) | | `excluded_files` | string[] | [] | Files to exclude (filename, relative, or absolute path) | **Request (Depth - multiple chunks per document):** ```json { "query": "How do I configure the server?", "top_n": 5, "uniqueness_mode": "allow_multiple" } ``` **Request (Breadth - one chunk per document):** ```json { "query": "How do I configure the server?", "top_n": 5, "uniqueness_mode": "one_per_document" } ``` **Response:** ```json { "results": [ { "chunk_id": "configuration_0", "content": "To configure the server, create a config.toml file...", "file_path": "docs/configuration.md", "header_path": ["Configuration Reference"], "score": 0.95 } ] } ``` ### GET /health Health check endpoint. **Response:** ```json { "status": "ok" } ``` ### GET /status Detailed server status. **Response:** ```json { "server_status": "running", "indexing_service": { "pending_queue_size": 0, "last_sync_time": "2025-12-22T10:30:45.123Z", "failed_files": [] }, "indices": { "document_count": 147, "index_version": "1.0.0" } } ```

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/andnp/ragdocs-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

integration.md•28.5 KiB