mcp-ragdocs

by hannesrudolph
Verified

extract_urls

Extract and analyze all URLs from a given web page. This tool crawls the specified webpage, identifies all hyperlinks, and optionally adds them to the processing queue. Useful for discovering related documentation pages, API references, or building a documentation graph. Handles various URL formats and validates links before extraction.

Input Schema

NameRequiredDescriptionDefault
add_to_queueNoIf true, automatically add extracted URLs to the processing queue for later indexing. This enables recursive documentation discovery. Use with caution on large sites to avoid excessive queuing.
urlYesThe complete URL of the webpage to analyze (must include protocol, e.g., https://). The page must be publicly accessible.

Input Schema (JSON Schema)

{ "properties": { "add_to_queue": { "default": false, "description": "If true, automatically add extracted URLs to the processing queue for later indexing. This enables recursive documentation discovery. Use with caution on large sites to avoid excessive queuing.", "type": "boolean" }, "url": { "description": "The complete URL of the webpage to analyze (must include protocol, e.g., https://). The page must be publicly accessible.", "type": "string" } }, "required": [ "url" ], "type": "object" }