extract_urls

Crawl a webpage to extract, validate, and optionally queue all hyperlinks for processing, aiding in documentation discovery and graph building.

Instructions

Extract and analyze all URLs from a given web page. This tool crawls the specified webpage, identifies all hyperlinks, and optionally adds them to the processing queue. Useful for discovering related documentation pages, API references, or building a documentation graph. Handles various URL formats and validates links before extraction.

Input Schema

Name	Required	Description	Default
`add_to_queue`	No	If true, automatically add extracted URLs to the processing queue for later indexing. This enables recursive documentation discovery. Use with caution on large sites to avoid excessive queuing.
`url`	Yes	The complete URL of the webpage to analyze (must include protocol, e.g., https://). The page must be publicly accessible.

Input Schema (JSON Schema)

{
  "properties": {
    "add_to_queue": {
      "default": false,
      "description": "If true, automatically add extracted URLs to the processing queue for later indexing. This enables recursive documentation discovery. Use with caution on large sites to avoid excessive queuing.",
      "type": "boolean"
    },
    "url": {
      "description": "The complete URL of the webpage to analyze (must include protocol, e.g., https://). The page must be publicly accessible.",
      "type": "string"
    }
  },
  "required": [
    "url"
  ],
  "type": "object"
}

mcp-ragdocs

extract_urls

Instructions

Input Schema

Input Schema (JSON Schema)

Other Tools from mcp-ragdocs

Related Tools

MCP directory API