Skip to main content
Glama

MCP Webscan Server

by bsmi021

extract-links

Extract and analyze hyperlinks from web pages, organizing URLs, anchor text, and contextual information into a structured format. Supports site mapping, SEO analysis, broken link checking, and targeted crawling preparation. Handles relative and absolute URLs with optional base URL and output limits.

Instructions

Extract and analyze all hyperlinks from a web page, organizing them into a structured format with URLs, anchor text, and contextual information. Performance-optimized with stream processing and worker threads for efficient handling of large pages. Works with either a direct URL or raw HTML content. Handles relative and absolute URLs properly by supporting an optional base URL parameter. Results can be limited to prevent overwhelming output for link-dense pages. Returns a comprehensive link inventory that includes destination URLs, link text, titles (if available), and whether links are internal or external to the source domain. Useful for site mapping, content analysis, broken link checking, SEO analysis, and as a preparatory step for targeted crawling operations.

Input Schema

NameRequiredDescriptionDefault
baseUrlNoOptional base URL to resolve relative links against. If provided, only links starting with this base URL will be returned. Useful for focusing on internal links.
limitNoMaximum number of links to return. Defaults to 100. Max allowed is 5000.
urlYesThe fully qualified URL of the web page from which to extract links. Must be a valid HTTP or HTTPS URL.

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "baseUrl": { "description": "Optional base URL to resolve relative links against. If provided, only links starting with this base URL will be returned. Useful for focusing on internal links.", "format": "uri", "type": "string" }, "limit": { "default": 100, "description": "Maximum number of links to return. Defaults to 100. Max allowed is 5000.", "maximum": 5000, "minimum": 1, "type": "integer" }, "url": { "description": "The fully qualified URL of the web page from which to extract links. Must be a valid HTTP or HTTPS URL.", "format": "uri", "type": "string" } }, "required": [ "url" ], "type": "object" }

Other Tools from MCP Webscan Server

Related Tools

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bsmi021/mcp-server-webscan'

If you have feedback or need assistance with the MCP directory API, please join our Discord server