web-scrape-links
Extract all hyperlinks from any public webpage. Returns absolute URLs with visible link text, with options to filter by domain, deduplicate, or include anchors.
Instructions
Extracts all hyperlinks from any public webpage. Returns href URLs normalized to absolute URLs with visible link text. Filters out javascript:, mailto:, data: schemes. Optionally restrict to same-domain links, deduplicate, or include #anchor links. Useful for crawlers, sitemap builders, link graph analysis, and content audits.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | URL of the webpage to extract links from (http or https). | |
| same_domain_only | No | Pass "true" to return only links pointing to the same domain as the input URL. Default: false. | |
| include_anchors | No | Pass "true" to include anchor-only links (#section). Default: false. | |
| deduplicate | No | Pass "false" to allow duplicate hrefs. Default: true (each unique URL returned once). | |
| limit | No | Maximum links to return (default 200, max 500). |