check_site
Scan entire websites to identify broken internal and external links. Recursively crawls pages to check link validity and provides detailed status reports.
Instructions
Recursively crawl and check all links across an entire website. This will scan multiple pages and check all internal and external links found. Use with caution on large sites as it may take significant time.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
excludeExternalLinks | No | If true, only check internal links (default: false) | |
honorRobotExclusions | No | If true, respect robots.txt and meta robots tags (default: true) | |
maxSocketsPerHost | No | Maximum concurrent requests per host (default: 1) | |
url | Yes | The starting URL of the site to check |
Input Schema (JSON Schema)
{
"properties": {
"excludeExternalLinks": {
"default": false,
"description": "If true, only check internal links (default: false)",
"type": "boolean"
},
"honorRobotExclusions": {
"default": true,
"description": "If true, respect robots.txt and meta robots tags (default: true)",
"type": "boolean"
},
"maxSocketsPerHost": {
"default": 1,
"description": "Maximum concurrent requests per host (default: 1)",
"type": "number"
},
"url": {
"description": "The starting URL of the site to check",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}