generate-site-map
Crawl a website from a given URL to a specified depth and generate an XML sitemap with discovered URLs, up to a defined limit, for improved site navigation and indexing.
Instructions
Crawls a website starting from a given URL up to a specified depth and generates an XML sitemap containing the discovered URLs (up to a specified limit).
Input Schema
Name | Required | Description | Default |
---|---|---|---|
limit | No | Maximum number of URLs to include in the generated sitemap XML. Defaults to 1000. Max allowed is 5000. | |
maxDepth | No | The maximum depth to crawl relative to the starting URL to discover pages for the sitemap. 0 means only the starting URL. Max allowed depth is 5. Defaults to 2. | |
url | Yes | The starting URL for the crawl to generate the sitemap. Must be a valid HTTP or HTTPS URL. |
Input Schema (JSON Schema)
{
"$schema": "http://json-schema.org/draft-07/schema#",
"additionalProperties": false,
"properties": {
"limit": {
"default": 1000,
"description": "Maximum number of URLs to include in the generated sitemap XML. Defaults to 1000. Max allowed is 5000.",
"maximum": 5000,
"minimum": 1,
"type": "integer"
},
"maxDepth": {
"default": 2,
"description": "The maximum depth to crawl relative to the starting URL to discover pages for the sitemap. 0 means only the starting URL. Max allowed depth is 5. Defaults to 2.",
"maximum": 5,
"minimum": 0,
"type": "integer"
},
"url": {
"description": "The starting URL for the crawl to generate the sitemap. Must be a valid HTTP or HTTPS URL.",
"format": "uri",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}