sitemap_parse
Fetch and parse an XML sitemap to get structured JSON of URLs or child sitemaps. Use to enumerate site pages, find recent updates, or build a crawl queue.
Instructions
Fetch and parse an XML sitemap (urlset or sitemapindex). Returns structured JSON: for urlsets — array of URLs with loc, lastmod, changefreq, priority, plus image/news sitemap extensions; for sitemapindex — list of child sitemap URLs. Use to enumerate all pages of a site, find recently updated content, or build a crawl queue — without loading raw XML into context.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Sitemap URL (http:// or https://). Accepts sitemap.xml, sitemap_index.xml, or any XML sitemap. | |
| maxUrls | No | Max URLs to return from a urlset (default 100, max 1000). Has no effect on sitemapindex (all child sitemaps are always returned). |