plan_audit
Analyze sitemaps to create intelligent sampling strategies for large websites, identifying route patterns and recommending pages for SEO audit analysis.
Instructions
RECOMMENDED FIRST STEP - Analyze sitemaps and create an intelligent sampling strategy for large sites.
This tool is essential for job boards and large sites with 100k+ pages. Instead of crawling everything, it:
Discovers and validates all sitemaps (robots.txt + common locations)
Identifies distinct route patterns (job pages, category pages, location pages, etc.)
Estimates total pages per route type
Generates a smart sampling strategy
Recommends which pages to analyze with Lighthouse
Returns:
Sitemap validation (URL limits, lastmod coverage, compression)
Route pattern classification with estimated counts
Sampling strategy (how many pages to sample per type)
Issues, warnings, and recommendations
Use this BEFORE crawl_site or sample_pages to understand site structure.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| baseUrl | Yes | The base URL of the site (e.g., https://talent.com) | |
| maxSitemapsToProcess | No | Maximum sitemaps to analyze (default: 20) | |
| maxUrlsPerSitemap | No | Maximum URLs to process per sitemap for pattern analysis (default: 5000) |