novada_map
Discover all URLs on a site by checking sitemap.xml first, then falling back to BFS crawl. Returns a list of URLs only, no content, for site structure discovery.
Instructions
Use when you need to know what URLs exist on a site before deciding what to read. Tries sitemap.xml first (fast), falls back to BFS crawl. Returns URL list only — no content.
Best for: Site structure discovery, finding the correct subpage URL when you extracted the wrong page. Not for: Reading page content (follow with novada_extract or novada_crawl). Note: Limited results on JavaScript SPAs — will flag this in output.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| search | No | ||
| limit | Yes | ||
| include_subdomains | Yes | ||
| max_depth | Yes | Link-hops from root to follow. Default 2. Higher = more pages found but slower. |