create_crawl
Discover and scrape entire websites by following links from a starting URL to extract content in various formats for data collection.
Instructions
Autonomously discover and scrape entire websites by following links from a start URL.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| start_url | Yes | Starting URL for the crawl. | |
| max_pages | No | Maximum number of pages to crawl. | |
| follow_links | No | Whether to follow links found on pages. | |
| output_format | No | Format for scraped content. Default: "markdown". | markdown |
| country | No | Optional country code for location-specific crawling. | |
| parser | No | Optional parser ID for specialized content extraction. |