invoke_firecrawl_crawlhtml
Start a web crawl job to extract HTML content from URLs and upload results to S3 storage for processing.
Instructions
Start an asynchronous web crawl job using Firecrawl to retrieve HTML content.
Args:
url: URL to crawl
s3_uri: S3 URI where results will be uploaded
limit: Maximum number of pages to crawl (default: 100)
Returns:
Dictionary with crawl job information including the job IDInput Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| s3_uri | Yes | ||
| limit | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |