invoke_firecrawl_crawlhtml
Initiate an asynchronous web crawl to extract HTML content from a specified URL. Results are stored in an S3 bucket, with control over the maximum number of pages to crawl.
Instructions
Start an asynchronous web crawl job using Firecrawl to retrieve HTML content.
Copy
Input Schema
Name | Required | Description | Default |
---|---|---|---|
limit | No | ||
s3_uri | Yes | ||
url | Yes |
Input Schema (JSON Schema)
You must be authenticated.
Other Tools from Unstructured API MCP Server
- cancel_crawlhtml_job
- cancel_job
- check_crawlhtml_status
- check_llmtxt_status
- create_astradb_destination
- create_azure_source
- create_gdrive_source
- create_neo4j_destination
- create_s3_destination
- create_s3_source
- create_weaviate_destination
- create_workflow
- delete_astradb_destination
- delete_azure_source
- delete_gdrive_source
- delete_neo4j_destination
- delete_s3_destination
- delete_s3_source
- delete_weaviate_destination
- delete_workflow
- get_destination_info
- get_job_info
- get_source_info
- get_workflow_info
- invoke_firecrawl_crawlhtml
- invoke_firecrawl_llmtxt
- list_destinations
- list_jobs
- list_sources
- list_workflows
- run_workflow
- update_astradb_destination
- update_azure_source
- update_gdrive_source
- update_neo4j_destination
- update_s3_destination
- update_s3_source
- update_weaviate_destination
- update_workflow
Related Tools
- @josemartinrodriguezmortaloni/webSearch-Tools
- @everford/fetcher-mcp
- @apify/mcp-server-rag-web-browser
- @ztobs/cline-browser-use-mcp
- @jae-jae/fetcher-mcp