extract
Scrape webpages and convert content to structured JSON data using AI, automatically bypassing bot detection and CAPTCHA protection.
Instructions
Scrape a webpage and extract structured data as JSON. First scrapes the page as markdown, then uses AI sampling to convert it to structured JSON format. This tool can unlock any webpage even if it uses bot detection or CAPTCHA.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
extraction_prompt | No | Custom prompt to guide the extraction process. If not provided, will extract general structured data from the page. | |
url | Yes |
Input Schema (JSON Schema)
{
"properties": {
"extraction_prompt": {
"description": "Custom prompt to guide the extraction process. If not provided, will extract general structured data from the page.",
"type": "string"
},
"url": {
"format": "uri",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}