Scrape Web Page
scraping.spider.scrapeScrape any web page and extract clean content as markdown, plain text, or raw HTML. Handles JavaScript rendering, anti-bot bypass, and proxy rotation. Returns LLM-ready output for AI applications.
Instructions
Scrape any web page and get clean content — markdown (default), plain text, or raw HTML. Handles JavaScript rendering, anti-bot bypass, proxy rotation. Returns LLM-ready output. Cheapest web scraper with PAYG pricing (Spider.cloud)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to scrape (e.g. "https://example.com/page") | |
| format | No | Output format: markdown (default, best for LLMs), text (plain), raw (HTML), commonmark | markdown |
| readability | No | Enable readability mode — pre-processes page for LLM consumption | |
| wait_for | No | Wait N ms for JS to render before scraping (0-30000) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | No | Tool response payload. Shape varies per tool — consult the tool description and inputSchema. May be an object, array, string, or number depending on the upstream provider response. | |
| error | No | Present only when the call failed. Includes error code, message, request_id, and any provider-specific extras. |