Submit an async extraction job
extract_asyncSubmit an asynchronous extraction job for heavy, slow, or anti-bot-protected pages. Returns a job ID immediately for polling until success or failure.
Instructions
Submit an asynchronous extraction job for a heavy, slow, or anti-bot-protected page. Returns a job id immediately; poll it with get_job until status is "success" or "failed". Use this instead of extract when a page is large or likely to need stealth rendering.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The public HTTPS URL to extract from. | |
| schema | Yes | An object mapping each field name to a type. A type is one of the literals "string", "number", "boolean", "array", "object"; OR a one-element array for a list (e.g. ["string"] for a list of strings, or [{ "title": "string", "price": "number" }] for a list of objects); OR a nested object (e.g. { "author": { "name": "string" } }). Use the most specific shape you can. Example: { "title": "string", "price": "number", "tags": ["string"], "reviews": [{ "user": "string", "stars": "number" }] }. | |
| examples | No | Up to 3 few-shot examples. | |
| webhookUrl | No | Optional URL to receive a signed callback when the job completes. | |
| idempotencyKey | No | Optional key; a retry with the same key replays the original job. |