extract
Define a JSON schema to pull structured fields from any URL. Returns JSON Lines, with optional headless browser support for dynamic content.
Instructions
Extract structured fields from a URL using a JSON schema. The schema mirrors the WebReaper Schema shape: { field, children: [ { field, selector, type, is_list }, ... ] }. Returns the extracted record(s) as JSON Lines.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to extract from. | |
| browser | No | Use the headless browser. Auto-spawns a system Chrome / Chromium / Edge via WebReaper.Cdp; install a Chromium-family browser on the MCP host first. Default false. | |
| schemaJson | Yes | The extraction schema as JSON. See the WebReaper docs for the shape. |