extract_inferred
Extract structured data from a web page without writing a schema. An LLM infers the schema from the content, optionally guided by a goal, for deterministic extraction.
Instructions
Extract structured data from a URL WITHOUT writing a schema: an LLM infers the schema from the page (optionally steered by a goal), then WebReaper extracts deterministically. Cheaper and more consistent than extract_with_prompt across similarly shaped pages. Requires an OpenAI-compatible LLM endpoint on the host: WEBREAPER_LLM_MODEL + WEBREAPER_LLM_BASE_URL, key in WEBREAPER_LLM_API_KEY (or OPENAI_API_KEY). Returns the extracted record(s) as JSON Lines.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to extract from. | |
| goal | No | Optional goal to steer the inferred schema (e.g. "product name and price"). | |
| model | No | Optional model id, overriding WEBREAPER_LLM_MODEL for this call. | |
| browser | No | Use the headless browser (for JS-rendered pages). Default false. |