extract_structured
Extract structured data from repeating HTML containers using CSS selectors and attributes. Capture text, links, or other attributes from each container.
Instructions
Extract structured data from repeating containers.
fields: [{name: "title", selector: ".job-title", attribute: "text|href|src|..."}]
Only direct text nodes of element are captured for "text" (prevents child-field mixing).
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| container_selector | Yes | ||
| fields | Yes | ||
| limit | No |
Output Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |