extract_structured
Extract structured data from a URL by providing a JSON Schema. Parses semantic markup like JSON-LD, Open Graph, and Schema.org to return a JSON object without an LLM.
Instructions
Fetch a URL and extract structured data matching a JSON Schema — title, author, date, price, description, rating, image, and more. Reads JSON-LD, Open Graph, Twitter Cards, and Schema.org microdata embedded in the page; returns only the extracted JSON object. No LLM required: extraction is deterministic. Ideal for articles, products, recipes, events, and any page using semantic markup.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to fetch and extract data from. | |
| schema | Yes | JSON Schema (as a JSON string) describing the fields to extract. E.g. {"type":"object","properties":{"title":{"type":"string"},"price":{"type":"number"}}} |