extract_webpage
Fetch any webpage and return only its main readable content, stripping HTML, ads, and navigation to save AI tokens.
Instructions
Fetch a webpage and return only its main readable content — no HTML, scripts, navigation, ads, or cookie banners. Saves 85–95% of tokens vs raw HTML.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to fetch. | |
| max_tokens | No | Truncate output to this many tokens if exceeded. | |
| include_links | No | If True, preserve hyperlinks as [text](url). | |
| include_metadata | No | If True, prepend title/author/date when available. | |
| model | No | Used for token counting. | gpt-4o |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||