fetch
Fetch web pages with full JavaScript execution and extract readable content. Use CSS selectors to target specific sections or access the accessibility tree with bounding boxes.
Instructions
Fetch a URL and extract readable content using the Servo browser engine (JS execution + CSS layout). Navbars, sidebars, and footers are stripped automatically. Use selector to extract a specific CSS-selected section instead of full-page Readability extraction. Set format to accessibility_tree to get the page's accessibility tree with bounding boxes. Long content is truncated at max_length; use start_index to paginate.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to fetch (http/https only) | |
| format | No | Output format: markdown (default), json, html, text, or accessibility_tree | |
| max_length | No | Max characters to return. Default: 5000 | |
| start_index | No | Character offset for pagination. Default: 0 | |
| timeout | No | Page load timeout in seconds. Default: 30 | |
| settle_ms | No | Extra wait in ms after the `load` event, for SPAs that keep hydrating. Default: 0. Max: 10000. | |
| selector | No | CSS selector to extract a specific section instead of full-page Readability extraction |