extract_content
Extract clean text and markdown content from web pages using hybrid extraction strategies, with optional JavaScript rendering support for dynamic sites.
Instructions
Extract clean text/markdown content from a URL using trafilatura (fast) with optional Playwright fallback (JS-rendered pages).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to extract content from. | |
| timeout | No | Fetch timeout in seconds (default 10). |