web_scrape
Extract content from dynamic and single-page applications using a browser that supports JavaScript. Load any URL and retrieve rendered HTML.
Instructions
Use Playwright (browser) to load a page with JavaScript support and extract content. Best for dynamic/SPA sites.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to scrape | |
| waitForSelector | No | CSS selector to wait for before extracting (e.g. 'main', '.article-body') | |
| timeout | No | Page load timeout in ms (default 30000) | |
| maxContentLength | No | Maximum content length in characters (default 15000) | |
| takeScreenshot | No | Take a screenshot after loading (default false) |