playwright_get_visible_html
Extract visible HTML content from web pages with options to remove scripts, styles, comments, and meta tags for cleaner output.
Instructions
Get the HTML content of the current page. By default, all tags are removed from the output unless removeScripts is explicitly set to false.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| selector | No | CSS selector to limit the HTML to a specific container | |
| removeScripts | No | Remove all script tags from the HTML (default: true) | |
| removeComments | No | Remove all HTML comments (default: false) | |
| removeStyles | No | Remove all style tags from the HTML (default: false) | |
| removeMeta | No | Remove all meta tags from the HTML (default: false) | |
| cleanHtml | No | Perform comprehensive HTML cleaning (default: false) | |
| minify | No | Minify the HTML output (default: false) | |
| maxLength | No | Maximum number of characters to return (default: 20000) |