playwright_get_visible_html
Extract and clean visible HTML content from a web page, allowing selective removal of scripts, styles, comments, and meta tags. Use CSS selectors to target specific elements and customize output with minification or length limits.
Instructions
Get the HTML content of the current page. By default, all <script> tags are removed from the output unless removeScripts is explicitly set to false.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
cleanHtml | No | Perform comprehensive HTML cleaning (default: false) | |
maxLength | No | Maximum number of characters to return (default: 20000) | |
minify | No | Minify the HTML output (default: false) | |
removeComments | No | Remove all HTML comments (default: false) | |
removeMeta | No | Remove all meta tags from the HTML (default: false) | |
removeScripts | No | Remove all script tags from the HTML (default: true) | |
removeStyles | No | Remove all style tags from the HTML (default: false) | |
selector | No | CSS selector to limit the HTML to a specific container |