playwright_get_visible_html
Extract visible HTML from web pages, optionally removing scripts, comments, styles, or meta tags. Supports minification, HTML cleaning, and character limits for precise content extraction.
Instructions
Get the HTML content of the current page. By default, all <script> tags are removed from the output unless removeScripts is explicitly set to false.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
cleanHtml | No | Perform comprehensive HTML cleaning (default: false) | |
maxLength | No | Maximum number of characters to return (default: 20000) | |
minify | No | Minify the HTML output (default: false) | |
removeComments | No | Remove all HTML comments (default: false) | |
removeMeta | No | Remove all meta tags from the HTML (default: false) | |
removeScripts | No | Remove all script tags from the HTML (default: true) | |
removeStyles | No | Remove all style tags from the HTML (default: false) | |
selector | No | CSS selector to limit the HTML to a specific container |