text_clean
Extract and return clean, readable text from a web page by removing scripts, styles, navigation, headers, footers, and other non-content elements. Collapses whitespace and strips JSON.
Instructions
Return chrome-stripped, JSON-stripped, whitespace-collapsed text from a selector or the best content root. Drops script/style/noscript/svg and page chrome (nav/header/footer/aside) plus obvious hidden widgets and repeated boilerplate.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| max_chars | No | Optional max characters to return. | |
| selector | No | Optional CSS selector to scope extraction. Default: best content root. |