text_clean
Strips scripts, styles, page chrome, and repeated boilerplate from web pages to return clean, whitespace-collapsed text. Supports CSS selector or automatic content root detection.
Instructions
Return chrome-stripped, JSON-stripped, whitespace-collapsed text from a selector or the best content root. Drops script/style/noscript/svg and page chrome (nav/header/footer/aside) plus obvious hidden widgets and repeated boilerplate.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| max_chars | No | Optional max characters to return. | |
| selector | No | Optional CSS selector to scope extraction. Default: best content root. |