extract_text_only
Extract clean text content from web pages by removing HTML formatting and extra whitespace to obtain plain text for analysis or processing.
Instructions
Extract plain text content without any formatting or HTML
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to extract text from | |
| removeWhitespace | No | Whether to remove extra whitespace (default: true) | |
| useCache | No | Whether to use cached content if available (default: true) |