clean_text_pipeline
Clean text by removing HTML, URLs, emails, numbers, punctuation, stopwords, and whitespace while converting to lowercase using configurable pipeline steps.
Instructions
Configurable cleaning pipeline. Steps: html, urls, emails, numbers, punctuation, stopwords, whitespace, lowercase.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | ||
| steps | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |