chunk_content
Splits clean text into overlapping chunks with unique IDs for vector embedding or search indexing.
Instructions
Split clean text into overlapping chunks (~1 000 characters each, 200-character overlap). Each chunk has a unique deterministic ID derived from the URL and position. Useful for preparing text for vector embedding or search indexing.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | Source URL to embed in each chunk. | |
| text | Yes | Clean plain text to split. | |
| title | No | Page title to embed in each chunk. |