scan_document
Scans document text, source, and type to validate and prepare content for retrieval-augmented generation (RAG) ingestion.
Instructions
Pre-ingestion scanning for RAG documents
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| source | No | Document source | |
| content | Yes | Document text to scan | |
| document_type | No | Document type (pdf, html, txt) |