read_document
Extract text and metadata from DOCX, DOC, TXT, MD, HTML, and other document formats for processing or analysis.
Instructions
Read various document formats (DOCX, DOC, TXT, MD, HTML, etc.)
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| filePath | Yes | Document path to read | |
| extractMetadata | No | Extract document metadata | |
| preserveFormatting | No | Preserve formatting (HTML output) |