read_document
Read text content from PDF, TXT, MD, DOCX, or CSV files. Supports page ranges and auto-OCR for scanned PDFs.
Instructions
Read text content from a document with optional page selection. Use this when you need the raw text of a PDF, TXT, MD, CSV, or DOCX file; supports page ranges (e.g. "1-5", "1-3,7,10-12") and auto-OCR fallback for scanned PDFs. Read-only.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| filename | Yes | The document filename to read | |
| page | No | Optional single page number to read (1-based). If omitted, returns full text. | |
| pages | No | Optional page ranges to read, e.g. "1-5", "10", "1-3,7,10-12". Overrides 'page' if both provided. |