download_text
Retrieve OCR text from a Gallica document using its ARK identifier. Save to cache to enable reading specific portions without loading the full text.
Instructions
Download OCR text from a Gallica document and save to cache in plain text format.
Args: identifier: Gallica ARK identifier (e.g., 'ark:/12148/bpt6k5619759j')
Returns: Path to the cached text file (as string)
IMPORTANT: The downloaded files are VERY LARGE (typically 100KB-1MB+ of text). DO NOT attempt to read the entire file into context. Use read tools with offset/limit parameters to read specific portions. Reading the full file will waste tokens and may cause performance issues.
Example: path = download_text("ark:/12148/bpt6k5619759j")
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| identifier | Yes |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |