export_dataset
Export a dataset of per-page text with source and page number to a Parquet or JSONL file. Filter by source filename to limit export to specific data.
Instructions
Write the per-page {source, page, text} dataset to a file (no vectors).
``fmt`` is "parquet" or "jsonl"; empty infers from the output suffix.
``source`` limits the export to one source filename. No embedding.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| output | Yes | ||
| fmt | No | ||
| source | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||