parse_pdf_structure
Parses PDF documents to extract structural elements like tables, figures, and sections. Supports OCR, page ranges, and asynchronous processing.
Instructions
Create a background Marker parse job for a PDF.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| pdf_path | Yes | ||
| output_dir | No | ||
| async_mode | No | ||
| ocr_enabled | No | ||
| ocr_language | No | eng | |
| rotate_pages | No | ||
| deskew | No | ||
| marker_max_pages_per_chunk | No | ||
| extract_figures | No | ||
| page_ranges | No | ||
| ctx | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |