doc2x_parse_pdf_wait_text
Extract text from PDF documents by waiting for parsing completion, returning merged content with configurable output limits for processing.
Instructions
Wait for a PDF parse task until success and return merged text. Prefer passing uid (no re-submit). If only pdf_path is provided, it will (a) reuse an in-process cached uid if available, otherwise (b) submit a new task then wait.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| uid | No | Doc2x parse task uid returned by doc2x_parse_pdf_submit. | |
| pdf_path | No | Absolute path to a local PDF file. If uid is not provided, this tool will reuse cached uid (if any) or submit a new task. | |
| poll_interval_ms | No | ||
| max_wait_ms | No | ||
| join_with | No | ||
| max_output_chars | No | Max characters of returned text (0 = unlimited). Useful to avoid LLM context overflow. Default can be set via env DOC2X_PARSE_PDF_MAX_OUTPUT_CHARS. | |
| max_output_pages | No | Max pages to merge into returned text (0 = unlimited). Default can be set via env DOC2X_PARSE_PDF_MAX_OUTPUT_PAGES. |