Extract positioned text runs
extract_entitiesRetrieve every text run from a PDF alongside its position, font size, and name. Use for layout-aware tasks like table reconstruction and positional lookup.
Instructions
Extract every text run of a PDF together with its layout geometry.
Returns JSON {path, entities, entity_count, page_count} where each entity is {text, page (0-based), x, y, font_size, font_name}. Coordinates are in PDF points with the origin at the bottom-left of the page. Read-only.
Use this for layout-aware tasks (table reconstruction, positional lookup, locating a label on the page). If you only need the reading text without coordinates, use extract_text; for Markdown or RAG chunks use convert_pdf.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| path | Yes | Path to the PDF file, relative to the configured workspace. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |