parse_document
Parse documents into structured formats (markdown, JSON, text, HTML). Extract text, tables, charts, formulas, and code blocks from PDF, DOCX, PPTX, HTML, MD, XLSX, images.
Instructions
Parse a document (PDF, DOCX, PPTX, HTML, MD) into structured output using Docling.
Docling is IBM's document understanding library that extracts text, tables, charts, formulas, and code blocks from multi-format documents.
Args: file_path: Absolute or relative path to the document file. Supports: PDF, DOCX, PPTX, HTML, Markdown, XLSX, Images. output_format: Output format - 'markdown' (default), 'json', 'text', or 'html'. extract_tables: If True, extract and structure tables (default True). extract_images: If True, extract embedded images (default False).
Returns: Dict with parsed content, metadata, and optionally tables/images.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | ||
| output_format | No | markdown | |
| extract_images | No | ||
| extract_tables | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||