extract_pdf_text
Extract text from PDF documents, including optional metadata and formatting preservation. Specify page ranges to refine extraction and retain document structure as needed.
Instructions
Extract text content from PDF documents with optional metadata and formatting preservation
Input Schema
Name | Required | Description | Default |
---|---|---|---|
file_path | Yes | Path to the PDF file to extract text from | |
include_metadata | No | Whether to include document metadata in the response | |
pages | No | Page range to extract (e.g., "1-5", "1,3,5", or "all") | all |
preserve_formatting | No | Whether to preserve text formatting and structure |