read-pdf
Extract text content from PDF files with options for page filtering, text cleaning, and metadata inclusion. Use this tool to retrieve specific information from PDF documents without manual reading.
Instructions
Extract text from a PDF file. Returns the full text content of the PDF with optional page filtering and text cleaning.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file | Yes | Path to the PDF file to extract text from | |
| pages | No | Page range (e.g., '1-5', '1,3,5', 'all'). Default: 'all' | |
| clean_text | No | Clean and normalize extracted text. Default: false | |
| include_metadata | No | Include PDF metadata in output. Default: true |