read_pdf_text
Extract text content from PDF files, including optional table data and specific page ranges, for document analysis and processing.
Instructions
读取PDF文件并提取文本内容
Args:
file_path: PDF文件绝对路径(必须是绝对路径)
page_numbers: 要提取的页面号列表,如果为None则提取所有页面
extract_tables: 是否提取表格数据
Returns:
包含文本内容、页面信息等的字典
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | ||
| page_numbers | No | ||
| extract_tables | No |