ocr-pdf-auto-legacy
Extract text from PDFs using OCR with language detection and page selection options to convert scanned documents into editable text.
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file | Yes | PDF 文件路径 | |
| pages | No | 页码范围('first', 'last', 'all', '1-3', '1,3,5'),默认 'first' | |
| language_type | No | 语言类型,默认 CHN_ENG | |
| detect_direction | No | 是否检测图像方向,默认 false | |
| detect_language | No | 是否检测语言,默认 false |