ocr_image
Extract text from image files (PNG, JPG, TIFF, BMP) using multiple OCR engines and preprocessing options.
Instructions
OCR a single image file (PNG/JPG/TIFF/BMP).
Args: path: absolute path to the image. engine: 'auto' (RapidOCR), 'rapidocr', 'tesseract', or 'finereader'. lang: ISO 639-1 code ('en','de','fr','ro',...). Mapped per-engine. preprocess: if true, apply grayscale/denoise/deskew first (needs opencv).
Returns JSON: {engine, ok, text, mean_confidence, line_count, low_confidence_count, lines:[{text,confidence,bbox}], warnings}.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| path | Yes | ||
| engine | No | auto | |
| lang | No | en | |
| preprocess | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |