ocr_image
Extract visible text from images using OCR. Supports optional language hints and preserves reading order and layout.
Instructions
【仅限 GLM/DeepSeek 系列模型调用】Extract visible text from an image with optional language and layout hints.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image | Yes | ||
| language | No | Optional language hint for visible text. | |
| preserveLayout | No | Preserve rough reading order and layout where possible. | |
| detail | No | How much visual detail to request from the vision model. | medium |
| maxTokens | No | ||
| model | No | Optional Claude model override. Defaults to VISIONTOOL_MODEL or claude-opus-4-8. | |
| _caller_model | No | 【限制项】调用方模型名。仅限 GLM / DeepSeek 系列模型调用此工具,其他模型将被拒绝。 |