Extract Text from Image (OCR)
ai.ocr.extractExtract text from images and PDFs using OCR. Supports 20+ languages including English, Chinese, Japanese, Korean. Accepts PNG, JPG, GIF, BMP, PDF, TIFF.
Instructions
Extract text from any image or PDF URL using OCR — supports 20+ languages including English, Russian, Chinese, Japanese, Korean, Arabic. Returns recognized text. Handles PNG, JPG, GIF, BMP, PDF, TIFF (OCR.space)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL of the image or PDF to extract text from (PNG, JPG, GIF, BMP, PDF, TIFF supported) | |
| language | No | OCR language: "eng" (English, default), "rus" (Russian), "ger" (German), "fre" (French), "spa" (Spanish), "jpn" (Japanese), "kor" (Korean), "chs" (Chinese Simplified) | |
| filetype | No | File type hint — set if URL has no extension or content-type is wrong | |
| detect_orientation | No | Auto-detect and correct image orientation (default false) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | No | Tool response payload. Shape varies per tool — consult the tool description and inputSchema. May be an object, array, string, or number depending on the upstream provider response. | |
| error | No | Present only when the call failed. Includes error code, message, request_id, and any provider-specific extras. |