Extract Text From Screenshot
extract_text_from_screenshotExtract text from screenshots using OCR. Recognizes code, terminal output, logs, documentation, and UI copy from images.
Instructions
Extract and recognize text from screenshots, including code, terminal output, logs, documentation, and UI copy.
Use when the user needs faithful OCR or text reconstruction from an image.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | Instructions for text extraction and formatting. | |
| image_detail | No | Optional Codex Responses input_image detail. Defaults to CODEX_VISION_IMAGE_DETAIL or high. | |
| image_source | Yes | Local file path, remote URL, or data:image URL to the screenshot. | |
| programming_language | No | Optional programming language hint if the screenshot contains code. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| data | No | ||
| error | No | ||
| success | Yes | ||
| metadata | No |