analyze_image
Analyze images using vision AI. Provide a file path, URL, or base64 image and get descriptions, OCR, or answers based on a custom prompt.
Instructions
Analyze an image with a vision LLM (OpenAI-compatible chat/completions). Provide exactly one of path (local file), url (http/https), or base64. Optionally pass a custom prompt to steer the analysis (OCR, table extraction, captioning, Q&A, etc).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | Public http(s) URL of the image. | |
| path | No | Absolute or relative local file path to the image. | |
| model | No | Override the vision model. Defaults to env VISION_MODEL (gpt-4o). | |
| base64 | No | Raw base64 string (with or without data: prefix). | |
| detail | No | Optional image detail hint passed to the gateway. | |
| prompt | No | What you want the model to do with the image. Defaults to a detailed description. | |
| system | No | Optional system message. | |
| mime_type | No | Override MIME type for base64 input. Auto-detected if omitted. | |
| max_tokens | No | ||
| temperature | No |