Analyze Image
analyze_imageExtract text-based information from one or more images using Gemini's multimodal analysis. Returns a text summary without generating images.
Instructions
Analyze and extract information from one or more images using Gemini multimodal understanding. Returns a text analysis — no image is generated. Default model: gemini-3-pro-preview.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| images | Yes | One or more images to analyze | |
| prompt | Yes | What to analyze or extract from the image(s) | |
| model | No | Model to use (default: gemini-3-pro-preview) | |
| max_tokens | No | Maximum tokens in response (default 16384, up to 64K output limit) | |
| global_media_resolution | No | Global image quality for cost optimization. MEDIUM recommended for PDFs (50% savings). |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| content | Yes | ||
| success | Yes |