Analyze image with Gemini
analyze_imageAnalyze local image files or URLs with Google's Gemini vision models to extract text descriptions or answer questions about screenshots, diagrams, and UI states without loading raw image bytes.
Instructions
Analyze a local image file (or image URL) with Google's Gemini vision models and return a text answer. Use this to read screenshots, diagrams, charts, or UI states without loading raw image bytes into the calling agent's context.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image | Yes | Absolute path to a local image file, or an http(s) URL. | |
| prompt | No | Question/instruction about the image. Defaults to a detailed description. | |
| model | No | Override the Gemini model (e.g. gemini-pro-latest for harder visual reasoning). |