analyze_image_with_gemini
Analyze local image files or image URLs using Google Gemini Vision when the current model lacks native vision. Requires explicit user consent before sending data to Gemini.
Instructions
Analyze a local image file or image URL using Google Gemini Vision.
PRIVACY RULES:
Use this tool ONLY when the current main model cannot natively understand images.
If the current main model has native vision capability, do NOT use this tool unless the user explicitly asks to use Gemini.
Before calling this tool with confirm_send_to_gemini=true, you MUST ask the user for consent.
The image bytes and prompt will be sent to Google Gemini API.
If image_source is a URL, this MCP downloads the image from your machine first, then sends bytes to Gemini.
If the user has not explicitly agreed, omit confirm_send_to_gemini or set it to false to show a confirmation notice only.
Supported inputs:
Local file path, e.g. C:/Users/me/Desktop/image.png
HTTP/HTTPS image URL, e.g. https://example.com/image.png
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image_source | Yes | Local image path or HTTP/HTTPS image URL. Examples: C:/Users/me/Desktop/a.png or https://example.com/a.png | |
| prompt | No | What to analyze. Example: 'Extract all visible text and summarize the screenshot.' | |
| model | No | Override the Gemini model for this call. Examples: gemini-2.5-flash, gemini-2.5-pro. Defaults to env GEMINI_VISION_MODEL. | |
| confirm_send_to_gemini | No | Must be true only after the user explicitly agrees to send this image to Gemini. Defaults to false. |