gemini-analyze-image
Analyze images in JPG, PNG, WebP, and other formats to extract summaries, objects, text, or detailed insights using Gemini's multimodal vision capabilities. Supports context-based enhancements for specific use cases.
Instructions
Analyze images using Gemini's multimodal vision capabilities (with learned user preferences)
Input Schema
Name | Required | Description | Default |
---|---|---|---|
analysis_type | No | Type of analysis to perform: "summary", "objects", "text", "detailed", or "custom" | |
context | No | Optional context for intelligent enhancement (e.g., "medical", "architectural", "nature") | |
file_path | Yes | Path to the image file to analyze (supports JPEG, PNG, WebP, HEIC, HEIF, BMP, GIF) |