analyze_image
Analyze local image files with vision AI. Submit a prompt to extract visual elements, layout, colors, or content from screenshots, mockups, or design files. Supports multiple image formats.
Instructions
Analyze one or more local image files using MiniMax vision. No browser involved. Use for screenshots, mockups, design files, or any image. Supports PNG, JPEG, GIF, WebP, BMP.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image_path | Yes | Path to a single image file, or an array of paths for multi-image analysis (e.g. before/after comparison). Absolute or relative paths accepted. | |
| prompt | Yes | The question or instruction for analyzing the image(s). Be specific about what visual elements, layout, colors, or content you want extracted. | |
| response_format | No | Optional. Set to 'json_object' to get structured JSON output. Default is 'text'. | |
| temperature | No | Optional. Controls randomness (0-2). Lower = more consistent. Default: 0.3. | |
| max_tokens | No | Optional. Maximum tokens in the response. Default: 2000. |