analyze_image
Analyze images using vision-language AI to answer questions about content, identify objects, extract text, or describe scenes from publicly accessible URLs.
Instructions
Analyze an image using Qwen2.5-VL vision-language model.
The image must be accessible via a public URL.
Examples:
"What's in this image?"
"Describe the scene"
"What text is visible?"
"Identify any people or objects"
"What is the mood or atmosphere?"
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image_url | Yes | URL of the image to analyze (must be publicly accessible) | |
| question | No | Question or prompt about the image | Describe this image in detail. |
| max_tokens | No | Maximum tokens in response |