doubao_vision_understand
Analyze images using Doubao vision model by providing an image source and a query prompt. Supports local files and URLs with adjustable detail level and parameters.
Instructions
Analyze an image using Doubao vision model via Volcengine Ark API. Supports both preset inference (model name) and custom inference (ep-xxxxx endpoint ID). Configure via DOUBAO_MODEL environment variable.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image | Yes | Image source: local file path (e.g. C:/path/to/screenshot.png) or URL (https://...) | |
| prompt | Yes | What to ask about the image. Be specific for best results. | |
| detail | No | Image detail level. 'high' for fine-grained analysis | auto |
| max_tokens | No | Maximum output tokens | |
| temperature | No | Sampling temperature (0-2) |