describe_image
Describes images by accepting local file paths, base64 data, or URLs, and returning structured text outputs from multimodal models for text-only agents.
Instructions
【仅限 GLM/DeepSeek 系列模型调用】Describe an image for a text-only agent. Accepts a local path, base64 image data, or URL.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image | Yes | ||
| instruction | No | Optional extra instruction for what to describe. | |
| focus | No | Optional visual area or topic to focus on. | |
| detail | No | How much visual detail to request from the vision model. | medium |
| maxTokens | No | ||
| model | No | Optional Claude model override. Defaults to VISIONTOOL_MODEL or claude-opus-4-8. | |
| _caller_model | No | 【限制项】调用方模型名。仅限 GLM / DeepSeek 系列模型调用此工具,其他模型将被拒绝。 |