describe_image
Generates a natural language description of an image, with adjustable detail and model modes for both general and document-focused analysis.
Instructions
Describe an image in natural language using Florence-2.
Args: image_path: Absolute or relative path to the image file (supports PNG, JPEG, SVG). detail_level: 'normal' for a brief caption, 'high' for a detailed one. model_mode: 'fast' for Florence-2 (default), 'deep' for MiniCPM-V 4.6 (better document understanding).
Returns: Dict with description, model name, and prompt used.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| image_path | Yes | ||
| model_mode | No | fast | |
| detail_level | No | normal |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||