object-detection-by-text
Detect and count objects in an image using a text prompt, providing detailed descriptions and 2D coordinates for precise visual analysis.
Instructions
Analyze an image based on a text prompt to identify and count specific objects, and return detailed descriptions of the objects and their 2D coordinates.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
imageFileUri | Yes | URI of the input image. Preferred for remote or local files. Must start with "https://" or "file://". | |
includeDescription | Yes | Whether to return a description of the objects detected in the image, but will take longer to process. | |
textPrompt | Yes | Nouns of target objects (English only, avoid adjectives). Use periods to separate multiple categories (e.g., 'person.car.traffic light'). |