extract_region
Crop and analyze a specific region of an image to focus on error popups, charts, or UI components, saving tokens and producing targeted results.
Instructions
Extract and analyze a specific region of an image. Use this when a coding agent needs to focus on a particular area of a screenshot, diagram, or UI — such as an error popup, a specific chart, a navigation bar, or a single UI component. Specify the region as pixel coordinates (x, y, width, height). The region is cropped from the original image before being sent to the vision provider, saving tokens and producing more focused results.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | general | |
| prompt | No | ||
| region | Yes | ||
| image_url | No | ||
| image_path | No | ||
| detail_level | No | standard |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| graph | No | ||
| tables | No | ||
| mermaid | No | ||
| summary | Yes | ||
| provider | Yes | ||
| inferences | No | ||
| observations | No | ||
| uncertainties | No | ||
| security_notes | No | ||
| recommended_next_steps | No |