analyze_webpage_screenshot
Need to understand a webpage from a screenshot? Analyze it to extract content, layout, and interactive elements, with optional focus on specific areas and accessibility analysis.
Instructions
Specialized tool for analyzing webpage screenshots. Extracts content, layout information, and interactive elements from web pages.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| data | Yes | The webpage screenshot data (base64 string, file path, or URL) | |
| type | Yes | The type of image input | |
| format | No | Output format (default: json for structured webpage analysis) | |
| mimeType | No | MIME type of the image (required for base64 input) | |
| focusArea | No | Specific area to focus on (optional) | |
| maxTokens | No | Maximum tokens in response (default: 4000) | |
| includeAccessibility | No | Include accessibility analysis (default: true) |