analyze_webpage_screenshot
Analyze webpage screenshots to extract content, layout, and interactive elements for structured analysis of web page structure and components.
Instructions
Specialized tool for analyzing webpage screenshots. Extracts content, layout information, and interactive elements from web pages.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
data | Yes | The webpage screenshot data (base64 string, file path, or URL) | |
focusArea | No | Specific area to focus on (optional) | |
format | No | Output format (default: json for structured webpage analysis) | |
includeAccessibility | No | Include accessibility analysis (default: true) | |
maxTokens | No | Maximum tokens in response (default: 4000) | |
mimeType | No | MIME type of the image (required for base64 input) | |
type | Yes | The type of image input |
Input Schema (JSON Schema)
{
"properties": {
"data": {
"description": "The webpage screenshot data (base64 string, file path, or URL)",
"type": "string"
},
"focusArea": {
"description": "Specific area to focus on (optional)",
"enum": [
"layout",
"content",
"navigation",
"forms",
"interactive",
"accessibility"
],
"type": "string"
},
"format": {
"description": "Output format (default: json for structured webpage analysis)",
"enum": [
"text",
"json"
],
"type": "string"
},
"includeAccessibility": {
"description": "Include accessibility analysis (default: true)",
"type": "boolean"
},
"maxTokens": {
"description": "Maximum tokens in response (default: 4000)",
"type": "number"
},
"mimeType": {
"description": "MIME type of the image (required for base64 input)",
"type": "string"
},
"type": {
"description": "The type of image input",
"enum": [
"base64",
"file",
"url"
],
"type": "string"
}
},
"required": [
"type",
"data"
],
"type": "object"
}