Skip to main content
Glama

analyze_screenshot

Analyze page screenshots with custom prompts to extract specific information or identify elements, optionally focusing on a defined region for targeted insights.

Instructions

AI-powered analysis of page screenshots with custom prompts. Can focus on specific regions and provide contextual insights.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
page_idYesUnique identifier of the page whose screenshot to analyze
analysis_promptYesSpecific prompt for screenshot analysis (e.g., 'identify all clickable buttons', 'find the search form')
focus_regionNoOptional region of screenshot to focus analysis on
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full burden. It mentions AI-powered analysis but fails to disclose whether it captures a new screenshot or uses an existing one, what the output format is, or any side effects. Minimal behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no redundant words. Efficient and clear.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description should clarify return value. It does not. Also lacks context on whether a screenshot must be pre-captured or is taken automatically. Incomplete for effective use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds 'focus on specific regions' aligning with focus_region, but no new meaning beyond schema descriptions. No added value for page_id or analysis_prompt.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it performs AI-powered analysis of page screenshots using custom prompts, and can focus on regions. This distinguishes it from sibling tools like 'take_screenshot' (capture) and 'analyze_dom_structure' (DOM analysis), though 'contextual insights' is vague.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool versus alternatives, such as other analysis tools. The description does not mention prerequisites, such as whether a screenshot must already exist, or when to prefer this over DOM-based analysis.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ZachHandley/ZMCPTools'

If you have feedback or need assistance with the MCP directory API, please join our Discord server