Skip to main content
Glama

doubao_vision_analyze

Analyze local screenshots with a vision model to detect UI issues, text, layout, and accessibility problems. Provides insights for code improvements.

Instructions

Analyze a local image or screenshot with a Doubao vision model.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
promptNoWhat to inspect in the image.请详细描述这张图片。如果这是 UI 截图,请指出可见文字、布局问题、交互问题、可访问性问题,以及可能需要修改的代码方向。请用清晰的中文分点回答。
image_pathYesAbsolute path to the local image file.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description must disclose behavioral traits. It only states that analysis occurs, but does not mention if the tool is read-only, requires network access, has rate limits, or what the output format is. Significant behavioral gaps exist.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence with no fluff, front-loading the purpose. However, it is overly concise and omits critical contextual information that a few extra sentences could provide.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The tool has no output schema and no annotations, so the description should cover what the tool returns and any important context. It does not mention return format, error behavior, or file size limits, leaving the agent with incomplete information.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% and the schema descriptions are clear ('What to inspect in the image', 'Absolute path to the local image file'). The tool description adds no additional meaning beyond the schema, so a baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's action (analyze) and resource (local image or screenshot) using a specific model (Doubao vision). However, the verb 'analyze' is somewhat generic and could be more specific about the type of analysis.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to use this tool, prerequisites (e.g., supported image formats, file existence), or when not to use it. The absence of sibling tools reduces the need for alternative comparisons, but usage context is still missing.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/amazing1102/doubao-tools-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server