Skip to main content
Glama

describe_image_to_file

Describes image content and writes the result to a UTF-8 text file, bypassing terminal encoding issues on Windows. Automatically generates a .md file next to the image when no output path is specified.

Instructions

识别图片内容并保存到文件(解决 Windows 终端中文乱码问题)。

将AI识别结果直接写入UTF-8编码的文本文件,完全绕过终端编码问题。 如果未指定输出路径,默认在原图同目录下生成同名的 .md 文件。

参数:

  • image_path: 图片的绝对路径

  • output_path: 输出文件的路径(可选,默认与原图同名 .md)

  • prompt: 自定义提示词

  • max_tokens: 最大输出长度

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
image_pathYes
output_pathNo
promptNo
max_tokensNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault
resultYes
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses writing UTF-8 files, default output path, and parameters. However, it does not mention overwrite behavior, error handling, or permission requirements, which are relevant for a file-writing tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise, with a clear opening statement and a bullet list for parameters. It fronts the main purpose and encoding benefit. Minor improvement could be more structured, but it is efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the tool's core function, default behavior, and parameter details. Given an output schema exists (not shown), it does not need to explain return values. It is complete for a file-writing tool with a sibling, though could add error conditions.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It explains each parameter's purpose (image_path absolute, output_path optional default, prompt custom, max_tokens limit). While not exhaustive (e.g., missing image format constraints), it adds meaning beyond the schema titles.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool recognizes image content and saves to a file, solving Windows terminal encoding issues. It explicitly distinguishes itself from the sibling tool 'describe_image' by focusing on file output.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains when to use this tool (to avoid encoding issues, for file output) and implies the alternative (describe_image) for terminal display. However, it does not explicitly state when not to use or provide exclusion criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/coffe-d/MCP-Vision-Server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server