MCP Vision Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MCP Vision Serverdescribe what's in /home/user/photo.jpg"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Vision Server · 视觉识别服务
基于 Kimi/Moonshot 视觉 API 的 MCP 服务器,作为 Claude Code 全局插件使用。传入本地图片路径,返回 AI 对图片内容的详细描述、文字提取等。
MCP server for image recognition via Kimi/Moonshot vision API. Works as a global Claude Code plugin.
中文
功能
describe_image — 识别图片内容,返回文字描述
describe_image_to_file — 识别并保存为 UTF-8 文件(解决 Windows 终端中文乱码)
支持 PNG / JPG / GIF / WebP / BMP,最大 20MB
支持自定义提示词(如"提取所有文字""描述图表结构")
安装
pip install mcp-vision-server或从源码安装:
git clone https://github.com/coffe-d/MCP-Vision-Server.git
cd mcp-vision-server
pip install -e .获取 API Key
在 Moonshot 开放平台 注册并创建 API Key。
注册到 Claude Code
claude mcp add vision-server \
--env KIMI_API_KEY="sk-你的密钥" \
-- mcp-vision-server注册后 Claude Code 即可使用 describe_image 和 describe_image_to_file 两个工具。
配置
环境变量 | 必填 | 默认值 | 说明 |
| 是 | — | Moonshot API 密钥 |
| 否 |
| API 地址 |
| 否 |
| 模型名称 |
工具说明
describe_image — 识别图片,返回文本描述。
参数 | 类型 | 必填 | 默认值 | 说明 |
| string | 是 | — | 图片绝对路径 |
| string | 否 | — | 自定义提示词 |
| int | 否 | 4096 | 最大输出长度 |
describe_image_to_file — 识别图片,结果保存为 UTF-8 文件。适合中文环境避免终端乱码。
参数 | 类型 | 必填 | 默认值 | 说明 |
| string | 是 | — | 图片绝对路径 |
| string | 否 | 自动(同名 .md) | 输出文件路径 |
常见问题
"KIMI_API_KEY environment variable is not set"
未设置环境变量。注册时确保使用了 --env KIMI_API_KEY="sk-..."。
终端中文乱码
使用 describe_image_to_file 代替 describe_image,结果直接写入 UTF-8 文件。
"不支持的图片格式"
仅支持 PNG、JPG、JPEG、GIF、WebP、BMP 格式。
许可
MIT — 详见 LICENSE。
English
Features
describe_image — Recognize image content and return text description
describe_image_to_file — Recognize and save result to a UTF-8 file
Supports PNG / JPG / GIF / WebP / BMP up to 20MB
Customizable prompt for targeted extraction
Install
pip install mcp-vision-serverOr from source:
git clone https://github.com/coffe-d/MCP-Vision-Server.git
cd mcp-vision-server
pip install -e .Get an API key
Sign up at Moonshot Platform and create an API key.
Register with Claude Code
claude mcp add vision-server \
--env KIMI_API_KEY="sk-your-key-here" \
-- mcp-vision-serverConfiguration
Variable | Required | Default | Description |
| Yes | — | Moonshot API key |
| No |
| API base URL |
| No |
| Model name |
API Reference
describe_image — Return image description as text.
Parameter | Type | Required | Default | Description |
| string | Yes | — | Absolute path to image |
| string | No | — | Custom prompt |
| int | No | 4096 | Max output tokens |
describe_image_to_file — Save result to a UTF-8 file.
Parameter | Type | Required | Default | Description |
| string | Yes | — | Absolute path to image |
| string | No | auto (.md) | Output file path |
Troubleshooting
"KIMI_API_KEY environment variable is not set" — Make sure you passed --env KIMI_API_KEY="sk-..." when running claude mcp add.
Garbled Chinese in terminal — Use describe_image_to_file to write directly to UTF-8 file.
License
MIT — see LICENSE.
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/coffe-d/MCP-Vision-Server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server