MCP Vision Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MCP Vision Serverdescribe what's in /home/user/photo.jpg"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Vision Server · 视觉识别服务
基于 Kimi/Moonshot 视觉 API 的 MCP 服务器,作为 Claude Code 全局插件使用。传入本地图片路径,返回 AI 对图片内容的详细描述、文字提取等。
MCP server for image recognition via Kimi/Moonshot vision API. Works as a global Claude Code plugin.
中文
功能
describe_image — 识别图片内容,返回文字描述
describe_image_to_file — 识别并保存为 UTF-8 文件(解决 Windows 终端中文乱码)
支持 PNG / JPG / GIF / WebP / BMP,最大 20MB
支持自定义提示词(如"提取所有文字""描述图表结构")
安装
pip install mcp-vision-server或从源码安装:
git clone https://github.com/coffe-d/MCP-Vision-Server.git
cd mcp-vision-server
pip install -e .获取 API Key
在 Moonshot 开放平台 注册并创建 API Key。
注册到 Claude Code
claude mcp add vision-server \
--env KIMI_API_KEY="sk-你的密钥" \
-- mcp-vision-server注册后 Claude Code 即可使用 describe_image 和 describe_image_to_file 两个工具。
配置
环境变量 | 必填 | 默认值 | 说明 |
| 是 | — | Moonshot API 密钥 |
| 否 |
| API 地址 |
| 否 |
| 模型名称 |
工具说明
describe_image — 识别图片,返回文本描述。
参数 | 类型 | 必填 | 默认值 | 说明 |
| string | 是 | — | 图片绝对路径 |
| string | 否 | — | 自定义提示词 |
| int | 否 | 4096 | 最大输出长度 |
describe_image_to_file — 识别图片,结果保存为 UTF-8 文件。适合中文环境避免终端乱码。
参数 | 类型 | 必填 | 默认值 | 说明 |
| string | 是 | — | 图片绝对路径 |
| string | 否 | 自动(同名 .md) | 输出文件路径 |
常见问题
"KIMI_API_KEY environment variable is not set"
未设置环境变量。注册时确保使用了 --env KIMI_API_KEY="sk-..."。
终端中文乱码
使用 describe_image_to_file 代替 describe_image,结果直接写入 UTF-8 文件。
"不支持的图片格式"
仅支持 PNG、JPG、JPEG、GIF、WebP、BMP 格式。
许可
MIT — 详见 LICENSE。
Related MCP server: Vision MCP
English
Features
describe_image — Recognize image content and return text description
describe_image_to_file — Recognize and save result to a UTF-8 file
Supports PNG / JPG / GIF / WebP / BMP up to 20MB
Customizable prompt for targeted extraction
Install
pip install mcp-vision-serverOr from source:
git clone https://github.com/coffe-d/MCP-Vision-Server.git
cd mcp-vision-server
pip install -e .Get an API key
Sign up at Moonshot Platform and create an API key.
Register with Claude Code
claude mcp add vision-server \
--env KIMI_API_KEY="sk-your-key-here" \
-- mcp-vision-serverConfiguration
Variable | Required | Default | Description |
| Yes | — | Moonshot API key |
| No |
| API base URL |
| No |
| Model name |
API Reference
describe_image — Return image description as text.
Parameter | Type | Required | Default | Description |
| string | Yes | — | Absolute path to image |
| string | No | — | Custom prompt |
| int | No | 4096 | Max output tokens |
describe_image_to_file — Save result to a UTF-8 file.
Parameter | Type | Required | Default | Description |
| string | Yes | — | Absolute path to image |
| string | No | auto (.md) | Output file path |
Troubleshooting
"KIMI_API_KEY environment variable is not set" — Make sure you passed --env KIMI_API_KEY="sk-..." when running claude mcp add.
Garbled Chinese in terminal — Use describe_image_to_file to write directly to UTF-8 file.
License
MIT — see LICENSE.
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/coffe-d/MCP-Vision-Server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server