Vison-MCP
Provides vision AI capabilities (image analysis, OCR, UI-to-code, error diagnosis, etc.) via OpenAI's GPT-4o model.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Vison-MCPdiagnose this error screenshot and suggest fixes"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Vison-MCP
MCP server for vision AI — screenshots to code, OCR, error diagnosis, and image analysis via OpenAI-compatible APIs.
Supported Tools
Tool | Description |
| General visual understanding — describe any image in detail |
| OCR optimized for terminals, code, documents, and general content |
| Convert UI screenshots into code, prompts, specs, or descriptions |
| Analyze error screenshots and propose actionable fixes |
| Interpret architecture diagrams, flowcharts, UML, ER, and system diagrams |
| Read charts and dashboards to surface insights, trends, and anomalies |
| Compare two UI screenshots to flag visual differences and implementation drift |
| Inspect videos (MP4/MOV/M4V) — scene detection, event analysis, content summarization |
Related MCP server: MCP HydroCoder Vision
Installation
git clone https://github.com/Lin-zhibo/Vison-MCP.git
cd Vison-MCP
npm install
npm run buildConfiguration
Set the following environment variables:
Variable | Required | Default | Description |
| Yes | — | API authentication key |
| Yes | — | OpenAI-compatible API endpoint |
| No |
| Vision model to use |
Usage
With Claude Code
Add to your .claude/settings.json or claude_desktop_config.json:
{
"mcpServers": {
"vison-mcp": {
"command": "node",
"args": ["/path/to/Vison-MCP/dist/index.js"],
"env": {
"VISIONAI_API_KEY": "your-api-key",
"VISIONAI_BASE_URL": "https://api.openai.com/v1",
"VISIONAI_MODEL_NAME": "gpt-4o"
}
}
}
}Local Development
# Copy environment template
cp .env.example .env
# Edit .env with your API credentials
# Build and run
npm run build
npm startRequirements
Node.js >= 18.0.0
An OpenAI-compatible vision API endpoint (GPT-4o, Claude Vision, or compatible)
License
MIT
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lin-zhibo/Vison-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server