Vison-MCP
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| VISIONAI_API_KEY | Yes | API authentication key | |
| VISIONAI_BASE_URL | Yes | OpenAI-compatible API endpoint | |
| VISIONAI_MODEL_NAME | No | Vision model to use | gpt-4o |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| image_analysisB | Analyze any image with a general vision model. Returns a detailed description of the image content, key elements, text, colors, layout, and context clues. |
| extract_text_from_screenshotA | Extract text from screenshots. Optimized for terminals, code editors, documents, and general content. Returns extracted text preserving original structure. |
| ui_to_artifactA | Convert UI screenshots into structured deliverables: production-ready code, image-generation prompts, technical specifications, or detailed descriptions. |
| diagnose_error_screenshotA | Analyze error screenshots (build errors, runtime errors, stack traces) and propose actionable fixes with root cause analysis. |
| understand_technical_diagramA | Interpret architecture diagrams, flowcharts, UML, ER, sequence, and system topology diagrams. Returns structured analysis of components, relationships, design patterns, and improvement suggestions. |
| analyze_data_visualizationA | Read charts, dashboards, and statistical visualizations to surface insights, trends, patterns, and anomalies with actionable recommendations. |
| ui_diff_checkA | Compare two UI screenshots — design vs implementation — to identify visual differences, layout drift, style inconsistencies, missing elements, and typography discrepancies. |
| video_analysisA | Inspect videos (local files ≤8MB, remote URLs) to describe scenes, detect events, and answer questions about visual moments. Supports MP4, MOV, M4V. |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lin-zhibo/Vison-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server