Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
VISIONAI_API_KEYYesAPI authentication key
VISIONAI_BASE_URLYesOpenAI-compatible API endpoint
VISIONAI_MODEL_NAMENoVision model to usegpt-4o

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}

Tools

Functions exposed to the LLM to take actions

NameDescription
image_analysisB

Analyze any image with a general vision model. Returns a detailed description of the image content, key elements, text, colors, layout, and context clues.

extract_text_from_screenshotA

Extract text from screenshots. Optimized for terminals, code editors, documents, and general content. Returns extracted text preserving original structure.

ui_to_artifactA

Convert UI screenshots into structured deliverables: production-ready code, image-generation prompts, technical specifications, or detailed descriptions.

diagnose_error_screenshotA

Analyze error screenshots (build errors, runtime errors, stack traces) and propose actionable fixes with root cause analysis.

understand_technical_diagramA

Interpret architecture diagrams, flowcharts, UML, ER, sequence, and system topology diagrams. Returns structured analysis of components, relationships, design patterns, and improvement suggestions.

analyze_data_visualizationA

Read charts, dashboards, and statistical visualizations to surface insights, trends, patterns, and anomalies with actionable recommendations.

ui_diff_checkA

Compare two UI screenshots — design vs implementation — to identify visual differences, layout drift, style inconsistencies, missing elements, and typography discrepancies.

video_analysisA

Inspect videos (local files ≤8MB, remote URLs) to describe scenes, detect events, and answer questions about visual moments. Supports MP4, MOV, M4V.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lin-zhibo/Vison-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server