Vision MCP Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Vision MCP Serveranalyze this error screenshot and suggest a fix"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Vision MCP Server
Free, unlimited vision capabilities for your AI coding assistant using Groq API and Meta Llama 4 Vision model.
Features
Image Analysis - Understand and describe images
Text Extraction (OCR) - Extract text from screenshots, documents, photos
UI Analysis - Describe UI components, layouts, and design
Error Diagnosis - Analyze error screenshots and suggest fixes
Diagram Understanding - Interpret flowcharts, UML, architecture diagrams
Chart Analysis - Read charts and dashboards for insights
Image Comparison - Compare two images for differences
Code Extraction - Extract code from IDE screenshots
Installation
Prerequisites
Python 3.10 or higher
Free Groq API key
Get Groq API Key (Free)
Visit https://console.groq.com/keys
Sign up (free)
Create a new API key
Install Dependencies
cd vision-mcp-server
# Option 1: Using install script (recommended)
./install.sh
# Option 2: Manual installation
pip3 install mcp groq pillow aiofilesConfiguration
Claude Desktop
Add to ~/.claude/config.json:
{
"mcpServers": {
"vision-mcp-server": {
"command": "python",
"args": ["-m", "vision_mcp_server.server"],
"env": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}OpenCode
Add to OpenCode settings:
{
"$schema": "https://opencode.ai/config.json",
"mcp": {
"vision-mcp-server": {
"type": "local",
"command": ["python", "-m", "vision_mcp_server.server"],
"environment": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}Cline (VS Code)
Add to Cline settings:
{
"mcpServers": {
"vision-mcp-server": {
"command": "python",
"args": ["-m", "vision_mcp_server.server"],
"env": {
"GROQ_API_KEY": "your-groq-api-key-here"
}
}
}
}Usage
Analyze Image
Describe this image: screenshot.pngExtract Text
Extract text from this document: scan.jpgDiagnose Error
What's wrong with this error screenshot: error.pngUnderstand Diagram
Explain this architecture diagram: system-diagram.pngCompare Images
Compare these two UI screenshots: old-ui.png vs new-ui.pngAvailable Tools
analyze_image- General image analysisextract_text- OCR text extractiondescribe_ui- UI component analysisdiagnose_error- Error screenshot analysisunderstand_diagram- Diagram interpretationanalyze_chart- Chart and dashboard analysiscompare_images- Image comparisoncode_from_screenshot- Code extraction from screenshots
Models Used
meta-llama/llama-4-scout-17b-16e-instruct - Latest Meta Llama 4 vision model
Available for free via Groq API
No quotas, no limits
Superior vision capabilities and multimodal performance
Testing
Run locally:
export GROQ_API_KEY=your-api-key
python -m vision_mcp_server.serverLicense
MIT
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/itcomgroup/vision-mcp-server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server