Image Parse MCP
Allows analyzing images using Google Gemini's vision models (via OpenAI-compatible endpoint) to extract information, describe contents, perform OCR, and interpret visual data.
Allows analyzing images using local vision models served by Ollama (e.g., llava) to extract information, describe contents, perform OCR, and interpret visual data.
Allows analyzing images using OpenAI's vision models (e.g., GPT-4o, GPT-4-vision) to extract information, describe contents, perform OCR, and interpret visual data.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Image Parse MCPCan you analyze this screenshot for UI issues? URL: https://example.com/ui.png"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Image Parse MCP
A multimodal image analysis MCP server that connects to any OpenAI-compatible vision API. Provide an image (URL, local path, or base64) and a prompt — get back a detailed analysis from the multimodal LLM of your choice.
Supported Providers
Any provider with an OpenAI-compatible chat completions endpoint:
OpenAI — GPT-4o, GPT-4-vision, GPT-4.1-mini
Anthropic (via compatible proxy / gateway)
Google Gemini (via OpenAI-compatible endpoint)
Azure OpenAI
Local models (Ollama, vLLM, LM Studio with OpenAI-compatible servers)
Alibaba Bailian — Qwen-VL (via OpenAI-compatible endpoint)
Third-party (DeepSeek, Groq, Together.ai, OpenRouter, etc.)
Related MCP server: Vision MCP
Configuration
Set these environment variables before launching the server:
Variable | Required | Default | Description |
| Yes | — | API key for your provider |
| No |
| API base URL |
| No |
| Multimodal model name |
Example: OpenAI
export IMAGE_PARSE_API_KEY=sk-...
export IMAGE_PARSE_BASE_URL=https://api.openai.com/v1
export IMAGE_PARSE_MODEL=gpt-4oExample: Google Gemini (via AI Studio)
export IMAGE_PARSE_API_KEY=your-gemini-api-key
export IMAGE_PARSE_BASE_URL=https://generativelanguage.googleapis.com/v1beta/openai
export IMAGE_PARSE_MODEL=gemini-2.5-flashExample: Ollama (local)
export IMAGE_PARSE_API_KEY=ollama
export IMAGE_PARSE_BASE_URL=http://localhost:11434/v1
export IMAGE_PARSE_MODEL=llavaExample: Azure OpenAI
export IMAGE_PARSE_API_KEY=your-azure-api-key
export IMAGE_PARSE_BASE_URL=https://your-resource.openai.azure.com/openai/deployments/your-deployment
export IMAGE_PARSE_MODEL=gpt-4oExample: Alibaba Bailian (Qwen-VL)
export IMAGE_PARSE_API_KEY=your-dashscope-api-key
export IMAGE_PARSE_BASE_URL=https://dashscope.aliyuncs.com/compatible-mode/v1
export IMAGE_PARSE_MODEL=qwen-vl-maxExample: DeepSeek
export IMAGE_PARSE_API_KEY=your-deepseek-api-key
export IMAGE_PARSE_BASE_URL=https://api.deepseek.com/v1
export IMAGE_PARSE_MODEL=deepseek-chatInstall & Run
# Clone or enter the project directory
cd image-parse
# Run directly (uv handles venv + deps automatically)
uv run image-parse-mcpClaude Code Configuration
Add to your Claude Code MCP config (~/.claude/claude.json or project .mcp.json):
{
"mcpServers": {
"image-parse": {
"type": "stdio",
"command": "uv",
"args": ["run", "--directory", "path/to/image-parse", "image-parse-mcp"],
"env": {
"IMAGE_PARSE_API_KEY": "sk-...",
"IMAGE_PARSE_BASE_URL": "https://api.openai.com/v1",
"IMAGE_PARSE_MODEL": "gpt-4o"
}
}
}
}Tool: analyze_image
Parameter | Required | Description |
| Yes | URL, local file path, base64 string, or data URI |
| Yes | What to analyze / extract from the image |
| No | Override auto-detected MIME type (e.g. |
What agents use it for
Describe the contents of an image
Extract text from a screenshot (OCR)
Read and interpret charts, graphs, data visualizations
Analyze UI screenshots (layout, elements, issues)
Identify objects, colours, people, or scenes in photos
Compare visual information across multiple images
Diagnose errors from error screenshots
Input forms for image_source
# URL
https://example.com/screenshot.png
# Local file path (on the host machine)
/Users/me/Downloads/chart.png
# Base64 data URI
data:image/png;base64,iVBORw0KGgo...
# Raw base64
iVBORw0KGgo...Development
# Create venv and install deps
uv venv
uv pip install -e .
# Run tests
uv run python -m pytestMaintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Tools
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/1617110693/Image-Parse-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server