Which integrations are available for this server?

Allows analyzing images using Google Gemini's vision models (via OpenAI-compatible endpoint) to extract information, describe contents, perform OCR, and interpret visual data. Allows analyzing images using local vision models served by Ollama (e.g., llava) to extract information, describe contents, perform OCR, and interpret visual data. Allows analyzing images using OpenAI's vision models (e.g., GPT-4o, GPT-4-vision) to extract information, describe contents, perform OCR, and interpret visual data.

How do I use Image Parse MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Image Parse MCP Can you analyze this screenshot for UI issues? URL: https://example.com/ui.png" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Image Parse MCP

by 1617110693

Overview Schema Related Servers Score Discussions

Python

Remote

Image Parse MCP

中文文档

A multimodal image analysis MCP server that connects to any OpenAI-compatible vision API. Provide an image (URL, local path, or base64) and a prompt — get back a detailed analysis from the multimodal LLM of your choice.

Supported Providers

Any provider with an OpenAI-compatible chat completions endpoint:

OpenAI — GPT-4o, GPT-4-vision, GPT-4.1-mini
Anthropic (via compatible proxy / gateway)
Google Gemini (via OpenAI-compatible endpoint)
Azure OpenAI
Local models (Ollama, vLLM, LM Studio with OpenAI-compatible servers)
Alibaba Bailian — Qwen-VL (via OpenAI-compatible endpoint)
Third-party (DeepSeek, Groq, Together.ai, OpenRouter, etc.)

Related MCP server: Vision MCP

Configuration

Set these environment variables before launching the server:

Variable	Required	Default	Description
`IMAGE_PARSE_API_KEY`	Yes	—	API key for your provider
`IMAGE_PARSE_BASE_URL`	No	`https://api.openai.com/v1`	API base URL
`IMAGE_PARSE_MODEL`	No	`gpt-4o`	Multimodal model name

Example: OpenAI

export IMAGE_PARSE_API_KEY=sk-...
export IMAGE_PARSE_BASE_URL=https://api.openai.com/v1
export IMAGE_PARSE_MODEL=gpt-4o

Example: Google Gemini (via AI Studio)

export IMAGE_PARSE_API_KEY=your-gemini-api-key
export IMAGE_PARSE_BASE_URL=https://generativelanguage.googleapis.com/v1beta/openai
export IMAGE_PARSE_MODEL=gemini-2.5-flash

Example: Ollama (local)

export IMAGE_PARSE_API_KEY=ollama
export IMAGE_PARSE_BASE_URL=http://localhost:11434/v1
export IMAGE_PARSE_MODEL=llava

Example: Azure OpenAI

export IMAGE_PARSE_API_KEY=your-azure-api-key
export IMAGE_PARSE_BASE_URL=https://your-resource.openai.azure.com/openai/deployments/your-deployment
export IMAGE_PARSE_MODEL=gpt-4o

Example: Alibaba Bailian (Qwen-VL)

export IMAGE_PARSE_API_KEY=your-dashscope-api-key
export IMAGE_PARSE_BASE_URL=https://dashscope.aliyuncs.com/compatible-mode/v1
export IMAGE_PARSE_MODEL=qwen-vl-max

Example: DeepSeek

export IMAGE_PARSE_API_KEY=your-deepseek-api-key
export IMAGE_PARSE_BASE_URL=https://api.deepseek.com/v1
export IMAGE_PARSE_MODEL=deepseek-chat

Install & Run

# Clone or enter the project directory
cd image-parse

# Run directly (uv handles venv + deps automatically)
uv run image-parse-mcp

Claude Code Configuration

Add to your Claude Code MCP config (~/.claude/claude.json or project .mcp.json):

{
  "mcpServers": {
    "image-parse": {
      "type": "stdio",
      "command": "uv",
      "args": ["run", "--directory", "path/to/image-parse", "image-parse-mcp"],
      "env": {
        "IMAGE_PARSE_API_KEY": "sk-...",
        "IMAGE_PARSE_BASE_URL": "https://api.openai.com/v1",
        "IMAGE_PARSE_MODEL": "gpt-4o"
      }
    }
  }
}

Tool: `analyze_image`

Parameter	Required	Description
`image_source`	Yes	URL, local file path, base64 string, or data URI
`prompt`	Yes	What to analyze / extract from the image
`mime_type`	No	Override auto-detected MIME type (e.g. `image/webp`)

What agents use it for

Describe the contents of an image
Extract text from a screenshot (OCR)
Read and interpret charts, graphs, data visualizations
Analyze UI screenshots (layout, elements, issues)
Identify objects, colours, people, or scenes in photos
Compare visual information across multiple images
Diagnose errors from error screenshots

Input forms for `image_source`

# URL
https://example.com/screenshot.png

# Local file path (on the host machine)
/Users/me/Downloads/chart.png

# Base64 data URI
data:image/png;base64,iVBORw0KGgo...

# Raw base64
iVBORw0KGgo...

Development

# Create venv and install deps
uv venv
uv pip install -e .

# Run tests
uv run python -m pytest

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

analyze_imageA

Related MCP Servers

Computer Vision MCP Server
Image & Video Processing Multimedia Processing Autonomous Agents
samhains
F
license
C
quality
D
maintenance
Enables image captioning and analysis through natural language by processing images from URLs or local files. Supports both OpenRouter's Gemini 2.5 Flash and local vision models for generating concise, descriptive captions.
Last updated 2026-05-25
4
Vision MCP
Image & Video Processing Multimedia Processing
i-richardwang
A
license
A
quality
D
maintenance
Enables image analysis and understanding using Vision Language Models through OpenAI-compatible APIs. Supports analyzing images from URLs or local files with custom prompts.
Last updated 2025-12-26
1
2
MIT
Kolosal Vision MCP
Image & Video Processing Multimedia Processing
madebyaris
A
license
-
quality
D
maintenance
Provides AI-powered image analysis and OCR capabilities using the Kolosal Vision API. Supports analyzing images from URLs, local files, or base64 data with natural language queries for object detection, scene description, text extraction, and visual assessment.
Last updated 2025-12-14
14
MIT
Vision MCP Server
Image & Video Processing Autonomous Agents
TheNomadInOrbit
A
license
B
quality
D
maintenance
Enables vision capabilities for any AI model by routing image analysis requests through OpenRouter's vision models. It provides tools to analyze images from URLs, local file paths, or base64 data.
Last updated 2025-10-09
2
91
17
MIT

View all related MCP servers

Related MCP Connectors

Frenchie
OCR, transcription, file extraction, and image generation for AI agents via MCP.
huuthangntk-claude-vision-mcp-server
Analyze images from multiple angles to extract detailed insights or quick summaries. Describe visu…
picdefenseio-mcp-server
Image risk scoring, EXIF, reverse-image backlinks, and image content detection via PicDefense.io.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/1617110693/Image-Parse-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Image Parse MCP

Supported Providers

Configuration

Example: OpenAI

Example: Google Gemini (via AI Studio)

Example: Ollama (local)

Example: Azure OpenAI

Example: Alibaba Bailian (Qwen-VL)

Example: DeepSeek

Install & Run

Claude Code Configuration

Tool: analyze_image

What agents use it for

Input forms for image_source

Development

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

Computer Vision MCP Server

Vision MCP

Kolosal Vision MCP

Vision MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API

Tool: `analyze_image`

Input forms for `image_source`