Schema | Image Parse MCP

Image Parse MCP

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`IMAGE_PARSE_MODEL`	No	Multimodal model name	gpt-4o
`IMAGE_PARSE_API_KEY`	Yes	API key for your provider
`IMAGE_PARSE_BASE_URL`	No	API base URL	https://api.openai.com/v1

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": false }
`prompts`	{ "listChanged": false }
`resources`	{ "subscribe": false, "listChanged": false }
`experimental`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
analyze_imageA	Analyze an image with a multimodal LLM (GPT-4o, Claude, Gemini, etc.). Provide an image (URL, local path, or base64) and a description of what you want to know. The tool calls an OpenAI-compatible vision API and returns the model's text response. Use this tool whenever you have an image and need to: Describe its contents Extract text / OCR Understand a chart, diagram, or data visualization Analyse a UI screenshot (layout, elements, issues) Identify objects, colours, people, or scenes in a photo Compare or summarise visual information Args: params (AnalyzeImageInput): - image_source (str): URL, local file path, or base64 image data. - prompt (str): What to analyze or extract from the image. - mime_type (Optional[str]): Override auto-detected MIME type. Returns: str: The multimodal model's analysis as plain text / Markdown.

Name

Description

analyze_imageA

Analyze an image with a multimodal LLM (GPT-4o, Claude, Gemini, etc.).

Provide an image (URL, local path, or base64) and a description of what you want to know. The tool calls an OpenAI-compatible vision API and returns the model's text response.

Use this tool whenever you have an image and need to:

Describe its contents
Extract text / OCR
Understand a chart, diagram, or data visualization
Analyse a UI screenshot (layout, elements, issues)
Identify objects, colours, people, or scenes in a photo
Compare or summarise visual information

Args: params (AnalyzeImageInput): - image_source (str): URL, local file path, or base64 image data. - prompt (str): What to analyze or extract from the image. - mime_type (Optional[str]): Override auto-detected MIME type.

Returns: str: The multimodal model's analysis as plain text / Markdown.

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/1617110693/Image-Parse-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server