Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
IMAGE_PARSE_MODELNoMultimodal model namegpt-4o
IMAGE_PARSE_API_KEYYesAPI key for your provider
IMAGE_PARSE_BASE_URLNoAPI base URLhttps://api.openai.com/v1

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
analyze_imageA

Analyze an image with a multimodal LLM (GPT-4o, Claude, Gemini, etc.).

Provide an image (URL, local path, or base64) and a description of what you want to know. The tool calls an OpenAI-compatible vision API and returns the model's text response.

Use this tool whenever you have an image and need to:

  • Describe its contents

  • Extract text / OCR

  • Understand a chart, diagram, or data visualization

  • Analyse a UI screenshot (layout, elements, issues)

  • Identify objects, colours, people, or scenes in a photo

  • Compare or summarise visual information

Args: params (AnalyzeImageInput): - image_source (str): URL, local file path, or base64 image data. - prompt (str): What to analyze or extract from the image. - mime_type (Optional[str]): Override auto-detected MIME type.

Returns: str: The multimodal model's analysis as plain text / Markdown.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/1617110693/Image-Parse-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server