Skip to main content
Glama

MCP Read Images

MCP Read Images

An MCP server for analyzing images using OpenRouter vision models. This server provides a simple interface to analyze images using various vision models like Claude-3.5-sonnet and Claude-3-opus through the OpenRouter API.

Installation

npm install @catalystneuro/mcp_read_images

Configuration

The server requires an OpenRouter API key. You can get one from OpenRouter.

Add the server to your MCP settings file (usually located at ~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json for VSCode):

{ "mcpServers": { "read_images": { "command": "read_images", "env": { "OPENROUTER_API_KEY": "your-api-key-here", "OPENROUTER_MODEL": "anthropic/claude-3.5-sonnet" // optional, defaults to claude-3.5-sonnet }, "disabled": false, "autoApprove": [] } } }

Usage

The server provides a single tool analyze_image that can be used to analyze images:

// Basic usage with default model use_mcp_tool({ server_name: "read_images", tool_name: "analyze_image", arguments: { image_path: "/path/to/image.jpg", question: "What do you see in this image?" // optional } }); // Using a specific model for this call use_mcp_tool({ server_name: "read_images", tool_name: "analyze_image", arguments: { image_path: "/path/to/image.jpg", question: "What do you see in this image?", model: "anthropic/claude-3-opus-20240229" // overrides default and settings } });

Model Selection

The model is selected in the following order of precedence:

  1. Model specified in the tool call (model argument)
  2. Model specified in MCP settings (OPENROUTER_MODEL environment variable)
  3. Default model (anthropic/claude-3.5-sonnet)

Supported Models

The following OpenRouter models have been tested:

  • anthropic/claude-3.5-sonnet
  • anthropic/claude-3-opus-20240229

Features

  • Automatic image resizing and optimization
  • Configurable model selection
  • Support for custom questions about images
  • Detailed error messages
  • Automatic JPEG conversion and quality optimization

Error Handling

The server handles various error cases:

  • Invalid image paths
  • Missing API keys
  • Network errors
  • Invalid model selections
  • Image processing errors

Each error will return a descriptive message to help diagnose the issue.

Development

To build from source:

git clone https://github.com/catalystneuro/mcp_read_images.git cd mcp_read_images npm install npm run build

License

MIT License. See LICENSE for details.

-
security - not tested
A
license - permissive license
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.

  1. Installation
    1. Configuration
      1. Usage
        1. Model Selection
        2. Supported Models
      2. Features
        1. Error Handling
          1. Development
            1. License

              Related MCP Servers

              • A
                security
                A
                license
                A
                quality
                This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
                Last updated -
                21
                Python
                MIT License
                • Apple
              • A
                security
                A
                license
                A
                quality
                MCP OpenVision is a Model Context Protocol (MCP) server that provides image analysis capabilities powered by OpenRouter vision models. It enables AI assistants to analyze images via a simple interface within the MCP ecosystem.
                Last updated -
                1
                10
                Python
                MIT License
                • Apple
              • -
                security
                F
                license
                -
                quality
                An MCP server that analyzes webpage design images using vision models and generates development documentation in Markdown format.
                Last updated -
                Python
                • Linux
              • A
                security
                A
                license
                A
                quality
                A Model Context Protocol server that enables AI assistants to analyze images using OpenRouter vision models through a simple interface.
                Last updated -
                1
                1
                Python
                MIT License
                • Apple

              View all related MCP servers

              MCP directory API

              We provide all the information about MCP servers via our MCP API.

              curl -X GET 'https://glama.ai/api/mcp/v1/servers/catalystneuro/mcp_read_images'

              If you have feedback or need assistance with the MCP directory API, please join our Discord server