Utilizes GPT-4-turbo model to analyze and provide detailed descriptions of images from URLs
image-mcp-server
An MCP server that receives image URLs or local file paths and analyzes image content using the GPT-4o-mini model.
Features
- Receives image URLs or local file paths as input and provides detailed analysis of the image content
- High-precision image recognition and description using the GPT-4o-mini model
- Image URL validity checking
- Image loading from local files and Base64 encoding
Installation
Installing via Smithery
To install Image Analysis Server for Claude Desktop automatically via Smithery:
Manual Installation
Configuration
To use this server, you need an OpenAI API key. Set the following environment variable:
MCP Server Configuration
To use with tools like Cline, add the following settings to your MCP server configuration file:
For Cline
Add the following to cline_mcp_settings.json
:
For Claude Desktop App
Add the following to claude_desktop_config.json
:
Usage
Once the MCP server is configured, the following tools become available:
analyze_image
: Receives an image URL and analyzes its content.analyze_image_from_path
: Receives a local file path and analyzes its content.
Usage Examples
Analyzing from URL:
Analyzing from local file path:
Note: Specifying Local File Paths
When using the analyze_image_from_path
tool, the AI assistant (client) must specify a valid file path in the environment where this server is running.
- If the server is running on WSL:
- If the AI assistant has a Windows path (e.g.,
C:\...
), it needs to convert it to a WSL path (e.g.,/mnt/c/...
) before passing it to the tool. - If the AI assistant has a WSL path, it can pass it as is.
- If the AI assistant has a Windows path (e.g.,
- If the server is running on Windows:
- If the AI assistant has a WSL path (e.g.,
/home/user/...
), it needs to convert it to a UNC path (e.g.,\\wsl$\Distro\...
) before passing it to the tool. - If the AI assistant has a Windows path, it can pass it as is.
- If the AI assistant has a WSL path (e.g.,
Path conversion is the responsibility of the AI assistant (or its execution environment). The server will try to interpret the received path as is.
Note: Type Errors During Build
When running npm run build
, you may see an error (TS7016) about missing TypeScript type definitions for the mime-types
module.
This is a type checking error, and since the JavaScript compilation itself succeeds, it does not affect the server's execution. If you want to resolve this error, install the type definition file as a development dependency.
Development
License
MIT
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Tools
A server that accepts image URLs and nestles their content using GPT-4-turbo, enabling Claude AI assistants to understand and describe images through natural language.
Related MCP Servers
- AsecurityAlicenseAqualityA MCP server that enables Claude and other MCP-compatible assistants to generate images from text prompts using Together AI's image generation models.Last updated -4MIT License
- AsecurityAlicenseAqualityProvides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.Last updated -324MIT License
- -securityAlicense-qualityA modular server implementation for Claude AI assistants with integrated tools, enabling Claude to perform actions and access external resources like file systems, web searches, browser automation, financial data, and document generation.Last updated -95MIT License
- AsecurityAlicenseAqualityA server that integrates with Claude Desktop to enable real-time web research capabilities, allowing users to search Google, extract webpage content, and capture screenshots directly from conversations.Last updated -3555MIT License