The MCP Image Extractor server extracts and converts images to base64 for LLM analysis.
- Extract images from various sources:
- Local files (PNG, JPG, GIF, WebP) using
extract_image_from_file
tool - URLs using
extract_image_from_url
tool - Process existing base64-encoded images using
extract_image_from_base64
tool
- Local files (PNG, JPG, GIF, WebP) using
- Automatic optimization: Resizes images to maximum 512x512 pixels to optimize context window usage
- Integration features: Compatible with Cursor for automated image extraction and supports Docker for containerized deployment
- Backward compatibility: Supports legacy parameters for existing implementations
Supports containerized deployment of the MCP image extraction service through Docker, allowing for isolated and portable execution environments.
Provides installation directly from GitHub repository, with specific instructions for cloning and building from the GitHub source.
Enables installation and management through npm, with support for npm link to make the command globally available.
MCP Image Extractor
MCP server for extracting and converting images to base64 for LLM analysis.
This MCP server provides tools for AI assistants to:
- Extract images from local files
- Extract images from URLs
- Process base64-encoded images
How it looks in Cursor:
Suitable cases:
- analyze playwright test results: screenshots
Installation
Recommended: Using npx in mcp.json (Easiest)
The recommended way to install this MCP server is using npx directly in your .cursor/mcp.json
file:
This approach:
- Automatically installs the latest version
- Does not require global installation
- Works reliably across different environments
Alternative: Local Path Installation
If you prefer to use a local installation of the package, you can clone the repository and point to the built files:
Manual Installation
This will make the mcp-image-extractor
command available globally.
Then configure in .cursor/mcp.json
:
Troubleshooting for Cursor Users: If you see "Failed to create client" error, try the local path installation method above or ensure you're using the correct path to the executable.
Available Tools
extract_image_from_file
Extracts an image from a local file and converts it to base64.
Parameters:
file_path
(required): Path to the local image file
Note: All images are automatically resized to optimal dimensions (max 512x512) for LLM analysis to limit the size of the base64 output and optimize context window usage.
extract_image_from_url
Extracts an image from a URL and converts it to base64.
Parameters:
url
(required): URL of the image to extract
Note: All images are automatically resized to optimal dimensions (max 512x512) for LLM analysis to limit the size of the base64 output and optimize context window usage.
extract_image_from_base64
Processes a base64-encoded image for LLM analysis.
Parameters:
base64
(required): Base64-encoded image datamime_type
(optional, default: "image/png"): MIME type of the image
Note: All images are automatically resized to optimal dimensions (max 512x512) for LLM analysis to limit the size of the base64 output and optimize context window usage.
Example Usage
Here's an example of how to use the tools from Claude:
Claude will automatically use the extract_image_from_file
tool to load and analyze the image content.
Claude will automatically use the extract_image_from_url
tool to fetch and analyze the image content.
Docker
Build and run with Docker:
License
MIT
You must be authenticated.
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A Model Context Protocol server that extracts images from URLs or base64 data and converts them into a format suitable for LLM analysis, allowing AI models to process and understand visual content.
Related Resources
Related MCP Servers
- -securityFlicense-qualityA Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats.Last updated -2Python
- -securityAlicense-qualityA Model Context Protocol (MCP) server that enables Claude or other LLMs to fetch content from URLs, supporting HTML, JSON, text, and images with configurable request parameters.Last updated -PythonMIT License
- -securityAlicense-qualityA Model Context Protocol server that enables fetching and processing images from URLs, local file paths, and numpy arrays, returning them as base64-encoded strings with proper MIME types.Last updated -1PythonMIT License
- -securityFlicense-qualityA Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.Last updated -TypeScript