MCP Servers for Image & Video Processing

Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.

View all MCP Servers

  • -
    security
    A
    license
    -
    quality
    A server that integrates Flux's advanced image generation and manipulation features into AI coding assistants, enabling seamless text-to-image and image control workflows in IDEs like Cursor and Windsurf.
    3
    Python
    MIT License
  • -
    security
    F
    license
    -
    quality
    An advanced MCP server for Cline that leverages EverArt's AI models to generate vector and raster images, supporting flexible storage, multiple formats, and robust image generation capabilities.
    JavaScript
  • A
    security
    A
    license
    A
    quality
    Facilitates running Python code in a sandbox and generating images using the FLUX model via an MCP server compatible with clients like Goose and the Claude Desktop App.
    2
    10
    Python
    MIT License
  • A
    security
    F
    license
    A
    quality
    A Node.js server that provides advanced video and image processing capabilities through the Model Context Protocol, enabling operations like conversion, compression, editing, and effects application.
    10
    7
    JavaScript
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    The Comfy MCP Server uses the FastMCP framework to generate images from prompts by interacting with a remote Comfy server, allowing automated image creation based on workflow configurations.
    1
    Python
    MIT License
  • A
    security
    A
    license
    A
    quality
    This server generates placeholder image URLs from various providers, supporting input validation and integration with desktop applications like Claude and Cursor.
    1
    1
    MIT License
  • A
    security
    A
    license
    A
    quality
    Facilitates the creation of DecentSampler drum kit configurations, supporting WAV file analysis and XML generation to ensure accurate sample lengths and well-structured presets.
    2
    63
    TypeScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Enables browser automation and real-time computer vision tasks through AI-driven commands, offering zero-cost digital navigation and interaction for enhanced web experiences.
    12
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    An AI-powered development toolkit for Cursor providing intelligent coding assistance through advanced reasoning, UI screenshot analysis, and code review tools.
    24
    175
    TypeScript
    MIT License
  • A
    security
    F
    license
    A
    quality
    Enables users to generate images from text prompts using Replicate's model, with configurable parameters and full MCP protocol compliance.
    1
    27
    TypeScript
  • A
    security
    A
    license
    A
    quality
    This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
    4
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables AI assistants to interact with Gyazo images using the Model Context Protocol, providing access to image URIs, metadata, and OCR data via the Gyazo API.
    4
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables the generation of images using Together AI's models through an MCP server, supporting customizable parameters such as model selection, image dimensions, and output directory.
    1
    1
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Enables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.
    2
    3
    9
    TypeScript
    MIT License
    • Apple
  • A
    security
    F
    license
    A
    quality
    Integrates Dify AI API to provide code generation for Ant Design components, supporting both text and image inputs with stream processing capabilities.
    1
    3
    JavaScript
  • A
    security
    A
    license
    A
    quality
    This server provides tools for uploading images and videos directly to Cloudinary using Claude/Cline, facilitating resource management with customizable options like resource type and public ID.
    1
    2
    4
    JavaScript
    MIT License
    • Apple
  • A
    security
    F
    license
    A
    quality
    A FastMCP server implementation that facilitates resource-based access to AI model inference, focusing on image generation through the Replicate API, with features like real-time updates, webhook integration, and secure API key management.
    18
    7
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    6
    Python
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.
    6
    JavaScript
    Apache 2.0
  • A
    security
    F
    license
    A
    quality
    A TypeScript-based Model Context Protocol (MCP) server enabling integration with PiAPI for media content generation using platforms like Midjourney, Flux, and others through MCP-compatible applications.
    1
    13
    TypeScript
    • Apple
  • A
    security
    A
    license
    A
    quality
    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
    1
    31
    3
    JavaScript
    MIT License
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server that provides Luma AI's video generation API as the Model Context Protocol (MCP)
    2
    TypeScript
  • A
    security
    A
    license
    A
    quality
    Provides access to Amazon Bedrock's Nova Canvas model for AI image generation.
    1
    2
    12
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Allows you to search for artworks, retrieve detailed information about specific artworks, access image tiles for artworks, and explore user-created collections from the Rijksmuseum.
    6
    18
    9
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    An intelligent MCP server with a fully automated batch pipeline for web-ready images. Features include noise reduction, auto levels/curves, JPEG artifact removal, 4K resizing, smart sharpening with shadow/highlight enhancement, and advanced WebP conversion.
    1
    3
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    1
    10
    7
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables AI assistants to download images from URLs and perform basic image optimization tasks.
    2
    2
    JavaScript
    Apache 2.0
  • -
    security
    F
    license
    -
    quality
    Upload, edit, and generate videos from everyone's favorite LLM and Video Jungle.
    21
    Python
    • Apple
  • A
    security
    A
    license
    A
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    2
    62
    63
    TypeScript
    MIT License
    • Apple