MCP Servers for Image & Video Processing

Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.

View all MCP Servers

  • A
    security
    A
    license
    A
    quality
    ModelContextProtocol server for Magic UI that allows AI assistants to easily implement UI components for web applications by providing code for components like marquees, animations, special effects, and interactive backgrounds.
    Last updated -
    8
    772
    51
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Enables MCP clients like Claude Desktop and Cursor to interact with MiniMax APIs for generating speech, cloning voices, creating videos, and generating images.
    Last updated -
    6
    302
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    JavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.
    Last updated -
    6
    359
    31
    TypeScript
    MIT License
  • Glifofficial

    A
    security
    A
    license
    A
    quality
    Run AI workflows hosted on Glif.app via MCP, including ComfyUI-based image generators, meme generators, selfies, chained LLM calls, and more
    Last updated -
    5
    64
    17
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
    Last updated -
    1
    6
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
    Last updated -
    1
    170
    13
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP server implementation that integrates with Minimax API to provide AI-powered image generation and text-to-speech functionality in editors like Windsurf and Cursor.
    Last updated -
    2
    192
    1
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A universal Model Context Protocol implementation that serves as a semantic layer between LLMs and 3D creative software, providing a standardized interface for interacting with various Digital Content Creation tools through a unified API.
    Last updated -
    125
    9
    Python
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    Creates and manipulates PowerPoint presentations with capabilities for adding various slide types, generating images, and incorporating tables and charts through natural language commands.
    Last updated -
    11
    24
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.
    Last updated -
    2
    143
    15
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that exposes Cloudinary Upload & Admin API methods as tools by AI assistants. This integration allows AI systems to trigger and interact with your Cloudinary cloud.
    Last updated -
    5
    506
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    This is a server implementation for performing Optical Character Recognition (OCR) using the Google Cloud Vision API. It is built on top of the FastMCP framework, which allows for the creation of modular and extensible command processing tools.
    Last updated -
    1
    1
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides an interface between AI assistants and Tripo AI via Model Context Protocol, enabling generation of 3D assets from natural language and importing them to Blender.
    Last updated -
    15
    139
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Image Tools MCP is a Model Context Protocol (MCP) service that retrieves image dimensions and compresses images from URLs and local files using the TinyPNG API. It supports converting images to formats like webp, jpeg/jpg, and png, providing detailed information on width, height, type, and compressi
    Last updated -
    4
    27
    3
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A server that enables AI assistants to create and edit PowerPoint presentations with features for adding various slide types, tables, charts, and AI-generated images through Stable Diffusion.
    Last updated -
    11
    1
    Python
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    An MCP server that creates a virtual traveling environment on Google Maps, allowing users to guide an avatar on journeys with photo reports and SNS integration.
    Last updated -
    2
    140
    11
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Allow your AI coding agents to access Figma files & prototypes directly. You can DM me for any issues / improvements: https://x.com/jasonzhou1993 1. Access all figma pages 2. Access all figma components 3. Access figma prototype flows
    Last updated -
    3
    49
    Python
    MIT License
  • A
    security
    A
    license
    A
    quality
    This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
    Last updated -
    14
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    MCP server for Hugeicons integration and documentation This is a TypeScript-based MCP server that provides tools and resources for integrating Hugeicons into various platforms. It implements a Model Context Protocol (MCP) server that helps AI assistants provide accurate guidance for using Hugeicons
    Last updated -
    3
    23
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A TypeScript-based Model Context Protocol server that integrates with Volcengine's Jimeng AI image generation service, allowing users to generate AI images through simple tool calls.
    Last updated -
    1
    70
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server that automates generating LinkedIn post drafts from YouTube videos. This server provides high-quality, editable content drafts based on YouTube video transcripts.
    Last updated -
    6
    2
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that provides image generation capabilities using Google's Gemini 2 API, allowing users to generate multiple images with customizable parameters like prompts, aspect ratios, and person generation settings.
    Last updated -
    1
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides access to Amazon Bedrock's Nova Canvas model for AI image generation.
    Last updated -
    1
    14
    20
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Generates realistic human face images that don't represent real people, offering various output shapes, configurable dimensions, and batch generation capabilities.
    Last updated -
    1
    3
    1
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Provides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.
    Last updated -
    3
    9
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables the generation of images using Together AI's models through an MCP server, supporting customizable parameters such as model selection, image dimensions, and output directory.
    Last updated -
    1
    4
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    MCP for Replicate Flux Model. Generating images by prompts
    Last updated -
    7
    487
    6
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A simple Model Context Protocol server that allows AI models to generate meme images using the ImgFlip API, enabling users to create memes from text prompts.
    Last updated -
    1
    32
    12
    JavaScript
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    The first open-source MCP server that enables AI to fully control remote macOS systems.
    Last updated -
    6
    68
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    MCP server that exposes Google's Veo2 video generation capabilities, allowing clients to generate videos from text prompts or images.
    Last updated -
    7
    7
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables AI assistants to generate images, text, and audio through the Pollinations APIs without requiring authentication.
    Last updated -
    7
    325
    4
    JavaScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
    Last updated -
    1
    278
    15
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A TypeScript-based Model Context Protocol (MCP) server enabling integration with PiAPI for media content generation using platforms like Midjourney, Flux, and others through MCP-compatible applications.
    Last updated -
    1
    22
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Facilitates the creation of DecentSampler drum kit configurations, supporting WAV file analysis and XML generation to ensure accurate sample lengths and well-structured presets.
    Last updated -
    5
    93
    1
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server generates placeholder image URLs from various providers, supporting input validation and integration with desktop applications like Claude and Cursor.
    Last updated -
    1
    6
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables high-quality image generation using the Flux.1 Schnell model via Together AI with customizable parameters.
    Last updated -
    1
    60
    8
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    MCP (Model Context Protocol) server that utilizes the Google Gemini Vision API to interact with YouTube videos. It allows users to get descriptions, summaries, answers to questions, and extract key moments from YouTube videos.
    Last updated -
    4
    141
    JavaScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    An intelligent MCP server with a fully automated batch pipeline for web-ready images. Features include noise reduction, auto levels/curves, JPEG artifact removal, 4K resizing, smart sharpening with shadow/highlight enhancement, and advanced WebP conversion.
    Last updated -
    1
    5
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that validates and renders Mermaid diagrams.
    Last updated -
    1
    40
    11
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Bridges YouTube API and AI assistants, enabling video analysis by downloading and processing closed captions to create summaries of YouTube videos.
    Last updated -
    1
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that provides AI vision capabilities for analyzing UI screenshots, offering tools for screen analysis, file operations, and UI/UX report generation.
    Last updated -
    26
    1
    JavaScript
    ISC License
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    An MCP server that enables users to generate summaries of YouTube videos in multiple languages and formats through integration with DeepSRT's API.
    Last updated -
    1
    33
    JavaScript
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server integration that enables Cursor AI to communicate with Figma, allowing users to read designs and modify them programmatically through natural language commands.
    Last updated -
    19
    5,185
    3,528
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server for Adobe After Effects that enables AI assistants and other applications to control After Effects through a standardized protocol.
    Last updated -
    13
    16
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A MCP server that integrates with Stable Diffusion WebUI to provide text-to-image generation and image upscaling capabilities through simple API calls.
    Last updated -
    5
    4
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A MCP server that enables Claude and other MCP-compatible assistants to generate images from text prompts using Together AI's image generation models.
    Last updated -
    1
    2
    TypeScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video transcripts and subtitles through a simple interface, making it ideal for content analysis and processing.
    Last updated -
    1
    258
    10
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
    Last updated -
    3
    15,453
    1,627
    TypeScript
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables searching for images on Pixabay with query parameters and returns formatted results with image URLs and metadata.
    Last updated -
    1
    2
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides tools for image, audio, and video recognition using Google's Gemini AI through the Model Context Protocol.
    Last updated -
    3
    6
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server that allows you to generate and edit images using Amazon Bedrock's Nova Canvas model, supporting features like text-to-image generation, inpainting, outpainting, image variation, and background removal.
    Last updated -
    8
    2
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server provides tools for uploading images and videos directly to Cloudinary using Claude/Cline, facilitating resource management with customizable options like resource type and public ID.
    Last updated -
    1
    71
    4
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that extracts images from URLs or base64 data and converts them into a format suitable for LLM analysis, allowing AI models to process and understand visual content.
    Last updated -
    3
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP server that allows users to generate images using Replicate's Stable Diffusion model and save them to the local filesystem.
    Last updated -
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.
    Last updated -
    3
    60
    5
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP Server that integrates with Stability AI's API to provide high-quality image generation, editing, and manipulation capabilities including background removal, outpainting, search-and-replace, and upscaling.
    Last updated -
    13
    35
    55
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schnell model via Together AI, allowing users to create images from text prompts with customizable dimensions.
    Last updated -
    1
    8
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables AI assistants to download images from URLs and perform basic image optimization tasks.
    Last updated -
    2
    4
    JavaScript
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    Provides stealth browser capabilities using Playwright with anti-detection techniques, allowing MCP clients to navigate websites and take screenshots while evading common bot detection systems.
    Last updated -
    1
    4
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A server that enables generating videos from static images using Vidu's AI models, with features for image-to-video conversion, task monitoring, and image uploading.
    Last updated -
    3
    1
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Allows you to search for artworks, retrieve detailed information about specific artworks, access image tiles for artworks, and explore user-created collections from the Rijksmuseum.
    Last updated -
    7
    12
    22
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    Last updated -
    1
    35
    10
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    The Model Context Protocol (MCP) Server built on Qiniu Cloud products supports users in accessing Qiniu Cloud Storage, intelligent multimedia services, and more through this MCP Server within the context of AI large model clients.
    Last updated -
    9
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    Drawing Tool for AI Assistants
    Last updated -
    4
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Enables users to generate images from text prompts using Replicate's model, with configurable parameters and full MCP protocol compliance.
    Last updated -
    1
    63
    TypeScript
  • A
    security
    F
    license
    A
    quality
    A TypeScript-based Model Context Protocol server that enables large language models to directly invoke Alibaba Cloud's Tongyi Wanxiang text-to-image generation API.
    Last updated -
    4
    44
    3
    TypeScript
  • A
    security
    F
    license
    A
    quality
    A server that converts various file formats (PDF, images, Office documents, etc.) to Markdown descriptions using Cloudflare AI services.
    Last updated -
    1
    4
    30
    JavaScript
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Provides image generation capabilities using Amazon Nova Canvas through Amazon Bedrock, enabling the creation of visuals from text prompts and color palettes—perfect for mockups, diagrams, and UI design concepts.
    Last updated -
    2,765
    Python
    Apache 2.0
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    A server that accepts image URLs and analyzes their content using GPT-4-turbo, enabling Claude AI assistants to understand and describe images through natural language.
    Last updated -
    1
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    An MCP server that visually reviews UI edit requests by comparing screenshots before and after edits, ensuring changes satisfy user requests.
    Last updated -
    1
    22
    15
    JavaScript
  • A
    security
    F
    license
    A
    quality
    An advanced MCP server for Cline that leverages EverArt's AI models to generate vector and raster images, supporting flexible storage, multiple formats, and robust image generation capabilities.
    Last updated -
    3
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that enables Claude to generate and upscale images through the Letz AI API, allowing users to create images directly within Claude conversations.
    Last updated -
    2
    1
    JavaScript
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    A TypeScript-based MCP server that generates images using OpenAI's dall-e-3 model based on text prompts and saves them to a specified directory.
    Last updated -
    1
    6
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that generates images using Replicate's FLUX model and stores them in Cloudflare R2, allowing users to create images through simple prompts and retrieve accessible URLs.
    Last updated -
    1
    6
  • A
    security
    F
    license
    A
    quality
    An MCP server designed to work with FFmpeg for media processing tasks, offering enhanced performance and secure communication for handling media processing requests.
    Last updated -
    2
    4
    TypeScript
  • A
    security
    F
    license
    A
    quality
    A TypeScript server that implements the Model Context Protocol, enabling AI-powered design creation in Figma using natural language prompts through Cursor Agent.
    Last updated -
    5
    647
    2
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Enables AI assistants to interact with Figma files through the ModelContextProtocol, allowing viewing, commenting, and analyzing Figma designs directly in chat interfaces.
    Last updated -
    5
    647
    119
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.
    Last updated -
    1
    1
    3
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Integrates Dify AI API to provide code generation for Ant Design components, supporting both text and image inputs with stream processing capabilities.
    Last updated -
    1
    22
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that manages desktop images, providing functionality to count, list, and compress image files stored on the desktop.
    Last updated -
    3
    154
    1
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A FastMCP server implementation that facilitates resource-based access to AI model inference, focusing on image generation through the Replicate API, with features like real-time updates, webhook integration, and secure API key management.
    Last updated -
    18
    10
    Python
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Node.js server that provides advanced video and image processing capabilities through the Model Context Protocol, enabling operations like conversion, compression, editing, and effects application.
    Last updated -
    10
    13
    JavaScript
    • Apple
    • Linux
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that converts PDF documents into PNG images through a simple MCP tool call.
    Last updated -
    1
    2
    Python
    • Apple
    • Linux
  • A
    security
    F
    license
    A
    quality
    An integration that allows Cursor AI to generate images through the Draw Things API using natural language prompts.
    Last updated -
    1
    56
    3
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides an image generation tool using Templated.io, allowing users to create customized images based on templates with text and image layers.
    Last updated -
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A tool that integrates Figma with Cursor through the Model Context Protocol, allowing users to retrieve, optimize, and convert design data from Figma files into structured CSS and design tokens.
    Last updated -
    1
    JavaScript
  • -
    security
    A
    license
    -
    quality
    A server that implements the Model Context Protocol (MCP), providing an interface for LLM applications to generate mermaid.js visualizations and diagrams.
    Last updated -
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP tool server that enables generating and editing images through OpenAI's image models, supporting text-to-image generation and advanced image editing (inpainting, outpainting) across various MCP-compatible clients.
    Last updated -
    11
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    An AI-powered development toolkit for Cursor providing intelligent coding assistance through advanced reasoning, UI screenshot analysis, and code review tools.
    Last updated -
    1,261
    240
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A CLIP-Based Fashion Recommender system with MCP that provides fashion recommendations based on uploaded images.
    Last updated -
    Python
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Cursor-compatible toolkit that provides intelligent coding assistance through custom AI tools for code architecture planning, screenshot analysis, code review, and file reading capabilities.
    Last updated -
    1,261
    2
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides image generation capabilities using the Flux Schnell model on Replicate, allowing users to create images from text prompts.
    Last updated -
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables browser automation and real-time computer vision tasks through AI-driven commands, offering zero-cost digital navigation and interaction for enhanced web experiences.
    Last updated -
    0
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Generate animation like 3blue1brown using a single prompt.
    Last updated -
    30
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Connects Claude Desktop to Hugging Face Spaces with minimal setup, enabling capabilities like image generation, vision tasks, text-to-speech, and chat with AI models.
    Last updated -
    188
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol (MCP) server for searching and retrieving Lottie animations from LottieFiles.
    Last updated -
    12
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Facilitates running Python code in a sandbox and generating images using the FLUX model via an MCP server compatible with clients like Goose and the Claude Desktop App.
    Last updated -
    2
    16
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A FastMCP server implementation that provides a standardized interface for accessing AI models hosted on Replicate's API, currently supporting image generation with customizable parameters.
    Last updated -
    2
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A server that enables LLM applications to interact directly with DaVinci Resolve video editing software, allowing AI-assisted capabilities like accessing timeline information and automating editing workflows.
    Last updated -
    72
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A customizable Model Context Protocol server implementation that enables AI models to interact with external tools including weather queries, Google search, and camera control functionality.
    Last updated -
    8
    Python
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    The Comfy MCP Server uses the FastMCP framework to generate images from prompts by interacting with a remote Comfy server, allowing automated image creation based on workflow configurations.
    Last updated -
    7
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    Last updated -
    2
    188
    241
    TypeScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.
    Last updated -
    5
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that provides desktop automation capabilities using RobotJS and screenshot capabilities, enabling LLMs to control mouse movements, keyboard inputs, and capture screenshots of the desktop environment.
    Last updated -
    42
    2
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Powerful image generation system leveraging multiple Stable Diffusion models (flux-schnell, flux-dev, sdxl, sd3, sd15) for creating high-quality AI-generated images with precise customization.
    Last updated -
    8
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.
    Last updated -
    1
    JavaScript
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    An API that exposes Unsplash image search, listing, and random photo capabilities as MCP tools, enabling AI models like Claude to directly interact with Unsplash's services.
    Last updated -
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    MCP server that provides computer control capabilities including mouse movements, keyboard actions, screenshot capture with OCR, and window management through a unified API.
    Last updated -
    4
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP server that generates 2D and 3D game assets from text prompts using AI models from Hugging Face Spaces, allowing developers to easily create game art through Claude Desktop or other MCP clients.
    Last updated -
    34
    JavaScript
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables text-to-image generation using Cloudflare's Flux Schnell model API.
    Last updated -
    3
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables AI models to search, retrieve, and utilize GIFs from Giphy with features like content filtering, multiple search methods, and comprehensive metadata.
    Last updated -
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A MCP server that integrates with Cursor IDE to generate images based on text descriptions using JiMeng AI, allowing users to create and save custom images directly within their development environment.
    Last updated -
    82
    Python
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A server that integrates Flux's advanced image generation and manipulation features into AI coding assistants, enabling seamless text-to-image and image control workflows in IDEs like Cursor and Windsurf.
    Last updated -
    4
    10
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.
    Last updated -
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos with language-specific support.
    Last updated -
    723
    MIT License
  • -
    security
    A
    license
    -
    quality
    A search service based on multiple image APIs and icon generation capabilities, specifically designed for integration with Cursor MCP service. Supports image search, download, and AI-generated icons.
    Last updated -
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    MCP Tool Server for Logo Generation. This server provides logo generation capabilities using FAL AI, with tools for image generation, background removal, and image scaling.
    Last updated -
    159
    Python
    GPL 3.0
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Model Context Protocol server that enables Claude Desktop (or any MCP client) to fetch web content and process images appropriately.
    Last updated -
    11
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A MCP Server for TikTok videos that allows you to get video subtitles and post details, such as the number of likes, hashtags, and publishing time.
    Last updated -
    54
    31
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.
    Last updated -
    11
    JavaScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables AI assistants to interact with Gyazo images using the Model Context Protocol, providing access to image URIs, metadata, and OCR data via the Gyazo API.
    Last updated -
    10
    TypeScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that provides multiple file conversion tools for AI agents, supporting various document and image format conversions including DOCX to PDF, PDF to DOCX, image conversions, Excel to CSV, HTML to PDF, and Markdown to PDF.
    Last updated -
    3
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A FreeCAD addon that implements the Model Context Protocol (MCP) to enable communication between FreeCAD and Claude AI through Claude Desktop.
    Last updated -
    17
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender for AI-assisted 3D modeling, scene manipulation, and rendering.
    Last updated -
    10,093
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
    Last updated -
    6
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that provides AI-powered image generation, modification, and processing capabilities through the Model Context Protocol, leveraging Google Gemini models and other image services.
    Last updated -
    6
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
    Last updated -
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol (MCP) server for running Genesis World simulations with integrated visualization support, using stdio transport to enable local runtime visualization features.
    Last updated -
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that integrates ComfyUI with MCP, allowing users to generate images and download them through natural language interactions.
    Last updated -
    1
    Python
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that integrates Figma design files with AI coding tools like Cursor, Windsurf, and Cline, allowing AI tools to access and understand Figma design data for generating more accurate code.
    Last updated -
    124
    6
    TypeScript
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A CLIP-Based Fashion Recommender system that allows users to upload clothing images and receive tags and recommendations based on visual analysis.
    Last updated -
    Python
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    MCP server to provide Sketch layout information to AI coding agents like Cursor
    Last updated -
    29
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Connects Cinema 4D to Claude, enabling AI-assisted 3D modeling and scene manipulation through natural language commands.
    Last updated -
    10
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Provides seamless access to Grok AI's capabilities (chat completion, image understanding, and function calling) directly from Cline via the Model Context Protocol.
    Last updated -
    3
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Pinterest Model Context Protocol (MCP) server for image search and information retrieval
    Last updated -
    91
    10
    TypeScript
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides API functionality for creating, managing, and exporting Excalidraw drawings in various formats like SVG, PNG, and JSON.
    Last updated -
    3
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Node.js server that enables video manipulation through natural language requests, including resizing videos to different resolutions (360p to 1080p) and extracting audio in various formats (MP3, AAC, WAV, OGG).
    Last updated -
    34
    2
    TypeScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that analyzes YouTube videos, enabling users to extract transcripts, generate summaries, and query video content using Gemini AI.
    Last updated -
    7
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server for downloading, processing, and managing YouTube content with features like video quality selection, format conversion, and metadata extraction.
    Last updated -
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A multi-agent human-computer interaction system that enables natural interaction through integrated visual recognition, speech recognition, and speech synthesis capabilities.
    Last updated -
    1
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Upload, edit, and generate videos from everyone's favorite LLM and Video Jungle.
    Last updated -
    120
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    Provides tools to get financial data (stock prices, company information) and generate financial visualizations through the Model Context Protocol.
    Last updated -
    2
    Python
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    Last updated -
    17
    Python
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    Connects Houdini to Claude AI through Model Context Protocol, enabling AI-assisted 3D modeling, scene creation, simulation setup, and rendering through natural language commands.
    Last updated -
    37
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    An MCP server that allows users to generate, edit, and create variations of images through OpenAI's DALL-E API, supporting both DALL-E 2 and DALL-E 3 models.
    Last updated -
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Provides chat and image analysis capabilities through OpenRouter.ai's diverse model ecosystem, enabling both text conversations and powerful multimodal image processing with various AI models.
    Last updated -
    292
    3
    TypeScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    Enables users to generate parametric 3D models from text descriptions or images using multi-view reconstruction and OpenSCAD, with support for AI image generation and remote processing.
    Last updated -
    6
    Python
  • -
    security
    F
    license
    -
    quality
    An MCP server that enables LLMs to 'see' what's happening in browser-based games and applications through vectorized canvas visualization and debug information.
    Last updated -
    35
    23
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Automatically captures and processes screenshots from YouTube videos and Shorts at specified intervals, supporting customizable screenshot timing and providing API endpoints for image management.
    Last updated -
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Node.js-based server that integrates with the xAI Grok API to provide AI-driven analysis tools for the Solana blockchain, supporting transaction analysis, address investigation, image processing, and general queries.
    Last updated -
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to extract transcripts from YouTube videos, allowing AI to analyze and work with video content directly.
    Last updated -
    28
    1
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that allows management and execution of Blender Python scripts, enabling users to create, edit and run scripts in a headless Blender environment through natural language interfaces.
    Last updated -
    4
    Python
  • -
    security
    F
    license
    -
    quality
    Enables Cursor AI to interact with Figma designs, allowing users to read design information and programmatically modify elements through natural language commands.
    Last updated -
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to create images and videos using Amazon Nova Canvas and Nova Reel models.
    Last updated -
    2
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Extracts components from Figma designs and transforms them into standardized JSON format for easy consumption by AI models and tools for interface reconstruction.
    Last updated -
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A server that connects to the xAI/Grok image generation API, allowing users to generate images from text prompts with support for multiple image generation and different response formats.
    Last updated -
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Create videos and images using Luma AI, this MCP server handles all API functionality for Luma Dream Machine from Claude Desktop.
    Last updated -
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    Converts addresses to GPS coordinates and creates map visualizations using the Geoapify API, allowing Claude users to generate GeoJSON data and map images from location lists.
    Last updated -
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server that provides Luma AI's video generation API as the Model Context Protocol (MCP)
    Last updated -
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Connects Sketchup to Claude AI through the Model Context Protocol, allowing Claude to directly interact with and control Sketchup for prompt-assisted 3D modeling and scene manipulation.
    Last updated -
    20
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to create, modify, and manipulate Excalidraw diagrams through a structured API.
    Last updated -
    7
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Generates and returns and image using Together.ai
    Last updated -
    3
    TypeScript
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Allows AI assistants to generate and transform high-quality images from text prompts using Google's Gemini model via the MCP protocol.
    Last updated -
    1
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    Converts Figma designs to React Native components, allowing users to extract components from Figma designs and generate corresponding React Native components with proper typing and styling.
    Last updated -
    111
    1
    TypeScript
  • -
    security
    F
    license
    -
    quality
    OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
    Last updated -
    10
    Python
    • Linux
  • -
    security
    F
    license
    -
    quality
    Enables programmatic creation of Whimsical diagrams from Mermaid markup generated by AI models like Claude through the Model Context Protocol.
    Last updated -
    7
    TypeScript