MCP Servers for Image & Video Processing

Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.

View all MCP Servers

  • Glifofficial

    A
    security
    A
    license
    A
    quality
    Run AI workflows hosted on Glif.app via MCP, including ComfyUI-based image generators, meme generators, selfies, chained LLM calls, and more
    5
    316
    16
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
    1
    6
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An intelligent MCP server with a fully automated batch pipeline for web-ready images. Features include noise reduction, auto levels/curves, JPEG artifact removal, 4K resizing, smart sharpening with shadow/highlight enhancement, and advanced WebP conversion.
    1
    5
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables AI assistants to generate images, text, and audio through the Pollinations APIs without requiring authentication.
    7
    276
    1
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP Server that integrates with Stability AI's API to provide high-quality image generation, editing, and manipulation capabilities including background removal, outpainting, search-and-replace, and upscaling.
    13
    79
    40
    TypeScript
    MIT License
    • Apple
  • A
    security
    F
    license
    A
    quality
    An MCP server that enables users to generate summaries of YouTube videos in multiple languages and formats through integration with DeepSRT's API.
    1
    33
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
    3
    7,048
    1,041
  • A
    security
    A
    license
    A
    quality
    Provides access to Amazon Bedrock's Nova Canvas model for AI image generation.
    1
    8
    16
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A MCP server that integrates with Stable Diffusion WebUI to provide text-to-image generation and image upscaling capabilities through simple API calls.
    5
    4
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Provides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.
    3
    9
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server that allows you to generate and edit images using Amazon Bedrock's Nova Canvas model, supporting features like text-to-image generation, inpainting, outpainting, image variation, and background removal.
    8
    2
    Python
    Apache 2.0
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides stealth browser capabilities using Playwright with anti-detection techniques, allowing MCP clients to navigate websites and take screenshots while evading common bot detection systems.
    1
    4
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
    1
    326
    10
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that exposes Cloudinary Upload & Admin API methods as tools by AI assistants. This integration allows AI systems to trigger and interact with your Cloudinary cloud.
    5
    506
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Allows you to search for artworks, retrieve detailed information about specific artworks, access image tiles for artworks, and explore user-created collections from the Rijksmuseum.
    7
    12
    22
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    1
    35
    10
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Generates realistic human face images that don't represent real people, offering various output shapes, configurable dimensions, and batch generation capabilities.
    1
    3
    1
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server that automates generating LinkedIn post drafts from YouTube videos. This server provides high-quality, editable content drafts based on YouTube video transcripts.
    6
    2
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    MCP for Replicate Flux Model. Generating images by prompts
    7
    487
    6
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Bridges YouTube API and AI assistants, enabling video analysis by downloading and processing closed captions to create summaries of YouTube videos.
    1
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    The first open-source MCP server that enables AI to fully control remote macOS systems.
    6
    54
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This is a server implementation for performing Optical Character Recognition (OCR) using the Google Cloud Vision API. It is built on top of the FastMCP framework, which allows for the creation of modular and extensible command processing tools.
    1
    1
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A MCP server that enables Claude and other MCP-compatible assistants to generate images from text prompts using Together AI's image generation models.
    1
    2
    TypeScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    An MCP server integration that enables Cursor AI to communicate with Figma, allowing users to read designs and modify them programmatically through natural language commands.
    19
    5,006
    2,555
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    A TypeScript-based Model Context Protocol (MCP) server enabling integration with PiAPI for media content generation using platforms like Midjourney, Flux, and others through MCP-compatible applications.
    1
    22
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Facilitates the creation of DecentSampler drum kit configurations, supporting WAV file analysis and XML generation to ensure accurate sample lengths and well-structured presets.
    5
    93
    1
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server generates placeholder image URLs from various providers, supporting input validation and integration with desktop applications like Claude and Cursor.
    1
    6
    MIT License
  • A
    security
    A
    license
    A
    quality
    A server that enables AI assistants to create and edit PowerPoint presentations with features for adding various slide types, tables, charts, and AI-generated images through Stable Diffusion.
    11
    1
    Python
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    MCP (Model Context Protocol) server that utilizes the Google Gemini Vision API to interact with YouTube videos. It allows users to get descriptions, summaries, answers to questions, and extract key moments from YouTube videos.
    4
    141
    JavaScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    A simple Model Context Protocol server that allows AI models to generate meme images using the ImgFlip API, enabling users to create memes from text prompts.
    1
    32
    12
    JavaScript
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that validates and renders Mermaid diagrams.
    1
    179
    9
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A universal Model Context Protocol implementation that serves as a semantic layer between LLMs and 3D creative software, providing a standardized interface for interacting with various Digital Content Creation tools through a unified API.
    125
    6
    Python
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    Allow your AI coding agents to access Figma files & prototypes directly. You can DM me for any issues / improvements: https://x.com/jasonzhou1993 1. Access all figma pages 2. Access all figma components 3. Access figma prototype flows
    3
    49
    Python
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that provides image generation capabilities using Google's Gemini 2 API, allowing users to generate multiple images with customizable parameters like prompts, aspect ratios, and person generation settings.
    1
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Creates and manipulates PowerPoint presentations with capabilities for adding various slide types, generating images, and incorporating tables and charts through natural language commands.
    11
    24
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video transcripts and subtitles through a simple interface, making it ideal for content analysis and processing.
    1
    119
    3
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server implementation that integrates with Minimax API to provide AI-powered image generation and text-to-speech functionality in editors like Windsurf and Cursor.
    2
    192
    1
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Image Tools MCP is a Model Context Protocol (MCP) service that retrieves image dimensions and compresses images from URLs and local files using the TinyPNG API. It supports converting images to formats like webp, jpeg/jpg, and png, providing detailed information on width, height, type, and compressi
    4
    194
    2
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides an interface between AI assistants and Tripo AI via Model Context Protocol, enabling generation of 3D assets from natural language and importing them to Blender.
    15
    137
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
    14
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A server that enables generating videos from static images using Vidu's AI models, with features for image-to-video conversion, task monitoring, and image uploading.
    3
    1
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP server that creates a virtual traveling environment on Google Maps, allowing users to guide an avatar on journeys with photo reports and SNS integration.
    2
    100
    10
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server provides tools for uploading images and videos directly to Cloudinary using Claude/Cline, facilitating resource management with customizable options like resource type and public ID.
    1
    71
    4
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables high-quality image generation using the Flux.1 Schnell model via Together AI with customizable parameters.
    1
    13
    5
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that extracts images from URLs or base64 data and converts them into a format suitable for LLM analysis, allowing AI models to process and understand visual content.
    3
    MIT License
  • A
    security
    A
    license
    A
    quality
    An MCP server that allows users to generate images using Replicate's Stable Diffusion model and save them to the local filesystem.
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables the generation of images using Together AI's models through an MCP server, supporting customizable parameters such as model selection, image dimensions, and output directory.
    1
    4
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
    1
    170
    13
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Enables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.
    2
    143
    15
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schnell model via Together AI, allowing users to create images from text prompts with customizable dimensions.
    1
    8
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables AI assistants to download images from URLs and perform basic image optimization tasks.
    2
    4
    JavaScript
    Apache 2.0
  • A
    security
    F
    license
    A
    quality
    Enables AI assistants to interact with Figma files through the ModelContextProtocol, allowing viewing, commenting, and analyzing Figma designs directly in chat interfaces.
    5
    562
    119
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A server that accepts image URLs and analyzes their content using GPT-4-turbo, enabling Claude AI assistants to understand and describe images through natural language.
    1
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    An advanced MCP server for Cline that leverages EverArt's AI models to generate vector and raster images, supporting flexible storage, multiple formats, and robust image generation capabilities.
    3
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A TypeScript-based MCP server that generates images using OpenAI's dall-e-3 model based on text prompts and saves them to a specified directory.
    1
    6
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A TypeScript server that implements the Model Context Protocol, enabling AI-powered design creation in Figma using natural language prompts through Cursor Agent.
    5
    562
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Drawing Tool for AI Assistants
    4
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Node.js server that provides advanced video and image processing capabilities through the Model Context Protocol, enabling operations like conversion, compression, editing, and effects application.
    10
    13
    JavaScript
    • Apple
    • Linux
  • A
    security
    F
    license
    A
    quality
    Enables users to generate images from text prompts using Replicate's model, with configurable parameters and full MCP protocol compliance.
    1
    63
    TypeScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that converts PDF documents into PNG images through a simple MCP tool call.
    1
    2
    Python
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Provides image generation capabilities using Amazon Nova Canvas through Amazon Bedrock, enabling the creation of visuals from text prompts and color palettes—perfect for mockups, diagrams, and UI design concepts.
    2,102
    Python
    Apache 2.0
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides an image generation tool using Templated.io, allowing users to create customized images based on templates with text and image layers.
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    An MCP server that visually reviews UI edit requests by comparing screenshots before and after edits, ensuring changes satisfy user requests.
    1
    25
    15
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that enables Claude to generate and upscale images through the Letz AI API, allowing users to create images directly within Claude conversations.
    2
    1
    JavaScript
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.
    1
    1
    3
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A FastMCP server implementation that facilitates resource-based access to AI model inference, focusing on image generation through the Replicate API, with features like real-time updates, webhook integration, and secure API key management.
    18
    10
    Python
    • Apple
  • A
    security
    F
    license
    A
    quality
    An MCP server designed to work with FFmpeg for media processing tasks, offering enhanced performance and secure communication for handling media processing requests.
    2
    1
    TypeScript
  • A
    security
    F
    license
    A
    quality
    Integrates Dify AI API to provide code generation for Ant Design components, supporting both text and image inputs with stream processing capabilities.
    1
    22
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that manages desktop images, providing functionality to count, list, and compress image files stored on the desktop.
    3
    154
    1
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    An integration that allows Cursor AI to generate images through the Draw Things API using natural language prompts.
    1
    56
    3
    JavaScript
  • -
    security
    A
    license
    -
    quality
    A server that implements the Model Context Protocol (MCP), providing an interface for LLM applications to generate mermaid.js visualizations and diagrams.
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP server that provides multiple file conversion tools for AI agents, supporting various document and image format conversions including DOCX to PDF, PDF to DOCX, image conversions, Excel to CSV, HTML to PDF, and Markdown to PDF.
    3
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Powerful image generation system leveraging multiple Stable Diffusion models (flux-schnell, flux-dev, sdxl, sd3, sd15) for creating high-quality AI-generated images with precise customization.
    8
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    An AI-powered development toolkit for Cursor providing intelligent coding assistance through advanced reasoning, UI screenshot analysis, and code review tools.
    2,384
    240
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables browser automation and real-time computer vision tasks through AI-driven commands, offering zero-cost digital navigation and interaction for enhanced web experiences.
    0
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
    6
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Generate animation like 3blue1brown using a single prompt.
    30
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables AI assistants to interact with Gyazo images using the Model Context Protocol, providing access to image URIs, metadata, and OCR data via the Gyazo API.
    7
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Connects Claude Desktop to Hugging Face Spaces with minimal setup, enabling capabilities like image generation, vision tasks, text-to-speech, and chat with AI models.
    338
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Facilitates running Python code in a sandbox and generating images using the FLUX model via an MCP server compatible with clients like Goose and the Claude Desktop App.
    2
    16
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A server that integrates Flux's advanced image generation and manipulation features into AI coding assistants, enabling seamless text-to-image and image control workflows in IDEs like Cursor and Windsurf.
    4
    10
    Python
    MIT License
  • -
    security
    F
    license
    -
    quality
    Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.
    1
    JavaScript
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that integrates Figma design files with AI coding tools like Cursor, Windsurf, and Cline, allowing AI tools to access and understand Figma design data for generating more accurate code.
    124
    6
    TypeScript
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A FastMCP server implementation that provides a standardized interface for accessing AI models hosted on Replicate's API, currently supporting image generation with customizable parameters.
    2
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A customizable Model Context Protocol server implementation that enables AI models to interact with external tools including weather queries, Google search, and camera control functionality.
    8
    Python
    Apache 2.0
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    The Comfy MCP Server uses the FastMCP framework to generate images from prompts by interacting with a remote Comfy server, allowing automated image creation based on workflow configurations.
    7
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    2
    338
    152
    TypeScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.
    5
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Cursor-compatible toolkit that provides intelligent coding assistance through custom AI tools for code architecture planning, screenshot analysis, code review, and file reading capabilities.
    2,384
    2
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides image generation capabilities using the Flux Schnell model on Replicate, allowing users to create images from text prompts.
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.
    11
    JavaScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol (MCP) server for running Genesis World simulations with integrated visualization support, using stdio transport to enable local runtime visualization features.
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Model Context Protocol server that enables Claude Desktop (or any MCP client) to fetch web content and process images appropriately.
    11
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables text-to-image generation using Cloudflare's Flux Schnell model API.
    3
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A MCP server that integrates with Cursor IDE to generate images based on text descriptions using JiMeng AI, allowing users to create and save custom images directly within their development environment.
    82
    Python
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender for AI-assisted 3D modeling, scene manipulation, and rendering.
    9,873
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that generates 2D and 3D game assets from text prompts using AI models from Hugging Face Spaces, allowing developers to easily create game art through Claude Desktop or other MCP clients.
    34
    JavaScript
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Enables AI models to search, retrieve, and utilize GIFs from Giphy with features like content filtering, multiple search methods, and comprehensive metadata.
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos with language-specific support.
    723
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A search service based on multiple image APIs and icon generation capabilities, specifically designed for integration with Cursor MCP service. Supports image search, download, and AI-generated icons.
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A FreeCAD addon that implements the Model Context Protocol (MCP) to enable communication between FreeCAD and Claude AI through Claude Desktop.
    17
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol (MCP) server for searching and retrieving Lottie animations from LottieFiles.
    224
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Pinterest Model Context Protocol (MCP) server for image search and information retrieval
    6
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    MCP Tool Server for Logo Generation. This server provides logo generation capabilities using FAL AI, with tools for image generation, background removal, and image scaling.
    159
    Python
    GPL 3.0
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A server that enables LLM applications to interact directly with DaVinci Resolve video editing software, allowing AI-assisted capabilities like accessing timeline information and automating editing workflows.
    72
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that provides AI-powered image generation, modification, and processing capabilities through the Model Context Protocol, leveraging Google Gemini models and other image services.
    6
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server that provides Luma AI's video generation API as the Model Context Protocol (MCP)
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    An MCP server that enables LLMs to 'see' what's happening in browser-based games and applications through vectorized canvas visualization and debug information.
    21
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Converts addresses to GPS coordinates and creates map visualizations using the Geoapify API, allowing Claude users to generate GeoJSON data and map images from location lists.
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    A multi-agent human-computer interaction system that enables natural interaction through integrated visual recognition, speech recognition, and speech synthesis capabilities.
    1
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Generates and returns and image using Together.ai
    3
    TypeScript
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Upload, edit, and generate videos from everyone's favorite LLM and Video Jungle.
    99
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server for downloading, processing, and managing YouTube content with features like video quality selection, format conversion, and metadata extraction.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Provides chat and image analysis capabilities through OpenRouter.ai's diverse model ecosystem, enabling both text conversations and powerful multimodal image processing with various AI models.
    292
    3
    TypeScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A server that converts various file formats (PDF, images, Office documents, etc.) to Markdown descriptions using Cloudflare AI services.
    114
    29
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Connects Sketchup to Claude AI through the Model Context Protocol, allowing Claude to directly interact with and control Sketchup for prompt-assisted 3D modeling and scene manipulation.
    20
    • Apple
  • -
    security
    F
    license
    -
    quality
    Extracts components from Figma designs and transforms them into standardized JSON format for easy consumption by AI models and tools for interface reconstruction.
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Enables users to generate parametric 3D models from text descriptions or images using multi-view reconstruction and OpenSCAD, with support for AI image generation and remote processing.
    6
    Python
  • -
    security
    F
    license
    -
    quality
    A MCP Server for TikTok videos that allows you to get video subtitles and post details, such as the number of likes, hashtags, and publishing time.
    6
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    17
    Python
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that allows management and execution of Blender Python scripts, enabling users to create, edit and run scripts in a headless Blender environment through natural language interfaces.
    4
    Python
  • -
    security
    F
    license
    -
    quality
    Enables Cursor AI to interact with Figma designs, allowing users to read design information and programmatically modify elements through natural language commands.
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides API functionality for creating, managing, and exporting Excalidraw drawings in various formats like SVG, PNG, and JSON.
    3
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A server that connects to the xAI/Grok image generation API, allowing users to generate images from text prompts with support for multiple image generation and different response formats.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides AI vision capabilities for analyzing UI screenshots, offering tools for screen analysis, file operations, and UI/UX report generation.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Connects Houdini to Claude AI through Model Context Protocol, enabling AI-assisted 3D modeling, scene creation, simulation setup, and rendering through natural language commands.
    37
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that generates images using Replicate's FLUX model and stores them in Cloudflare R2, allowing users to create images through simple prompts and retrieve accessible URLs.
    4
  • -
    security
    F
    license
    -
    quality
    Enables programmatic creation of Whimsical diagrams from Mermaid markup generated by AI models like Claude through the Model Context Protocol.
    7
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Converts Figma designs to React Native components, allowing users to extract components from Figma designs and generate corresponding React Native components with proper typing and styling.
    111
    1
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to create images and videos using Amazon Nova Canvas and Nova Reel models.
    2
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to extract transcripts from YouTube videos, allowing AI to analyze and work with video content directly.
    12
    1
    TypeScript
  • -
    security
    F
    license
    -
    quality
    OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
    10
    Python
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to create, modify, and manipulate Excalidraw diagrams through a structured API.
    7
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Node.js-based server that integrates with the xAI Grok API to provide AI-driven analysis tools for the Solana blockchain, supporting transaction analysis, address investigation, image processing, and general queries.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Automatically captures and processes screenshots from YouTube videos and Shorts at specified intervals, supporting customizable screenshot timing and providing API endpoints for image management.
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Node.js server that enables video manipulation through natural language requests, including resizing videos to different resolutions (360p to 1080p) and extracting audio in various formats (MP3, AAC, WAV, OGG).
    34
    2
    TypeScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    An MCP server that allows users to generate, edit, and create variations of images through OpenAI's DALL-E API, supporting both DALL-E 2 and DALL-E 3 models.
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that analyzes YouTube videos, enabling users to extract transcripts, generate summaries, and query video content using Gemini AI.
    7
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Create videos and images using Luma AI, this MCP server handles all API functionality for Luma Dream Machine from Claude Desktop.
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    Allows AI assistants to generate and transform high-quality images from text prompts using Google's Gemini model via the MCP protocol.
    1
    Python
    • Apple