MCP Servers for Image & Video Processing

Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.

View all MCP Servers

  • A
    security
    A
    license
    A
    quality
    An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
    1
    6
    TypeScript
    MIT License
    • Apple
  • Glifofficial

    A
    security
    A
    license
    A
    quality
    Run AI workflows hosted on Glif.app via MCP, including ComfyUI-based image generators, meme generators, selfies, chained LLM calls, and more
    5
    14
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables high-quality image generation using the Flux.1 Schnell model via Together AI with customizable parameters.
    1
    13
    5
    JavaScript
    MIT License
  • A
    security
    F
    license
    A
    quality
    An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
    3
    617
    TypeScript
  • A
    security
    A
    license
    A
    quality
    Allow your AI coding agents to access Figma files & prototypes directly. You can DM me for any issues / improvements: https://x.com/jasonzhou1993 1. Access all figma pages 2. Access all figma components 3. Access figma prototype flows
    3
    41
    Python
    MIT License
  • A
    security
    A
    license
    A
    quality
    Creates and manipulates PowerPoint presentations with capabilities for adding various slide types, generating images, and incorporating tables and charts through natural language commands.
    11
    24
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
    13
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Allows you to search for artworks, retrieve detailed information about specific artworks, access image tiles for artworks, and explore user-created collections from the Rijksmuseum.
    7
    12
    22
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    This server provides tools for uploading images and videos directly to Cloudinary using Claude/Cline, facilitating resource management with customizable options like resource type and public ID.
    1
    71
    4
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video transcripts and subtitles through a simple interface, making it ideal for content analysis and processing.
    1
    0
    1
    TypeScript
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    A MCP server that enables Claude and other MCP-compatible assistants to generate images from text prompts using Together AI's image generation models.
    1
    2
    TypeScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    An MCP server that allows users to generate images using Replicate's Stable Diffusion model and save them to the local filesystem.
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables AI assistants to download images from URLs and perform basic image optimization tasks.
    2
    4
    JavaScript
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    Enables the generation of images using Together AI's models through an MCP server, supporting customizable parameters such as model selection, image dimensions, and output directory.
    1
    4
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    Provides stealth browser capabilities using Playwright with anti-detection techniques, allowing MCP clients to navigate websites and take screenshots while evading common bot detection systems.
    1
    4
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
    1
    326
    10
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Facilitates the creation of DecentSampler drum kit configurations, supporting WAV file analysis and XML generation to ensure accurate sample lengths and well-structured presets.
    5
    93
    1
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
    1
    170
    13
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Generates realistic human face images that don't represent real people, offering various output shapes, configurable dimensions, and batch generation capabilities.
    1
    3
    1
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A simple Model Context Protocol server that allows AI models to generate meme images using the ImgFlip API, enabling users to create memes from text prompts.
    1
    32
    12
    JavaScript
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    MCP for Replicate Flux Model. Generating images by prompts
    7
    487
    6
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    1
    35
    10
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server integration that enables Cursor AI to communicate with Figma, allowing users to read designs and modify them programmatically through natural language commands.
    19
    767
    1,560
    JavaScript
    MIT License
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    An intelligent MCP server with a fully automated batch pipeline for web-ready images. Features include noise reduction, auto levels/curves, JPEG artifact removal, 4K resizing, smart sharpening with shadow/highlight enhancement, and advanced WebP conversion.
    1
    5
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A TypeScript-based Model Context Protocol (MCP) server enabling integration with PiAPI for media content generation using platforms like Midjourney, Flux, and others through MCP-compatible applications.
    1
    22
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schnell model via Together AI, allowing users to create images from text prompts with customizable dimensions.
    1
    8
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Image Tools MCP is a Model Context Protocol (MCP) service that retrieves image dimensions and compresses images from URLs and local files using the TinyPNG API. It supports converting images to formats like webp, jpeg/jpg, and png, providing detailed information on width, height, type, and compressi
    4
    12
    1
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server generates placeholder image URLs from various providers, supporting input validation and integration with desktop applications like Claude and Cursor.
    1
    6
    MIT License
  • A
    security
    F
    license
    A
    quality
    An MCP server that enables users to generate summaries of YouTube videos in multiple languages and formats through integration with DeepSRT's API.
    1
    32
    JavaScript
    • Apple
  • A
    security
    A
    license
    A
    quality
    Provides access to Amazon Bedrock's Nova Canvas model for AI image generation.
    1
    8
    16
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Enables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.
    2
    6
    14
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    Bridges YouTube API and AI assistants, enabling video analysis by downloading and processing closed captions to create summaries of YouTube videos.
    1
    3
    Python
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP Server that integrates with Stability AI's API to provide high-quality image generation, editing, and manipulation capabilities including background removal, outpainting, search-and-replace, and upscaling.
    13
    79
    40
    TypeScript
    MIT License
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides an image generation tool using Templated.io, allowing users to create customized images based on templates with text and image layers.
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A FastMCP server implementation that facilitates resource-based access to AI model inference, focusing on image generation through the Replicate API, with features like real-time updates, webhook integration, and secure API key management.
    18
    10
    Python
    • Apple
  • A
    security
    F
    license
    A
    quality
    An advanced MCP server for Cline that leverages EverArt's AI models to generate vector and raster images, supporting flexible storage, multiple formats, and robust image generation capabilities.
    3
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Enables users to generate images from text prompts using Replicate's model, with configurable parameters and full MCP protocol compliance.
    1
    63
    TypeScript
  • A
    security
    F
    license
    A
    quality
    Integrates Dify AI API to provide code generation for Ant Design components, supporting both text and image inputs with stream processing capabilities.
    1
    17
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that converts PDF documents into PNG images through a simple MCP tool call.
    1
    2
    Python
    • Apple
    • Linux
  • A
    security
    F
    license
    A
    quality
    An integration that allows Cursor AI to generate images through the Draw Things API using natural language prompts.
    1
    56
    3
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A TypeScript-based MCP server that generates images using OpenAI's dall-e-3 model based on text prompts and saves them to a specified directory.
    1
    6
    JavaScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    Enables AI assistants to interact with Figma files through the ModelContextProtocol, allowing viewing, commenting, and analyzing Figma designs directly in chat interfaces.
    5
    896
    98
    TypeScript
    • Apple
  • A
    security
    F
    license
    A
    quality
    A Node.js server that provides advanced video and image processing capabilities through the Model Context Protocol, enabling operations like conversion, compression, editing, and effects application.
    10
    13
    JavaScript
    • Apple
    • Linux
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.
    1
    1
    3
    JavaScript
  • A
    security
    F
    license
    A
    quality
    Drawing Tool for AI Assistants
    4
    1
    JavaScript
  • A
    security
    F
    license
    A
    quality
    An MCP server that visually reviews UI edit requests by comparing screenshots before and after edits, ensuring changes satisfy user requests.
    1
    25
    15
    JavaScript
  • A
    security
    F
    license
    A
    quality
    A Model Context Protocol server that enables Claude to generate and upscale images through the Letz AI API, allowing users to create images directly within Claude conversations.
    2
    JavaScript
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Provides image generation capabilities using the Flux Schnell model on Replicate, allowing users to create images from text prompts.
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP server implementation that integrates with Minimax API to provide AI-powered image generation and text-to-speech functionality in editors like Windsurf and Cursor.
    38
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that provides multiple file conversion tools for AI agents, supporting various document and image format conversions including DOCX to PDF, PDF to DOCX, image conversions, Excel to CSV, HTML to PDF, and Markdown to PDF.
    3
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A MCP server that integrates with Cursor IDE to generate images based on text descriptions using JiMeng AI, allowing users to create and save custom images directly within their development environment.
    67
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A server that provides AI-powered image generation, modification, and processing capabilities through the Model Context Protocol, leveraging Google Gemini models and other image services.
    5
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A FastMCP server implementation that provides a standardized interface for accessing AI models hosted on Replicate's API, currently supporting image generation with customizable parameters.
    2
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Cursor-compatible toolkit that provides intelligent coding assistance through custom AI tools for code architecture planning, screenshot analysis, code review, and file reading capabilities.
    2,657
    2
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides a complete set of Figma API methods through the Model Context Protocol, allowing interaction with Figma files, components, styles, and other Figma resources.
    86
    2
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.
    11
    JavaScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    The Comfy MCP Server uses the FastMCP framework to generate images from prompts by interacting with a remote Comfy server, allowing automated image creation based on workflow configurations.
    7
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    An AI-powered development toolkit for Cursor providing intelligent coding assistance through advanced reasoning, UI screenshot analysis, and code review tools.
    2,657
    240
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides logo generation capabilities using FAL AI with tools for image generation, background removal, and automatic scaling to different sizes while maintaining transparency.
    159
    Python
    GPL 3.0
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables AI assistants to interact with Gyazo images using the Model Context Protocol, providing access to image URIs, metadata, and OCR data via the Gyazo API.
    6
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Connects Claude Desktop to Hugging Face Spaces with minimal setup, enabling capabilities like image generation, vision tasks, text-to-speech, and chat with AI models.
    540
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server implementation that interfaces with KoboldAI, enabling text generation with persistent memory, OpenAI-compatible API endpoints, and Stable Diffusion integration.
    0
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    MCP Tool Server for Logo Generation. This server provides logo generation capabilities using FAL AI, with tools for image generation, background removal, and image scaling.
    159
    Python
    GPL 3.0
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    PromptShopMCP is an AI-powered image editing server that generates or transforms photos using natural language commands. It allows you to modify images by simply describing what you want.
    5
    Python
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol (MCP) server for running Genesis World simulations with integrated visualization support, using stdio transport to enable local runtime visualization features.
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that uses Replicate's Stable Diffusion to generate images, with options to save them and list previously saved images.
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that analyzes the screen with OmniParser and automatically operates the GUI, allowing users to control applications through natural language commands.
    13
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP server that creates a virtual traveling environment on Google Maps, allowing users to guide an avatar on journeys with photo reports and SNS integration.
    123
    4
    TypeScript
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.
    1
    JavaScript
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.
    5
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables AI models to search, retrieve, and utilize GIFs from Giphy with features like content filtering, multiple search methods, and comprehensive metadata.
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A TypeScript-based MCP server that enables text-to-image generation using Cloudflare's Flux Schnell model API.
    3
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos with language-specific support.
    723
    MIT License
  • -
    security
    A
    license
    -
    quality
    Static code analysis tool that converts code into UML diagrams and flowcharts, helping users understand code structure through visualization.
    4
    JavaScript
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    AI-powered tool that enables creation of 2D and 3D game assets from text prompts by integrating Hugging Face models and providing seamless interaction through MCP for game developers.
    32
    JavaScript
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    2
    540
    129
    TypeScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender for AI-assisted 3D modeling, scene manipulation, and rendering.
    7,935
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Facilitates running Python code in a sandbox and generating images using the FLUX model via an MCP server compatible with clients like Goose and the Claude Desktop App.
    2
    16
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Model Context Protocol server that enables Claude Desktop (or any MCP client) to fetch web content and process images appropriately.
    11
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that integrates the Moondream vision model with AI assistants like Claude and Cline, enabling advanced image analysis capabilities including captioning, object detection, and visual question answering.
    11
    JavaScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A server that enables LLM applications to interact directly with DaVinci Resolve video editing software, allowing AI-assisted capabilities like accessing timeline information and automating editing workflows.
    22
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that generates 2D and 3D game assets from text prompts using AI models from Hugging Face Spaces, allowing developers to easily create game art through Claude Desktop or other MCP clients.
    32
    JavaScript
    MIT License
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that exposes Cloudinary Upload & Admin API methods as tools by AI assistants. This integration allows AI systems to trigger and interact with your Cloudinary cloud.
    274
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A universal Model Context Protocol implementation that serves as a semantic layer between LLMs and 3D creative software, providing a standardized interface for interacting with various Digital Content Creation tools through a unified API.
    3
    TypeScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
    3
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Provides an interface between AI assistants and Tripo AI via Model Context Protocol, enabling generation of 3D assets from natural language and importing them to Blender.
    26
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Provides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.
    4
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that integrates Flux's advanced image generation and manipulation features into AI coding assistants, enabling seamless text-to-image and image control workflows in IDEs like Cursor and Windsurf.
    4
    10
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    Enables browser automation and real-time computer vision tasks through AI-driven commands, offering zero-cost digital navigation and interaction for enhanced web experiences.
    0
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    Integrate libraries to LLM to analyze music audio.
    4
    Python
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Pinterest Model Context Protocol (MCP) server for image search and information retrieval
    4
    TypeScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables generation of various chart types (bar, line, pie, etc.) using QuickChart.io, allowing users to create visual data representations through MCP tools.
    302
    1
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Gives AI-powered coding tools like Cursor access to Figma files, enabling them to accurately implement designs by fetching and simplifying Figma design data.
    10,638
    2,903
    TypeScript
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.
    1
    JavaScript
    MIT License
  • -
    security
    A
    license
    -
    quality
    An MCP server that allows AI coding agents to directly access and interact with Figma files and prototypes.
    41
    Python
    MIT License
  • -
    security
    A
    license
    -
    quality
    A FreeCAD addon that implements the Model Context Protocol (MCP) to enable communication between FreeCAD and Claude AI through Claude Desktop.
    17
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that analyzes YouTube videos, enabling users to extract transcripts, generate summaries, and query video content using Gemini AI.
    7
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    12
    Python
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    Enables programmatic creation of Whimsical diagrams from Mermaid markup generated by AI models like Claude through the Model Context Protocol.
    6
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Converts addresses to GPS coordinates and creates map visualizations using the Geoapify API, allowing Claude users to generate GeoJSON data and map images from location lists.
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    A server that provides Luma AI's video generation API as the Model Context Protocol (MCP)
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Upload, edit, and generate videos from everyone's favorite LLM and Video Jungle.
    48
    Python
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to create, modify, and manipulate Excalidraw diagrams through a structured API.
    7
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A server for downloading, processing, and managing YouTube content with features like video quality selection, format conversion, and metadata extraction.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Connects to the xAI/Grok image generation API, allowing users to generate AI images through natural language prompts with support for multiple image generation and different response formats.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
    5
    Python
    • Linux
  • -
    security
    F
    license
    -
    quality
    Enables Cursor AI to interact with Figma designs, allowing users to read design information and programmatically modify elements through natural language commands.
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Enables users to generate parametric 3D models from text descriptions or images using multi-view reconstruction and OpenSCAD, with support for AI image generation and remote processing.
    Python
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that generates images using Replicate's FLUX model and stores them in Cloudflare R2, allowing users to create images through simple prompts and retrieve accessible URLs.
    4
  • -
    security
    F
    license
    -
    quality
    Provides read-only integration with Figma's API through Claude and other MCP-compatible clients, allowing users to access Figma files and projects through natural language interactions.
    87
    TypeScript
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides AI vision capabilities for analyzing UI screenshots, offering tools for screen analysis, file operations, and UI/UX report generation.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A MCP Server for TikTok videos that allows you to get video subtitles and post details, such as the number of likes, hashtags, and publishing time.
    4
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that allows management and execution of Blender Python scripts, enabling users to create, edit and run scripts in a headless Blender environment through natural language interfaces.
    4
    Python
  • -
    security
    F
    license
    -
    quality
    Enables users to generate parametric 3D models from text descriptions or images using AI image generation and multi-view reconstruction techniques.
    Python
  • -
    security
    F
    license
    -
    quality
    A server that connects to the xAI/Grok image generation API, allowing users to generate images from text prompts with support for multiple image generation and different response formats.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Connects Sketchup to Claude AI through the Model Context Protocol, allowing Claude to directly interact with and control Sketchup for prompt-assisted 3D modeling and scene manipulation.
    20
    • Apple
  • -
    security
    F
    license
    -
    quality
    Enables AI assistants to interact with Figma files through ModelContextProtocol, allowing for viewing, commenting, and analyzing Figma designs.
    896
    98
    TypeScript
    • Apple
  • -
    security
    F
    license
    -
    quality
    An MCP server that allows users to generate, edit, and create variations of images through OpenAI's DALL-E API, supporting both DALL-E 2 and DALL-E 3 models.
    2
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A server that converts various file formats (PDF, images, Office documents, etc.) to Markdown descriptions using Cloudflare AI services.
    15
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides API functionality for creating, managing, and exporting Excalidraw drawings in various formats like SVG, PNG, and JSON.
    3
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Extracts components from Figma designs and transforms them into standardized JSON format for easy consumption by AI models and tools for interface reconstruction.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Node.js server that enables video manipulation through natural language requests, including resizing videos to different resolutions (360p to 1080p) and extracting audio in various formats (MP3, AAC, WAV, OGG).
    34
    2
    TypeScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides image generation functionality using the Ideogram API, allowing users to create images from text prompts with customizable parameters.
    1
    3
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI models to programmatically create Whimsical diagrams from Mermaid markup.
    6
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to create images and videos using Amazon Nova Canvas and Nova Reel models.
    2
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    Integrates TikTok access into Claude AI and other apps, allowing users to analyze video virality factors, extract content from videos, and chat with TikTok videos through TikNeuron.
    4
    JavaScript
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI assistants to extract transcripts from YouTube videos, allowing AI to analyze and work with video content directly.
    12
    1
    TypeScript
  • -
    security
    F
    license
    -
    quality
    Converts Figma designs to React Native components, allowing users to extract components from Figma designs and generate corresponding React Native components with proper typing and styling.
    95
    TypeScript
  • -
    security
    F
    license
    -
    quality
    A multi-agent human-computer interaction system that enables natural interaction through integrated visual recognition, speech recognition, and speech synthesis capabilities.
    1
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A powerful video editing server that processes natural language commands to perform FFmpeg operations including trimming, merging, format conversion, speed adjustment, audio management, and subtitle integration.
    12
    Python
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Node.js-based server that integrates with the xAI Grok API to provide AI-driven analysis tools for the Solana blockchain, supporting transaction analysis, address investigation, image processing, and general queries.
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Generates and returns and image using Together.ai
    2
    TypeScript
    • Linux
    • Apple