Image & Video Processing
Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.
MCP ServersBrowse all →
- TypeScriptMIT

AWS Nova Canvasofficial
AlicenseAqualityCmaintenanceProvides image generation capabilities using Amazon Nova Canvas through Amazon Bedrock, enabling the creation of visuals from text prompts and color palettes—perfect for mockups, diagrams, and UI design concepts.Last updated29,030
vchart-mcp-serverofficial
AlicenseAqualityCmaintenanceA Model Context Protocol (MCP) server for the @visactor/vchart that enables AI assistants to generate interactive charts and visualizations.Last updated105251MIT
Graphistry MCPofficial
AlicenseAqualityCmaintenanceGPU-accelerated graph visualization and analytics server for Large Language Models that integrates with Model Control Protocol (MCP), enabling AI assistants to visualize and analyze complex network data.Last updated172711MIT- AlicenseAqualityCmaintenanceA security-hardened MCP server for generating and editing images using Google Gemini models. It provides tools for text-to-image creation and iterative image editing with strict input validation and secure file handling.Last updated13MIT
- AlicenseAqualityBmaintenancePost AI-generated images to Vynly, the AI-only social feed. Four tools for posting, reading, searching, and ephemeral sparks — auto-claims a demo token on first run so it works with no signup.Last updated440MIT
- AlicenseBqualityCmaintenanceA server that enables AI assistants to create and edit PowerPoint presentations with features for adding various slide types, tables, charts, and AI-generated images through Stable Diffusion.Last updated21153MIT
- AlicenseAqualityBmaintenanceConvert HTML to PDF/PNG/WebP/PPTX slide carousels with 11 themes. For LinkedIn carousels, decks, Instagram posts, and infographics — Puppeteer-based pixel-perfect rendering.Last updated2261MIT
- AlicenseAqualityBmaintenanceMCP server for ComfyUI — text-to-image, variations, img2img refine, upscale, image proxy, and workflow runner.Last updated41545MIT

Jsoncut MCP Serverofficial
AlicenseAqualityFmaintenanceEnables AI agents to generate JSON configurations for creating images and videos programmatically through the Jsoncut API, with support for layers, positioning, transitions, and validation.Last updated561MIT
MiniMax MCP JSofficial
AlicenseAqualityDmaintenanceJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated10604122MIT- AlicenseBqualityCmaintenanceMCP server for video enhancement and SAM3 image segmentation, enabling tasks like upscaling videos and segmenting objects in images via natural language.Last updated4MIT
- AlicenseBqualityCmaintenanceAn official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.Last updated11836MIT

Jina AI Remote MCP Serverofficial
AlicenseAqualityCmaintenanceEnables web content extraction, screenshot capture, web search, arXiv paper search, and image search through Jina AI's APIs. Provides tools for reading URLs as markdown, searching the web for current information, and finding academic papers or images.Last updated19671Apache 2.0- AlicenseAqualityBmaintenanceRoutes one brief to the right image model across 60+ (gpt-image-1.5, Ideogram 3, Recraft V4, Flux), validates the output, and fans out to iOS/Android/PWA/favicon/visionOS/Flutter bundles. Works without an API key via Pollinations, HF Inference, Stable Horde, or host-LLM inline SVG.Last updated2244
- AlicenseAqualityBmaintenanceMCP server for Shopify Admin API with a ComfyUI bridge for AI product image generation. Covers products, orders, inventory, and customers.Last updated2587MIT

Runway API MCP Serverofficial
AlicenseAqualityCmaintenanceEnables AI video and image generation through the Runway API. Supports video generation from images and text prompts, image creation, video upscaling and editing, and task management.Last updated7819MIT
Recraft AI MCP Serverofficial
AlicenseAqualityBmaintenanceAn MCP server that integrates with Recraft AI to enable generation and manipulation of high-quality raster and vector images through tools like image generation, editing, vectorization, background removal, and upscaling.Last updated914654MIT- AlicenseBqualityCmaintenanceEnables AI assistants to search for and retrieve images, illustrations, and videos directly from Pixabay. It provides specialized tools for discovering diverse media content like photos and animations using the Pixabay API.Last updated2MIT
- AlicenseAqualityCmaintenanceAn MCP server that reviews UI edit requests by comparing before and after screenshots, providing visual feedback on whether changes satisfy the user's requirements.Last updated1915GPL 2.0
- AlicenseAqualityCmaintenanceEnables AI agents to manage ComfyUI workflows using a human-readable Domain Specific Language (DSL), with automatic conversion to/from JSON format. Supports workflow creation, validation, execution, and monitoring through natural language interactions.Last updated162MIT
- AlicenseAqualityCmaintenanceEnables Claude Desktop to interact with local ComfyUI installations for AI-powered image generation, including workflow management, model selection, real-time monitoring, and custom workflow execution through natural language.Last updated1453215MIT
- AlicenseAqualityCmaintenanceProvides screenshot and OCR capabilities for macOS.Last updated110823MIT
- AlicenseAqualityCmaintenanceAbout MCP server for image conversion, resizing, and merging — runs locally, no uploadsLast updated501MIT
- AlicenseBqualityCmaintenanceEnables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.Last updated292117MIT
- AlicenseBqualityCmaintenanceEnables Claude Desktop to generate text and analyze images using Google's Gemini Pro API. Provides seamless integration between Claude and Gemini's AI capabilities through natural language commands.Last updated2MIT
- AlicenseAqualityCmaintenanceEnables AI assistants to generate product mockups by integrating with the Dynamic Mockups API. Supports browsing mockup catalogs, creating single or batch renders with design assets, and managing PSD templates.Last updated1312
- AlicenseBqualityCmaintenanceModel Context Protocol server that enables generating videos from text prompts and/or images using AI models (Luma Ray2 Flash and Kling v1.6 Pro) with configurable parameters like aspect ratio, resolution, and duration.Last updated23MIT
- AlicenseAqualityCmaintenanceProvides machine learning researchers with tools for creating publication-quality scientific visualizations, statistical plots, and 2D data representations. It streamlines the research workflow by enabling AI assistants to generate complex figures from CSV, JSON, or direct data inputs.Last updated9MIT
- AlicenseBqualityBmaintenanceAgent-native media processing: video encoding, image manipulation, document conversion, audio transcription, and more via 86+ cloud Robots.Last updated771
MCP ConnectorsBrowse all →
Background removal, 4x upscaling, and face restoration via GPU
FFmpeg Micro MCP Server. Transcode videos from n8n or Make using FFmpeg in the cloud. Code+Docs: https://github.com/javidjamae/ffmpeg-micro-mcp/
Composable APIs for document extraction, image transformation, and document & sheet generation.
Media intelligence analysis for audio, video, and images via the Echosaw MCP server.
AI-powered image processing via GPU. Remove backgrounds and upscale images (2x/4x) directly from any MCP client. OAuth 2.1 authenticated, returns processed images inline with download links. Free credits on signup at maskr.io.
Create and track AI music videos and audio-reactive visuals from songs.
OCR, transcription, file extraction, and image generation for AI agents via MCP.
Search and browse a curated gallery of memes, infographics, and visual content.
Detect AI-generated images, videos, and audio with identifAI's deepfake detection tools.
Create and manage cinematic AI video renders through the Future Video Studio Agent API.
AI virtual staging for real estate — stage rooms, beautify floor plans, classify images.
MCP server for meme generation, template search, caption rendering, and AI meme creation.
CATAAS MCP — Cat as a Service (free, no auth)
Image processing for AI agents. Resize, convert, compress, and pipeline images.
Hosted MCP server for meme generation, meme template search, caption rendering, and AI meme creation.
Remote MCP for image/video anonymization, face/body/license-plate masking, and de-identification.
Imgflip MCP — wraps Imgflip API (free, no auth for template listing)
Explore and analyze Cupix construction site data: 360 images, progress, and insights.
125+ browser tools for PDF, Image, Video, Audio, AI, Scanner. Files never leave your device.