Image & Video Processing
Tools for image or video recognition, editing, and processing. Enables deep analysis and AI-driven visual content generation.
MCP ServersBrowse all →
AlicenseAqualityDmaintenanceJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated210410125MIT
forgemesh-imagegenofficial
AlicenseAqualityCmaintenanceMCP server for AI image generation with automatic USDC payments on Base mainnet. Generate, remove backgrounds, and upscale images via simple tool calls.Last updated64MIT- AlicenseAqualityBmaintenanceMCP server for creating Apple .icon bundles with Liquid Glass effects (iOS 26+). 12 tools for programmatic icon creation, glass effect tuning, dark mode, and App Store export.Last updated212283MIT
- AlicenseBqualityCmaintenanceMCP server for video enhancement and SAM3 image segmentation, enabling tasks like upscaling videos and segmenting objects in images via natural language.Last updated4211MIT

Jina AI Remote MCP Serverofficial
AlicenseAqualityCmaintenanceEnables web content extraction, screenshot capture, web search, arXiv paper search, and image search through Jina AI's APIs. Provides tools for reading URLs as markdown, searching the web for current information, and finding academic papers or images.Last updated19699Apache 2.0
ZeroTrue MCP Serverofficial
AlicenseAqualityBmaintenanceEnables detection of AI-generated content in text, images, video, and audio via the ZeroTrue API, supporting multiple analysis tools and MCP-compatible clients.Last updated6MIT- AlicenseAqualityCmaintenanceOn-demand live vision MCP for AI agents — open a session at a lat/lng, receive a JPEG snapshot or WebRTC stream, settled per-second in USDC on Base. Four tools (get_session, get_frame, get_stream_url, cancel_session); currently Base Sepolia testnet.Last updated241MIT
- AlicenseAqualityCmaintenanceMCP server that turns articles, transcripts, and markdown into LinkedIn carousel PDFs, Instagram PNGs, and Threads PNGs. Content in, slides out. No web UI, no cloud service.Last updated85MIT

Recraft AI MCP Serverofficial
AlicenseAqualityDmaintenanceAn MCP server that integrates with Recraft AI to enable generation and manipulation of high-quality raster and vector images through tools like image generation, editing, vectorization, background removal, and upscaling.Last updated97357MIT
vchart-mcp-serverofficial
AlicenseAqualityCmaintenanceA Model Context Protocol (MCP) server for the @visactor/vchart that enables AI assistants to generate interactive charts and visualizations.Last updated102251MIT
@avclabs.ai/enhance-mcpofficial
AlicenseAqualityDmaintenanceEnables video enhancement through MCP tools for creating tasks, querying status, and synchronous enhancement, supporting URL or local file inputs.Last updated342Apache 2.0- AlicenseAqualityAmaintenanceMCP server + Claude Code plugin for ComfyUI: execute workflows, generate images, visualize pipelines as Mermaid diagrams, compose/validate workflows, manage and download models, control VRAM, and explore custom nodes. 36 tools, cross-platform, installs via npx -y comfyui-mcp.Last updated18853,734116MIT
- AlicenseBqualityCmaintenanceAn official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.Last updated1536MIT

Jsoncut MCP Serverofficial
AlicenseAqualityFmaintenanceEnables AI agents to generate JSON configurations for creating images and videos programmatically through the Jsoncut API, with support for layers, positioning, transitions, and validation.Last updated5141MIT
Runway API MCP Serverofficial
AlicenseAqualityCmaintenanceEnables AI video and image generation through the Runway API. Supports video generation from images and text prompts, image creation, video upscaling and editing, and task management.Last updated72120MIT
flo-pluginofficial
AlicenseBqualityCmaintenanceIntegrates Flo's AI-powered media automation into Claude, enabling tasks like quality control, content moderation, delivery validation, and asset search via slash commands and MCP tools.Last updated120MIT- AlicenseBqualityCmaintenanceGenerates realistic human face images that don't represent real people, offering various output shapes, configurable dimensions, and batch generation capabilities.Last updated41236MIT

AWS Nova Canvasofficial
AlicenseAqualityCmaintenanceProvides image generation capabilities using Amazon Nova Canvas through Amazon Bedrock, enabling the creation of visuals from text prompts and color palettes—perfect for mockups, diagrams, and UI design concepts.Last updated29,166- AlicenseBqualityCmaintenanceProduction-ready MCP server with 40+ tools — QR codes, PDFs, text processing, TTS, web scraping, image generation and more. Built for AI agents.Last updated22515MIT

Graphistry MCPofficial
AlicenseAqualityFmaintenanceGPU-accelerated graph visualization and analytics server for Large Language Models that integrates with Model Control Protocol (MCP), enabling AI assistants to visualize and analyze complex network data.Last updated172311MIT- TypeScriptMIT
- AlicenseAqualityCmaintenanceAn MCP server that brings Google Gemini's image generation and editing capabilities to Claude Desktop, Claude Code, and Cursor. It supports 2K image creation, natural language image transformations, and session consistency to maintain styles across generations.Last updated7820MIT
- AlicenseAqualityBmaintenanceGenerates family pedigree tree diagrams as PNG or SVG images using standard genetic notation compliant with Bennett 2008/2022 NSGC guidelines. Supports comprehensive genealogical features including conditions, genetic testing results, twin relationships, carrier status, and adoption indicators.Last updated22210MIT
- MIT
- AlicenseBqualityCmaintenanceAn MCP server that enables image generation using Replicate's Flux 1.1 Pro model. It provides a tool for creating visuals from text prompts with customizable settings for aspect ratio, output format, and quality.Last updated1MIT
- AlicenseBqualityCmaintenanceEnables users to send live webcam images to Claude Desktop or other MCP clients, facilitating interaction through capturing images, screenshots, and providing a webcam view for visual input.Last updated230117MIT
- AlicenseBqualityCmaintenanceEnables AI agents to search, analyze, and extract insights from YouTube videos including transcripts, visual frames, and benchmarks without requiring API keys. Supports semantic search across playlists, sentiment analysis, and visual content indexing with automatic fallback chains for reliable access.Last updated4119225MIT
- AlicenseAqualityCmaintenanceEnables AI assistants to capture screenshots and read clipboard content from Windows applications while operating within a WSL environment. It supports monitor or window-specific targeting and features intelligent image optimization for efficient data transfer.Last updated24MIT
- AlicenseAqualityCmaintenanceUse XAI's latest api functionalities with Grok MCP. It supports image understanding and generation, live search, latest models and more.Last updated2233MIT
- AlicenseAqualityCmaintenanceEnables AI assistants to generate product mockups by integrating with the Dynamic Mockups API. Supports browsing mockup catalogs, creating single or batch renders with design assets, and managing PSD templates.Last updated13188MIT
MCP ConnectorsBrowse all →
Tag, rename, and enrich PDFs and images. Free tier: 1,500 tags/month, no credit card.
Background removal, 4x upscaling, and face restoration via GPU
- mcpA
Generate images, GIFs, and PDFs from HTML, URLs, or templates — from your AI agent.
FFmpeg Micro MCP Server. Transcode videos from n8n or Make using FFmpeg in the cloud. Code+Docs: https://github.com/javidjamae/ffmpeg-micro-mcp/
Composable APIs for document extraction, image transformation, and document & sheet generation.
- mcpA
ROC biometrics & computer vision: face, LPR, OCR, pedestrian, vehicle, gun detection.
Render charts and data visualizations as SVG or PNG images from a JSON config.
Decode video ads, load brand intelligence, generate ad scripts.
AI-powered image processing via GPU. Remove backgrounds and upscale images (2x/4x) directly from any MCP client. OAuth 2.1 authenticated, returns processed images inline with download links. Free credits on signup at maskr.io.
Media intelligence analysis for audio, video, and images via the Echosaw MCP server.
Create and track AI music videos and audio-reactive visuals from songs.
Random food images by category (pizza, burger, dosa, etc.) via Foodish. Keyless.
OCR, transcription, file extraction, and image generation for AI agents via MCP.
Detect AI-generated images, videos, and audio with identifAI's deepfake detection tools.
Create and manage cinematic AI video renders through the Future Video Studio Agent API.
AI virtual staging for real estate — stage rooms, beautify floor plans, classify images.
Image processing for AI agents. Resize, convert, compress, and pipeline images.
CATAAS MCP — Cat as a Service (free, no auth)