Multimedia Processing
Provides the ability to handle multimedia, such as audio and video editing, playback, format conversion, also includes video filters, enhancements, and so on.
MCP ServersBrowse all →
AlicenseAqualityDmaintenanceJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated210426125MIT- AlicenseAqualityBmaintenanceMCP server that fetches YouTube video transcripts and optionally summarizes them. Supports multiple transcript formats (text, JSON, SRT, WebVTT), multi-language retrieval, and flexible YouTube URL parsing.Last updated452MIT
- AlicenseAqualityCmaintenanceExtracts YouTube video metadata, titles, and descriptions along with transcripts generated from subtitles or OpenAI Whisper speech-to-text. This server enables users to retrieve and analyze detailed video content directly within MCP-compatible environments.Last updated418MIT
- AlicenseBqualityCmaintenanceMCP server for video enhancement and SAM3 image segmentation, enabling tasks like upscaling videos and segmenting objects in images via natural language.Last updated4211MIT
- MIT

Jsoncut MCP Serverofficial
AlicenseAqualityFmaintenanceEnables AI agents to generate JSON configurations for creating images and videos programmatically through the Jsoncut API, with support for layers, positioning, transitions, and validation.Last updated5141MIT
flo-pluginofficial
AlicenseBqualityCmaintenanceIntegrates Flo's AI-powered media automation into Claude, enabling tasks like quality control, content moderation, delivery validation, and asset search via slash commands and MCP tools.Last updated120MIT- AlicenseAqualityCmaintenanceAll Voice Lab MCP ServerLast updated1256MIT

Qiniu MCP Serverofficial
AlicenseBqualityDmaintenanceThe Model Context Protocol (MCP) Server built on Qiniu Cloud products supports users in accessing Qiniu Cloud Storage, intelligent multimedia services, and more through this MCP Server within the context of AI large model clients.Last updated2236MIT- AlicenseAqualityBmaintenanceRemove vocals, extract instrumentals, and split any song into up to six stems — directly from Claude Desktop, Cursor, or any MCP client. Supports local audio files, YouTube URLs, and SoundCloud trackLast updated11570MIT

@avclabs.ai/enhance-mcpofficial
AlicenseAqualityDmaintenanceEnables video enhancement through MCP tools for creating tasks, querying status, and synchronous enhancement, supporting URL or local file inputs.Last updated342Apache 2.0- AlicenseBqualityDmaintenanceProvides programmatic access to Baidu's Xiling Digital Human platform, enabling AI assistants to generate digital human videos, clone voices, and create synthesized speech through 13 standardized MCP protocol interfaces.Last updated134MIT

MMAudio MCPofficial
AlicenseBqualityBmaintenanceEnables AI-powered video-to-audio and text-to-audio generation using MMAudio's API. Create synchronized audio from video content or generate audio from text descriptions with configurable parameters.Last updated393MIT- AlicenseAqualityCmaintenanceEnables AI assistants to download Instagram content including posts, videos, reels, stories, highlights, and profile pictures using Instaloader, with optional metadata and caption extraction.Last updated75MIT

Cosmic MCP Serverofficial
AlicenseAqualityFmaintenanceAn MCP server that enables AI assistants to manage content, media, and schemas within Cosmic CMS buckets. It allows users to perform CRUD operations on objects and types while providing tools for AI-driven text, image, and video generation.Last updated17251MIT
Runway API MCP Serverofficial
AlicenseAqualityCmaintenanceEnables AI video and image generation through the Runway API. Supports video generation from images and text prompts, image creation, video upscaling and editing, and task management.Last updated72120MIT- AlicenseBqualityCmaintenanceGenerates realistic human face images that don't represent real people, offering various output shapes, configurable dimensions, and batch generation capabilities.Last updated41396MIT
- AlicenseBqualityCmaintenanceEnables text-to-image generation through the ModelScope platform using the Qwen/Qwen-Image model. It supports customizable parameters such as negative prompts, resolution, and sampling steps within MCP-compatible clients.Last updated1141MIT
- AlicenseBqualityCmaintenanceAn unofficial MCP-compatible server that enables advanced automation, querying, and remote control of Adobe Premiere Pro projects for power users, workflow automation, and AI integration.Last updated1514MIT
- AlicenseBqualityAmaintenanceAn MCP server that enables AI assistants to control Blender through 108 specialized tools for 3D modeling, animation, and rendering. It provides a secure, thread-safe interface to execute validated operations in Blender using natural language commands.Last updated10029AGPL 3.0
- AlicenseAqualityBmaintenanceUnified MCP server for Gemini media generation --Nano Banana (images), Veo 3.1 (video), TTS, and Lyria 3 (music). Single Go binary, 12 tools, supports Gemini API key and Vertex AI.Last updated125Apache 2.0
- AlicenseBqualityCmaintenanceAn MCP server that enables image generation using Replicate's Flux 1.1 Pro model. It provides a tool for creating visuals from text prompts with customizable settings for aspect ratio, output format, and quality.Last updated1MIT
- AlicenseAqualityDmaintenanceAn MCP server for TouchDesigner that lets you control TouchDesigner with ClaudeLast updated5MIT
- AlicenseAqualityDmaintenanceEnables advanced audio transcription, text-to-speech generation, and audio processing using OpenAI's Whisper and GPT-4o models with support for multiple audio formats, file management, and parallel processing.Last updated854MIT
- AlicenseAqualityCmaintenanceEnables AI-assisted RAW photo development via RawTherapee CLI, with a visual feedback loop that allows the LLM to see and iteratively edit images.Last updated49MIT
- AlicenseAqualityCmaintenanceAn MCP server that parses Douyin share links and performs intelligent content analysis using the Doubao video understanding model. It provides structured outputs including video summaries, categorized outlines, and step-by-step tutorial information.Last updated12MIT
- AlicenseAqualityCmaintenanceAn MCP server that brings Google Gemini's image generation and editing capabilities to Claude Desktop, Claude Code, and Cursor. It supports 2K image creation, natural language image transformations, and session consistency to maintain styles across generations.Last updated7820MIT
- AlicenseAqualityBmaintenanceFetches YouTube video subtitles and transcripts with support for multiple languages and output formats (SRT, VTT, TXT, JSON).Last updated118Apache 2.0
- AlicenseAqualityBmaintenanceAI 3D model generation and post-processing MCP server — text/image/multiview-to-3D via Tripo, retopology, format conversion (GLB/FBX/OBJ/STL/USDZ), and stylization. Single Go binary, 10 tools.Last updated103Apache 2.0
- AlicenseBquality-maintenanceAn MCP server that automatically renames local subtitle files to match corresponding videos using statistical token matching and episode verification. It enables users to scan media directories, preview matches, and manage subtitle configurations through natural language commands.Last updated6
MCP ConnectorsBrowse all →
Video transcoding and document conversion for AI agents. Transcode to MP4 (H.264), WebM/VP9, ProRes 422, GIF, or MP3 audio. Convert PDFs, DOCX, PPTX, XLSX, HTML, Markdown, and images. Prepaid wallet with per-job billing — no FFmpeg, no storage, no infrastructure to manage.
Background removal, 4x upscaling, and face restoration via GPU
- mcpA
Generate images, GIFs, and PDFs from HTML, URLs, or templates — from your AI agent.
AI audio tools for music producers — stem splitting, vocal removal, BPM & key detection, audio-to-MIDI, format conversion, trimming, video-to-audio extraction and AI song generation.
The first artist-owned MCP server. Discover, narrate, and stream Matthew Hartley's debut album The Time Is Now from any compatible AI client. Exposes 8 tools (list_songs, get_song, list_chapters, get_chapter, get_artist, get_experience, get_experience_prompt, refresh_stream_urls) over a public HTTP endpoint with no auth. Apache 2.0 licensed.
Composable APIs for document extraction, image transformation, and document & sheet generation.
Media intelligence analysis for audio, video, and images via the Echosaw MCP server.
AI-powered image processing via GPU. Remove backgrounds and upscale images (2x/4x) directly from any MCP client. OAuth 2.1 authenticated, returns processed images inline with download links. Free credits on signup at maskr.io.
- apiA
Quiz.Video MCP: list, create, AI-generate, and render quiz and flashcard videos.
Create and track AI music videos and audio-reactive visuals from songs.
MCP server for meme generation, template search, caption rendering, and AI meme creation.
Image processing for AI agents. Resize, convert, compress, and pipeline images.
Hosted MCP server for meme generation, meme template search, caption rendering, and AI meme creation.
Imgflip MCP — wraps Imgflip API (free, no auth for template listing)
125+ browser tools for PDF, Image, Video, Audio, AI, Scanner. Files never leave your device.
AI music, video, image, and voice tools callable by agents with USDC payments via x402 on Base.
84+ free local-first tools: image, PDF, docs, dev utils. Wasm, zero upload, x402 API.
Analyze images and videos with Gemini to get fast, reliable visual insights. Handle content from U…
Search your Flashback video library with natural language to instantly find relevant moments. Get…
- ChiR24-unreal_mcpOAuth
Control Unreal Engine to browse assets, import content, and manage levels and sequences. Automate…