"Using Hugging Face for Text-to-Audio, Image, and Video Generation" matching MCP servers:

image-video-generation-mcp
AI & Machine Learning Image & Video Processing
156554395
A
license
B
quality
D
maintenance
Enables image and video generation using BigModel AI's CogView and CogVideoX models via MCP, supporting batch image generation and various configuration options.
Last updated 2025-10-22
6
23
4
MIT
Video to Text MCP Server
Multimedia Processing Audio Processing Speech Processing
strzhao
A
license
B
quality
D
maintenance
Enables downloading videos from platforms like YouTube and converting them to text using OpenAI Whisper and ffmpeg. It supports multiple output formats including TXT, JSON, SRT, and VTT for transcriptions.
Last updated 2026-01-13
2
3
ISC
Doubao Image/Video Generation
Image & Video Processing Multimedia Processing
156554395
A
license
A
quality
C
maintenance
Enables AI image generation using Doubao Seedream models and video generation using Doubao Seedance models through Volcano Engine's API, supporting text-to-image, image-to-image, text-to-video, and task status queries.
Last updated 2025-12-30
3
31
3
MIT
Zhipu Text-to-Image MCP Server
Image & Video Processing Multimedia Processing
2716025420
A
license
-
quality
D
maintenance
Enables text-to-image generation using Zhipu AI's CogView-4 API. Supports generating images from text prompts with configurable size and quality parameters through MCP-compatible clients like Claude Desktop and Cline.
Last updated 2025-12-07
7
MIT
io.github.pvliesdonk/image
Image & Video Processing AI & Machine Learning
pvliesdonk
A
license
-
quality
B
maintenance
Multi-provider image generation MCP server that enables image generation from Claude Desktop, Claude Code, or any MCP client using OpenAI, Google Gemini, Stable Diffusion, or a placeholder provider.
Last updated 2026-06-08
1
MIT
text-to-model
App Automation Developer Tools Software Architecture
mikan-atomoki
A
license
B
quality
B
maintenance
Turn natural language into 3D models in Fusion 360. 64 CAD tools including sketches, extrudes, fillets,and JIS standard parts.
Last updated 2026-03-16
65
2
MIT
Azure Image Generation MCP
Image & Video Processing Multimedia Processing Developer Tools
malikmalikayesha
A
license
B
quality
C
maintenance
Enables AI-powered image generation using Azure DALL-E 3 and FLUX models with intelligent automatic model selection. Generates stunning photorealistic or creative images directly within LibreChat through simple text prompts.
Last updated 2026-02-20
1
6
2
MIT
Video & Audio Editing MCP
Multimedia Processing Audio Processing
misbahsy
F
license
B
quality
C
maintenance
Provides powerful video and audio editing capabilities through FFmpeg, enabling AI assistants to perform professional-grade operations including format conversion, trimming, overlays, transitions, and advanced audio processing.
Last updated 2025-05-24
27
77
Image Generation MCP Server
Image & Video Processing Autonomous Agents
GongRzhe
A
license
B
quality
F
maintenance
Provides image generation capabilities for Claude using the Replicate Flux model, allowing users to create images from text prompts with customizable parameters like aspect ratio and output format.
Last updated 2025-04-15
1
40
51
MIT
MCP Video & Audio Text
Speech Processing Multimedia Processing
SealinGp
A
license
-
quality
C
maintenance
An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
Last updated 2025-05-27
9
MIT
Image Generation MCP
Image & Video Processing Marketing Social Media
12-days-of-shipmas-2025
A
license
A
quality
B
maintenance
Generates blog and social media images using Google's Gemini AI with pre-configured platform presets for Ghost, Medium, Instagram, Twitter, LinkedIn, YouTube, and more.
Last updated 2025-12-31
2
13
MIT
Content & Image Generation MCP
Image & Video Processing Marketing Content Management Systems
vanman2024
A
license
-
quality
C
maintenance
AI-powered content and image generation server with Google Imagen 3/4 for images, Veo 2/3 for videos, and Claude/Gemini for marketing copywriting, including batch processing, cost estimation, and campaign planning tools.
Last updated 2025-12-25
MIT
Hugging Face MCP Server
Text Summarization Image & Video Processing Speech Processing
NimbleBrainInc
A
license
-
quality
-
maintenance
Enables access to 200,000+ machine learning models through the Hugging Face Inference API. Supports text generation, image creation, classification, translation, speech processing, embeddings, and more AI tasks.
Last updated 2025-10-09
Image Generation MCP Server
Image & Video Processing Multimedia Processing
marc-shade
A
license
-
quality
C
maintenance
Provides multi-provider image generation with automatic fallback across services like Pollinations.ai, Cloudflare, and Hugging Face. It features specialized pixel art generation, cost tracking, and automatic saving of generated visual assets to disk.
Last updated 2026-02-22
MIT
Hugging Face MCP Server
AI & Machine Learning Search
huggingface
A
license
-
quality
B
maintenance
Connects LLMs to the Hugging Face Hub and thousands of Gradio AI applications, enabling model interaction and search.
Last updated 2026-06-09
247
MIT
PDF to Text MCP Server
xxx87
-
license
-
quality
-
maintenance
Converts PDF files to text for use with MCP-compatible applications like Cursor IDE.
Last updated 2025-09-19
Hugging Face Hub MCP Server
Search Databases RAG Systems
michaelwaves
F
license
A
quality
D
maintenance
Enables access to the Hugging Face Hub API to search and retrieve information about machine learning models, datasets, and their metadata. Provides comprehensive tools for exploring the Hugging Face ecosystem including model details, dataset information, and parquet file access.
Last updated 2025-08-24
8
Image Generation MCP Server
Image & Video Processing Multimedia Processing
Ichigo3766
A
license
A
quality
C
maintenance
A MCP server that integrates with Stable Diffusion WebUI to provide text-to-image generation and image upscaling capabilities through simple API calls.
Last updated 2025-07-21
5
37
MIT
Hugging Face MCP Server
RAG Systems Databases App Automation
shreyaskarnik
A
license
B
quality
C
maintenance
A Model Context Protocol server that provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language.
Last updated 2025-03-19
10
71
MIT
Volcengine Image Generation
Image & Video Processing Multimedia Processing
stvlynn
F
license
B
quality
C
maintenance
Enables AI-powered text-to-image generation using Volcengine's API with support for multiple image sizes, customizable parameters like guidance scale and seed, and flexible output formats.
Last updated 2025-08-21
1