Using Hugging Face for Text-to-Audio, Image, and Video Generation

Search for:

Using Hugging Face for Text-to-Audio, Image, and Video Generation

View all MCP Servers

Why this server?
This server allows you to use HuggingFace Spaces directly from Claude, offering capabilities for image generation, chat, vision tasks, and more. It supports image, audio, and text uploads/downloads, making it suitable for handling various media types.
mcp-hfspace
Image & Video Processing Audio Processing App Automation
evalstate
A
license
C
quality
D
maintenance
Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
Last updated 2025-06-13
3
143
387
MIT
Why this server?
This server provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language. This is essential for discovering and utilizing Hugging Face resources.
Hugging Face MCP Server
RAG Systems Databases App Automation
shreyaskarnik
A
license
B
quality
D
maintenance
A Model Context Protocol server that provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language.
Last updated 2025-03-19
10
71
MIT
Why this server?
Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
Video Editor MCP Server
Multimedia Processing App Automation
Kush36Agrawal
F
license
-
quality
D
maintenance
Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
Last updated 2025-01-04
49
Why this server?
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Speech MCP Server
Text-to-Speech Audio Processing Multimedia Processing
hammeiam
A
license
B
quality
D
maintenance
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated 2025-03-28
4
7
1
MIT
Why this server?
Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.
ElevenLabs Text-to-Speech MCP
Text-to-Speech Code Execution
georgi-io
F
license
-
quality
D
maintenance
Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.
Last updated 2025-03-25
1

Using Hugging Face for Text-to-Audio, Image, and Video Generation

mcp-hfspace

Hugging Face MCP Server

Video Editor MCP Server

Speech MCP Server

ElevenLabs Text-to-Speech MCP