Search for:

Using Hugging Face for Text-to-Audio, Image, and Video Generation

  • Why this server?

    This server allows you to use HuggingFace Spaces directly from Claude, offering capabilities for image generation, chat, vision tasks, and more. It supports image, audio, and text uploads/downloads, making it suitable for handling various media types.

    -
    security
    A
    license
    -
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    2
    184
    219
    TypeScript
    MIT License
    • Apple
  • Why this server?

    This server provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language. This is essential for discovering and utilizing Hugging Face resources.

    -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language.
    4
    Python
    MIT License
    • Apple
  • Why this server?

    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.

    -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    17
    Python
    • Apple
    • Linux
  • Why this server?

    A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.

    -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
    239
    JavaScript
    • Apple
    • Linux
  • Why this server?

    Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.

    -
    security
    F
    license
    -
    quality
    Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.
    1
    Python
    • Linux
    • Apple