Search for:

Using Hugging Face for Text-to-Audio, Image, and Video Generation

  • Why this server?

    This server allows you to use HuggingFace Spaces directly from Claude, offering capabilities for image generation, chat, vision tasks, and more. It supports image, audio, and text uploads/downloads, making it suitable for handling various media types.

    -
    security
    A
    license
    -
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    Last updated -
    2
    188
    241
    TypeScript
    MIT License
    • Apple
  • Why this server?

    This server provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language. This is essential for discovering and utilizing Hugging Face resources.

    -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that provides Claude and other LLMs with read-only access to Hugging Face Hub APIs, enabling interaction with models, datasets, spaces, papers, and collections through natural language.
    Last updated -
    4
    Python
    MIT License
    • Apple
  • Why this server?

    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.

    -
    security
    F
    license
    -
    quality
    Enables video editing using natural language commands powered by FFmpeg, supporting operations like trimming, merging, format conversion, and more with real-time progress tracking and error handling.
    Last updated -
    17
    Python
    • Apple
    • Linux
  • Why this server?

    A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.

    -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
    Last updated -
    239
    JavaScript
    • Apple
    • Linux
  • Why this server?

    Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.

    -
    security
    F
    license
    -
    quality
    Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.
    Last updated -
    1
    Python
    • Linux
    • Apple