MCP Servers for Speech Processing

Voice interaction and speech processing capabilities. Enables converting speech to text, audio commands, and voice generation.

View all MCP Servers

  • A
    security
    F
    license
    A
    quality
    Facilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.
    1
    2
    TypeScript
  • -
    security
    A
    license
    -
    quality
    Enables seamless integration between Ollama's local LLM models and MCP-compatible applications, supporting model management and chat interactions.
    10
    3
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    This server enables AI models to send SMS messages and initiate Text-to-Speech calls programmatically using ClickSend's API with built-in rate limiting and input validation.
    2
    1
    JavaScript
    MIT License
  • -
    security
    F
    license
    -
    quality
    Enables users to manage Gmail accounts using AI agent-assisted operations via an MCP protocol, supporting email search, reading, deletion, and sending with a voice-powered interface.
    200
    4
    TypeScript
  • A
    security
    A
    license
    A
    quality
    Enables integration of DeepSeek's language models with MCP-compatible applications, offering features like chat completion, custom model selection, and parameter control for enhancing language-based interactions.
    1
    388
    56
    JavaScript
    MIT License
  • -
    security
    F
    license
    -
    quality
    A server providing text-to-speech and speech-to-text functionalities using Windows' native speech services without external dependencies.
    3
    JavaScript
  • A
    security
    A
    license
    A
    quality
    Enables text-to-speech functionality on macOS using the say command, offering extensive control over speech parameters like voice, rate, volume, and pitch for a customizable auditory experience.
    2
    3
    8
    JavaScript
    MIT License
    • Apple
  • -
    security
    A
    license
    -
    quality
    Expose all Home Assistant voice intents through a Model Context Protocol Server allowing home control.
    23
    Python
    Apache 2.0
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video captions and subtitles through a simple interface.
    1
    86
    31
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
    2
    62
    63
    TypeScript
    MIT License
    • Apple