MCP Servers for Text-to-Speech

Tools for converting text-to-speech and vice-versa.

View all MCP Servers

  • A
    security
    A
    license
    A
    quality
    Enables MCP clients like Claude Desktop and Cursor to interact with MiniMax APIs for generating speech, cloning voices, creating videos, and generating images.
    Last updated -
    6
    381
    Python
    MIT License
    • Linux
    • Apple
  • A
    security
    A
    license
    A
    quality
    JavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.
    Last updated -
    6
    359
    31
    TypeScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    A simple MCP server that can send notifications on mac devices.
    Last updated -
    5
    9
    8
    TypeScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    An MCP server implementation that integrates with Minimax API to provide AI-powered image generation and text-to-speech functionality in editors like Windsurf and Cursor.
    Last updated -
    2
    192
    1
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Node.js server that enables AI assistants to interact with Bouyomi-chan's text-to-speech functionality through Model Context Protocol (MCP), allowing for voice reading of text with adjustable parameters.
    Last updated -
    1
    1
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    This server enables AI models to send SMS messages and initiate Text-to-Speech calls programmatically using ClickSend's API with built-in rate limiting and input validation.
    Last updated -
    2
    1
    JavaScript
    MIT License
  • A
    security
    A
    license
    A
    quality
    Enables text-to-speech functionality on macOS using the say command, offering extensive control over speech parameters like voice, rate, volume, and pitch for a customizable auditory experience.
    Last updated -
    2
    7
    11
    JavaScript
    MIT License
    • Apple
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
    Last updated -
    1
    176
    4
    JavaScript
    The Unlicense
    • Apple
    • Linux
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables AI assistants like Claude to use Bouyomichan (a Japanese text-to-speech program) for voice reading with adjustable voice types, volume, speed, and pitch.
    Last updated -
    1
    1
    JavaScript
    MIT License
    • Apple
  • A
    security
    F
    license
    A
    quality
    Facilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.
    Last updated -
    1
    9
    TypeScript
    • Linux
  • A
    security
    F
    license
    A
    quality
    A Python-based server that provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.
    Last updated -
    5
    Python
    • Linux
    • Apple
  • A
    security
    F
    license
    A
    quality
    Integrates Textwell with the Model Context Protocol for facilitating text operations such as writing and appending text through a GitHub Pages bridge.
    Last updated -
    1
    JavaScript
    • Apple
  • -
    security
    A
    license
    -
    quality
    A server that generates MP3 audio files from text using Kokoro TTS technology with optional S3 upload capabilities.
    Last updated -
    Python
    Apache 2.0
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
    Last updated -
    TypeScript
    MIT License
  • -
    security
    -
    license
    -
    quality
    An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
    Last updated -
    1
    JavaScript
    MIT License
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.
    Last updated -
    1
    Python
  • -
    security
    F
    license
    -
    quality
    A server providing text-to-speech and speech-to-text functionalities using Windows' native speech services without external dependencies.
    Last updated -
    4
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Integrates ElevenLabs Text-to-Speech capabilities with Cursor through the Model Context Protocol, allowing users to convert text to speech with selectable voices within the Cursor editor.
    Last updated -
    1
    Python
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
    Last updated -
    239
    JavaScript
    • Apple
    • Linux
  • -
    security
    -
    license
    -
    quality
    Official ElevenLabs Model Context Protocol server that enables AI assistants like Claude to interact with Text to Speech and audio processing APIs, allowing them to generate speech, clone voices, transcribe audio, and create soundscapes.
    Last updated -
    Python
    MIT License
  • -
    security
    -
    license
    -
    quality
    An MCP server that enables LLMs to access the NijiVoice API for text-to-speech generation, supporting features like fetching available voice actors and checking credit balance.
    Last updated -
    1
    Python
    MIT License
  • -
    security
    F
    license
    -
    quality
    Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
    Last updated -
    2
    Python
  • -
    security
    F
    license
    -
    quality
    Enables seamless integration with Typecast API through the Model Context Protocol, allowing clients to manage voices, convert text to speech, and play audio in a standardized way.
    Last updated -
    1
    Python