Why this server?
Integrates with ElevenLabs text-to-speech API, directly addressing the request for TTS capabilities.
AsecurityAlicense-qualityIntegrates with ElevenLabs text-to-speech API.Last updated a year ago6119MITWhy this server?
Provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
AsecurityFlicense-qualityA Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.Last updated a year ago481Why this server?
HuggingFace Space integration often include text-to-speech capabilities.
AsecurityAlicense-qualityConnects Claude Desktop to Hugging Face Spaces with minimal setup, enabling capabilities like image generation, vision tasks, text-to-speech, and chat with AI models.Last updated a year ago3121MITWhy this server?
Integrates with MiniMax AI services, offering image generation, video generation, text-to-speech, and voice cloning through MCP.

MiniMax MCP JSofficial
AsecurityAlicense-qualityJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated 9 months ago10657114MITWhy this server?
Offers speech-to-text, diarization, translation, and text summarization through a Python API, thus can use for speech-to-speech.
AsecurityFlicense-qualityA Python-based server that provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.Last updated a year ago52Why this server?
Enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities.
AsecurityFlicense-qualityA Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.Last updated a year ago11Why this server?
Provides desktop automation capabilities, which could potentially include controlling TTS applications.
-securityAlicense-qualityA Model Context Protocol server that provides desktop automation capabilities using RobotJS and screenshot capabilities, enabling LLMs to control mouse movements, keyboard inputs, and capture screenshots of the desktop environment.Last updated 5 months ago7026MITWhy this server?
Since the server can interact with emails, it is possible to use it to extract text content, and then use another TTS service to perform text to speech.
-securityAlicense-qualityThis MCP server provides email sending functionality using Protonmail's SMTP service. It allows both Claude Desktop and Cline VSCode extension to send emails on your behalf using your Protonmail credentials.Last updated a year ago35MITWhy this server?
Can enable direct user control of music with voice commands.
-securityAlicense-qualityAn MCP server that allows AI models to control YouTube Music playback through Google Chrome by searching and playing songs using song and artist names.Last updated a year ago21MIT