Kokoro TTS MCP Server
A Model Context Protocol (MCP) server that provides text-to-speech capabilities using the Kokoro TTS engine. This server exposes TTS functionality through MCP tools, making it easy to integrate speech synthesis into your applications.
Prerequisites
Python 3.10 or higher
uvpackage manager
Related MCP server: Typecast API MCP Server
Installation
First, install the
uvpackage manager:
Clone this repository and install dependencies:
Features
Text-to-speech synthesis with customizable voices
Adjustable speech speed
Support for saving audio to files or direct playback
Cross-platform audio playback support (Windows, macOS, Linux)
Usage
The server provides a single MCP tool generate_speech with the following parameters:
text(required): The text to convert to speechvoice(optional): Voice to use for synthesis (default: "af_heart")speed(optional): Speech speed multiplier (default: 1.0)save_path(optional): Directory to save audio filesplay_audio(optional): Whether to play the audio immediately (default: False)
Example Usage
Dependencies
kokoro >= 0.8.4
mcp[cli] >= 1.3.0
soundfile >= 0.13.1
Platform Support
Audio playback is supported on:
Windows (using
start)macOS (using
afplay)Linux (using
aplay)
MCP Configuration
Add the following configuration to your MCP settings file:
License
[Add your license information here]