Provides a Docker configuration for easily setting up and running the Piper TTS service, with pre-configured voice models and automatic startup.
🎤 Piper TTS MCP Server
A Model Context Protocol (MCP) server that integrates Piper TTS for high-quality text-to-speech functionality. This server provides a speak
tool that converts text to speech and plays it directly through your speakers with customizable volume control.
✨ Features
- 🔊 High-quality text-to-speech using Piper TTS
- 🎚️ Volume control (0.01 to 1.00)
- ⚡ Memory-based audio playback (no temporary files)
- 🔧 Customizable voice parameters (speaker, speed, voice variation)
- 🚀 Fast and lightweight MCP integration
- 🔇 Silent operation (no console output)
📋 Requirements
⚠️ Important Prerequisites:
- Python 3.12+ installed on your system
- Piper TTS service running on
localhost:5000
You can set this up using the included Docker configuration or by running Piper TTS separately.
🚀 Quick Start
1. 🐳 Start Piper TTS Service
Clone this repo and start the TTS service:
Option A: Using Docker Compose (Recommended)
Option B: Using Docker Build
2. 🔧 Configure MCP Client
Add the server to your MCP configuration (e.g., .mcp.json
for Claude Desktop):
3. 🎉 Enjoy!
The speak
tool is now available in your MCP client!
🛠️ Usage
The MCP server provides a speak
tool with the following parameters:
Parameter | Type | Default | Description |
---|---|---|---|
text | string | required | Text to convert to speech |
speaker_id | int | 0 | Voice speaker ID |
length_scale | float | 1.1 | Speech speed (lower = faster) |
noise_scale | float | 0.667 | Voice variation control |
noise_w_scale | float | 0.333 | Pronunciation variation |
volume | float | 0.15 | Volume level (0.01 to 1.00) |
Example Usage
🔧 Development
Dependencies
- Python 3.12+
mcp[cli]
- MCP frameworkrequests
- HTTP client for TTS APIpygame
- Audio playback
Local Development
📦 Docker Configuration
The included Docker setup provides:
- Piper TTS service on port 5000
- Pre-configured voice models
- Automatic startup
🎙️ Voice Models
The default voice model used in this repository is en_GB-cori-high
(British English, female voice).
🔍 Exploring Voice Options
- Voice Samples: Listen to different voice models at Piper Voice Samples
- Piper Documentation: Learn more about Piper TTS at rhasspy/piper
🛠️ Changing Voice Models
To use a different voice model:
- Choose a voice from the voice samples page
- Update the Dockerfile - Replace
en_GB-cori-high
with your chosen voice model: - Rebuild the Docker image:
🎨 Creating Custom Voices
Creating your own custom voice requires additional effort and research. Please refer to the Piper documentation for guidance on voice training and customization.
🤝 Contributing
Contributions are welcome! Please feel free to submit issues and pull requests.
📄 License
This project is open source and available under the MIT License.
🙏 Acknowledgments
- Piper TTS - Fast, local neural text to speech
- Model Context Protocol - Standardized protocol for AI model interactions
Made with ❤️ for the MCP community
This server cannot be installed
Integrates Piper TTS into the Model Context Protocol, allowing AI assistants to convert text to speech and play it through speakers with customizable voice settings and volume control.
Related MCP Servers
- -securityFlicense-qualityProvides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.Last updated -6Python
- AsecurityAlicenseAqualityProvides an interface between AI assistants and Tripo AI via Model Context Protocol, enabling generation of 3D assets from natural language and importing them to Blender.Last updated -15166PythonMIT License
- AsecurityFlicenseAqualityA Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.Last updated -1TypeScript
- -securityFlicense-qualityEnables seamless integration with Typecast API through the Model Context Protocol, allowing clients to manage voices, convert text to speech, and play audio in a standardized way.Last updated -2Python