The Zonos TTS MCP Server enables text-to-speech functionality in Claude through the speak_response tool, allowing it to generate and play spoken audio from text with the following capabilities:
Multi-Language Support: Generate speech in different languages (default:
en-us)Emotion Control: Customize the emotional tone (neutral, happy, sad, angry)
PulseAudio Integration: Ensures proper audio playback
MCP Integration: Seamlessly works with Claude's Model Context Protocol
The server requires a running instance of Zonos API with proper Node.js and PulseAudio setup.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Zonos TTS MCP Serverspeak this in French with a happy tone: Bonjour, comment allez-vous?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Zonos MCP Integration
A Model Context Protocol integration for Zonos TTS, allowing Claude to generate speech directly.
Setup
Installing via Smithery
To install Zonos TTS Integration for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @PhialsBasement/zonos-tts-mcp --client claudeManual installation
Make sure you have Zonos running with our API implementation (PhialsBasement/zonos-api)
Install dependencies:
npm install @modelcontextprotocol/sdk axiosConfigure PulseAudio access:
# Your pulse audio should be properly configured for audio playback
# The MCP server will automatically try to connect to your pulse serverBuild the MCP server:
npm run build
# This will create the dist folder with the compiled serverAdd to Claude's config file: Edit your Claude config file (usually in
~/.config/claude/config.json) and add this to themcpServerssection:
"zonos-tts": {
"command": "node",
"args": [
"/path/to/your/zonos-mcp/dist/server.js"
]
}Replace /path/to/your/zonos-mcp with the actual path where you installed the MCP server.
Related MCP server: TTS-MCP
Using with Claude
Once configured, Claude automatically knows how to use the speak_response tool:
speak_response(
text="Your text here",
language="en-us", # optional, defaults to en-us
emotion="happy" # optional: "neutral", "happy", "sad", "angry"
)Features
Text-to-speech through Claude
Multiple emotions support
Multi-language support
Proper audio playback through PulseAudio
Requirements
Node.js
PulseAudio setup
Running instance of Zonos API (PhialsBasement/zonos-api)
Working audio output device
Notes
Make sure both the Zonos API server and this MCP server are running
Audio playback requires proper PulseAudio configuration