The Zonos TTS MCP Server enables text-to-speech functionality in Claude through the speak_response tool, allowing it to generate and play spoken audio from text with the following capabilities:
Multi-Language Support: Generate speech in different languages (default:
en-us)Emotion Control: Customize the emotional tone (neutral, happy, sad, angry)
PulseAudio Integration: Ensures proper audio playback
MCP Integration: Seamlessly works with Claude's Model Context Protocol
The server requires a running instance of Zonos API with proper Node.js and PulseAudio setup.

Zonos MCP Integration
A Model Context Protocol integration for Zonos TTS, allowing Claude to generate speech directly.
Setup
Installing via Smithery
To install Zonos TTS Integration for Claude Desktop automatically via Smithery:
Manual installation
Make sure you have Zonos running with our API implementation (PhialsBasement/zonos-api)
Install dependencies:
Configure PulseAudio access:
Build the MCP server:
Add to Claude's config file: Edit your Claude config file (usually in
~/.config/claude/config.json) and add this to themcpServerssection:
Replace /path/to/your/zonos-mcp with the actual path where you installed the MCP server.
Related MCP server: TranscriptionTools MCP Server
Using with Claude
Once configured, Claude automatically knows how to use the speak_response tool:
Features
Text-to-speech through Claude
Multiple emotions support
Multi-language support
Proper audio playback through PulseAudio
Requirements
Node.js
PulseAudio setup
Running instance of Zonos API (PhialsBasement/zonos-api)
Working audio output device
Notes
Make sure both the Zonos API server and this MCP server are running
Audio playback requires proper PulseAudio configuration