Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@VOICEVOX TTS MCPSay 'Welcome to the stream!' using speaker 3"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
VOICEVOX TTS MCP
English | 日本語
A text-to-speech MCP server using VOICEVOX
What You Can Do
Make your AI assistant speak — Text-to-speech from MCP clients like Claude Desktop
Multi-character conversations — Switch speakers per segment in a single call
Smooth playback — Queue management, immediate playback, prefetching, streaming
Cross-platform — Works on Windows, macOS, Linux (including WSL)
Quick Start
Requirements
Node.js 18.0.0 or higher
VOICEVOX Engine (must be running)
ffplay (optional, recommended)
Installing FFplay
ffplay is a lightweight player included with FFmpeg that supports playback from stdin. When available, it automatically enables low-latency streaming playback.
💡 FFplay is optional. Without it, playback falls back to temp file-based playback (Windows: PowerShell, macOS: afplay, Linux: aplay, etc.).
Easy setup: One-liner installation for each OS (see steps below)
Required:
ffplaymust be in PATH (restart terminal/apps after installation)
Installation examples:
Windows (any of these)
Winget:
winget install --id=Gyan.FFmpeg -eChocolatey:
choco install ffmpegScoop:
scoop install ffmpegOfficial builds: Download from https://www.gyan.dev/ffmpeg/builds/ or https://github.com/BtbN/FFmpeg-Builds and add the
binfolder to PATH
macOS
Homebrew:
brew install ffmpeg
Linux
Debian/Ubuntu:
sudo apt-get update && sudo apt-get install -y ffmpegFedora:
sudo dnf install -y ffmpegArch:
sudo pacman -S ffmpeg
PATH Setup:
Windows: Add
...\ffmpeg\binto environment variables, then restart PowerShell/terminal and editor (Claude/VS Code, etc.)Verify:
powershell -c "$env:Path"should include the ffmpeg path
macOS/Linux: Usually auto-detected. Check with
echo $PATHif needed, restart shell.MCP clients (Claude Desktop/Code): Restart the app to reload PATH.
Verification:
If version info is displayed, installation is complete. CLI/MCP will automatically detect ffplay and use stdin streaming playback.
3 Steps to Get Started
1. Start VOICEVOX Engine
2. Add to Claude Desktop config file
Config file location:
Windows:
%APPDATA%\Claude\claude_desktop_config.jsonmacOS:
~/Library/Application Support/Claude/claude_desktop_config.json
3. Restart Claude Desktop
That's it! Ask Claude to "say hello" and it will speak!
MCP Tools
speak — Text-to-Speech
The main feature callable from Claude.
Parameter | Description | Default |
| Text to speak (multiple segments separated by newlines) | Required |
| Speaker ID | 1 |
| Playback speed | 1.0 |
| Immediate playback (clears queue) | true |
| Wait for playback completion | false |
Examples:
Tool | Description |
| Check VOICEVOX Engine connection |
| Get list of available speakers |
| Get speaker details |
| Stop playback and clear queue |
| Generate speech synthesis query |
| Generate audio file |
Configuration
VOICEVOX Settings
Variable | Description | Default |
| Engine URL |
|
| Default speaker ID |
|
| Playback speed |
|
Playback Options
Variable | Description | Default |
| Streaming playback (requires |
|
| Immediate playback |
|
| Wait for playback start |
|
| Wait for playback end |
|
Restriction Settings
Restrict AI from specifying certain options.
Variable | Description |
| Restrict |
| Restrict |
| Restrict |
Disable Tools
Server Settings
Variable | Description | Default |
| Enable HTTP mode |
|
| HTTP port |
|
| HTTP host |
|
| Allowed hosts (comma-separated) |
|
| Allowed origins (comma-separated) |
|
Command line arguments take priority over environment variables.
Argument | Description |
| Show help |
| Show version |
| VOICEVOX Engine URL |
| Default speaker ID |
| Playback speed |
| Streaming playback |
| Immediate playback |
| Wait for start |
| Wait for end |
| Restrict immediate |
| Restrict waitForStart |
| Restrict waitForEnd |
| Disable tools |
| HTTP mode |
| HTTP port |
| HTTP host |
| Allowed hosts (comma-separated) |
| Allowed origins (comma-separated) |
For remote connections:
Start Server:
Claude Desktop Config (using mcp-remote):
Connecting from WSL to an MCP server running on Windows:
1. Get Windows Host IP from WSL
2. Start Server on Windows
Add the WSL gateway IP to MCP_ALLOWED_HOSTS to allow access from WSL:
Or with CLI arguments:
3. WSL Configuration (.mcp.json)
⚠️ Within WSL,
localhostrefers to WSL itself. Use the WSL gateway IP to access the Windows host.
Troubleshooting
1. Check if VOICEVOX Engine is running
2. Check platform-specific playback tools
OS | Required Tool |
Linux | One of |
macOS |
|
Windows | PowerShell (pre-installed) |
Check package installation:
npm list -g @kajidog/mcp-tts-voicevoxVerify JSON syntax in config file
Restart the client
Package Structure
Package | Description |
| MCP server |
General-purpose VOICEVOX client library (can be used independently) |
Setup
Commands
Command | Description |
| Build all packages |
| Run tests |
| Run lint |
| Start dev server |
| Dev with stdio mode |
License
ISC