speaches-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@speaches-mcptranscribe the audio at /tmp/recording.mp3"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
speaches-mcp
An MCP (Model Context Protocol) server that exposes speaches as transcribe_audio and text_to_speech tools.
Speaches is a local, OpenAI API-compatible server for speech-to-text (via faster-whisper) and text-to-speech (via Kokoro/Piper). This MCP server lets AI assistants like Claude use it directly.
Tools
transcribe_audio
Transcribe an audio file using your speaches instance.
Parameter | Required | Description |
| ✅ | Absolute path to the audio file |
| ❌ | ISO-639-1 language code (e.g. |
| ❌ | Whisper model ID. Defaults to |
text_to_speech
Convert text to speech and save to a file.
Parameter | Required | Description |
| ✅ | Text to convert |
| ✅ | Absolute path for the output audio file (e.g. |
| ❌ | Voice ID. Defaults to |
| ❌ | TTS model ID. Defaults to |
Usage
With Docker + Supergateway (SSE transport)
This exposes the MCP server over SSE on port 8010, suitable for remote clients.
docker compose up --buildThen connect your MCP client to http://localhost:8010/sse.
Standalone (stdio transport)
Build the image:
docker build -t speaches-mcp .Run it:
docker run --rm -i \
-e SPEACHES_URL=http://your-speaches-host:8000 \
speaches-mcpFor Claude Desktop, add to your config:
{
"mcpServers": {
"speaches": {
"command": "docker",
"args": ["run", "--rm", "-i",
"-e", "SPEACHES_URL=http://your-speaches-host:8000",
"speaches-mcp"
]
}
}
}Environment Variables
Variable | Default | Description |
|
| Base URL of your speaches instance |
|
| Default speech-to-text model |
|
| Default text-to-speech model |
|
| Default TTS voice |
|
| Required by the OpenAI SDK but not used by speaches |
Downloading Models
Before transcribing, make sure you've downloaded models into speaches:
# Speech-to-text
curl http://your-speaches-host:8000/v1/models/Systran/faster-whisper-large-v3 -X POST
# Text-to-speech
curl http://your-speaches-host:8000/v1/models/speaches-ai/Kokoro-82M-v1.0-ONNX -X POSTLicense
MIT
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/xavier-hernandez/mcp-speaches'
If you have feedback or need assistance with the MCP directory API, please join our Discord server