Which integrations are available for this server?

Integrates with Ollama to enable local LLM-based speech-to-text and text-to-speech, leveraging locally running models for on-device processing. Integrates with OpenAI API to provide cloud-based speech-to-text (STT) and text-to-speech (TTS) capabilities, offering high-accuracy transcription and a variety of voices.

How do I use STT2TTS MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@STT2TTS MCP transcribe meeting_recording.mp3" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

STT2TTS MCP

by pygodzilla

Overview Schema Related Servers Score Discussions

Python

Local

STT2TTS MCP Server

Local-first speech-to-text and text-to-speech MCP server. Hot-swappable engines via config.yaml — no code changes, no API keys required.

┌──────────────┐     stdio      ┌──────────────────┐
│ MCP client   │ ◀────────────▶ │ stt2tts-mcp      │
│              │                │  ├─ STT engine   │ ──▶ faster-whisper
│              │                │  └─ TTS engine   │ ──▶ piper / kokoro / coqui
└──────────────┘                └──────────────────┘
                                       │
                                       ▼
                              config.yaml (hot-reload)

Why

Replaces whisper-mcp. Works offline, ships with five STT and six TTS engines, switches per-task via config.

Related MCP server: speaches-mcp

Install

pip install stt2tts-mcp

# Add the engines you actually use:
pip install stt2tts-mcp[stt-faster-whisper]   # local STT
pip install stt2tts-mcp[tts-piper]            # local TTS (~50MB voices)

# Register with your MCP client (consult your client's docs for the exact
# config file location — most use mcp_config.json or a per-client equivalent):
{
  "mcp": {
    "stt2tts": {
      "type": "local",
      "command": ["stt2tts-mcp"],
      "enabled": true
    }
  }
}

Engines

STT	Size	License	Best for
faster-whisper	39M – 2.9 GB	MIT	English, INT8 CPU, fastest
sherpa-onnx	39M – large	Apache 2.0	Multilingual
OpenAI API	cloud	Proprietary	Highest accuracy, needs key
Ollama	varies	MIT	Local LLM integration
LMStudio	varies	MIT	Local model server

TTS	Voice size	License	Best for
Piper	20 – 50 MB	Apache 2.0	Smallest, 10-20× realtime
Kokoro-82M	~330 MB	Apache 2.0	Quality/size ratio
Coqui XTTS	~1.5 GB	MPL 2.0	Voice cloning, needs GPU
OpenAI API	cloud	Proprietary	All voices, needs key
Ollama	varies	MIT	LLM-based voices
LMStudio	varies	MIT	Local model server

Configure

config.yaml:

stt:
  engine: faster_whisper   # sherpa_onnx | openai_api | ollama | lmstudio
  enabled: true
  params:
    model_size: base.en     # tiny.en | base.en | small.en | medium.en
    device: cpu             # cpu | cuda

tts:
  engine: piper             # kokoro | coqui | openai_api | ollama | lmstudio
  enabled: true
  params:
    voice: en_US-lessac-medium
    model_dir: ~/.cache/piper

Reload without restart by calling the reload_config MCP tool.

MCP Tools

Tool	What it does
`transcribe(audio_path, language?)`	Audio file → text
`speak(text, output_path, voice?)`	Text → WAV file
`list_stt_models`	Available STT models
`list_tts_voices`	Available TTS voices
`reload_config`	Re-read `config.yaml`, rebuild engines
`health_check`	Engine status

All formats ffmpeg supports (wav, mp3, ogg, flac, m4a) are accepted; STT input is auto-converted to 16 kHz mono.

Develop

Source-only releases ship on main for clean installs. The dev branch carries the test suite (tests/test_config_loader.py, tests/test_mcp_integration.py, tests/test_piper_no_json.py) for contributors.

git clone https://github.com/pygodzilla/stt2tts-mcp
cd stt2tts-mcp
git checkout dev                 # for tests + dev iteration
pip install -e ".[all]"
python -m stt2tts_mcp.server

License

MIT

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/pygodzilla/stt2tts-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server