Interactive Voice MCP Server

Interactive Voice MCP Server (Kokoro TTS + NeMo ASR)

A Model Context Protocol server that provides Text-to-Speech (TTS) capabilities using Kokoro and Speech-to-Text (STT) capabilities using NVIDIA NeMo Parakeet models, enabling interactive voice dialogues.

Available Tools

interactive_voice_dialog - Synthesizes text to speech, plays it, then listens for user speech input and returns the transcription.
- Required arguments:
  - text_to_speak (string): The text for the assistant to speak.
- Optional arguments:
  - voice (string): The voice to use for TTS (e.g., 'af_heart'). Defaults to 'af_heart'.

Installation

Prerequisites

Some of the underlying TTS models require espeak-ng to be installed on your system.

Windows Installation:

Go to espeak-ng releases.
Click on "Latest release".
Download the appropriate *.msi file (e.g. espeak-ng-20191129-b702b03-x64.msi).
Run the downloaded installer.

Local Development Installation

To allow Claude Desktop to launch this server using python -m mcp_server_tts, you need to install it as a Python module. Installing in "editable" mode (-e) is recommended for development, as it means changes to the source code are reflected immediately without needing to reinstall.

Navigate to the directory containing the pyproject.toml file (the root of this server project) and run:

pip install -e .

After installation, you can run it as a script using:

python -m mcp_server_tts.server # Assuming the main module is still server.py within mcp_server_tts
# Or, if you create a new package structure:
# python -m mcp_interactive_voice_server

Configuration

To use this server with Claude Desktop, you need to add it to your claude_desktop_config.json file. The location of this file is typically: C:\Users\<YourUsername>\AppData\Roaming\Claude\claude_desktop_config.json

Add the following entry under the mcpServers object in your claude_desktop_config.json:

    "tts": {
      "command": "python",
      "args": ["-m", "mcp_server_tts"]
    }

For example, your mcpServers section might look like this:

{
  // ... other configurations ...
  "mcpServers": {
    // ... other servers ...
    "tts": {
      "command": "python",
      "args": ["-m", "mcp_server_tts"]
    }
    // ... other servers ...
  }
  // ... other configurations ...
}

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

Enables voice-based interactions with Claude by converting text to speech using Kokoro TTS and transcribing user responses using NVIDIA NeMo ASR, creating interactive voice dialogues.

Related MCP Servers

mcp-server-neonofficial
neondatabase-labs
-
security
A
license
-
quality
Lets you use Claude Desktop, or any MCP Client, to use natural language to accomplish things with Neon.
Last updated -
664
386
TypeScript
MIT License
Zonos TTS MCP Server
PhialsBasement
A
security
F
license
A
quality
Facilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.
Last updated -
1
0
12
TypeScript
Azure MCP Server
kalivaraprasad-gonapa
A
security
A
license
A
quality
Enables natural language interaction with Azure services through Claude Desktop, supporting resource management, subscription handling, and tenant selection with secure authentication.
Last updated -
3
11
13
TypeScript
MIT License
TranscriptionTools MCP Server
MushroomFleet
A
security
A
license
A
quality
Provides intelligent transcript processing capabilities for Claude, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.
Last updated -
4
15
TypeScript
MIT License

View all related MCP servers

Interactive Voice MCP Server