Skip to main content
Glama
thundersoft-td

speech-mcp-server

Speech Model Context Protocol Server

An MCP server implementation for speech of volcengine

Features

Tools

  • asr Automatic Speech Recognition: Converts audio to text.

    • Args:

      • content: url or absolute path of the audio file to transcribe.

    • Returns:

      • Asr text

  • tts Text-to-Speech: Synthesizes text into audio.

    • Args:

      • text: The text to synthesize into speech.

      • speed: Speech speed (e.g., 1.0 for normal). default: 1.0.

      • encoding: Desired audio output format (e.g., 'mp3', 'wav'). default: 'mp3'.

    • Returns:

      • Return the path of audio file.

Related MCP server: MCP Video & Audio Text Extraction Server

Configuration

The server requires the following environment variables to be set:

  • VOLC_APPID: Required, The APP ID for the VolcEngine.

  • VOLC_TOKEN: Required, The Access Token for the VolcEngine.

  • VOLC_VOICE_TYPE: Optional, Large speech synthesis model service voice_type, default is 'zh_female_meilinvyou_moon_bigtts'

  • VOLC_CLUSTER: Required, Large speech synthesis model service cluster ID

The services that need to be activated on Volcengine are: Large speech synthesis modelStreaming speech recognition large modelLarge model for audio file recognition

You can set these environment variables in your shell.

MCP Settings Configuration

To add this server to your MCP configuration, add the following to your MCP settings file:

{
  "mcpServers": {
    "speech-mcp-server": {
      "command": "uv",
      "args": [
        "--directory",
        "/ABSOLUTE/PATH/TO/PARENT/FOLDER/src/mcp_server_speech",
        "run",
        "main.py"
      ]
    }
  }
}

or

{
    "mcpServers": {
        "speech-mcp-server": {
            "command": "uvx",
            "args": [
                "--from",
                "git+https://github.com/thundersoft-td/mcp-server-speech",
                "mcp-server-speech"
            ],
            "env": {
                "VOLC_APPID": "your appid",
                "VOLC_TOKEN": "your token",
                "VOLC_VOICE_TYPE": "tts voice type",
                "VOLC_CLUSTER": "tts cluster id",
            }
        }
    }
}

Usage

Running the Server

# Run the server with stdio transport (default)
python -m mcp_server_speech [--transport/-t {sse,stdio}]

License

This library is licensed under the MIT-0 License. See the LICENSE file.

A
license - permissive license
-
quality - not tested
D
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/thundersoft-td/mcp-server-speech'

If you have feedback or need assistance with the MCP directory API, please join our Discord server