Skip to main content
Glama
Hwenyi
by Hwenyi

Edge TTS MCP

A Model Context Protocol (MCP) server for Microsoft Edge Text-to-Speech service that allows AI assistants to read text aloud with natural-sounding voices.

Language

Related MCP server: Edge-TTS MCP Server

Features

  • Generate lifelike speech from text input

  • Support for multiple voice options

  • Customizable speech parameters (rate, volume, pitch)

  • Optional audio saving capability

  • Easy integration with Cline and other MCP-compatible clients

Installation

Prerequisites

Setup

  1. Clone the repository:

git clone https://github.com/Hwenyi/edge-tts-mcp.git
cd edge-tts-mcp
  1. Install dependencies:

bun install
  1. Build the project:

bun run build

Configuration

Environment Variables

The Edge TTS MCP server supports the following environment variables:

Variable

Description

Default Value

Example Values

VOICE

The voice to use for speech generation

zh-CN-XiaoxiaoNeural

en-US-AriaNeural, ja-JP-NanamiNeural

RATE

The speech rate

0%

-10%, +20%

VOLUME

The speech volume

0%

-50%, +50%

PITCH

The speech pitch

0Hz

-10Hz, +5Hz

SAVE_AUDIO

Whether to save audio files (true/false)

false

true

You can set these environment variables before starting the server.

Usage

Starting the Server

# Using default settings
bun run start

# Or with custom configuration
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true bun run start

Integrating with Cline

To use this MCP server with Cline, add the following configuration to your Cline config:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "bun",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Replace /path/to/edge-tts-mcp with the actual path to your installation.

MCP Tool Parameters

The MCP server exposes the following tool:

Tool Name: speech_text_aloud

Parameters:

  • input (string): The text to be converted to speech and read aloud

Using with Node.js

You can also run the server using Node.js instead of Bun:

# Run with Node.js
node dist/index.js

# Or with custom environment variables
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true node dist/index.js

For Cline integration with Node.js, update your configuration:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "node",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Audio File Storage

When the SAVE_AUDIO environment variable is set to true, audio files will be saved in the dist directory by default. Each file is named with a random UUID to prevent overwriting.

Configuration for Other Clients

5ire or Claude

You can also configure this MCP server in other clients like 5ire or Claude. Here's an example configuration:

{
  "name": "edge-tts-mcp",
  "key": "EdgeTTSMCP",
  "description": "Read text aloud using Edge TTS",
  "command": "bun",
  "args": [
    "/path/to/edge-tts-mcp/dist/index.js"
  ]
}

⚠️ Path Format Warning: Pay attention to the path format in your configuration:

  • Windows: Uses backslashes (\) and needs to be escaped in JSON as \\ or converted to forward slashes (/)

  • macOS/Linux: Uses forward slashes (/)

Examples:

  • Windows path: C:\\Users\\username\\edge-tts-mcp\\dist\\index.js or C:/Users/username/edge-tts-mcp/dist/index.js

  • macOS/Linux path: /Users/username/edge-tts-mcp/dist/index.js

Incorrect path formatting is a common cause of setup issues across different operating systems.

Make sure to adjust the file path according to your actual installation directory.

⚠️ Important Notice: Currently, there are known issues with MCP integration in Cherry-Studio. The configuration above may not work properly in Cherry-Studio. We recommend using Cline or other well-tested MCP clients until these issues are resolved.

Example Usage in an AI Assistant

When your AI assistant needs to read text aloud, it can use a prompt like:

I need to read this text aloud: "Hello world, this is a test of the Edge TTS system."

The assistant will call the speech_text_aloud tool with the appropriate input text.

Voice Options

Microsoft Edge TTS provides many voices across different languages. Some popular options include:

  • en-US-AriaNeural (English, US, Female)

  • en-US-GuyNeural (English, US, Male)

  • zh-CN-XiaoxiaoNeural (Chinese, Female)

  • ja-JP-NanamiNeural (Japanese, Female)

  • de-DE-KatjaNeural (German, Female)

  • fr-FR-DeniseNeural (French, Female)

For a complete list of available voices, refer to the Microsoft Edge TTS documentation.

License

MIT

A
license - permissive license
-
quality - not tested
D
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Hwenyi/edge-tts-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server