The server can run as a FastAPI-based service with endpoints for API information, health checks, voice listings, and MCP communication.
Edge-TTS MCP Server
A Model Context Protocol (MCP) server that provides speech synthesis services for AI agents that leverage the text-to-speech capabilities of Microsoft Edge.
overview
This MCP server uses the edge-tts library to provide text-to-speech capabilities, and is designed as a tool to enable AI agents to respond in a natural voice.
function
- Text to speech conversion
- Multiple voice and language support
- Adjust audio speed and pitch
- Streaming audio data
install
Or if you want to install in development mode:
How to use
Example setup in VS Code
Example of setting in VS Code settings.json:
Use with MCP Inspector
Runs as a standard MCP server:
Running with uvx (uvicorn)
If you run it as a FastAPI based server under uv:
Command line options:
API endpoint
When running in FastAPI mode, the following endpoints are available:
/
- API information/health
- health check/voices
- List of available voices (optionally filterable by?locale=ja-JP
, etc.)/mcp
- MCP API endpoint
license
MIT
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.
Related MCP Servers
- -securityFlicense-qualityA Model Context Protocol server that enables integration with the TESS API, allowing users to list and manage agents, execute agents with custom messages, and manage files through natural language interfaces.Last updated -TypeScript
Gladia MCPofficial
-securityAlicense-qualityOfficial Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.Last updated -2PythonMIT License- -securityAlicense-qualityA Model Context Protocol server that enables developers to integrate advanced text-to-speech and video translation capabilities into their applications through simple API calls.Last updated -PythonMIT License
- AsecurityAlicenseAqualityA Model Context Protocol server that integrates with VOICEVOX engine to provide text-to-speech synthesis and speaker information retrieval, allowing users to generate and play voice audio from text.Last updated -2TypeScriptMIT License