How do I use io.github.Engr-FaizanAli/text-to-speech?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@io.github.Engr-FaizanAli/text-to-speech Read aloud: The deployment completed successfully." That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

io.github.Engr-FaizanAli/text-to-speech

by Engr-FaizanAli

Overview Schema Related Servers Score Discussions

Python

Local

Text to Speech MCP Server

Text to Speech is an open-source Model Context Protocol (MCP) server that lets AI assistants read text aloud on the user's computer. On Windows it uses the built-in Speech API (SAPI) by default, so no API key, account, subscription, or cloud text-to-speech service is required.

The server exposes one model-controlled tool:

speak_text(text: string)

Use it for user-provided text, assistant answers, accessibility workflows, or spoken progress updates while an agent works.

Features

Local playback through Windows SAPI by default.
No cloud API and no API key for the default setup.
FIFO playback: concurrent requests are spoken one at a time, in order.
Blocking tool completion: each call returns after its audio finishes.
Bounded input and queue sizes to prevent unbounded resource use.
Temporary generated WAV files are removed after playback by default.
Standard MCP stdio transport through the official Python SDK.
Optional Piper, Transformers MMS, and local HTTP backends for advanced users.

The MCP server source is open source under the MIT License. Windows SAPI is a proprietary component included with Windows; it is not an open-source speech engine.

Related MCP server: VOICEVOX TTS MCP

Requirements

Windows 10 or Windows 11 for the zero-configuration SAPI backend.
Python 3.10 or newer.
An MCP client that supports stdio MCP servers.
uv/uvx is recommended for package-based MCP installation.

Install

Configure an MCP client to run the published PyPI package:

uvx text-to-speech-mcp

For MCP clients that accept command-based server configuration, use:

command = "uvx"
args = ["text-to-speech-mcp"]
startup_timeout_sec = 30
tool_timeout_sec = 300
enabled = true

Some clients use TOML, JSON, or a graphical settings page. Use uvx text-to-speech-mcp as the server command and restart the client after changing its configuration.

Install from source

git clone https://github.com/engr-faizanali/text-to-speech-mcp.git
cd text-to-speech-mcp
python -m pip install .

Then configure the client to run text-to-speech-mcp directly.

Prompt Examples

Read arbitrary text:

Use the Text to Speech tool to read aloud: The deployment completed successfully.

Read the final answer:

Use the Text to Speech tool to read your final response aloud before displaying it.

Read visible intermediate progress updates in order:

Use the text_to_speech MCP server's speak_text tool for spoken progress updates.

For every meaningful intermediate update that you display to me:
1. Call speak_text with the exact update text you are about to display.
2. Wait for the call to finish before producing or speaking the next update.
3. Then display the same update in text.

Also call speak_text with the exact final answer before displaying it. Never
narrate hidden reasoning, chain-of-thought, secrets, credentials, raw tool
output, terminal logs, or source code unless I explicitly ask you to read that
content aloud. Do not invoke speech calls in parallel. If the tool is
unavailable, continue normally in text and report the failure once.

The text_to_speech portion is an example client-side server name. Clients may display a different namespace while keeping the tool name speak_text.

Tool Contract

Field	Value
Tool name	`speak_text`
Input	`text`, required string, 1-50,000 characters
Result	Completion message after local playback finishes
Ordering	FIFO, one active playback at a time
Queue limit	32 pending requests
Network use with SAPI	None

The tool is model-controlled under MCP. The user decides when to ask the model to call it, and the MCP client may show or require approval for tool calls.

Privacy

With the default SAPI backend, text is passed from the MCP client to a local Python process and then to Windows speech components. It is not sent to this project, an external API, or a cloud TTS provider. Generated WAV files are written under %TEMP%\text-to-speech-mcp and deleted after playback unless TEXT_TO_SPEECH_KEEP_AUDIO=true is set.

Do not ask an AI assistant to speak secrets, credentials, private keys, hidden reasoning, or sensitive tool output.

Optional Backends

The default requires no configuration:

TEXT_TO_SPEECH_BACKEND = "sapi"

Advanced users can set TEXT_TO_SPEECH_BACKEND to piper, transformers_mms, or http. These options require their own local model, binary, Python dependencies, or endpoint. See backend configuration.

Claude Code Skill

For Claude Code users, skills/project-tts-responder/SKILL.md defines streaming, batch, and read-aloud narration modes built on speak_text. Copy it into your project's .claude/skills/ directory to use it.

MCP Compatibility

MCP transport: stdio
MCP tool implementation: official Python MCP SDK
Registry metadata: server.json using the 2025-12-11 schema
Package registry: PyPI
Registry ownership marker: this README's mcp-name comment
Registry namespace: io.github.Engr-FaizanAli/text-to-speech

License

MIT. See LICENSE.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

5dRelease cycle

2Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

TTS-MCP
Text-to-Speech Audio Processing
nakamurau1
A
license
-
quality
C
maintenance
A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
Last updated 2025-03-25
29
1
MIT
VOICEVOX TTS MCP
Text-to-Speech Speech Processing Multimedia Processing
kajidog
A
license
A
quality
D
maintenance
A text-to-speech MCP server that enables AI assistants to speak using the VOICEVOX engine with support for multi-character conversations. It features queue management, low-latency streaming via FFplay, and cross-platform playback across Windows, macOS, and Linux.
Last updated 2026-07-04
7
149
16
ISC
speaches-mcp
Speech Processing Text-to-Speech
xavier-hernandez
F
license
C
quality
C
maintenance
An MCP server that exposes speech-to-text and text-to-speech capabilities using a local speaches instance, allowing AI assistants to transcribe audio and generate speech.
Last updated 2026-05-16
2
Edge TTS MCP
Text-to-Speech Speech Processing
Hwenyi
A
license
-
quality
D
maintenance
An MCP server that converts text into lifelike speech using Microsoft Edge's Text-to-Speech service, supporting customizable voice, rate, volume, and pitch.
Last updated 2025-04-09
4
MIT

View all related MCP servers

Related MCP Connectors

mcp-fish
MCP server exposing the AceDataCloud Fish Audio API (text-to-speech with voice conditioning)
mcp-aichat
MCP server for AI dialogue using various LLM models via AceDataCloud
hithereiamaliff-mcp-nextcloud
A comprehensive Model Context Protocol (MCP) server that enables AI assistants to interact with yo…

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Engr-FaizanAli/text-to-speech-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server