MS-Lucidia-语音网关-MCP

一个模型上下文协议 (MCP) 服务器，使用 Windows 内置语音服务提供文本转语音和语音转文本功能。此服务器通过 PowerShell 命令利用本机 Windows 语音 API (SAPI)，从而无需外部 API 或服务。

特征

使用 Windows SAPI 语音的文本转语音 (TTS)
使用 Windows 语音识别进行语音转文本 (STT)
用于测试的简单 Web 界面
无外部 API 依赖
使用原生 Windows 功能

先决条件

启用语音识别的 Windows 10/11
Node.js 16+
PowerShell

安装

克隆存储库：

git clone https://github.com/ExpressionsBot/MS-Lucidia-Voice-Gateway-MCP.git
cd MS-Lucidia-Voice-Gateway-MCP

安装依赖项：

npm install

构建项目：

npm run build

用法

测试接口

启动测试服务器：

npm run test

在浏览器中打开http://localhost:3000
使用 Web 界面测试 TTS 和 STT 功能

可用工具

文本转语音

使用 Windows SAPI 将文本转换为语音。

参数：

text （必需）：要转换为语音的文本
voice （可选）：要使用的语音（例如“Microsoft David Desktop”）
speed （可选）：语速从 0.5 到 2.0（默认值：1.0）

例子：

fetch('http://localhost:3000/tts', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    text: "Hello, this is a test",
    voice: "Microsoft David Desktop",
    speed: 1.0
  })
});

语音转文本

录制音频并使用 Windows 语音识别将其转换为文本。

参数：

duration （可选）：录制持续时间（秒）（默认值：5，最大值：60）

例子：

fetch('http://localhost:3000/stt', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    duration: 5
  })
}).then(response => response.json())
  .then(data => console.log(data.text));

故障排除

确保 Windows 语音识别已启用：
- 打开 Windows 设置
- 前往“时间和语言”>“语音”
- 启用语音识别
检查可用的声音：
- 打开 PowerShell 并运行：GXP7
测试语音识别：
- 在 Windows 设置中打开语音识别
- 如果尚未完成，请运行安装向导
- 测试 Windows 是否可以识别你的声音

贡献

分叉存储库
创建你的功能分支
提交你的更改
推送到分支
创建新的 Pull 请求

执照

麻省理工学院

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

使用 Windows 原生语音服务提供文本转语音和语音转文本功能的服务器，无需外部依赖。

Related Resources

Reddit Discussion about this server

Related MCP Servers

Kokoro TTS MCP Server
giannisanni
-
security
F
license
-
quality
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated -
7
Python
Gladia MCPofficial
gladiaio
-
security
A
license
-
quality
Official Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.
Last updated -
2
Python
MIT License
Edge-TTS MCP Server
yuiseki
-
security
F
license
-
quality
A Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.
Last updated -
4
Python
VOICEVOX MCP Server
Yuki10Kobayashi
A
security
A
license
A
quality
A Model Context Protocol server that integrates with VOICEVOX engine to provide text-to-speech synthesis and speaker information retrieval, allowing users to generate and play voice audio from text.
Last updated -
2
TypeScript
MIT License

View all related MCP servers

MS-Lucidia-Voice-Gateway-MCP

MS-Lucidia-语音网关-MCP

特征

先决条件

安装

用法

测试接口

可用工具

文本转语音

语音转文本

故障排除

贡献

执照

Related Resources

Related MCP Servers

Kokoro TTS MCP Server

Gladia MCPofficial

Edge-TTS MCP Server

VOICEVOX MCP Server

Appeared in Searches

New MCP Servers

MCP directory API