MS-Lucidia-Voice-Gateway-MCP

Windows の組み込み音声サービスを使用して、テキスト読み上げ機能と音声テキスト変換機能を提供するモデルコンテキストプロトコル (MCP) サーバーです。このサーバーは、PowerShell コマンドを通じてネイティブの Windows Speech API (SAPI) を活用するため、外部 API やサービスは必要ありません。

特徴

Windows SAPI 音声を使用したテキスト読み上げ (TTS)
Windows 音声認識を使用した音声テキスト変換 (STT)
テスト用のシンプルなWebインターフェース
外部APIへの依存なし
ネイティブのWindows機能を使用する

前提条件

音声認識が有効になっているWindows 10/11
Node.js 16以上
パワーシェル

インストール

リポジトリをクローンします。

git clone https://github.com/ExpressionsBot/MS-Lucidia-Voice-Gateway-MCP.git
cd MS-Lucidia-Voice-Gateway-MCP

依存関係をインストールします:

npm install

プロジェクトをビルドします。

npm run build

使用法

テストインターフェース

テストサーバーを起動します。

npm run test

ブラウザでhttp://localhost:3000を開きます。
Webインターフェースを使用してTTSおよびSTT機能をテストする

利用可能なツール

テキスト読み上げ

Windows SAPI を使用してテキストを音声に変換します。

パラメータ:

text （必須）: 音声に変換するテキスト
voice (オプション): 使用する音声 (例: "Microsoft David Desktop")
speed （オプション）：0.5～2.0の音声速度（デフォルト：1.0）

例：

fetch('http://localhost:3000/tts', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    text: "Hello, this is a test",
    voice: "Microsoft David Desktop",
    speed: 1.0
  })
});

音声テキスト変換

Windows 音声認識を使用して音声を録音し、テキストに変換します。

パラメータ:

duration （オプション）：録画時間（秒）（デフォルト：5、最大：60）

例：

fetch('http://localhost:3000/stt', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    duration: 5
  })
}).then(response => response.json())
  .then(data => console.log(data.text));

トラブルシューティング

Windows 音声認識が有効になっていることを確認します。
- Windowsの設定を開く
- 「時間と言語」>「スピーチ」へ移動
- 音声認識を有効にする
利用可能な音声を確認してください:
- PowerShellを開いて実行: GXP7
音声認識をテストする:
- Windows設定で音声認識を開く
- まだ実行していない場合はセットアップウィザードを実行します
- Windowsがあなたの声を認識できるかテストする

貢献

リポジトリをフォークする
機能ブランチを作成する
変更をコミットする
ブランチにプッシュする
新しいプルリクエストを作成する

ライセンス

マサチューセッツ工科大学

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

外部依存なしで Windows のネイティブ音声サービスを使用して、テキスト読み上げ機能および音声テキスト変換機能を提供するサーバー。

Related Resources

Reddit Discussion about this server

Related MCP Servers

ElevenLabs MCP Server
mamertofabian
A
security
A
license
A
quality
Integrates with ElevenLabs text-to-speech API.
Last updated -
6
36
Python
MIT License
Kokoro TTS MCP Server
giannisanni
-
security
F
license
-
quality
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated -
2
Python
Speech MCP Server
hammeiam
-
security
F
license
-
quality
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated -
239
JavaScript
TTS-MCP
nakamurau1
-
security
A
license
-
quality
A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
Last updated -
TypeScript
MIT License

View all related MCP servers

MS-Lucidia-Voice-Gateway-MCP

MS-Lucidia-Voice-Gateway-MCP

特徴

前提条件

インストール

使用法

テストインターフェース

利用可能なツール

テキスト読み上げ

音声テキスト変換

トラブルシューティング

貢献

ライセンス

Related Resources

Related MCP Servers

ElevenLabs MCP Server

Kokoro TTS MCP Server

Speech MCP Server

TTS-MCP

Appeared in Searches

New MCP Servers

MCP directory API