Used for version control and distribution of the MCP server code.
Hosts the repository for the MCP server, enabling collaboration and distribution.
Provides a runtime environment for the MCP server, allowing it to execute JavaScript code for handling text-to-speech and speech-to-text requests.
Package manager used for installing dependencies and running scripts for the MCP server.
MS-Lucidia-Voice-Gateway-MCP
A Model Context Protocol (MCP) server that provides text-to-speech and speech-to-text capabilities using Windows' built-in speech services. This server leverages the native Windows Speech API (SAPI) through PowerShell commands, eliminating the need for external APIs or services.
Features
Text-to-Speech (TTS) using Windows SAPI voices
Speech-to-Text (STT) using Windows Speech Recognition
Simple web interface for testing
No external API dependencies
Uses native Windows capabilities
Related MCP server: Gladia MCP
Prerequisites
Windows 10/11 with Speech Recognition enabled
Node.js 16+
PowerShell
Installation
Clone the repository:
Install dependencies:
Build the project:
Usage
Testing Interface
Start the test server:
Open
http://localhost:3000in your browserUse the web interface to test TTS and STT capabilities
Available Tools
text_to_speech
Converts text to speech using Windows SAPI.
Parameters:
text(required): The text to convert to speechvoice(optional): The voice to use (e.g., "Microsoft David Desktop")speed(optional): Speech rate from 0.5 to 2.0 (default: 1.0)
Example:
speech_to_text
Records audio and converts it to text using Windows Speech Recognition.
Parameters:
duration(optional): Recording duration in seconds (default: 5, max: 60)
Example:
Troubleshooting
Make sure Windows Speech Recognition is enabled:
Open Windows Settings
Go to Time & Language > Speech
Enable Speech Recognition
Check available voices:
Open PowerShell and run:
Add-Type -AssemblyName System.Speech (New-Object System.Speech.Synthesis.SpeechSynthesizer).GetInstalledVoices().VoiceInfo.NameTest speech recognition:
Open Speech Recognition in Windows Settings
Run through the setup wizard if not already done
Test that Windows can recognize your voice
Contributing
Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a new Pull Request
License
MIT