Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Claude Voice Commandsactivate voice mode"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Claude Voice Commands
Enable natural voice interaction with Claude Code through MCP (Model Context Protocol).
Features
π€ Voice-to-text commands - Speak naturally to Claude Code
π Wake word activation - "Hey Claude" to start listening
π Low latency - Sub-second response with Google Speech Recognition
π Multiple backends - Google, Whisper, and OpenAI Realtime support
π Command queue - Handle multiple rapid commands
π‘οΈ Privacy-focused - Only listens when activated
Quick Start
Prerequisites
Python 3.8+
macOS, Windows, or Linux
Working microphone
Claude Desktop app
Installation
Clone the repository:
git clone https://github.com/JWitcoff/Claude_Chat.git
cd Claude_ChatCreate virtual environment:
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activateInstall dependencies:
pip install -r requirements.txtTest your microphone:
python tests/test_microphone.pyConfigure Claude Desktop (Sprint 2+): Add to
~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"voice-commands": {
"command": "python",
"args": ["-m", "mcp_server.voice_server"],
"cwd": "/path/to/Claude_Chat",
"env": {
"PYTHONPATH": "/path/to/Claude_Chat"
}
}
}
}Restart Claude Desktop
Usage
Basic Commands
Say to Claude Code:
"Activate voice mode" - Start continuous listening
"Stop voice mode" - Stop listening
"Hey Claude, [command]" - Wake word activation
Example Interactions
You: "Activate voice mode"
Claude: β
Voice mode activated
You: "Create a Python function to sort a list"
Claude: [Creates the function]
You: "Now add error handling to it"
Claude: [Modifies the code]
You: "Stop voice mode"
Claude: β
Voice mode deactivatedConfiguration
Create a .env file:
# Recognition Settings
VOICE_BACKEND=google # Primary: google, whisper, openai
VOICE_ENERGY_THRESHOLD=4000 # Microphone sensitivity
WAKE_WORDS=hey claude,claude
# Optional API Keys (for premium features)
OPENAI_API_KEY=sk-... # For ultra-low latencyDevelopment
See ROADMAP.md for the development plan and CLAUDE.md for Claude Code guidance.
Current Status
Sprint 0: Project Setup β
Sprint 1: Basic Voice Capture (In Progress)
Troubleshooting
Microphone Issues
# List available microphones
python -c "import speech_recognition as sr; print(sr.Microphone.list_microphone_names())"
# Calibrate for ambient noise
python tests/calibrate_mic.pyRecognition Issues
Speak clearly and at normal pace
Reduce background noise
Adjust
VOICE_ENERGY_THRESHOLDin .env
License
MIT
Contributing
See CONTRIBUTING.md for guidelines.
Acknowledgments
Google Speech Recognition API
OpenAI Whisper
FastMCP for MCP integration