Enable continuous, hands-free two-way voice conversations with Claude Code through browser-based speech recognition and text-to-speech. Voice Input: Speak continuously to Claude with real-time interruptions, redirections, and feedback without stopping ongoing tasks. Voice Output: Receive spoken responses using browser-based or system-level text-to-speech (including high-quality Mac voices). Seamless Integration: Automatically delivers voice input after tool use and integrates with Claude Code's existing workflow through hooks, requiring no downloads or API keys. Flexible Control: Configure automatic processing or enable manual control for debugging, with customizable settings for server port, browser behavior, and voice delivery timing.
Voice Mode for Claude Code
Voice Mode for Claude Code allows you to have a continuous two-way conversation with Claude Code, hands-free.
It uses the new Claude Code hooks to deliver voice input to Claude while it works.
This lets you speak continuously to Claude - interrupt, redirect, or provide feedback without stopping what Claude is doing.
Optionally enable text-to-speech to have Claude speak back to you.
Voice recognition and text-to-speech are handled by the browser, so there is nothing to download, and no API keys are needed.
Demo Video
Installation
Installation is easy.
1. Install Claude Code
2. Install Voice Mode
Usage
1. Start Claude Code
2. Start Listening
The browser interface will automatically open after 3 seconds (http://localhost:5111).
Click "Start Listening"
3. Speak
Say something to Claude. You will need to send one message in the Claude Code CLI to start the conversation.
Browser Compatibility
- ✅ Chrome: Full support for speech recognition, browser text-to-speech, and system text-to-speech
- ⚠️ Safari: Full support for speech recognition and system text-to-speech, but browser text-to-speech cannot load high-quality voices
- ❌ Edge: Speech recognition not working on Apple Silicon (language-not-supported error)
Voice responses
There are two options for voice responses:
- Browser Text-to-Speech
- System Text-to-Speech
Selecting and downloading high quality System Voices (Mac only)
Mac has built-in text to speech, but high quality voices are not available by default.
You can download high quality voices from the system voice menu: System Settings > Accessibility > Spoken Content > System Voice
Click the info icon next to the system voice dropdown. Search for "Siri" to find the highest quality voices. You'll have to trigger a download of the voice.
Once it's downloaded, you can select it in the Browser Voice (Local) menu in Chrome.
Test it with the bash command:
To use Siri voices with voice-hooks, you need to set your system voice and select "Mac System Voice" in the voice-hooks browser interface.
Other downloaded voices will show up in the voice dropdown in the voice-hooks browser interface so you can select them there directly, instead of using the "Mac System Voice" option.
There is a bug in Safari that prevents browser text-to-speech from loading high-quality voices after browser restart. This is a Safari Web Speech API limitation. To use high-quality voices in Safari you need to set your system voice to Siri and select "Mac System Voice" in the voice-hooks browser interface.
Manual Hook Installation
The hooks are automatically installed/updated when the MCP server starts. However, if you need to manually install or reconfigure the hooks:
This will configure your project's .claude/settings.local.json
with the necessary hook commands.
Uninstallation
To completely remove MCP Voice Hooks:
This will:
- Clean up voice hooks from your project's
.claude/settings.local.json
- Preserve any custom hooks you've added
Development Mode
If you're developing mcp-voice-hooks itself:
Important: When developing with npm link
:
- Claude runs the compiled JavaScript from the
dist
folder, not your TypeScript source - After making changes to TypeScript files (
src/*.ts
), you must runnpm run build
- For changes to browser files (
public/*
), just restart Claude Code - Then restart Claude Code to use the updated code
Configuration
Port Configuration
The default port is 5111. To use a different port, add to your project's .claude/settings.local.json
:
Browser Auto-Open
When running in MCP-managed mode, the browser will automatically open if no frontend connects within 3 seconds. To disable this behavior:
Auto-Deliver Voice Input (Default)
By default, mcp-voice-hooks automatically delivers voice input to Claude after tool use, before speaking, and before stopping:
- The
dequeue_utterances
andwait_for_utterance
MCP tools are hidden from Claude - Voice input is automatically delivered when Claude performs any action
- Claude receives voice input naturally without needing to explicitly call mcp-voice-hooks tools
To disable auto-delivery:
When auto-delivery is disabled:
- The
dequeue_utterances
andwait_for_utterance
tools become visible - Hooks no longer automatically process voice input
- Claude will be blocked from making tool calls until it manually dequeues voice input
- This mode is useful for debugging or when you want manual control
local-only server
The server can only run on the client's local machine because it depends on local resources.
Tools
Voice Mode for Claude Code
Related MCP Servers
- MIT License
- MIT License
- MIT License