Voice Recorder MCP Server
local-only server
The server can only run on the client’s local machine because it depends on local resources.
Integrations
Provides installation support via GitHub repository, allowing users to clone and install the voice-recorder-mcp from DefiBax's GitHub account
Uses OpenAI's Whisper model for audio transcription, enabling conversion of recorded voice to text with different model sizes for varying accuracy and performance needs
Voice Recorder MCP Server
An MCP server for recording audio and transcribing it using OpenAI's Whisper model. Designed to work as a Goose custom extension or standalone MCP server.
Features
- Record audio from the default microphone
- Transcribe recordings using Whisper
- Integrates with Goose AI agent as a custom extension
- Includes prompts for common recording scenarios
Installation
Usage
As a Standalone MCP Server
Testing with MCP Inspector
The MCP Inspector provides an interactive interface to test your server:
With Goose AI Agent
- Open Goose and go to Settings > Extensions > Add > Command Line Extension
- Set the name to
voice-recorder
- In the Command field, enter the full path to the voice-recorder-mcp executable:Or for a specific model:To find the path, run:
- No environment variables are needed for basic functionality
- Start a conversation with Goose and introduce the recorder with: "I want you to take action from transcriptions returned by voice-recorder. For example, if I dictate a calculation like 1+1, please return the result."
Available Tools
start_recording
: Start recording audio from the default microphonestop_and_transcribe
: Stop recording and transcribe the audio to textrecord_and_transcribe
: Record audio for a specified duration and transcribe it
Whisper Models
This extension supports various Whisper model sizes:
Model | Speed | Accuracy | Memory Usage | Use Case |
---|---|---|---|---|
tiny.en | Fastest | Lowest | Minimal | Testing, quick transcriptions |
base.en | Fast | Good | Low | Everyday use (default) |
small.en | Medium | Better | Moderate | Good balance |
medium.en | Slow | High | High | Important recordings |
large | Slowest | Highest | Very High | Critical transcriptions |
The .en
suffix indicates models specialized for English, which are faster and more accurate for English content.
Requirements
- Python 3.12+
- An audio input device (microphone)
Configuration
You can configure the server using environment variables:
Troubleshooting
Common Issues
- No audio being recorded: Check your microphone permissions and settings
- Model download errors: Ensure you have a stable internet connection for the initial model download
- Integration with Goose: Make sure the command path is correct
- Audio quality issues: Try adjusting the sample rate (default: 16000)
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature
) - Commit your changes (
git commit -m 'Add some amazing feature'
) - Push to the branch (
git push origin feature/amazing-feature
) - Open a Pull Request
License
This project is licensed under the MIT License - see the LICENSE file for details.
This server cannot be installed
Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
- Features
- Installation
- Usage
- Available Tools
- Whisper Models
- Requirements
- Configuration
- Troubleshooting
- Contributing
- License