Replaces the built-in macOS dictation system with enhanced speech-to-text capabilities using local AI processing
Integrates OpenAI's Whisper model for local speech-to-text transcription on Apple Silicon devices, replacing macOS native dictation with superior accuracy
Enables transcription of YouTube videos by extracting audio content and processing it through local Whisper models
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Whisperatranscribe this audio file from my meeting"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Whispera
A native macOS app that replaces the built-in dictation with OpenAI's Whisper for superior transcription accuracy. Transcribe speech, local files, YouTube videos, and network streams - all processed locally on your Neural Engine.
⬇️ Download Latest Release
Demos
Related MCP server: Voice Recorder MCP Server
Features
Live transcription (beta)
Speech-to-text - Replaces macOS native dictation with WhisperKit (OpenAI's Whisper model on Neural Engine) for better accuracy
File transcription - Audio and video files
Network media transcription - Stream video/music URLs
YouTube transcription
All processing runs locally. Internet required only for initial model download.
Roadmap
Multi-language support beyond English
Real-time translation capabilities
Additional customization options
Usage
Simply use your configured global shortcut to start transcribing with Whisper instead of the default macOS dictation.
Known Issues
The app does not work with Intel mac(see Issue 15
Auto install does not work, after an app has been downloaded, please manually drag and drop the app to you
/ApplicationfolderThere is a weird issue with app quiting unexpectedly if you get that please report it here: Issue 21
Requirements
macOS 13.0 or later
Apple Silicon
We are working on support for Intel Mac
Credits
Built with:
WhisperKit - On-device Whisper transcription for Apple Silicon
YouTubeKit - YouTube content extraction
Thanks to these projects for making privacy-focused, local transcription a reality.
License
MIT License