Replaces the built-in macOS dictation system with enhanced speech-to-text capabilities using local AI processing
Integrates OpenAI's Whisper model for local speech-to-text transcription on Apple Silicon devices, replacing macOS native dictation with superior accuracy
Enables transcription of YouTube videos by extracting audio content and processing it through local Whisper models
Whispera
A native macOS app that replaces the built-in dictation with OpenAI's Whisper for superior transcription accuracy. Transcribe speech, local files, YouTube videos, and network streams - all processed locally on your Neural Engine.
⬇️ Download Latest Release
Demos
Features
- Live transcription (beta)
- Speech-to-text - Replaces macOS native dictation with WhisperKit (OpenAI's Whisper model on Neural Engine) for better accuracy
- File transcription - Audio and video files
- Network media transcription - Stream video/music URLs
- YouTube transcription
All processing runs locally. Internet required only for initial model download.
Roadmap
- Multi-language support beyond English
- PR: https://github.com/sapoepsilon/Whispera/pull/2
- Release: https://github.com/sapoepsilon/Whispera/releases/tag/v1.0.3
- Real-time translation capabilities
- PR: https://github.com/sapoepsilon/Whispera/pull/17
- Release: https://github.com/sapoepsilon/Whispera/releases/tag/v1.0.18
- Additional customization options
Usage
Simply use your configured global shortcut to start transcribing with Whisper instead of the default macOS dictation.
Known Issues
- The app does not work with Intel mac(see Issue 15
- Auto install does not work, after an app has been downloaded, please manually drag and drop the app to you
/Application
folder
Requirements
- macOS 13.0 or later
- Apple Silicon
- We are working on support for Intel Mac
Credits
Built with:
- WhisperKit - On-device Whisper transcription for Apple Silicon
- YouTubeKit - YouTube content extraction
- swift-markdown-ui
Thanks to these projects for making privacy-focused, local transcription a reality.
License
MIT License
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
AI-powered voice transcription app for macOS using WhisperKit
Related MCP Servers
- AsecurityAlicenseAqualityEnables text-to-speech functionality on macOS using the say command, offering extensive control over speech parameters like voice, rate, volume, and pitch for a customizable auditory experience.Last updated -2317MIT License
- -securityAlicense-qualityEnables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.Last updated -6MIT License
- AsecurityFlicenseAqualityA Python-based server that provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.Last updated -52
- -securityAlicense-qualityA portable, Dockerized Python tool that implements Model Context Protocol for audio transcription using Whisper models, featuring both CLI and web UI interfaces for converting audio files to JSON transcriptions.Last updated -MIT License