transcribe_local
Transcribe local audio or video files with timestamps using Whisper. Supports formats like mp3, mp4, and more, with cached results for instant repeats.
Instructions
Transcribe a local audio or video file with timestamps using Whisper.
Supports any format FFmpeg can decode: mp3, wav, m4a, flac, ogg, aac, mp4, mkv, webm, avi, mov, wma, opus, and more.
Results are cached — repeat requests for the same file are instant.
Sample prompts that trigger this tool: - "Transcribe this recording: /path/to/meeting.mp3" - "What's said in this video? /path/to/lecture.mp4" - "Transcribe ~/Downloads/interview.wav" - "Transcribe the audio file on my desktop"
Args: file_path: Absolute path to the audio or video file. model_size: Whisper model size (tiny/base/small/medium/large). Default: tiny. language: Language code (e.g. "en", "de", "fr"). Auto-detected if empty.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | ||
| model_size | No | tiny | |
| language | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |