soul_listen
Transcribe audio files to text with timestamps using local Whisper speech-to-text. Provide audio file path (WAV, MP3, M4A) for accurate transcription.
Instructions
Convert speech to text using local Whisper STT.
Transcribes audio into text with timestamps for each segment. First call downloads whisper base model (~150MB).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audio_path | Yes | Path to audio file (WAV, MP3, M4A, etc.) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |