transcribe_audio
Transcribe audio files in formats like wav, mp3, or m4a into text using Whisper models. Supports optional timestamps and multiple output formats.
Instructions
Transcribe an audio file using Whisper. Supports various audio formats (wav, mp3, m4a, etc.). Returns the transcribed text.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | Absolute path to the audio file to transcribe | |
| model | No | Whisper model to use (tiny.en, base.en, small.en, medium.en, large). Default: base.en | |
| language | No | Language code (e.g., en, es, fr). Default: en | |
| output_format | No | Output format: text (plain text), timestamps (with timestamps), json (structured). Default: text |