transcribe_audio
Transcribe local audio files to text or subtitle formats (SRT, VTT) with speaker diarization. Supports JSON and verbose output, optional context prompts, and language selection.
Instructions
Transcribe an audio file using VibeVoice-ASR.
Args: file_path: Absolute path to the audio file (wav, mp3, flac, etc.) response_format: Output format: text, json, verbose_json, srt, vtt (default: text) prompt: Optional prompt/context to guide transcription language: Language code for verbose_json output (default: en)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | No | ||
| language | No | ||
| file_path | Yes | ||
| response_format | No | text |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |