transcribe_audio
Convert audio files to text with Whisper AI. Supports Telegram voice messages, multiple formats, automatic language detection, and optional word timestamps for transcription analysis.
Instructions
Transcribe an audio file to text using Whisper.
Supports OGG (Telegram voice), WAV, MP3, FLAC, and most common audio formats.
Args: file_path: Absolute path to the audio file to transcribe. language: Optional ISO-639-1 language code (e.g. 'en', 'fr'). None = auto-detect. word_timestamps: If True, include word-level timestamps in segments.
Returns: dict with: text, language, language_probability, duration, segments, backend, success, error
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | ||
| language | No | ||
| word_timestamps | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||