skill_transcribe
Transcribe audio into clean text. Provide base64-encoded audio, optionally specify language and format, to get accurate speech recognition results.
Instructions
High-level transcription skill. Returns clean text from audio.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audio_base64 | Yes | Audio data encoded as Base64 | |
| lang | No | Language (ru-RU, en-US, kk-KK) | ru-RU |
| format | No | Audio format (oggopus, lpcm) | oggopus |
| return_raw | No | Return raw API response instead of plain text |