synthesize
Convert text to speech using Yandex SpeechKit and receive audio as Base64. Supports multiple languages, voices, emotions, and speech speed adjustment.
Instructions
Speech synthesis (TTS) via Yandex SpeechKit. Takes text, returns Base64 audio.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text to synthesize (max 5000 chars) | |
| lang | No | Synthesis language (ru-RU, en-US, tr-TR, kk-KK) | ru-RU |
| voice | No | Voice name (filipp, alena, jane, dasha, john, etc.) | filipp |
| format | No | Audio format (oggopus, lpcm, mp3) | oggopus |
| emotion | No | Emotion (neutral, good, evil) — only for voices that support it | |
| speed | No | Speech speed multiplier (0.1–3.0) |