skill_synthesize
Convert text to speech with automatic language detection and smart voice defaults. Choose voice, format, and output spoken audio.
Instructions
High-level speech synthesis skill. Smart voice defaults, auto-detects language.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text to speak (max 5000 chars) | |
| voice | No | Voice name | filipp |
| lang | No | Language — auto-detected from voice if omitted | |
| format | No | Output format (mp3, oggopus, lpcm) | mp3 |