text_to_speech
Convert text to natural speech with customizable voice, model, and audio settings. Saves the generated audio file to your specified location.
Instructions
Convert text to natural speech using ElevenLabs' industry-leading TTS. Saves audio file to disk. Trigger: 'elevenlabs tts', 'elevenlabs speak', or 'elevenlabs text to speech'.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | The text to convert to speech | |
| voice | No | Voice name (e.g., "Rachel", "Adam", "Bella") or voice ID. Use list_voices to see options. | Rachel |
| model | No | TTS model: "eleven_multilingual_v2" (default, 29 languages), "eleven_v3" (latest), "eleven_turbo_v2_5" (fast), "eleven_flash_v2_5" (ultra-fast) | |
| stability | No | Voice stability (0.0-1.0). Lower = more expressive, higher = more consistent. Default: 0.5 | |
| similarity_boost | No | Voice clarity/similarity (0.0-1.0). Higher = closer to original voice. Default: 0.75 | |
| style | No | Style exaggeration (0.0-1.0). Higher = more expressive delivery. Default: 0 | |
| speed | No | Speech speed (0.25-4.0). Default: 1.0 | |
| output_format | No | Audio format: "mp3_44100_128" (default), "mp3_44100_192", "pcm_44100", "pcm_24000", "pcm_16000" | |
| save_path | No | File path to save the audio. If not provided, auto-saves to output directory. |