generate_voice
Generate expressive speech and narration from text with customizable tone and style. Create character dialogue, scripts, and stories using Gemini 2.5 Native Audio.
Instructions
Generates expressive speech and narration from text. Best for scripts, character dialogue, and narration. Uses Gemini 2.5 Native Audio.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | The script or text to be read by the voice. | |
| voice_direction | No | Optional instructions for the tone and style (e.g., 'Speak like a fast-talking auctioneer' or 'Use a whispery tone'). | |
| format | No | Optional output format (wav, mp3, ogg, flac, etc.). | |
| auto_play | No | If true, automatically plays the generated audio. |