modal_voice_clone
Synthesize speech from text using F5-TTS voice cloning. Configure speed and temperature settings to generate audio outputs as URLs or base64.
Instructions
Synthesize speech using Modal F5-TTS voice clone (Isaiah voice by default). Returns audio URL or base64.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text to synthesize | |
| speed | No | Speed multiplier 0.5–2.0 (default: 1.0) | |
| temperature | No | Sampling temperature (default: 0.7) |