vocametrix_synthesize_speech
Synthesize speech from text with Azure neural voices, providing an audio URL and word-level timing data for voice analysis or pronunciation assessment.
Instructions
Synthesize speech from text using Azure neural text-to-speech. Returns an audio URL and word-level timing data. Supports all Azure Neural voice names for the requested locale.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text to synthesize (max 1000 characters) | |
| speakerLocale | No | BCP-47 locale code, e.g. "en-US", "fr-FR", "es-ES" | en-US |
| voiceName | No | Azure Neural voice name, e.g. "en-US-JennyNeural" |