vocametrix_assess_pronunciation
Assess pronunciation accuracy at the phoneme level by comparing audio to reference text. Returns accuracy, fluency, completeness, and prosody scores (0–100) with per-word and per-phoneme breakdowns.
Instructions
Score pronunciation accuracy at phoneme level against a reference text. Returns accuracy, fluency, completeness, and prosody scores (0–100) plus per-word and per-phoneme breakdowns. Supports 30+ locales (en-US, fr-FR, de-DE, zh-CN, ar-SA, etc.). Audio should be a clear reading of the reference text.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audioPath | Yes | WAV recording of the speaker reading the reference text | |
| referenceText | Yes | The text the speaker was reading aloud | |
| speakerLocale | No | BCP-47 locale code, e.g. "en-US", "fr-FR", "es-ES" | en-US |