text_to_phonemes
Convert English text to phoneme strings for speech synthesis using dictionary lookup, with options to adjust pitch, rate, and formant.
Instructions
Convert English text to an approximate klattsch phoneme string. This does a dictionary lookup word-by-word. Unknown words are spelled out letter-by-letter (which sounds robotic — hand-craft those for best results).
Returns a full phoneme string ready for speak/speak_file. You can (and should) edit the output before passing to speak — add stress marks (!), pitch contours (+N/-N), adjust pauses, or fix mispronounced words.
The output includes control prefixes based on your voice selections.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | The English text to convert (e.g. "Hello, how are you?"). | |
| pitch | No | Base pitch in Hz. 100-140 = male, 180-220 = female, 250-300 = child. | |
| rate | No | Per-phoneme rate in ms. 80-100 = fast, 100-120 = normal, 200-400 = sung. | |
| formantScale | No | Formant scale: 1.0 = male, 1.17 = female, 1.3 = child. | |
| vibrato | No | Vibrato depth in Hz. 0 = off, 2-3 = natural, 5-6 = dramatic/operatic. |