Retrieve a list of available voices with their capabilities and supported features for text-to-speech synthesis, enabling users to select the best fit for expressive and professional speech output.
Transform multiple text segments into audio with customizable voice, speed, emotion, and pacing. Optionally merge segments into a single file for efficient TTS processing.
Provides high-quality text-to-speech synthesis with 10 natural voices, emotion control, and dynamic pacing for professional applications requiring expressive speech output.
Facilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.
Integrates Piper TTS into the Model Context Protocol, allowing AI assistants to convert text to speech and play it through speakers with customizable voice settings and volume control.