Integrations
Rime MCP
A Model Context Protocol (MCP) server that provides text-to-speech capabilities using the Rime API. This server downloads audio and plays it using the system's native audio player.
Features
- Exposes a
speak
tool that converts text to speech and plays it through system audio - Uses Rime's high-quality voice synthesis API
Requirements
- Node.js 16.x or higher
- A working audio output device
- macOS: Uses
afplay
There's sample code from Claude for the following that is not tested 🤙✨
- Windows: Built-in Media.SoundPlayer (PowerShell)
- Linux: mpg123, mplayer, aplay, or ffplay
MCP Configuration
All of the optional env vars are part of the tool definition and are prompts to
All voice options are listed here.
You can get your API key from the Rime Dashboard.
The following environment variables can be used to customize the behavior:
RIME_GUIDANCE
: The main description of when and how to use the speak toolRIME_WHO_TO_ADDRESS
: Who the speech should address (default: "user")RIME_WHEN_TO_SPEAK
: When the tool should be used (default: "when asked to speak or when finishing a command")RIME_VOICE
: The default voice to use (default: "cove")
Example use cases
Example 1: Coding agent announcements
Example 2: Learn how the kids talk these days
Example 3: Different languages based on context
Development
- Install dependencies:
- Build the server:
- Run in development mode with hot reload:
License
MIT
Badges
You must be authenticated.
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Tools
A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
Related MCP Servers
- -securityFlicense-qualityProvides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.Last updated -2Python
- -securityFlicense-qualityA Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.Last updated -TypeScript
- -securityAlicense-qualityA server that enables Claude 3.7 and other AI agents to access VOICEVOX-compatible speech synthesis engines (AivisSpeech, VOICEVOX, COEIROINK) through the Model Context Protocol.Last updated -2TypeScriptMIT License
ElevenLabs MCP Serverofficial
AsecurityAlicenseAqualityAn official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.Last updated -19633PythonMIT License