Skip to main content
Glama

Blabber-MCP

📢 Blabber-MCP 🗣️

An MCP server that gives your LLMs a voice using OpenAI's Text-to-Speech API! 🔊


✨ Features

  • Text-to-Speech: Converts input text into high-quality spoken audio.
  • Voice Selection: Choose from various OpenAI voices (alloy, echo, fable, onyx, nova, shimmer).
  • Model Selection: Use standard (tts-1) or high-definition (tts-1-hd) models.
  • Format Options: Get audio output in mp3, opus, aac, or flac.
  • File Saving: Saves the generated audio to a local file.
  • Optional Playback: Automatically play the generated audio using a configurable system command.
  • Configurable Defaults: Set a default voice via configuration.

🔧 Configuration

To use this server, you need to add its configuration to your MCP client's settings file (e.g., mcp_settings.json).

  1. Get OpenAI API Key: You need an API key from OpenAI.
  2. Add to MCP Settings: Add the following block to the mcpServers object in your settings file, replacing "YOUR_OPENAI_API_KEY" with your actual key.
{ "mcpServers": { "blabber-mcp": { "command": "node", "args": ["/full/path/to/blabber-mcp/build/index.js"], (IMPORTANT: Use the full, absolute path to the built index.js file) "env": { "OPENAI_API_KEY": "YOUR_OPENAI_API_KEY", "AUDIO_PLAYER_COMMAND": "xdg-open", (Optional: Command to play audio (e.g., "cvlc", "vlc", "mpv", "ffplay", "afplay", "xdg-open"; defaults to "cvlc") "DEFAULT_TTS_VOICE": "nova" (Optional: Set default voice (alloy, echo, fable, onyx, nova, shimmer); defaults to nova) }, "disabled": false, "alwaysAllow": [] } } }

Important: Make sure the args path points to the correct location of the build/index.js file within your blabber-mcp project directory. Use the full absolute path.


🚀 Usage

Once configured and running, you can use the text_to_speech tool via your MCP client.

Tool: text_to_speech Server: blabber-mcp (or the key you used in the config)

Arguments:

  • input (string, required): The text to synthesize.
  • voice (string, optional): The voice to use (alloy, echo, fable, onyx, nova, shimmer). Defaults to the DEFAULT_TTS_VOICE set in config, or nova.
  • model (string, optional): The model (tts-1, tts-1-hd). Defaults to tts-1.
  • response_format (string, optional): Audio format (mp3, opus, aac, flac). Defaults to mp3.
  • play (boolean, optional): Set to true to automatically play the audio after saving. Defaults to false.

Example Tool Call (with playback):

<use_mcp_tool> <server_name>blabber-mcp</server_name> <tool_name>text_to_speech</tool_name> <arguments> { "input": "Hello from Blabber MCP!", "voice": "shimmer", "play": true } </arguments> </use_mcp_tool>

Output:

The tool saves the audio file to the output/ directory within the blabber-mcp project folder and returns a JSON response like this:

{ "message": "Audio saved successfully. Playback initiated using command: cvlc", "filePath": "path/to/speech_1743908694848.mp3", "format": "mp3", "voiceUsed": "shimmer" }

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.


🕒 Changelog

See the CHANGELOG.md file for details on version history.


-
security - not tested
A
license - permissive license
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.

  1. ✨ Features
    1. 🔧 Configuration
      1. 🚀 Usage
        1. 📜 License
          1. 🕒 Changelog

            Related MCP Servers

            • A
              security
              A
              license
              A
              quality
              A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
              Last updated -
              1
              2
              JavaScript
              MIT License
              • Linux
              • Apple
            • A
              security
              A
              license
              A
              quality
              An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
              Last updated -
              19
              700
              Python
              MIT License
              • Apple
            • -
              security
              -
              license
              -
              quality
              An MCP server that enables LLMs to access the NijiVoice API for text-to-speech generation, supporting features like fetching available voice actors and checking credit balance.
              Last updated -
              1
              Python
              MIT License
            • -
              security
              F
              license
              -
              quality
              An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
              Last updated -
              2
              Python
              • Linux
              • Apple

            View all related MCP servers

            MCP directory API

            We provide all the information about MCP servers via our MCP API.

            curl -X GET 'https://glama.ai/api/mcp/v1/servers/pinkpixel-dev/blabber-mcp'

            If you have feedback or need assistance with the MCP directory API, please join our Discord server