Blabber-MCP

📢 Blabber-MCP 🗣️

An MCP server that gives your LLMs a voice using OpenAI's Text-to-Speech API! 🔊

✨ Features

Text-to-Speech: Converts input text into high-quality spoken audio.
Voice Selection: Choose from various OpenAI voices (alloy, echo, fable, onyx, nova, shimmer).
Model Selection: Use standard (tts-1) or high-definition (tts-1-hd) models.
Format Options: Get audio output in mp3, opus, aac, or flac.
File Saving: Saves the generated audio to a local file.
Optional Playback: Automatically play the generated audio using a configurable system command.
Configurable Defaults: Set a default voice via configuration.

🔧 Configuration

To use this server, you need to add its configuration to your MCP client's settings file (e.g., mcp_settings.json).

Get OpenAI API Key: You need an API key from OpenAI.
Add to MCP Settings: Add the following block to the mcpServers object in your settings file, replacing "YOUR_OPENAI_API_KEY" with your actual key.

{
  "mcpServers": {
    "blabber-mcp": {
      "command": "node",
      "args": ["/full/path/to/blabber-mcp/build/index.js"], (IMPORTANT: Use the full, absolute path to the built index.js file)
      "env": {
        "OPENAI_API_KEY": "YOUR_OPENAI_API_KEY",
        "AUDIO_PLAYER_COMMAND": "xdg-open", (Optional: Command to play audio (e.g., "cvlc", "vlc", "mpv", "ffplay", "afplay", "xdg-open"; defaults to "cvlc")
        "DEFAULT_TTS_VOICE": "nova" (Optional: Set default voice (alloy, echo, fable, onyx, nova, shimmer); defaults to nova)
      },
      "disabled": false,
      "alwaysAllow": []
    }
  }
}

Important: Make sure the args path points to the correct location of the build/index.js file within your blabber-mcp project directory. Use the full absolute path.

🚀 Usage

Once configured and running, you can use the text_to_speech tool via your MCP client.

Tool: text_to_speech Server: blabber-mcp (or the key you used in the config)

Arguments:

input (string, required): The text to synthesize.
voice (string, optional): The voice to use (alloy, echo, fable, onyx, nova, shimmer). Defaults to the DEFAULT_TTS_VOICE set in config, or nova.
model (string, optional): The model (tts-1, tts-1-hd). Defaults to tts-1.
response_format (string, optional): Audio format (mp3, opus, aac, flac). Defaults to mp3.
play (boolean, optional): Set to true to automatically play the audio after saving. Defaults to false.

Example Tool Call (with playback):

<use_mcp_tool>
  <server_name>blabber-mcp</server_name>
  <tool_name>text_to_speech</tool_name>
  <arguments>
  {
    "input": "Hello from Blabber MCP!",
    "voice": "shimmer",
    "play": true
  }
  </arguments>
</use_mcp_tool>

Output:

The tool saves the audio file to the output/ directory within the blabber-mcp project folder and returns a JSON response like this:

{
  "message": "Audio saved successfully. Playback initiated using command: cvlc",
  "filePath": "path/to/speech_1743908694848.mp3", 
  "format": "mp3",
  "voiceUsed": "shimmer"
}

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🕒 Changelog

See the CHANGELOG.md file for details on version history.

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.

Related MCP Servers

Audio Transcriber MCP Server
Ichigo3766
A
security
A
license
A
quality
A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
Last updated -
1
0
7
JavaScript
MIT License
NijiVoice-MCP
ryoooo
-
security
A
license
-
quality
An MCP server that enables LLMs to access the NijiVoice API for text-to-speech generation, supporting features like fetching available voice actors and checking credit balance.
Last updated -
Python
MIT License
MCP Video & Audio Text Extraction Server
SealinGp
-
security
F
license
-
quality
An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
Last updated -
5
Python
Freesound MCP Server
timjrobinson
-
security
A
license
-
quality
An MCP server that enables AI assistants to search, analyze, and retrieve information about audio samples from Freesound.org through their API.
Last updated -
JavaScript
MIT License

View all related MCP servers

Appeared in Searches

Transcribing Voice Conversations into Structured Meeting Notes

Blabber-MCP

📢 Blabber-MCP 🗣️

✨ Features

🔧 Configuration

🚀 Usage

📜 License

🕒 Changelog

Related MCP Servers

Audio Transcriber MCP Server

NijiVoice-MCP

MCP Video & Audio Text Extraction Server

Freesound MCP Server

Appeared in Searches

New MCP Servers

MCP directory API