Skip to main content
Glama
134,441 tools. Last updated 2026-05-23 17:13

"A platform providing TTS (Text-to-Speech) capabilities" matching MCP tools:

  • Convert text to speech to generate MP3 audio files for accessibility, content creation, or multimedia projects using customizable voice options.
    MIT
  • Convert text to speech into OGG/Opus audio files that play as native Telegram voice notes when attached, using configurable TTS backends including local Kokoro or OpenAI options.
    MIT
  • Converts text to speech and plays it aloud. Uses a neural TTS server if configured, otherwise falls back to the system TTS engine.
    MIT

Matching MCP Servers

Matching MCP Connectors

  • Synchronize mouth movements in videos with audio using text-to-speech or custom audio upload. Works with human faces in real, 3D, or 2D videos to create lip-sync content.
    MIT
  • Convert text to speech using VOICEVOX TTS MCP server. Process text line by line for multi-character conversations with configurable playback controls.
    ISC
  • Convert text to speech using Microsoft Edge TTS with customizable voice, rate, volume, and pitch settings for audio output.
    MIT
  • Convert text to speech using Rime's API for audio output when users request spoken responses or need verbal explanations after completing commands.
    The Unlicense
  • Convert text to speech on macOS using customizable voice options, rate control, and background mode for uninterrupted MCP server interactions.
    MIT
  • Convert text to speech with Google's Gemini TTS. Choose from prebuilt voices and set language for audio generation.
    Apache 2.0
  • Retrieve available text-to-speech voices with IDs, names, languages, and previews for voice selection in speech synthesis.
    MIT
  • Convert text into spoken audio using OpenAI's TTS technology, with options for different voices, models, and audio formats. The generated audio can be saved to a file and optionally played automatically.
    MIT
  • Convert text to speech and play it through system audio using customizable voice options for accessibility, content consumption, or audio output needs.
    MIT