Text-to-Speech
Tools for converting text-to-speech and vice-versa.
MCP ServersBrowse all →
AlicenseAqualityFmaintenanceJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated610309125MIT
@vocea.app/mcp-serverofficial
AlicenseAqualityCmaintenanceEnables AI agents to generate speech, transcribe audio, and manage voices via the Vocea API.Last updated6MIT- AlicenseAqualityBmaintenanceAI-powered multi-voice audiobook creation platform. Provides tools for pricing, language support, use cases, onboarding, FAQ, alternatives comparison, and cost estimation. npx echo3s-mcpLast updated126MIT

supertone-mcpofficial
AlicenseAqualityBmaintenanceMCP server for the Supertone TTS API. Generate natural speech, browse and preview the voice catalog, predict synthesis cost, and create cloned voices — directly from Claude Desktop, Cursor, or any MCP-compatible client. Supports Korean, English, Japanese, and 20+ other languages, with speed, pitch, and emotion-style control.Last updated102MIT- AlicenseAqualityCmaintenanceProvides speech recognition (STT) and synthesis (TTS) tools via the Sber SaluteSpeech API, enabling audio transcription and voice generation through natural language.Last updated25271MIT
- AlicenseAqualityCmaintenanceA simple MCP server that can send notifications on mac devices.Last updated152926MIT

MiniMax MCP Serverofficial
AlicenseAqualityDmaintenanceEnables MCP clients like Claude Desktop and Cursor to interact with MiniMax APIs for generating speech, cloning voices, creating videos, and generating images.Last updated61,515MIT- AlicenseAqualityCmaintenanceA Model Context Protocol server for FlowSpeech text-to-speech. It lets MCP-compatible clients generate human-like audio with context-aware emotion control, pause control, multi-speaker dialogue, and 30+ available voices.Last updated3MIT
- AlicenseAquality-maintenanceEnables integration with VOICEVOX text-to-speech services to convert text into audio using a variety of character voices. It provides tools for speech generation, listing available speakers, and monitoring system health.Last updated5
- AlicenseAqualityDmaintenanceA text-to-speech MCP server that enables AI assistants to speak using the VOICEVOX engine with support for multi-character conversations. It features queue management, low-latency streaming via FFplay, and cross-platform playback across Windows, macOS, and Linux.Last updated714915ISC
- AlicenseAqualityCmaintenanceBitcoin-powered AI tools via Lightning Network micropayments (L402). Image generation, text generation, video, music, speech, 3D models, file conversion, and SMS — no signup or API keys required.Last updated4913MIT
- AlicenseBqualityDmaintenanceA Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.Last updated27MIT
- AlicenseAqualityDmaintenanceProvides high-quality text-to-speech synthesis with 10 natural voices, emotion control, and dynamic pacing for professional applications requiring expressive speech output.Last updated52MIT
- AlicenseAqualityFmaintenanceEnables AI assistants to access Venice AI's capabilities including chat with open-source models, image generation, text-to-speech, embeddings, and API key management.Last updated13195MIT
- AlicenseBqualityCmaintenanceA Model Context Protocol server that integrates with AivisSpeech to enable AI assistants to convert text to natural-sounding Japanese speech with customizable voice parameters.Last updated1387Apache 2.0
- AlicenseBqualityFmaintenanceAn enhanced server for ElevenLabs that enables high-quality text-to-speech, voice cloning, and multi-speaker dialogue management. It features advanced conversational tools for transcript retrieval, history tracking, and emotional audio synthesis using the v3 model.Last updated291278MIT
- AlicenseAqualityDmaintenanceEnables Claude Desktop and Claude Code to synthesize and play speech using VOICEVOX text-to-speech engine. Supports multiple voice characters, session-based voice assignment, and queue management for audio playback.Last updated722MIT
- AlicenseBqualityCmaintenanceThis server enables AI models to send SMS messages and initiate Text-to-Speech calls programmatically using ClickSend's API with built-in rate limiting and input validation.Last updated23MIT
- AlicenseBqualityDmaintenanceEnables AI assistants to generate images, text, and audio content through the Pollinations APIs. Provides direct access to multimodal generation capabilities including image creation from text prompts, text-to-speech, and text generation.Last updated12336MIT
- AlicenseBqualityBmaintenanceEnables speech recognition, synthesis, and voice listing via Yandex SpeechKit API through 5 tools.Last updated525MIT
- AlicenseBqualityDmaintenanceA server that generates MP3 audio files from text using Kokoro TTS technology with optional S3 upload capabilities.Last updated178Apache 2.0
- AlicenseBqualityDmaintenanceProvides AI-powered audio generation and processing through the MusicGPT API, enabling music creation, voice conversion, audio manipulation, stem extraction, and audio analysis capabilities.Last updated24201MIT
- AlicenseAquality-maintenanceEnables interaction with MiniMax AI APIs for text-to-speech, voice cloning, video generation, image generation, and music creation through MCP clients like Claude Desktop and Cursor.Last updated9
- AlicenseBqualityFmaintenanceMCP server that exposes OpenClaw Gateway tools to Claude Code and other MCP clients, enabling messaging, session management, scheduling, node control, web search, memory search, and TTS.Last updated212814MIT
- AlicenseBqualityDmaintenanceProvides voice notifications using Grok's text-to-speech API to alert users when Claude Code completes tasks, with support for both local and remote server configurations.Last updated1MIT
- AlicenseAqualityDmaintenanceA cross-platform MCP server that enables Claude to speak using Microsoft Edge TTS with support for over 300 voices across 50+ languages. It requires no API keys and allows for customization of speech rate, volume, and pitch.Last updated31MIT
- AlicenseBqualityDmaintenanceProvides VOICEVOX text-to-speech as an MCP tool. Requires a running VOICEVOX engine on localhost.Last updated111771Apache 2.0
- AlicenseAqualityDmaintenanceAn MCP server that makes AI agents speak a brief summary of every response out loud using TTS.Last updated1GPL 3.0
- AlicenseAquality-maintenanceEnables interaction with ElevenLabs Text-to-Speech and audio processing APIs. Supports speech generation, voice cloning, audio transcription, and sound effect creation through natural language.Last updated24
- AlicenseBqualityDmaintenanceA Node.js server that enables AI assistants to interact with Bouyomi-chan's text-to-speech functionality through Model Context Protocol (MCP), allowing for voice reading of text with adjustable parameters.Last updated12MIT
MCP ConnectorsBrowse all →
Curated audio-news: search articles, narrated audio, vertical briefings, topic subscriptions.
Free receptionist tools: phone scripts, IVR menus (EN+ES), ElevenLabs prompts, missed-call math
Pronunciation scoring, speech-to-text, and text-to-speech for language learning
Manage Speko voice-AI agents, sessions, calls, phone numbers, knowledge bases, evals, and docs.
Send web articles or AI-written text to your personal podcast feed; listen in any podcast app.
The Listenetic MCP server is a remote, cloud-hosted server that enables AI assistants like ChatGPT and Claude to convert articles, documents, websites, and videos into high-quality AI-generated audio. It provides multi-format support for text and binary files, natural-sounding text-to-audio conversion using AI, and specialized processing for SSML, markup, markdown, and various media formats through three core tools: listentic_supported_mimetypes, listentic_add_content_text, and listentic_add_content_binary.