Why this server?
This server enables AI assistants to make real phone calls using OpenAI's Real-Time Voice API, directly handling voice conversations.
Flicense-qualityDmaintenanceEnables AI assistants to make real phone calls on your behalf using VoIP, handling conversations automatically through OpenAI's Real-Time Voice API. Simply tell Claude what you want to accomplish and it will call and manage the entire conversation for you.Last updated20Why this server?
This server is explicitly designed for natural voice conversations and provides human-like voice interactions for AI assistants.
Alicense-qualityBmaintenanceNatural voice conversations for AI assistants that brings human-like voice interactions to Claude, ChatGPT, and other LLMs through the Model Context Protocol (MCP).Last updated1,188MITWhy this server?
This server offers integration with ElevenLabs' Text to Speech and audio processing APIs, including speech generation and voice cloning.

ElevenLabs MCP Serverofficial
AlicenseAqualityBmaintenanceAn official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.Last updated241,361MITWhy this server?
This server uses OpenAI's models specifically for Text-to-Speech conversion, a core 'voice' functionality.
Alicense-qualityCmaintenanceEnables agents to convert text to speech using OpenAI's TTS models with voice selection, delivery instructions, and queue-based audio playback. Supports both blocking and non-blocking modes for flexible audio generation and playback control.Last updated3BSD 3-ClauseWhy this server?
This server specializes in speech-to-text transcription and voice recognition, converting voice input into text.
Flicense-qualityCmaintenanceA powerful speech-to-text MCP server that supports multiple audio formats and recognition engines including remote APIs (Bailian, OpenAI Whisper, iFLYTEK), Google Speech Recognition, and CMU Sphinx.Last updated1Why this server?
This server uses the AivisSpeech Engine to provide high-quality voice synthesis capabilities.
FlicenseDqualityCmaintenanceA Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.Last updated11Why this server?
This service supports image generation, video generation, text-to-speech, and voice cloning, all highly related to the 'voice' query.

MiniMax MCP JSofficial
AlicenseAqualityDmaintenanceJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated10396122MITWhy this server?
This entry explicitly mentions text-to-speech synthesis capabilities for converting text output into voice.
Alicense-qualityCmaintenanceA Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.Last updated161MITWhy this server?
This server provides Voice Mode functionality specifically for Claude Code, indicating direct support for voice interaction workflows.
FlicenseBqualityDmaintenanceVoice Mode for Claude CodeLast updated199114