Why this server?
This server enables AI assistants to make real phone calls using OpenAI's Real-Time Voice API, directly handling voice conversations.
-securityFlicense-qualityEnables AI assistants to make real phone calls on your behalf using VoIP, handling conversations automatically through OpenAI's Real-Time Voice API. Simply tell Claude what you want to accomplish and it will call and manage the entire conversation for you.Last updated 5 months ago18Why this server?
This server is explicitly designed for natural voice conversations and provides human-like voice interactions for AI assistants.
-securityAlicense-qualityNatural voice conversations for AI assistants that brings human-like voice interactions to Claude, ChatGPT, and other LLMs through the Model Context Protocol (MCP).Last updated 16 hours ago974MITWhy this server?
This server offers integration with ElevenLabs' Text to Speech and audio processing APIs, including speech generation and voice cloning.

ElevenLabs MCP Serverofficial
AsecurityAlicense-qualityAn official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.Last updated 20 days ago241,289MITWhy this server?
This server uses OpenAI's models specifically for Text-to-Speech conversion, a core 'voice' functionality.
-securityAlicense-qualityEnables agents to convert text to speech using OpenAI's TTS models with voice selection, delivery instructions, and queue-based audio playback. Supports both blocking and non-blocking modes for flexible audio generation and playback control.Last updated 7 months ago3BSD 3-ClauseWhy this server?
This server specializes in speech-to-text transcription and voice recognition, converting voice input into text.
-securityFlicense-qualityA powerful speech-to-text MCP server that supports multiple audio formats and recognition engines including remote APIs (Bailian, OpenAI Whisper, iFLYTEK), Google Speech Recognition, and CMU Sphinx.Last updated 9 months ago1Why this server?
This server uses the AivisSpeech Engine to provide high-quality voice synthesis capabilities.
AsecurityFlicense-qualityA Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.Last updated a year ago11Why this server?
This service supports image generation, video generation, text-to-speech, and voice cloning, all highly related to the 'voice' query.

MiniMax MCP JSofficial
AsecurityAlicense-qualityJavaScript implementation of MiniMax MCP that enables interaction with MiniMax AI services for image generation, video generation, text-to-speech, and voice cloning through MCP-compatible clients.Last updated 9 months ago10657114MITWhy this server?
This entry explicitly mentions text-to-speech synthesis capabilities for converting text output into voice.
-securityAlicense-qualityA Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.Last updated a year ago171MITWhy this server?
This server provides Voice Mode functionality specifically for Claude Code, indicating direct support for voice interaction workflows.
AsecurityFlicense-qualityVoice Mode for Claude CodeLast updated 15 days ago174104