Popular MCP Servers

Edge TTS MCP
Text-to-Speech Speech Processing Multimedia Processing
s-n-n
A
license
A
quality
D
maintenance
A cross-platform MCP server that enables Claude to speak using Microsoft Edge TTS with support for over 300 voices across 50+ languages. It requires no API keys and allows for customization of speech rate, volume, and pitch.
Last updated 2026-01-18
2
3
1
MIT
@vocea.app/mcp-serverofficial
Speech Processing Text-to-Speech
vocea-admin
A
license
A
quality
C
maintenance
Enables AI agents to generate speech, transcribe audio, and manage voices via the Vocea API.
Last updated 2026-05-14
6
MIT
mocoVoice MCP Serverofficial
Speech Processing Audio Processing AI & Machine Learning
mocomoco-inc
A
license
A
quality
B
maintenance
Enables transcription of audio and video files using mocoVoice API, allowing users to start transcription jobs and retrieve results directly from Claude Desktop.
Last updated 2026-05-31
6
3
MIT
ElevenLabs MCP Serverofficial
Speech Processing Audio Processing
elevenlabs
A
license
A
quality
B
maintenance
An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
Last updated 2026-06-25
26
1,432
MIT
Anam MCP Serverofficial
AI & Machine Learning Speech Processing Communication
anam-org
A
license
B
quality
F
maintenance
Enables managing AI personas, avatars, voices, and sessions from any MCP client, for integration with Anam AI.
Last updated 2026-04-21
54
27
MIT
salutespeech-mcp
Speech Processing Text-to-Speech
theYahia
A
license
A
quality
D
maintenance
Provides speech recognition (STT) and synthesis (TTS) tools via the Sber SaluteSpeech API, enabling audio transcription and voice generation through natural language.
Last updated 2026-05-19
2
5
23
1
MIT
Video Transcriber MCP Server
Multimedia Processing Audio Processing Speech Processing
nhatvu148
A
license
A
quality
D
maintenance
Transcribes videos from 1000+ platforms (YouTube, TikTok, Vimeo, etc.) and local video files using OpenAI's Whisper model, with support for 90+ languages and multiple output formats.
Last updated 2025-11-26
2
4
17
1
MIT
ContextPulseofficial
Knowledge & Memory OS Automation Speech Processing
ContextPulse
A
license
A
quality
D
maintenance
Lets AI assistants understand what you're working on — current screen content, recent dictation, clipboard, and saved notes — running entirely on your own machine with nothing sent to the cloud.
Last updated 2026-06-20
36
AGPL 3.0
supertone-mcpofficial
Text-to-Speech Speech Processing AI & Machine Learning
supertone-inc
A
license
A
quality
B
maintenance
MCP server for the Supertone TTS API. Generate natural speech, browse and preview the voice catalog, predict synthesis cost, and create cloned voices — directly from Claude Desktop, Cursor, or any MCP-compatible client. Supports Korean, English, Japanese, and 20+ other languages, with speed, pitch, and emotion-style control.
Last updated 2026-06-17
10
2
MIT
Ringback
Speech Processing Communication
mohitbadwal
A
license
A
quality
A
maintenance
Let your AI agent call your phone and talk to you — MCP servers for live, interruptible voice calls + tiered alerts, using free self-hosted pieces (pjsua2 + whisper.cpp + Linphone). No paid telephony, no extra API key.
Last updated 2026-06-23
4
3
15
Apache 2.0
MCP Server Whisper
Audio Processing Multimedia Processing Speech Processing
arcaputo3
A
license
A
quality
D
maintenance
Enables advanced audio transcription, text-to-speech generation, and audio processing using OpenAI's Whisper and GPT-4o models with support for multiple audio formats, file management, and parallel processing.
Last updated 2026-05-02
8
55
MIT
Apple Voice Memo MCP Server
Audio Processing Speech Processing
jwulff
A
license
A
quality
D
maintenance
Provides programmatic access to Apple Voice Memos on macOS, enabling AI assistants to list, retrieve details, get audio, and transcribe recordings.
Last updated 2026-01-12
5
16
5
MIT
biliscribe
Speech Processing Audio Processing Multimedia Processing
mcp-server-summary
A
license
A
quality
D
maintenance
Extracts and formats Bilibili video content into structured text for LLM processing and analysis.
Last updated 2025-05-20
1
4
MIT
live-audio-intelligence-mcp
Speech Processing Audio Processing
ykshah1309
A
license
A
quality
B
maintenance
Enables real-time transcription and heuristic vocal stress analysis of live financial webcasts, providing an LLM with rolling transcripts and a stress score.
Last updated 2026-04-16
4
1
MIT
whisper-telegram-mcp
Audio Processing Speech Processing AI & Machine Learning
abid-mahdi
A
license
A
quality
D
maintenance
An MCP server that enables transcribing local audio files and Telegram voice messages using OpenAI's Whisper via local inference or cloud API. It supports multiple audio formats, automatic language detection, and optional word-level timestamps for AI-powered audio analysis.
Last updated 2026-03-30
5
1
MIT
AssemblyAI MCP Server
Audio Processing Speech Processing Multimedia Processing
cogell
A
license
A
quality
F
maintenance
Enables AI assistants to transcribe audio files from URLs or local paths using AssemblyAI's services, with support for speaker diarization, language detection, and asynchronous job management through a standardized MCP interface.
Last updated 2025-08-05
4
8
1
MIT
Tavus MCP Server
Multimedia Processing Audio Processing Speech Processing
rakeshdavid
A
license
B
quality
D
maintenance
Enables AI video generation, replica management, conversational AI, lipsync, and speech synthesis through the Tavus API. Provides 29 tools across Phoenix replicas, video generation, personas, lipsync, and text-to-speech capabilities.
Last updated 2025-07-09
29
2
MIT
MCP FishAudio Server
Speech Processing Audio Processing
da-okazaki
A
license
B
quality
F
maintenance
An MCP (Model Context Protocol) server that provides seamless integration between Fish Audio's Text-to-Speech API and LLMs like Claude, enabling natural language-driven speech synthesis.
Last updated 2026-02-11
2
23
11
MIT
media-context-mcp
Image & Video Processing Speech Processing Multimedia Processing
vishalguptax
A
license
A
quality
A
maintenance
Give your AI assistant eyes and ears — analyze any video, audio, or image, entirely on your machine.
Last updated 2026-06-24
2
Apache 2.0
Vora
Speech Processing Communication Autonomous Agents
stefanstojanovicstefa-creator
A
license
A
quality
C
maintenance
First Voice AI MCP for AI Agents
Last updated 2026-04-25
5
MIT

Popular MCP Servers

@vocea.app/mcp-serverofficial

mocoVoice MCP Serverofficial

ElevenLabs MCP Serverofficial

Anam MCP Serverofficial

ContextPulseofficial

supertone-mcpofficial