Speech Processing

Voice interaction and speech processing capabilities. Enables converting speech to text, audio commands, and voice generation.

MCP ServersBrowse all →

ContextPulseofficial
Knowledge & Memory OS Automation Speech Processing
ContextPulse
A
license
A
quality
D
maintenance
Lets AI assistants understand what you're working on — current screen content, recent dictation, clipboard, and saved notes — running entirely on your own machine with nothing sent to the cloud.
Last updated 2026-07-21
36
1
AGPL 3.0
Anam MCP Serverofficial
AI & Machine Learning Speech Processing Communication
anam-org
A
license
B
quality
F
maintenance
Enables managing AI personas, avatars, voices, and sessions from any MCP client, for integration with Anam AI.
Last updated 2026-07-23
54
29
MIT
@vocea.app/mcp-serverofficial
Speech Processing Text-to-Speech
vocea-admin
A
license
A
quality
C
maintenance
Enables AI agents to generate speech, transcribe audio, and manage voices via the Vocea API.
Last updated 2026-05-14
6
MIT
Speak AI MCP Serverofficial
AI & Machine Learning Speech Processing Search
speakai
A
license
A
quality
B
maintenance
Connects Speak AI transcription and insight data to Claude and ChatGPT, enabling natural language queries for summaries, action items, and quotes from recordings.
Last updated 2026-07-27
100
348
MIT
SeaMeet MCPofficial
Note Taking Speech Processing Knowledge & Memory
seameet-ai
A
license
A
quality
C
maintenance
SeaMeet MCP connects Claude, Cursor, Codex, and other AI agents to SeaMeet meeting recordings, transcripts, AI summaries, screenshots, action items, webhooks, and desktop recording controls. Use it to search meeting memory, read synced cloud recordings, and automate meeting notes through the Model Context Protocol.
Last updated 2026-07-14
10
249
MIT
douyin-mcp-server
Web Scraping Speech Processing Multimedia Processing
yzfly
A
license
A
quality
F
maintenance
douyin-mcp-server
Last updated 2026-07-02
4
3
1,200
Apache 2.0
mocoVoice MCP Serverofficial
Speech Processing Audio Processing AI & Machine Learning
mocomoco-inc
A
license
A
quality
B
maintenance
Enables transcription of audio and video files using mocoVoice API, allowing users to start transcription jobs and retrieve results directly from Claude Desktop.
Last updated 2026-05-31
6
3
MIT
TypeWhisper MCPofficial
Speech Processing Developer Tools
TypeWhisper
A
license
A
quality
C
maintenance
Connects to the TypeWhisper macOS app to let coding agents transcribe local files, inspect model status, search history, and manage dictionary terms and corrections.
Last updated 2026-07-13
10
78
GPL 3.0
ElevenLabs MCP Serverofficial
Speech Processing Audio Processing
elevenlabs
A
license
A
quality
B
maintenance
An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
Last updated 2026-07-23
26
1,487
MIT
Voice MCP
Text-to-Speech Speech Processing Audio Processing
forgemeshlabs
A
license
A
quality
B
maintenance
Give your AI agent a voice with x402 pay-per-call speech synthesis, offering 20 voices, 10 personas, 31 languages, and granular controls.
Last updated 2026-07-01
4
6
22
MIT
supertone-mcpofficial
Text-to-Speech Speech Processing AI & Machine Learning
supertone-inc
A
license
A
quality
B
maintenance
MCP server for the Supertone TTS API. Generate natural speech, browse and preview the voice catalog, predict synthesis cost, and create cloned voices — directly from Claude Desktop, Cursor, or any MCP-compatible client. Supports Korean, English, Japanese, and 20+ other languages, with speed, pitch, and emotion-style control.
Last updated 2026-06-17
14
4
MIT
ElevenLabs
Speech Processing Audio Processing Text-to-Speech
wynandw87
A
license
A
quality
C
maintenance
MCP server that brings ElevenLabs to Claude Code — text-to-speech, sound effects, music generation, voice cloning, speech-to-speech, transcription, and voice isolation. 8 tools for industry-leading AI audio.
Last updated 2026-02-12
8
MIT
Ringback
Speech Processing Communication
mohitbadwal
A
license
A
quality
A
maintenance
Let your AI agent call your phone and talk to you — MCP servers for live, interruptible voice calls + tiered alerts, using free self-hosted pieces (pjsua2 + whisper.cpp + Linphone). No paid telephony, no extra API key.
Last updated 2026-06-23
3
16
Apache 2.0
Edge TTS MCP
Text-to-Speech Speech Processing Multimedia Processing
s-n-n
A
license
A
quality
D
maintenance
A cross-platform MCP server that enables Claude to speak using Microsoft Edge TTS with support for over 300 voices across 50+ languages. It requires no API keys and allows for customization of speech rate, volume, and pitch.
Last updated 2026-01-18
3
2
MIT
yuppie-mcp-tts
Text-to-Speech Speech Processing
yuppie1949
A
license
A
quality
B
maintenance
Provides text-to-speech synthesis using Microsoft Edge's free TTS engine, supporting multiple voices, languages, and audio output options (base64 or file).
Last updated 2026-07-10
3
MIT
Sats4AI
Speech Processing Text-to-Speech
cnghockey
A
license
A
quality
C
maintenance
Bitcoin-powered AI tools via Lightning Network micropayments (L402). Image generation, text generation, video, music, speech, 3D models, file conversion, and SMS — no signup or API keys required.
Last updated 2026-07-10
49
197
1
MIT
Video to Text MCP Server
Multimedia Processing Audio Processing Speech Processing
strzhao
A
license
B
quality
D
maintenance
Enables downloading videos from platforms like YouTube and converting them to text using OpenAI Whisper and ffmpeg. It supports multiple output formats including TXT, JSON, SRT, and VTT for transcriptions.
Last updated 2026-01-13
2
6
ISC
TranscriptionTools MCP Server
Text Summarization Speech Processing
MushroomFleet
A
license
B
quality
D
maintenance
Provides intelligent transcript processing capabilities for Claude, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.
Last updated 2026-04-07
4
19
MIT
voice-analysis-mcp
Audio Processing Speech Processing
Patience-dot-devl
A
license
A
quality
C
maintenance
Provides local audio analysis tools for LLMs, enabling transcription, conversation dynamics, prosody analysis, and visual inspection without API keys.
Last updated 2026-07-22
8
MIT
VOICEVOX TTS MCP
Text-to-Speech Speech Processing Multimedia Processing
kajidog
A
license
A
quality
D
maintenance
A text-to-speech MCP server that enables AI assistants to speak using the VOICEVOX engine with support for multi-character conversations. It features queue management, low-latency streaming via FFplay, and cross-platform playback across Windows, macOS, and Linux.
Last updated 2026-07-04
7
149
16
ISC
live-translate-mcp
Speech Processing Language Translation
waxberry-dev
A
license
A
quality
B
maintenance
Real-time English ↔ Mandarin Chinese speech translation for Claude. Transcribes audio locally with Whisper, translates via Claude API, and synthesises speech locally with Piper TTS. Pass a WAV file path. Claude handles the rest.
Last updated 2026-06-17
3
167
3
MIT
WhatsApp Web MCP
Browser Automation Communication Speech Processing
KingDonRush
A
license
B
quality
C
maintenance
Local MCP server for agents to search, structure, and export authorized WhatsApp Web conversations. Uses Playwright for DOM interaction, supports message search, export, media transcription, and controlled message sending.
Last updated 2026-07-25
18
MIT
salutespeech-mcp
Speech Processing Text-to-Speech
theYahia
A
license
A
quality
C
maintenance
Provides speech recognition (STT) and synthesis (TTS) tools via the Sber SaluteSpeech API, enabling audio transcription and voice generation through natural language.
Last updated 2026-06-23
5
86
1
MIT
whisper-windows-mcp
Audio Processing Speech Processing
eviscerations
A
license
A
quality
A
maintenance
A Windows-native MCP server that lets Claude Desktop transcribe audio files locally using whisper.cpp, with no internet connection required.
Last updated 2026-07-22
12
87
1
Sleepycat
Elba MCP Server
AI & Machine Learning Speech Processing Agent Orchestration
Kolsetu-Opensource
A
license
A
quality
C
maintenance
Manage voice AI agents from Claude Code, Cursor, VS Code, or any MCP-compatible assistant.
Last updated 2026-07-25
3
5
3
MIT
media-context-mcp
Image & Video Processing Speech Processing Multimedia Processing
vishalguptax
A
license
A
quality
A
maintenance
Give your AI assistant eyes and ears — analyze any video, audio, or image, entirely on your machine.
Last updated 2026-06-26
2
124
1
Apache 2.0
Whisper MCP Server
Audio Processing Speech Processing
jwulff
A
license
A
quality
F
maintenance
Provides local audio transcription using whisper.cpp, supporting multiple models and audio formats. Enables transcription of audio files via MCP tools with optional timestamps.
Last updated 2026-01-12
3
103
2
MIT
brainiall-mcp-server
Speech Processing Audio Processing Language Translation
fasuizu-br
A
license
A
quality
C
maintenance
AI-powered speech tools by Brainiall: pronunciation assessment with phoneme-level feedback, speech-to-text with language detection, and text-to-speech with multiple voices.
Last updated 2026-03-11
4
1
MIT
biliscribe
Speech Processing Audio Processing Multimedia Processing
mcp-server-summary
A
license
A
quality
D
maintenance
Extracts and formats Bilibili video content into structured text for LLM processing and analysis.
Last updated 2025-05-20
1
4
MIT
Voicevox MCP Server
Speech Processing Autonomous Agents
Dosugamea
A
license
B
quality
F
maintenance
A server that enables Claude 3.7 and other AI agents to access VOICEVOX-compatible speech synthesis engines (AivisSpeech, VOICEVOX, COEIROINK) through the Model Context Protocol.
Last updated 2025-03-23
1
11
MIT