Apache 2.0
21,839 servers. Last updated
Search Relevance ↓
Recent Usage ↓
Date Added ↓
Date Updated ↓
Weekly Downloads ↓
GitHub Stars ↓
Recent GitHub Stars ↓
app.linear
com.stayker
io.github.0nork
io.github.Servicedsi
Popular MCP Servers

ElevenLabs MCP Serverofficial
AsecurityAlicenseBqualityAn official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.Last updated241,310MIT- AsecurityAlicenseAqualityTranscribes videos from 1000+ platforms (YouTube, TikTok, Vimeo, etc.) and local video files using OpenAI's Whisper model, with support for 90+ languages and multiple output formats.Last updated14121MIT
- AsecurityAlicenseAqualityEnables interaction with ElevenLabs Text-to-Speech and audio processing APIs. Supports speech generation, voice cloning, audio transcription, and sound effect creation through natural language.Last updated24MIT
- AsecurityAlicenseAqualityEnables advanced audio transcription, text-to-speech generation, and audio processing using OpenAI's Whisper and GPT-4o models with support for multiple audio formats, file management, and parallel processing.Last updated852MIT
- AsecurityAlicenseBqualityA Node.js server that enables AI assistants to interact with Bouyomi-chan's text-to-speech functionality through Model Context Protocol (MCP), allowing for voice reading of text with adjustable parameters.Last updated12MIT
- AsecurityAlicenseBqualityA Model Context Protocol server that integrates with AivisSpeech to enable AI assistants to convert text to natural-sounding Japanese speech with customizable voice parameters.Last updated1278Apache 2.0
- AsecurityAlicenseAqualityA text-to-speech MCP server that enables AI assistants to speak using the VOICEVOX engine with support for multi-character conversations. It features queue management, low-latency streaming via FFplay, and cross-platform playback across Windows, macOS, and Linux.Last updated714914ISC
- AsecurityAlicenseAqualityMCP Server for automated conversational phone calls using Asterisk with Speech-to-Speech capabilities, allowing users to make phone conversations as easily as writing a prompt.Last updated9245MIT
- AsecurityAlicenseBqualityAn MCP server that enables LLMs to access the NijiVoice API for text-to-speech generation, supporting features like fetching available voice actors and checking credit balance.Last updated32MIT
- AsecurityAlicenseAqualityAI-powered speech tools by Brainiall: pronunciation assessment with phoneme-level feedback, speech-to-text with language detection, and text-to-speech with multiple voices.Last updated4MIT
- AsecurityAlicenseBqualityProvides intelligent transcript processing capabilities for Claude, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.Last updated419MIT
- AsecurityAlicenseAqualityA cross-platform MCP server that enables Claude to speak using Microsoft Edge TTS with support for over 300 voices across 50+ languages. It requires no API keys and allows for customization of speech rate, volume, and pitch.Last updated31MIT
- AsecurityAlicenseBqualityEnables AI video generation, replica management, conversational AI, lipsync, and speech synthesis through the Tavus API. Provides 29 tools across Phoenix replicas, video generation, personas, lipsync, and text-to-speech capabilities.Last updated292MIT
- AsecurityAlicenseAqualityBitcoin-powered AI tools via Lightning Network micropayments (L402). Image generation, text generation, video, music, speech, 3D models, file conversion, and SMS — no signup or API keys required.Last updated18161MIT
- AsecurityAlicenseAqualityAn MCP server that enables transcribing local audio files and Telegram voice messages using OpenAI's Whisper via local inference or cloud API. It supports multiple audio formats, automatic language detection, and optional word-level timestamps for AI-powered audio analysis.Last updated5MIT
- AsecurityAlicenseBqualityAn MCP (Model Context Protocol) server that provides seamless integration between Fish Audio's Text-to-Speech API and LLMs like Claude, enabling natural language-driven speech synthesis.Last updated23110MIT
- AsecurityAlicenseBqualityProvides voice notifications using Grok's text-to-speech API to alert users when Claude Code completes tasks, with support for both local and remote server configurations.Last updated1MIT
- AsecurityAlicenseBqualityA server that enables Claude 3.7 and other AI agents to access VOICEVOX-compatible speech synthesis engines (AivisSpeech, VOICEVOX, COEIROINK) through the Model Context Protocol.Last updated111MIT
- AsecurityAlicenseAqualityA Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.Last updated1428The Unlicense
- AsecurityAlicenseAqualityAn MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.Last updated121MIT
- AsecurityAlicenseBqualityA MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.Last updated129MIT
- AsecurityFlicenseAqualityEnables high-quality transcription and subtitle generation from local media files or URLs using Faster Whisper on local hardware. It supports automatic language detection and integration with MCP clients for seamless speech-to-text workflows.Last updated3
- AsecurityFlicenseBqualityA PowerShell-based MCP server that enables Claude Desktop to convert text to speech using Windows' built-in Speech API, offering features like playback control, speed and volume adjustment.Last updated10
- AsecurityFlicenseAqualityProvides various AI capabilities through DeepInfra's OpenAI-compatible API including image generation, text processing, embeddings, speech recognition, object detection, and classification tasks. Enables users to access multiple AI models for diverse tasks like generating images from prompts, transcribing audio, analyzing text sentiment, and performing computer vision operations.Last updated101

Deepgram MCPofficial
-securityAlicense-qualityProvides AI editors with access to Deepgram's speech-to-text transcription and audio intelligence tools. Dynamically fetches available capabilities from Deepgram's API at runtime, ensuring access to new features without package upgrades.Last updatedMIT
VoiceAI-MCP-VAVickyofficial
AsecurityFlicenseCqualityBest VoiceAI White Label Platform for Agencies with Full MCP Connectivity to Any Platform for Custom VoiceAi & AI Chat Integrations and You Keep 100% of the ProfitsLast updated32- AsecurityAlicenseCqualityConnects Claude Desktop to Hugging Face Spaces with minimal setup, enabling capabilities like image generation, vision tasks, text-to-speech, and chat with AI models.Last updated3131MIT
- AsecurityFlicenseAqualityA Python-based server that provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.Last updated52

Flyworks MCPofficial
-securityAlicense-qualityA Model Context Protocol server that enables fast and free lipsync video creation for a wide range of digital avatars, supporting both audio and text inputs to generate synchronized lip movements.Last updated94MIT- AsecurityAlicenseCqualityA Model Context Protocol server for macOS that enables AI assistants to play system sounds for audio feedback, offering informational, warning, and error sound options.Last updated41MIT
- AsecurityFlicenseBqualityA Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.Last updated27
- AsecurityFlicenseAqualityEnables AI assistants to transcribe audio files from URLs or local paths using AssemblyAI's services, with support for speaker diarization, language detection, and asynchronous job management through a standardized MCP interface.Last updated471
- AsecurityFlicenseAqualityGives Claude Desktop a maid personality with Japanese-accented text-to-speech, an interactive visual avatar with 16+ poses and animations, and speech recognition for voice input. Designed for fun rather than productivity.Last updated111
- AsecurityFlicenseBqualityEnables interaction with Retell AI's voice and chat agent platform. Build, deploy, and manage AI phone agents, configure conversation flows, handle calls/chats, and manage phone numbers through natural language.Last updated52
- AsecurityFlicenseBqualityVoice Mode for Claude CodeLast updated134109
- AsecurityAlicense-qualityEnables users to convert text into high-quality audio by accessing the OpenAI Text-to-Speech API. It supports customizable model selection and voice options for synthesized speech generation via the MCP protocol.Last updated1MIT
- AsecurityFlicenseBqualityEnables downloading videos from platforms like YouTube and converting them to text using OpenAI Whisper and ffmpeg. It supports multiple output formats including TXT, JSON, SRT, and VTT for transcriptions.Last updated22

spekoai-mcpofficial
-securityAlicense-qualityA Model Context Protocol server that provides voice-AI session management and usage tracking tools for SpekoAI. It enables creating, retrieving, and ending voice sessions while monitoring usage through MCP clients like Claude Desktop.Last updatedMIT- AsecurityFlicenseAqualityProvides access to Whissle AI services for speech-to-text, speaker diarization, translation, and text summarization. It enables users to process various audio formats and manage text content through natural language tools.Last updated5
- -securityAlicense-qualityEnables speech-to-text transcription, text-to-speech synthesis, and audio analysis using Deepgram's AI models. Supports features like speaker diarization, sentiment analysis, language detection, and various audio processing capabilities.Last updated2MIT
- -securityAlicense-qualityEnables AI assistants to convert text to speech using Microsoft Edge's Text-to-Speech service with customizable voice options, speech rate, volume, and pitch parameters.Last updatedMIT
- AsecurityFlicenseDqualityFacilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.Last updated1413
- -securityAlicense-qualityEnables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.Last updated6MIT
- -securityAlicense-qualityEnables LLM applications to make voice calls and send SMS messages through the Vonage API, allowing AI assistants to perform real-world telephony operations with support for speech recognition and customizable voice parameters.Last updated11MIT
- -securityAlicense-qualityThis service provides fast and reliable transcriptions for audio/video files and voice memos. It allows LLMs to interact with the text content of audio/video file.Last updated6MIT
- -securityAlicense-qualityA Model Context Protocol server that enables AI assistants like Claude to initiate and manage real-time voice calls using Twilio and OpenAI's voice models.Last updated59MIT
- -securityAlicense-qualityA multi-provider voice notification MCP server that enables Claude Code to provide auditory alerts for task completions, confirmation requests, and errors. It supports various TTS engines including VOICEVOX, Google Cloud TTS, and OpenAI for customizable voice feedback.Last updated7MIT
- -securityAlicense-qualityEnables control of Pollen Robotics Reachy Mini robot through high-level emotional expressions and low-level motor commands, with support for vision, audio, and text-to-speech capabilities.Last updated25MIT
- -securityAlicense-qualityNatural voice conversations for AI assistants that brings human-like voice interactions to Claude, ChatGPT, and other LLMs through the Model Context Protocol (MCP).Last updated974MIT