Best Deepgram MCP Servers
Deepgram is an AI speech platform that provides fast, accurate, and scalable speech recognition technology. Their platform offers transcription services, voice analytics, and AI models to help developers easily build innovative voice applications.
Why this server?
Utilizes Deepgram Nova-3 for real-time speech recognition during AI-powered phone calls facilitated by the server.
AsecurityAlicense-qualityGive AI agents a real phone number and voice. Make calls, send email, manage contacts — 26 tools, 12 resources.Last updated 18 days ago143MITWhy this server?
Provides tools for speech-to-text transcription, speech recognition, and audio intelligence operations through the Deepgram API.

Deepgram MCPofficial
-securityAlicense-qualityProvides AI editors with access to Deepgram's speech-to-text transcription and audio intelligence tools. Dynamically fetches available capabilities from Deepgram's API at runtime, ensuring access to new features without package upgrades.Last updated a day agoMITWhy this server?
Identifies when Deepgram models are the optimal choice for voice and speech processing workflows by scoring them against other providers.
AsecurityAlicenseDqualityDescribe your AI use case in plain English, get ranked model recommendations with cost estimates and tradeoff reasoning. Covers 62 models across 29 providers. Available as a web app (BYOK + guest tier) and as an MCP server for Claude Desktop and Cursor — same recommendation engine, two interfaces.Last updated 13 days ago11MITWhy this server?
Provides access to Deepgram's speech recognition and text-to-speech capabilities, including audio transcription, natural speech generation, audio analysis with sentiment detection, speaker diarization, and language detection across multiple specialized models.
-securityAlicense-qualityEnables speech-to-text transcription, text-to-speech synthesis, and audio analysis using Deepgram's AI models. Supports features like speaker diarization, sentiment analysis, language detection, and various audio processing capabilities.Last updated 7 months ago2MITWhy this server?
Provides text-to-speech capabilities through Deepgram's API, enabling the Reachy Mini robot to speak using the `speak` tool.
-securityAlicense-qualityEnables control of Pollen Robotics Reachy Mini robot through high-level emotional expressions and low-level motor commands, with support for vision, audio, and text-to-speech capabilities.Last updated 3 months ago25MITWhy this server?
Uses Deepgram's speech-to-text API to transcribe audio extracted from videos, with particular effectiveness for Chinese content.
-securityAlicense-qualityA service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.Last updated a year ago26MITWhy this server?
Provides real-time speech-to-text and transcription capabilities across multiple languages.
Why this server?
Uses Deepgram's Nova-2 model to provide high-speed transcription of podcast audio files into text.
AsecurityFlicense-qualityAn MCP server that scrapes and transcribes podcast episodes from YouTube or RSS feeds using Deepgram's Nova-2 model. It allows users to track podcasts for new episodes, manage transcripts, and generate personalized summaries through Claude.Last updated 4 months ago101Why this server?
Provides AI-powered transcription of Twitter Spaces audio using Deepgram's speech recognition API, with support for multiple transcript formats including speaker diarization, time-coded transcripts, and summaries.
-securityFlicense-qualityEnables downloading and AI-powered transcription of Twitter Spaces audio recordings with multiple output formats including speaker diarization, time-coded text, and summaries.Last updated 9 months ago