Best Deepgram MCP Servers
Deepgram is an AI speech platform that provides fast, accurate, and scalable speech recognition technology. Their platform offers transcription services, voice analytics, and AI models to help developers easily build innovative voice applications.
Why this server?
Utilizes Deepgram Nova-3 for real-time speech recognition during AI-powered phone calls facilitated by the server.
AlicenseBqualityCmaintenanceGive AI agents a real phone number and voice. Make calls, send email, manage contacts — 26 tools, 12 resources.Last updated43MITWhy this server?
Provides tools for speech-to-text transcription, speech recognition, and audio intelligence operations through the Deepgram API.

Deepgram MCPofficial
Alicense-qualityCmaintenanceProvides AI editors with access to Deepgram's speech-to-text transcription and audio intelligence tools. Dynamically fetches available capabilities from Deepgram's API at runtime, ensuring access to new features without package upgrades.Last updated1MITWhy this server?
Uses Deepgram's Nova-2 model to provide high-speed transcription of podcast audio files into text.
FlicenseAquality-maintenanceAn MCP server that scrapes and transcribes podcast episodes from YouTube or RSS feeds using Deepgram's Nova-2 model. It allows users to track podcasts for new episodes, manage transcripts, and generate personalized summaries through Claude.Last updated101Why this server?
Provides real-time speech-to-text and transcription capabilities across multiple languages.
Why this server?
Identifies when Deepgram models are the optimal choice for voice and speech processing workflows by scoring them against other providers.
AlicenseDqualityCmaintenanceDescribe your AI use case in plain English, get ranked model recommendations with cost estimates and tradeoff reasoning. Covers 62 models across 29 providers. Available as a web app (BYOK + guest tier) and as an MCP server for Claude Desktop and Cursor — same recommendation engine, two interfaces.Last updated11MITWhy this server?
Uses Deepgram's speech-to-text API to transcribe audio extracted from videos, with particular effectiveness for Chinese content.
Alicense-qualityDmaintenanceA service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.Last updated26MITWhy this server?
Provides access to Deepgram's speech recognition and text-to-speech capabilities, including audio transcription, natural speech generation, audio analysis with sentiment detection, speaker diarization, and language detection across multiple specialized models.
Alicense-qualityCmaintenanceEnables speech-to-text transcription, text-to-speech synthesis, and audio analysis using Deepgram's AI models. Supports features like speaker diarization, sentiment analysis, language detection, and various audio processing capabilities.Last updated2MITWhy this server?
Provides text-to-speech capabilities through Deepgram's API, enabling the Reachy Mini robot to speak using the `speak` tool.
Alicense-qualityDmaintenanceEnables control of Pollen Robotics Reachy Mini robot through high-level emotional expressions and low-level motor commands, with support for vision, audio, and text-to-speech capabilities.Last updated26MITWhy this server?
Provides AI-powered transcription of Twitter Spaces audio using Deepgram's speech recognition API, with support for multiple transcript formats including speaker diarization, time-coded transcripts, and summaries.
Flicense-qualityCmaintenanceEnables downloading and AI-powered transcription of Twitter Spaces audio recordings with multiple output formats including speaker diarization, time-coded text, and summaries.Last updated