Tools and Services for Converting Speech to Text

Search for:

Tools and Services for Converting Speech to Text

View all MCP Servers

Why this server?
This server extracts and transcribes audio content from videos, which directly addresses the '语音转文本' (speech to text) requirement.
MCP Video Digest
Multimedia Processing Audio Processing Web Scraping
R-lz
A
license
-
quality
D
maintenance
A service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.
Last updated 2025-04-03
28
MIT
Why this server?
While primarily for text to speech, managing voices and audio can be relevant if the user wants to convert the text back into speech after processing.
Typecast API MCP Server
Text-to-Speech Audio Processing App Automation
neosapience
F
license
-
quality
B
maintenance
Enables seamless integration with Typecast API through the Model Context Protocol, allowing clients to manage voices, convert text to speech, and play audio in a standardized way.
Last updated 2026-07-24
3
Why this server?
Provides text-to-speech capabilities, useful in scenarios where after converting speech to text, the user wants to convert it back or use it in other audio-related tasks.
Kokoro TTS MCP Server
Text-to-Speech Speech Processing
giannisanni
A
license
D
quality
C
maintenance
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated 2026-06-24
1
10
MIT
Why this server?
Offers chat and image analysis, useful if the user has multimodal inputs (audio and images) related to the voice data.
OpenRouter MCP Multimodal
Autonomous Agents Image & Video Processing
stabgan
A
license
B
quality
B
maintenance
Provides chat and image analysis capabilities through OpenRouter.ai's diverse model ecosystem, enabling both text conversations and powerful multimodal image processing with various AI models.
Last updated 2026-07-26
11
268
65
Apache 2.0
Why this server?
May be useful to test that various text transcription tasks for accuracy, but it is not clearly stated. Add to selection for the user to explore.
mcp-otzaria-server
Search Art & Culture RAG Systems
Sivan22
A
license
-
quality
D
maintenance
An MCP server that provides powerful search capabilities for Jewish texts and literature. This server enables Large Language Models to search and reference Jewish texts through a standardized interface.
Last updated 2025-04-17
23
MIT

Tools and Services for Converting Speech to Text

MCP Video Digest

Typecast API MCP Server

Kokoro TTS MCP Server

OpenRouter MCP Multimodal

mcp-otzaria-server