Why this server?
This server extracts and transcribes audio content from videos, which directly addresses the '语音转文本' (speech to text) requirement.
-securityAlicense-qualityA service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.Last updated a year ago26MITWhy this server?
While primarily for text to speech, managing voices and audio can be relevant if the user wants to convert the text back into speech after processing.
-securityFlicense-qualityEnables seamless integration with Typecast API through the Model Context Protocol, allowing clients to manage voices, convert text to speech, and play audio in a standardized way.Last updated 2 months ago3Why this server?
Provides text-to-speech capabilities, useful in scenarios where after converting speech to text, the user wants to convert it back or use it in other audio-related tasks.
-securityFlicense-qualityProvides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.Last updated a year ago10Why this server?
Offers chat and image analysis, useful if the user has multimodal inputs (audio and images) related to the voice data.
AsecurityAlicense-qualityProvides chat and image analysis capabilities through OpenRouter.ai's diverse model ecosystem, enabling both text conversations and powerful multimodal image processing with various AI models.Last updated 12 days ago631119MITWhy this server?
May be useful to test that various text transcription tasks for accuracy, but it is not clearly stated. Add to selection for the user to explore.
-securityAlicense-qualityAn MCP server that provides powerful search capabilities for Jewish texts and literature. This server enables Large Language Models to search and reference Jewish texts through a standardized interface.Last updated a year ago21MIT