A service to convert text to ready-to-use audio with download, player, or embed options

Search for:

A service to convert text to ready-to-use audio with download, player, or embed options

View all MCP Servers

Why this server?
Integrates with ElevenLabs text-to-speech API, providing functionality to convert text into audio.
ElevenLabs MCP Server
Text-to-Speech Audio Processing
mamertofabian
A
license
A
quality
F
maintenance
Integrates with ElevenLabs text-to-speech API.
Last updated 2025-01-07
6
118
MIT
Why this server?
Provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Speech MCP Server
Text-to-Speech Audio Processing Multimedia Processing
hammeiam
A
license
B
quality
D
maintenance
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated 2025-03-28
4
7
1
MIT
Why this server?
A Python server providing access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.
Whissle MCP Server
Speech Processing Text-to-Speech Text Summarization
WhissleAI
F
license
A
quality
D
maintenance
A Python-based server that provides access to Whissle API endpoints for speech-to-text, diarization, translation, and text summarization.
Last updated 2025-04-11
5
2
Why this server?
Provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Speech MCP Server
Text-to-Speech Audio Processing Multimedia Processing
hammeiam
A
license
B
quality
D
maintenance
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated 2025-03-28
4
7
1
MIT
Why this server?
Provides voice recognition and text extraction capabilities with support for both stdio and MCP modes, processing audio files or base64 encoded data and returning structured results with language, emotion, and speaker information.
Voice Recognition MCP Service
Speech Processing Audio Processing
yangsenessa
A
license
-
quality
D
maintenance
Provides voice recognition and text extraction capabilities with support for both stdio and MCP modes, processing audio files or base64 encoded data and returning structured results with language, emotion, and speaker information.
Last updated 2025-06-17
MIT
Why this server?
A Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats. It indirectly helps with audio processing if audio is embedded.
Unstructured Document
Documentation Access Developer Tools Search
MKhalusova
F
license
B
quality
D
maintenance
A Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats.
Last updated 2025-03-20
1
11

A service to convert text to ready-to-use audio with download, player, or embed options

ElevenLabs MCP Server

Speech MCP Server

Whissle MCP Server

Speech MCP Server

Voice Recognition MCP Service

Unstructured Document