Speech MCP Server

Overview Inspect Schema Related Servers Score Discussions

text_to_speech_with_options

Convert text into speech with adjustable speed and selectable voices using a customizable TTS tool. Ideal for generating audio content with personalized settings.

Instructions

Convert text to speech with customizable speed

Input Schema

TableJSON Schema

Name	Required	Description
`speed`	No	Speech rate multiplier (0.5 to 2.0)
`text`	Yes	The text to convert to speech
`voice`	No	The voice to use for speech synthesis (e.g. 'af_bella'). Use list_voices to see available options.

Implementation Reference

src/index.ts:294-307 (handler)
Handler logic for the text_to_speech_with_options tool. Parses arguments, validates text input, generates and plays audio using TTSClient with speed option, and returns success message.
case "text_to_speech_with_options": { const args = request.params.arguments as unknown as TextToSpeechWithOptionsArgs; if (!args.text) { throw new Error("Missing required argument: text"); } await ttsClient.generateAndPlayAudio(args.text, args.voice, args.speed); return { content: [{ type: "text", text: `Successfully generated and played audio${args.voice ? ` using voice: ${args.voice}` : ''} (speed: ${args.speed || 1.0})` }], }; }
src/index.ts:64-85 (schema)
JSON Schema defining the input parameters for the text_to_speech_with_options tool, including text (required), optional voice, and speed with constraints.
inputSchema: { type: "object", properties: { text: { type: "string", description: "The text to convert to speech", minLength: 1, maxLength: 1000, }, voice: { type: "string", description: "The voice to use for speech synthesis (e.g. 'af_bella'). Use list_voices to see available options.", }, speed: { type: "number", description: "Speech rate multiplier (0.5 to 2.0)", minimum: 0.5, maximum: 2.0, }, }, required: ["text"], },
src/index.ts:356-360 (registration)
The textToSpeechWithOptionsTool is registered here in the array returned by the ListToolsRequest handler.
textToSpeechTool, textToSpeechWithOptionsTool, listVoicesTool, getModelStatusTool, ],
src/index.ts:34-37 (schema)
TypeScript interface for tool arguments, defining structure with optional speed and voice.
interface TextToSpeechWithOptionsArgs extends TextToSpeechArgs { speed?: number; voice?: KokoroVoice; }
src/index.ts:230-250 (helper)
Supporting method in TTSClient class that performs the actual text-to-speech generation using KokoroTTS, saves to temp WAV file, and plays it synchronously.
async generateAndPlayAudio(text: string, voice?: KokoroVoice, speed?: number): Promise<void> { await this.waitForInit(); if (!this.ttsInstance) { throw new Error("TTS model not initialized"); } const audio = await this.ttsInstance.generate(text, { voice: voice || DEFAULT_VOICE, // @ts-ignore-line speed: speed || DEFAULT_SPEECH_SPEED, }); const tempFile = join(tmpdir(), `${Date.now()}.wav`); await audio.save(tempFile); await player.play({ path: tempFile, sync: true }); } }

This server cannot be installed

Other Tools

Related Tools

speak_fast
@balloonf/widows_tts_mcp
speak_slow
@balloonf/widows_tts_mcp
batch_synthesize
@samihalawa/advanced-tts-mcp
async_text_to_speech
@mobvoi/mobvoi-tts-mcp
synthesize_speech
@samihalawa/advanced-tts-mcp
mcp_openai_tts
@bigdata-coss/agent_mcp

Latest Blog Posts

Federated Learning with MCP: Building Privacy-Preserving Agents Across Distributed Edges
By Om-Shree-0709 on December 21, 2025.
Secure
mcp
Learning
What Is Context Bloat in MCP?
By Om-Shree-0709 on December 16, 2025.
mcp
Context Bloat
MCP Moves to the Linux Foundation: Neutral Stewardship for Agentic Infrastructure
By Om-Shree-0709 on December 15, 2025.
mcp
anthropic
Linux Foundation

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/hammeiam/koroko-speech-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server