Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
SUPERTONE_API_KEYYesYour Supertone API key
SUPERTONE_OUTPUT_DIRNoDirectory where audio files are saved~/supertone-tts-output/
SUPERTONE_MCP_AUTOPLAYNoAuto-play generated audio on macOS via afplaytrue
SUPERTONE_MCP_VOICE_IDNoDefault voice_id for text_to_speech / predict_durationpreset voice (Aiden, multilingual)
SUPERTONE_MCP_OUTPUT_MODENoOne of `files`, `resources`, `both`. Controls how text_to_speech returns audiofiles

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
text_to_speechA

Generate natural-sounding speech audio from text. Use this when the user wants to: hear text read aloud, create narration or voiceover, generate voice audio, preview how text sounds when spoken, or convert any writing into spoken audio. Supports 23 languages including Korean, English, and Japanese. Audio is automatically played back on macOS. A default voice is already configured -- just call this tool directly. Only call search_voice if the user explicitly asks to change or browse voices.

search_voiceA

Search the Supertone voice catalog. Filters are optional and combined with AND semantics: name, description, language, gender, age, use_case, style, model. With no filters, returns the full catalog (the v0.1 list_voices behavior). The output is a numbered plain-text list; when any filter is set, the first line shows "Filters applied: ...".

get_voiceA

Fetch full detail for a single voice by voice_id. Returns name, description, age, gender, use_cases, languages, styles, supported models, sample count, and thumbnail URL. Use preview_voice to get the actual sample audio URLs.

get_credit_balanceA

Returns the remaining Supertone credit balance for the current API key. Use this before long TTS calls to confirm you have enough characters left.

preview_voiceA

Fetch sample audio URLs for a voice. Optionally filter samples by language, style, and model. Returns one URL per matching sample. v0.2 does NOT play the audio locally; pass the URL to your client to listen.

predict_durationA

Predict the expected output audio duration in seconds for a given text WITHOUT producing any audio file. Accepts the same parameters as text_to_speech and applies the same 300-character limit. Use this to estimate credit cost before synthesizing — credit usage is proportional to the predicted duration.

clone_voiceA

Create a custom voice from a single local audio file. Constraints: WAV or MP3 only, max 3MB, exactly one file. The returned voice_id can be used immediately in text_to_speech. Path supports ~ expansion (e.g., "~/sample.wav").

search_custom_voiceA

List custom (cloned) voices created by this API key. Optional name and description filters perform partial matching. Pagination is handled internally; v0.2 returns the SDK default page.

edit_custom_voiceA

Update the name and/or description of an existing custom voice. At least one of name or description must be provided.

delete_custom_voiceA

Permanently delete a custom (cloned) voice. THIS IS IRREVERSIBLE — once deleted, the voice cannot be recovered and any saved voice_id referencing it will stop working. Confirm with the user before calling.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/supertone-inc/supertone-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server