Schema | voxcpm-mcp

voxcpm-mcp

Describes the environment variables required to run the server.

Name	Required	Description	Default
`VOXCPM_PYTHON`	No	Python executable with VoxCPM2 + CUDA (e.g., path to venv python)
`VOXCPM_OUTPUT_DIR`	No	Directory where WAV files are saved	./voxcpm_output

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": false }
`experimental`	{}

Functions exposed to the LLM to take actions

Name	Description
synthesizeA	Synthesize speech from text using VoxCPM2 (2B diffusion TTS, 48 kHz). Returns the path to the output WAV file and its duration.
synthesize_with_cloneA	Synthesize speech cloning a voice from a reference WAV. The reference WAV sets the speaker identity, prosody, and style. Both reference and output are 48 kHz mono WAV.
preload_modelA	Load VoxCPM2 into VRAM now (takes ~10 s on RTX 4060 Laptop). Call this before synthesize if you want the first synthesis to be fast.
pingA	Check that the VoxCPM2 worker subprocess is alive.

Interactive templates invoked by user choice

Name	Description
No prompts

Contextual data attached and managed by the client

Name	Description
No resources

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/OLGTX303/voxcpm-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server