Skip to main content
Glama
OLGTX303
by OLGTX303

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
VOXCPM_PYTHONNoPython executable with VoxCPM2 + CUDA (e.g., path to venv python)
VOXCPM_OUTPUT_DIRNoDirectory where WAV files are saved./voxcpm_output

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
synthesizeA

Synthesize speech from text using VoxCPM2 (2B diffusion TTS, 48 kHz). Returns the path to the output WAV file and its duration.

synthesize_with_cloneA

Synthesize speech cloning a voice from a reference WAV. The reference WAV sets the speaker identity, prosody, and style. Both reference and output are 48 kHz mono WAV.

preload_modelA

Load VoxCPM2 into VRAM now (takes ~10 s on RTX 4060 Laptop). Call this before synthesize if you want the first synthesis to be fast.

pingA

Check that the VoxCPM2 worker subprocess is alive.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/OLGTX303/voxcpm-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server