preload_model
Loads the VoxCPM2 model into VRAM to reduce latency for the first text-to-speech synthesis request.
Instructions
Load VoxCPM2 into VRAM now (takes ~10 s on RTX 4060 Laptop). Call this before synthesize if you want the first synthesis to be fast.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||