gpu_optimize
Automatically free GPU VRAM by offloading largest models to CPU until target free memory is reached. Specify target_free_mb or use auto mode.
Instructions
Automatically free VRAM by offloading models to CPU. Offloads largest models first until threshold is met. Set target_free_mb to specify how much VRAM to free, or 0 for auto.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| target_free_mb | No |