gpu_offload_model
Free up GPU memory by offloading a model from GPU to CPU. Specify which model to offload using its index, or offload the largest model with -1.
Instructions
Move a specific model from GPU to CPU by index. Use gpu_memory_map to see model indices. Pass -1 to offload the largest model.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model_index | No |