gpu_auto_manage
Automatically offloads models from GPU VRAM when usage exceeds a threshold to prevent out-of-memory errors.
Instructions
Enable or disable automatic VRAM management. When enabled, the optimizer automatically offloads models when VRAM exceeds the threshold.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| enabled | No |