Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default | 
|---|---|---|---|
| OLLAMA_KEEP_ALIVE | No | Keep models loaded for 24 hours | 24h | 
| OLLAMA_NUM_PARALLEL | No | Handle parallel requests | 8 | 
| OLLAMA_KV_CACHE_TYPE | No | High-quality 8-bit cache | q8_0 | 
| OLLAMA_FLASH_ATTENTION | No | Enable efficient attention mechanism | 1 | 
| OLLAMA_MAX_LOADED_MODELS | No | Keep models in memory simultaneously | 4 | 
Schema
Prompts
Interactive templates invoked by user choice
| Name | Description | 
|---|---|
No prompts  | |
Resources
Contextual data attached and managed by the client
| Name | Description | 
|---|---|
No resources  | |
Tools
Functions exposed to the LLM to take actions
| Name | Description | 
|---|---|
| qwen3_code_review | Review code using Qwen3-Coder  | 
| qwen3_code_explain | Explain code using Qwen3-Coder  | 
| qwen3_code_generate | Generate code using Qwen3-Coder  | 
| qwen3_code_fix | Fix bugs in code using Qwen3-Coder  | 
| qwen3_code_optimize | Optimize code using Qwen3-Coder  |