llama_chat
Generate chat responses using a local LLM via llama.cpp. Supports OpenAI-compatible parameters for customization.
Instructions
Chat completion (OpenAI-compatible format)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| messages | Yes | Chat messages | |
| max_tokens | No | Maximum tokens to generate | |
| temperature | No | Sampling temperature (0-2) | |
| top_p | No | Nucleus sampling threshold | |
| stop | No | Stop sequences | |
| seed | No | Random seed for reproducibility |