spraay_compute_text_inference
Send chat messages to run text inference using multiple LLMs. Choose from 11 models priced between $0.003-$0.10 USDC per request.
Instructions
Run LLM text inference via Spraay Compute. 11 models across Chutes, Replicate, OpenRouter (DeepSeek, Llama, Qwen, Gemma). Costs $0.003-$0.10 USDC depending on model.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model | No | Model ID (e.g. 'deepseek-ai/DeepSeek-V3-0324', 'auto' for cheapest). Use spraay_compute_models to list all. | auto |
| messages | Yes | Chat messages array | |
| max_tokens | No | Maximum tokens to generate | |
| temperature | No | Sampling temperature (0-2) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| ok | Yes | True when the gateway call succeeded; false when it returned an error. | |
| data | No | The gateway response payload on success. The exact shape depends on the tool (see the tool description and the JSON in the text content block). | |
| error | No | Human-readable error message, present only when ok is false. |