volt_recommend_route
Find optimal AI model providers by comparing cost, latency, and reliability to reduce compute expenses with personalized recommendations.
Instructions
Get the optimal provider recommendation for a model based on cost, latency, reliability, or balanced optimization. Shows savings vs your current cost.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model | Yes | Model name or partial match to filter offerings (e.g. "llama-70b", "gpt-4o") | |
| optimize | No | What to optimize for (default: balanced) | balanced |
| current_cost_per_million | No | What you currently pay per million tokens (avg of input+output), for savings estimate | |
| min_quality | No | Minimum acceptable quality score 0-1 (default: 0.7) | |
| max_latency_ms | No | Maximum acceptable P95 latency in ms (default: 5000) | |
| blocked_providers | No | Provider IDs to exclude from recommendations |