check_backend_health
Verify backend reachability and latency, with optional cache bypass, to diagnose routing decisions before long batch calls.
Instructions
On-demand ping of one backend's API endpoint to verify reachability and capture latency. Results are cached for 5 minutes; pass force:true to bypass the cache. SAB already runs periodic auto-health checks in the background, so most callers don't need this — reach for it when troubleshooting a routing decision that failed unexpectedly or before kicking off a long batch call against a particular backend. Read-only: makes one HTTP request to the backend's health endpoint. Returns: {status:'online'|'offline'|'degraded', latency_ms, backend, last_check_iso, error?, models?:[{id, name, size, quantization}] (router only), available_presets? (router only), cache_status:'HIT'|'MISS', total_check_time}.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| backend | Yes | Backend name to check (local, gemini, nvidia_deepseek, nvidia_qwen, openai, groq) | |
| force | No | Bypass cache and force fresh check |