probe
Check health of LLM API endpoints by measuring time to first token, latency, and throughput. Reports status for each configured model.
Instructions
Run a health check against configured LLM API endpoints. Returns TTFT, latency, throughput, and health status for each model defined in the config file.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| config | No | path to probes.yml config file |