probe_all
Probe all configured LLM API endpoints to return health metrics: time to first token, total latency, throughput, and status for each model.
Instructions
Probe all configured LLM API endpoints. Returns TTFT (ms), total latency (ms), throughput (tokens/sec), and health status for every model in the config file.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| config | No | path to probes.yml config file |