probe_model
Probe any LLM model by provider and name to get TTFT, total latency, throughput, and health status without a config file.
Instructions
Probe a single LLM model by provider and model name. Use this for ad-hoc checks without a config file. Returns TTFT (ms), total latency (ms), throughput (tokens/sec), and health status.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| provider | Yes | provider name (openai, anthropic, google, azure, bedrock) | |
| model | Yes | model identifier (e.g. gpt-4o, claude-sonnet-4-20250514) | |
| api_key_env | Yes | environment variable name containing the API key | |
| base_url | No | optional base URL for OpenAI-compatible endpoints (e.g. http://localhost:8000) | |
| label | No | optional display name for the endpoint (e.g. vllm-local) |