list_evaluations
List evaluations of trained models against benchmark datasets. Filter by status (queued, running, succeeded, failed, canceled) and set result limit to monitor model quality.
Instructions
List model evaluations. Evaluations run your trained models against benchmark datasets using various evaluators to measure quality.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| status | No | Filter by status: queued, running, succeeded, failed, canceled | |
| limit | No | Max results (default 20) |