list_evaluations
List past evaluation runs in a directory to access metadata such as timestamp, dataset, models, pass/fail results, and cost.
Instructions
List past evaluation runs in a directory. Returns metadata for each run: timestamp, dataset, models, pass/fail, and cost.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| results_dir | Yes | Directory containing evaluation result files. |