tool_get_eval_summary
Extract comprehensive evaluation summaries from log files, including task details, model configurations, scoring results, token usage, and performance metrics.
Instructions
Get comprehensive evaluation summary including header, results, and stats.
Returns full metadata about an evaluation run: task info, model configuration, scoring results, token usage, duration, and revision info.
Args: log_file: Path to log file (absolute or relative to log_dir) log_dir: Optional log directory for relative paths
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| log_file | Yes | ||
| log_dir | No |