Skip to main content
Glama

tool_compare_runs

Compare metrics between two evaluation runs to analyze task/model performance, score differences, token usage, and duration changes.

Instructions

Compare metrics between two evaluation runs.

Shows side-by-side comparison of two evaluation runs including task/model info, sample counts, score differences, token usage differences, and duration.

Args: log_file_a: Path to first log file log_file_b: Path to second log file log_dir: Optional log directory for relative paths

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
log_file_aYes
log_file_bYes
log_dirNo

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/PranshuSrivastava/inspect-logs-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server