replay
Identify regressions by comparing a test's current behavior to its baseline using a side-by-side HTML viewer of tool calls, parameters, outputs, and score changes.
Instructions
Open a trajectory diff viewer for a specific test. Shows a side-by-side HTML comparison of baseline vs current behavior — tool calls, parameters, outputs, and score changes. Opens automatically in the browser.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| test_name | No | Name of the test to replay (optional — shows latest if omitted) | |
| test_path | No | Path to test directory (default: 'tests/') | |
| no_browser | No | Don't auto-open HTML report in browser. Default: false. |