verify_equivalence
Run two commands and compare their stdout and exit code. Verifies functional equivalence before measuring energy or token usage.
Instructions
Run two commands and compare stdout + exit code. The gate before any energy/token comparison.
With no stdin_inputs it's a single-input smoke test. Pass stdin_inputs (a list of strings fed to each program's stdin) to run an input BATTERY — all inputs must match. Stronger evidence still comes from running the project's own test suite.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| command_a | Yes | ||
| command_b | Yes | ||
| stdin_inputs | No |