reverify_run
Re-hashes raw artifacts from a test run to verify metrics reproduce before allowing promotion. Blocks benchmark-gaming by ensuring claimed results are genuine.
Instructions
Deep re-verification: re-hash every raw artifact behind a full test and confirm the claimed metrics reproduce. Promotion is blocked until this passes (anti benchmark-gaming / baseline-tampering).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| runId | Yes | ||
| testId | No | ||
| hypothesisId | No |