run_ab_comparison
Compare two model variants (A and B) on the same dataset using precision, recall, F1, and AUC to determine the winner with a statistical delta.
Instructions
Compares two model variants (A and B) on the same dataset using precision, recall, F1, and AUC. Returns a winner recommendation and statistical delta.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model_a | Yes | Predictions from model A | |
| model_b | Yes | Predictions from model B | |
| threshold | No | Classification threshold (default 0.5) |