compare_models
Evaluate and compare multiple trained models on the same dataset, using statistical significance testing to identify the best performer.
Instructions
Compare multiple trained models on the same dataset with statistical significance testing
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model_paths | Yes | Dictionary mapping model names to file paths | |
| dataset_name | No | Name of the loaded dataset for model comparison | |
| dataset_path | No | Path to the evaluation dataset file - alternative to dataset_name | |
| target_column | Yes | Name of the target/label column | |
| cv_folds | No | Number of cross-validation folds | |
| enable_statistical_tests | No | Perform statistical significance tests | |
| significance_level | No | Significance level for statistical tests | |
| scoring_metrics | No | List of scoring metrics for model comparison | |
| generate_learning_curves | No | Generate learning curves | |
| learning_curve_train_sizes | No | Training sizes for learning curves | |
| detailed_metrics | No | Calculate detailed metrics and reports | |
| save_results | No | Save comparison results |