split_dataset
Split a dataset into training, validation, and test sets using methods like random, stratified, time-series, or group-based partitioning.
Instructions
Split dataset into train/validation/test sets
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| dataset_name | Yes | Name of the dataset to split | |
| split_method | Yes | Method for splitting the dataset | |
| test_size | No | Proportion of data for test set | |
| val_size | No | Proportion of data for validation set (creates 70/20/10 split by default) | |
| target_column | No | Target column for stratified splitting | |
| time_column | No | Time column for time-series splitting | |
| group_column | No | Group column for group-based splitting | |
| random_state | No | Random seed for reproducibility |