load_dataset
Load a CSV file into memory and return column names, row count, and categorical distinct values to prepare for driver analysis.
Instructions
Load a CSV or ZIP-compressed CSV file into memory under a given name. Call this ONCE before find_drivers or explain_segment — you do not need to reload the same file again within the same session.
Returns column names, row count, and for categorical columns their distinct values. Use this metadata to:
Identify the target variable and target class for find_drivers.
Spot columns that are direct encodings or duplicates of the target (e.g. a numeric "survived" column when the target is "alive") — pass those in the
attributesexclusion list so find_drivers does not pick them as trivial drivers.
Args: name: Short label to refer to this dataset in later calls (e.g. "accidents") path: Absolute or relative path to a CSV or ZIP-compressed CSV file separator: Column delimiter — use "\t" for tab-separated files, default is "," encoding: File encoding, default "utf-8" (use "cp1250" for Windows Eastern European files)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| name | Yes | ||
| path | Yes | ||
| separator | No | , | |
| encoding | No | utf-8 |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |