get_column_summary
Get detailed summary statistics for any column, auto-detecting its type to compute metrics like five-number summary, outlier count, mode, or date range.
Instructions
Return full summary statistics for a single column. The column type is auto-detected and the appropriate statistics are computed:
continuous/discrete: five-number summary, mean, std, skewness with plain english label, kurtosis with label, outlier count (IQR method), zero count, infinite count, normality test (scipy normaltest p-value and result)
categorical: mode, top 10 value counts with percentages
binary: mode, top value counts, class balance ratio with imbalance flag (flagged if majority:minority ratio exceeds 3:1)
temporal: min/max date, date range in days, gap count, most common year and month
high_cardinality: flagged as likely ID or free text with sample values only
Use this to investigate a specific column in depth after calling load_dataset to identify columns of interest.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| file_path | Yes | ||
| column | Yes | ||
| table | No |