get_statistics
Compute descriptive statistics for numerical columns including count, mean, standard deviation, min/max values, and percentiles to analyze data distribution and quality.
Instructions
Get comprehensive statistical summary of numerical columns.
Computes descriptive statistics for all or specified numerical columns including count, mean, standard deviation, min/max values, and percentiles. Optimized for AI workflows with clear statistical insights and data understanding.
Returns: Comprehensive statistical analysis with per-column summaries
Statistical Metrics: 📊 Count: Number of non-null values 📈 Mean: Average value 📉 Std: Standard deviation (measure of spread) 🔢 Min/Max: Minimum and maximum values 📊 Percentiles: 25th, 50th (median), 75th quartiles
Examples: # Get statistics for all numeric columns stats = await get_statistics("session_123")
# Analyze specific columns only
stats = await get_statistics("session_123", columns=["price", "quantity"])
# Analyze all numeric columns (percentiles always included)
stats = await get_statistics("session_123")AI Workflow Integration: 1. Essential for data understanding and quality assessment 2. Identifies data distribution and potential issues 3. Guides feature engineering and analysis decisions 4. Provides context for outlier detection thresholds
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| columns | Yes | List of specific columns to analyze (None = all numeric columns) |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| success | No | Whether operation completed successfully | |
| statistics | Yes | Statistical summary for each column | |
| total_rows | Yes | Total number of rows in the dataset | |
| column_count | Yes | Total number of columns analyzed | |
| numeric_columns | Yes | Names of numeric columns that were analyzed |