DataDistributionTool
Analyzes data distribution patterns and field value frequencies in OpenSearch indices to detect anomalies, assess data quality, and identify trends. Supports comparative analysis between time periods.
Instructions
Analyzes data distribution patterns and field value frequencies within OpenSearch indices. Supports both single dataset analysis for understanding data characteristics and comparative analysis between two time periods to identify distribution changes. Automatically detects useful fields, calculates value distributions, groups numeric data, and computes divergence metrics. Useful for anomaly detection, data quality assessment, and trend analysis. We can use this tool to analyze the distribution of failures over time
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| index | Yes | Target OpenSearch index name | |
| selectionTimeRangeStart | Yes | Start time for analysis period | |
| selectionTimeRangeEnd | Yes | End time for analysis period | |
| timeField | Yes | Date/time field for filtering(requied) | |
| baselineTimeRangeStart | No | Start time for baseline period (optional) | |
| baselineTimeRangeEnd | No | End time for baseline period (optional) | |
| size | No | Maximum number of documents to analyze |