ch_data_quality
Identify data quality issues in ClickHouse tables: nulls, duplicate rows, hourly gaps, and missing market coverage for a given date.
Instructions
Run data quality checks: nulls, duplicates, gaps, and market coverage.
Checks for a specific date:
Null/empty values per column
Duplicate rows by primary key
Hourly data gaps (missing time windows)
Market coverage (are all 5 coins present?)
Data freshness
Args: table: Table to check (default: crypto_trades) database: Database name (default: cdc_pipeline) check_date: Date to check in YYYY-MM-DD format (default: today)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| table | No | crypto_trades | |
| database | No | cdc_pipeline | |
| check_date | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |