llm_quality_guard
Track rolling average judge scores for all routed models and receive degradation alerts if any score drops below 0.7 with sufficient samples.
Instructions
Show quality scores per model with degradation alerts (v6.2).
Displays rolling average judge scores for all routed models over the past N days.
Alerts if any model's score < 0.7 with sufficient samples (quality degradation).
Args:
days: Number of days of history to analyze (default 7).
Returns:
Formatted table with model scores, trend arrows, and alerts.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| days | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |