llm_quality_guard
Monitor rolling average quality scores per model over past N days; alerts trigger when any model's score falls below 0.7 with sufficient samples, indicating degradation.
Instructions
Show quality scores per model with degradation alerts (v6.2).
Displays rolling average judge scores for all routed models over the past N days. Alerts if any model's score < 0.7 with sufficient samples (quality degradation).
Args: days: Number of days of history to analyze (default 7).
Returns: Formatted table with model scores, trend arrows, and alerts.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| days | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |