loop_regression
Automates the regression loop: test all cases, score responses, analyze failures, improve the prompt, and repeat until all scores meet the threshold or max iterations are reached.
Instructions
Run the full regression loop: test all cases → score → improve → repeat.
Stops when BOTH conditions are met:
Overall pass rate >= threshold
Every individual test case score >= threshold Or when max iterations are exhausted.
Loop:
Run all test cases, score responses, call post_test_result for each.
Call get_regression_status.
If pass rate >= threshold AND all individual scores >= threshold → SUCCESS.
If iteration >= maxIterations → EXHAUSTED. Report best result.
Analyse failures, write improved prompt, call post_prompt_suggestion + apply_suggestion.
Go to 1.
After the loop: call pull_ui_history and save results locally.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| workspaceId | Yes | ||
| threshold | No | Pass score 0–100 (default: workspace goal or 70) | |
| maxIterations | No | Max iterations (default: workspace goal or 5) |