run_regression_testsuite
Run all test cases against the current system prompt and score each response. Use to confirm the prompt passes defined thresholds before deployment.
Instructions
Run all test cases against the current system prompt. Single pass — does not auto-improve.
Use this to verify an already-good prompt still passes all test cases. For automatic improvement loops, use loop_regression.
Steps to follow after this call:
Run each test case against the model, score the response, call post_test_result.
Call get_regression_status to see pass/fail summary.
Optionally: post_prompt_suggestion with an improvement (user reviews).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| workspaceId | Yes | ||
| threshold | No | Pass score 0–100 (default 70) |