ai_red_team
Generate adversarial test cases for an AI system prompt. Produces a structured runbook with attack categories to identify vulnerabilities.
Instructions
Generate adversarial test cases for an AI system prompt (SEC-005).
Produces a red team runbook: specific adversarial inputs tailored to the system prompt, organized by attack category. Returns a structured framework for the host to generate the test cases.
Args: system_prompt: The system prompt to red team. num_test_cases: Number of test cases to generate (default 10, max 30).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| system_prompt | Yes | ||
| num_test_cases | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |