guard_action
Evaluate proposed agent actions before irreversible spending, sending, or destructive commands. Returns a fail-closed decision (allow, review, block) with risk score, reasons, and safer alternative to prevent misuse.
Instructions
THE MONEY-LINE GATE. Decide whether a proposed agent ACTION should proceed — use this right before the agent SPENDS, SENDS, or commits something irreversible (a payment, an outbound message, a destructive command, a data share).
Fail-closed: it will not 'allow' what it cannot justify as safe — uncertainty escalates to review, real red flags block. Returns: decision (allow | review | block), an honest 0-1 risk, concrete reasons, specific concerns, and a safer alternative. This independent, signed verdict is the thing no free local check can give the agent at the moment money or irreversibility is on the line. (Maps to OWASP ASI02 Tool Misuse / ASI08 Cascading Failures — produces an audit artifact.)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| policy | No | ||
| context | No |