Run Evidence Judge Pass
session_evidence_judge_passEvaluate open checklist items against a draft using an LLM judge peer. Promotes items verified as satisfied, or run in shadow mode to collect judgment data without state changes.
Instructions
v2.9.0 LLM-based satisfied detection for the Evidence Broker. The configured judge peer reads each currently-open checklist item against the supplied draft and returns a structured judgment (satisfied + confidence + rationale). The runtime promotes only items where satisfied=true AND confidence='verified'; everything else stays open. Terminal operator statuses (satisfied/deferred/rejected) and items already addressed by resurfacing-inference are NEVER touched. Items per pass are capped via CROSS_REVIEW_EVIDENCE_JUDGE_MAX_ITEMS_PER_PASS (default 8). Optional item_ids filter narrows the pass to specific items; omit for all-open. The judge_peer is the LLM that performs the judgment — choose any peer with a configured API key. v2.10.0: optional shadow_mode (default false) routes the pass through a non-mutating path that emits session.evidence_judge_pass.shadow_decision events without touching checklist state — operators use it to collect empirical judgment-quality data before relying on active mutation.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| draft | Yes | ||
| round | No | ||
| caller | No | operator | |
| item_ids | No | ||
| judge_peer | Yes | ||
| session_id | Yes | ||
| shadow_mode | No | ||
| review_focus | No | ||
| response_format | No | json |