eval_discover
Retrieve a complete catalog of available evaluators, trap families, and test suites to plan your evaluation strategy without guessing tool names.
Instructions
Return the full machine-readable capability catalog.
Useful as a first call at session start — an agent can plan its evaluation strategy against the actual available evaluators rather than guessing or hallucinating tool names.
Returns: A dict with three top-level keys:
- ``evaluators``: every available multivon-eval evaluator,
with its tier, what inputs it needs, and (when shipped)
calibrated default thresholds per judge model.
- ``traps``: every pdfhell trap family, the failure mode each
elicits, and the expected_failure_mode metadata.
- ``suites``: every named pdfhell suite, the (trap_family,
seed_count) breakdown, and the suite_hash for the canonical
version.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||