Multi-step Hypothesis Test
test_hypothesisRun predefined live checks to test a hypothesis and get a mechanical verdict: supported, refuted, or partially supported.
Instructions
Run a small verification plan made of concrete live checks and summarize whether a hypothesis is supported. Use this when one conclusion depends on multiple simple checks such as endpoint reachability, npm search counts, or whether a page contains an exact substring. This is a coordination tool, not an open-ended research agent: every test must be explicitly defined in advance, and tests run in order with no branching or early exit. The final verdict is mechanical: all tests passing => SUPPORTED, zero passing => REFUTED, otherwise PARTIALLY SUPPORTED. Use verify_claim when you already have evidence URLs, estimate_market for category sizing, and compare_competitors when you already know exact package names.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| hypothesis | Yes | Claim to test, for example 'there are fewer than 50 MCP email servers on npm'. | |
| tests | Yes | Ordered list of one to ten checks to run. Each test object uses only the fields required by its type. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| hypothesis | Yes | Hypothesis that was evaluated. | |
| tests | Yes | Per-test execution results in input order. | |
| verdict | Yes | High-level verdict for the hypothesis. |