Experiment Manage Tool
experiment_manageCreate, start, pause, resume, retry, or kill experiments. Validate state transitions and view costs, steps, or share public tokens.
Instructions
Experiments — the platform's core unit of work. Each experiment runs a workflow (DAG) through a 20-state machine: Draft → Scoring → Planning → Building → AwaitingApproval → Approved → Executing → CollectingMetrics → Evaluating → (Iterating | Completed). Lifecycle transitions are validated by ExperimentTransitionMap; use valid_transitions to discover what's currently allowed.
Actions:
list (read) — optional: status, workflow_id, limit.
get (read) — experiment_id.
create (write) — name, hypothesis; optional workflow_id (else uses default workflow).
start (write) — experiment_id. Transitions Draft → Scoring; reserves budget.
pause / resume (write) — experiment_id. Pause holds at the current stage.
retry (write) — experiment_id. Re-runs the failed stage.
retry_from_step (write) — experiment_id, step_id. Graph-aware BFS reset of step + downstream.
kill (DESTRUCTIVE) — experiment_id. Terminal; cannot resume.
valid_transitions (read) — experiment_id. Allowed next states for current state.
cost / steps / share (read) — experiment_id. Cost breakdown / step list / public share token.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | Action to perform: list, get, create, start, pause, resume, retry, retry_from_step, kill, valid_transitions, cost, steps, share | |
| deadline_ms | No | Optional: max wall-clock time (ms) the tool may spend. If exceeded during the call, returns a DEADLINE_EXCEEDED error. Minimum 100 ms. Leave unset for no deadline. | |
| status | No | Filter by status: draft, scoring, planning, building, executing, completed, killed, paused, etc. | |
| limit | No | Max results to return (default 10, max 100) | |
| experiment_id | Yes | The experiment UUID | |
| title | Yes | Experiment title | |
| thesis | No | Experiment thesis/hypothesis | |
| track | No | Experiment track: growth, retention, revenue, engagement (default: growth) | growth |
| budget_cap_credits | No | Budget cap in credits (default: 10000) | |
| step_id | Yes | The playbook step UUID to retry from | |
| reason | No | Reason for killing the experiment | |
| show_costs | No | Whether to show cost data in the public view (for update action) | |
| show_stages | No | Whether to show pipeline stages in the public view (for update action) | |
| show_outputs | No | Whether to show stage outputs in the public view (for update action) | |
| expires_at | No | ISO8601 expiry datetime after which the share link is invalid. Pass null to remove expiry. (for update action) |