delimit_prompt_drift
Tracks how a prompt's performance changes over time per AI model. Record results, check drift, or rank models for specific task categories.
Instructions
Detect prompt drift across Claude / Codex / Gemini for the same task.
When to use: to track per-model prompt performance over time, or to rank models for specific task categories on your codebase. When NOT to use: to run a multi-model deliberation (use delimit_deliberate) — drift tracks single-model behaviour.
Sibling contrast: delimit_deliberate runs cross-model on a question; this tracks how a known prompt drifts per model.
Side effects: action="record" writes a result to the prompt-drift store via ai.prompt_drift.record_result. "check" and "rank" are read-only.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| action | No | "record", "check" (default), or "rank". | check |
| prompt | No | Prompt text (for record / check). | |
| model | No | AI model name (required for record). | |
| result_summary | No | Brief description of the result (for record). | |
| success | No | "true" / "false" — whether the result was good. | true |
| task_type | No | Task category — "refactoring", "testing", "debugging", "docs". |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||