chimera_csm
Optimizes user input, estimates token cost, and proposes a budget for cost-effective AI responses.
Instructions
CALL FIRST on every message. Optimizes input, estimates token cost, proposes budget. Show proposal_text to user for approval. After approval: constrain response to max_output_tokens and use optimized_prompt as effective input.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | The user's raw input text to optimize and cost-estimate. | |
| messages | No | Optional conversation history [{role, content}] for full context token count. | |
| model | No | Model for pricing. Default: claude-sonnet-4-6 | claude-sonnet-4-6 |
| task_complexity | No | Controls output token estimate. auto=detect from prompt keywords. simple=brief factual answer, moderate=explanation/how-to, complex=code/build/design. Default: auto | auto |
| focus | No | Optional task focus/query. Defaults to prompt when omitted. | |
| algorithm | No | Optimization algorithm. quantum = query-aware compression. classic = legacy rewrite-only compression. | quantum |