set_agent_budget
Define monthly inference budget for an agent. Set hard cap to block requests when exceeded, soft cap for warnings, and choose enforcement action.
Instructions
Set or update the monthly inference budget for an agent. When the agent exceeds the hard cap, the LLM proxy blocks further requests (429).
Args: agent_id: The agent identifier. monthly_limit_usd: Monthly spend limit in USD. soft_cap_pct: Percentage at which a warning is emitted (default 80). hard_cap_pct: Percentage at which requests are blocked (default 100). action: Enforcement action at hard cap — "reject" (default) or "warn".
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| action | No | reject | |
| agent_id | Yes | ||
| hard_cap_pct | No | ||
| soft_cap_pct | No | ||
| monthly_limit_usd | Yes |