get_ai_kpis
Runs all AI cost health metrics in one call, including prompt cache savings, context window waste, model sprawl, and prompt efficiency, delivering estimated monthly savings and specific remediation advice.
Instructions
Full AI cost health dashboard with actionable KPIs.
Runs all AI cost health metrics in one call:
Prompt cache hit rate and estimated savings (Anthropic)
Context window utilisation per model (are you paying for 200K context but using 2K?)
Model sprawl score (Herfindahl index of model concentration)
Peak usage day-of-week and weekend vs weekday patterns
Prompt efficiency (output/input token ratio, flags verbose or wrong-model usage)
Error spend estimate (tokens wasted on failed requests)
AI vs infrastructure spend ratio (benchmark: healthy SaaS = 5–15%)
Each finding includes an estimated monthly savings amount and specific remediation advice.
Args: days: Lookback window in days (default 30). infra_total_usd: Your total cloud infrastructure spend for the same period. Pass this to get AI-vs-infra ratio benchmarking.
Examples: - "Show me our AI cost health dashboard" - "What's our prompt cache hit rate?" - "Are we using the right AI models?" - "How efficient are our AI prompts?" - "What AI cost optimisations should we prioritise?"
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | ||
| infra_total_usd | No |