model_routing_recommendations
Finds opportunities to replace expensive models like Claude Sonnet 4 with cheaper Gemini 2.5 Flash for extraction or GPT-4o with GPT-4o-mini for short chats, providing 30-day cost savings.
Instructions
Suggestions to route some volume to cheaper providers/models — e.g., claude-sonnet-4 → gemini-2.5-flash for extraction-style work, gpt-4o → gpt-4o-mini for short chats. Each rec includes 30d estimated savings.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| window_hours | No | Window over which to find candidates (default 720 = 30 days) |