Score a prompt's quality across 8 dimensions BEFORE sending it to an expensive
model. Returns a 0-80 score, an A-F grade, the per-dimension breakdown
(clarity, specificity, context, constraints, output_format, role_definition,
examples, cot_structure), and the weakest dimension.
USE WHEN:
- The user is workshopping a prompt and asks "is this good?" / "will this
work?" / "should I add more detail?"
- The user is about to send a long or expensive prompt to GPT-4, Claude Opus,
or any frontier model, especially in a batch or automation context where
rework is costly.
- The user mentions iterating on a prompt that produced poor output and wants
to diagnose what's missing.
- The user pastes a prompt and asks for feedback on it.
DO NOT USE WHEN:
- The user is asking you to write a prompt for them (write it yourself first,
then optionally call score_prompt to verify).
- The prompt is conversational chat (this scores task-shaped prompts).
COST: Free, no API key required. Rate-limited per IP: 5/min, 10/day, 100/month.
If the user exceeds the limit, the response will include a structured upgrade
path with subscribe and account URLs.
LATENCY: ~2 seconds.