Skip to main content
Glama

kopern_list_grading_runs

Read-only

List grading runs to view score history, pass rates, and version changes over time for a suite. Track performance without incurring LLM costs.

Instructions

List grading runs for a suite. Shows score history, pass rates, and versions over time. No LLM cost.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
agent_idYesThe agent ID or name
suite_idYesThe grading suite ID
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true. Description adds value by specifying what historical data is shown (score history, pass rates, versions) and that there's no LLM cost, which is beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, zero wasted words. Front-loaded with action and resource, then details and a key differentiator.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequately covers purpose and a key trait (no cost). For a simple list tool without output schema, it's sufficient; lacks pagination/sorting info but not critical.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. Description only mentions 'for a suite' (suite_id) but adds no extra meaning for agent_id or parameter formats.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the action (list), resource (grading runs for a suite), and what it shows (score history, pass rates, versions). Distinguishes from siblings like kopern_run_grading and kopern_get_grading_results by noting 'No LLM cost'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides a hint about no LLM cost, suggesting it's a cheaper alternative, but does not explicitly state when to use this vs. other tools or mention exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/berch-t/kopern'

If you have feedback or need assistance with the MCP directory API, please join our Discord server