macalc
Server Details
The most comprehensive everyday calculator MCP server — 501 tools across 22 categories covering 8 countries' tax systems (FR, BE, CH, CA, US, UK, MA, SN). Finance, health, math, science, construction, conversions, education, sport, cooking, travel, and more. Free, no API key required. Streamable HTTP transport.
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 3.6/5 across 446 of 446 tools scored. Lowest: 1.8/5.
Many tools overlap in purpose, e.g., multiple cooking time calculators, several area/volume converters, and duplicate finance tools for different countries. An agent would struggle to choose the correct tool among similar options.
Most tools follow a calculate_verb_noun pattern, but some use convert_ and a few (list_bundles, get_bundle_tools) break the pattern. Inconsistencies like 'bpm_to_ms' vs 'convert_temperature' reduce predictability.
With 446 tools, the server is extremely bloated. It covers too many domains, making navigation and selection impractical. A typical agent would be overwhelmed and likely misselect tools.
Despite the huge number of tools, many subdomains have gaps (e.g., missing update/delete operations, incomplete CRUD for personal finance). The broad scope prevents any focused completeness.
Available Tools
446 toolscalculate_1rm_tableAInspect
Generate a full 1RM-to-12RM repetition table from a known lift using Epley formula. Returns: {input_weight, input_reps, estimated_1rm}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| reps | Yes | Repetitions performed at that weight | |
| weight | Yes | Weight lifted in kg or lbs |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It mentions the Epley formula and return fields but fails to disclose formula limitations (e.g., typical validity up to 12 reps) or assumptions about weight units. This gap reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences, front-loading the purpose. Every sentence adds value without unnecessary details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations but existing output schema and sibling references, the description is fairly complete. However, it lacks explicit usage constraints (e.g., max reps for formula accuracy) and could better integrate with the schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the two parameters, and the description adds little beyond the schema's descriptions. The schema already explains weight and reps, so the description adds marginal value, earning a baseline 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: generating a 1RM-to-12RM repetition table using the Epley formula. It specifies the return fields and distinguishes itself from siblings like calculate_one_rep_max (single value) and list_bundles (related calculators).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a clear context for use and directs users to list_bundles for related 'sport' calculators, aiding in tool selection. However, it does not explicitly state when not to use this tool or provide specific exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_add_hoursAInspect
Add two time durations and return the total in hours and minutes. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| hours1 | Yes | First duration — hours | |
| hours2 | Yes | Second duration — hours | |
| minutes1 | Yes | First duration — minutes | |
| minutes2 | Yes | Second duration — minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. States core function but omits details like rounding, overflow handling, or error conditions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single concise sentence with action first, plus a reference for related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema present, so return values are documented. Lacks usage guidelines and behavioral details, but tool is simple and purpose is clear.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% description coverage, baseline 3. Description adds context about return format but no additional parameter-specific meaning.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clear verb ('Add'), resource ('time durations'), and return format ('total in hours and minutes'). References list_bundles for related tools, aiding differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or alternatives guidance. Only mentions 'See list_bundles for related calculators', which is vague.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ageCInspect
Calculate exact age in years, months and days from a birth date. Returns: {today, age_years, age_months, age_days, total_days_lived, days_to_next_birthday}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| birth_date | Yes | YYYY-MM-DD — Date of birth |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must fully disclose behavioral traits. It states the output but omits details like read-only nature, error handling (e.g., invalid date), and any side effects. For a calculator tool, this is a gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences effectively convey purpose and output. The cross-reference to list_bundles is slightly tangential but not excessive. The description is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple tool with one parameter and an output schema, the description covers basic functionality and return values. However, it lacks context on edge cases (e.g., future dates) and does not differentiate from the similar sibling calculate_age_in_units.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single parameter, and the schema already provides format details. The description adds only the phrase 'from a birth date', which is redundant. No meaningful enhancement beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates exact age in years, months, and days from a birth date, and lists return fields. However, it does not explicitly distinguish from similar sibling tools like calculate_age_in_units, relying on a cross-reference to list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs. alternatives. The mention of list_bundles for 'temps-rh' calculators hints at related tools but does not provide criteria for choosing this one.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_age_in_unitsAInspect
Calculate exact age in multiple units from birth date. Returns: {weeks, hours, minutes, seconds}. See list_bundles for related 'fun' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| birth_date | Yes | Birth date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description bears full burden. It states a read-only calculation returning an object, but lacks detail on edge cases, errors, or required permissions. Minimal but adequate for a simple calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first covers purpose and output, second directs to related tools. No filler, front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Complete for a simple tool with one parameter and an output schema hinted in the description. Lacks mention of error conditions but sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 100% schema description coverage, baseline is 3. The description adds the return format but no extra insight into the birth_date parameter beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Calculate exact age in multiple units from birth date' and specifies the output format. It distinguishes from siblings like 'calculate_age' by mentioning 'fun calculators' and referencing 'list_bundles'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. It points to 'list_bundles' for related calculators, but does not compare with direct sibling 'calculate_age' or other age tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_alcohol_unitsBInspect
Compute alcohol units (UK) and grams of pure alcohol in a drink. Use for health tracking and limit awareness. Inputs: volume mL, ABV %. Returns UK units, grams of pure alcohol, and standard-drink equivalent. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| drinks | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It states what the tool computes but does not explicitly disclose read-only behavior or other traits. Since it's a calculator, the description is adequate but lacks explicit transparency claims.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences and efficiently conveys purpose, usage, and input/output summary. It is front-loaded but could be slightly improved by detailing the input structure.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple calculator nature and presence of an output schema, the description provides adequate context. However, it fails to fully describe the input array format and the 'type' field, which are essential for correct use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, so the description must compensate. It mentions 'volume mL, ABV %' but does not describe the array structure, the required 'type' field, or that multiple drinks can be input. This leaves significant ambiguity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes UK alcohol units and grams of pure alcohol for health tracking. It is specific about the resource and action, but does not explicitly distinguish from sibling tools like 'calculate_blood_alcohol' or 'calculate_bac_points'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says to use it for health tracking and limit awareness, giving a clear usage context. However, it does not mention when not to use it or provide alternatives among sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_altitude_sicknessBInspect
Assess altitude sickness risk and recommend acclimatization schedule. Returns: {risk_level, risk_color, recommended_acclimatization_days, symptoms_to_watch, recommendations}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| altitude_m | Yes | Target altitude in meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses return fields but does not mention any side effects, permissions, or constraints beyond the schema. Adequate for a simple read-like calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first sentence states purpose and return, second points to related tool. No fluff or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple structure (1 param, no annotations) and presence of an output schema (though not shown), the description is adequate. However, it lacks context on when altitude_sickness is relevant versus other altitude or health tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Only one parameter with 100% schema coverage. The description adds no extra meaning beyond the schema's 'Target altitude in meters'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it assesses altitude sickness risk and recommends acclimatization schedule, with specific return fields. Verb 'assess' and resource 'altitude sickness' are explicit. Sibling tools are numerous but distinguished by subject matter.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use or not use this tool, nor any mention of alternatives. The reference to list_bundles for related voyage calculators is minimal and not about usage conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_angle_convertBInspect
Convert angles between degrees, radians, gradians and turns. Returns: {original}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Angle value | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Describes the conversion operation adequately, but the return format is ambiguous with '{original}' placeholder.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, efficient. The '{original}' placeholder is unclear and could be improved, but overall concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given low complexity and existence of output schema, description is adequate but the placeholder for return value is incomplete and confusing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema descriptions cover all parameters (100% coverage). Description adds enumeration of units but that's already in the enum. No additional semantic value beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it converts angles between four specified units. However, it does not differentiate itself from the sibling convert_angle tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implied usage for angle conversion, but no explicit guidance on when to use this vs convert_angle. Reference to list_bundles suggests related conversion calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_annuity_paymentBInspect
Calculate periodic payment amount for a loan or annuity. Returns: {monthly_payment_eur, total_paid_eur, total_interest_eur}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate | Yes | Annual interest rate percent | |
| periods | Yes | Number of payment periods (months) | |
| principal | Yes | Principal amount EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It discloses return values but does not mention any behavioral traits such as compounding assumptions, precision, or rate limits. For a calculation tool, this is adequate but not exceptional.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states purpose, second provides a pointer to related tools. No unnecessary words, efficiently communicates key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (3 parameters, output schema exists), the description minimally covers return values but lacks details on edge cases, calculation method, or parameter relationships. It is sufficient but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline applies. The description adds no extra meaning beyond schema field definitions, e.g., explaining how rate or periods are used in the calculation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates periodic payment for a loan or annuity, and lists return fields. However, it does not differentiate from similar sibling tools like calculate_loan_payment or calculate_mortgage.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only suggests seeing list_bundles for related calculators, providing no explicit guidance on when to use this tool versus alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_anythingAInspect
Universal AI-powered calculator — handles any calculation not covered by specialized tools. Requires premium subscription.
| Name | Required | Description | Default |
|---|---|---|---|
| query | Yes | Calculation request in natural language (English or French) | |
| context | No | Optional context: units, constraints, domain |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description only mentions AI-powered and premium requirement, lacking behavioral details like accuracy, speed, or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema covers return values; description is brief but could provide more context given the many sibling tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Adds value beyond schema by noting query language (English/French) and context purpose; schema coverage is 100%.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it's a universal calculator for any calculation not covered by specialized tools, distinguishing it from hundreds of siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies use when no specialized tool exists, but lacks explicit exclusion of alternatives; mentions premium subscription as a constraint.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_aquarium_volumeCInspect
Compute aquarium water volume in L and US gallons. Use for fishkeeping, dosing, and stocking decisions. Inputs: shape (rectangular/cylindrical/bow-front), L×W×H or radius×height in cm. Returns liters and gallons. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_cm | Yes | ||
| height_cm | Yes | ||
| length_cm | Yes | ||
| substrate_cm | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It states returns liters and gallons, but it does not mention any permissions, data integrity, rate limits, or other behavioral aspects. The input description is inconsistent with the schema (shape/radius vs. dimensions), which undermines trust.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences, each serving a purpose: purpose, usage, inputs/returns, and reference to related calculators. It is relatively concise but could be more efficient by directly aligning with the schema.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 4 parameters, no schema descriptions, and no annotations, the description should be thorough. Instead, it introduces non-existent inputs (shape, radius) and omits the substrate parameter entirely. This makes the tool unusable without guessing, so completeness is very poor.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 0% description coverage, so the description must compensate. It mentions 'L×W×H or radius×height in cm' but the schema only has length, width, height—no radius or shape. This adds confusion. It does specify 'in cm' which is helpful, but overall it fails to clarify the parameters' meanings correctly.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes aquarium water volume in L and US gallons, specifying the domain (fishkeeping, dosing, stocking). The verb 'compute' and resource 'aquarium water volume' are specific and distinguish it from generic volume calculators like calculate_volume or calculate_pool_volume.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for fishkeeping, dosing, and stocking decisions,' providing context. However, it describes inputs as 'shape (rectangular/cylindrical/bow-front), L×W×H or radius×height in cm,' but the actual schema only includes length_cm, width_cm, height_cm, and substrate_cm—no shape or radius parameter. This discrepancy is misleading and fails to guide proper use. No alternatives or exclusions are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_areaBInspect
Compute area for common 2D shapes (rectangle, triangle, circle, trapezoid, etc.). Use for geometry, real estate, or paint estimates. Inputs: shape + dimensions. Returns area in input-units squared. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| d1 | No | Diagonal 1 for rhombus | |
| d2 | No | Diagonal 2 for rhombus | |
| side | No | Side for hexagon | |
| shape | Yes | Shape type | |
| width | No | Width | |
| height | No | Height | |
| length | No | Length or base | |
| radius | No | Radius | |
| semi_major | No | Semi-major axis for ellipse | |
| semi_minor | No | Semi-minor axis for ellipse |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description bears full responsibility for behavioral disclosure. It states the tool returns area but does not discuss rate limits, mutation (though read-only is implied), potential errors if required dimensions are missing, or how multiple shapes are handled beyond the enum. The description is too brief to fully inform an AI agent of behavioral nuances.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise: one sentence for purpose, two short statements for inputs/outputs, and a pointer to related tools. Every sentence contributes value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (10 parameters, 7 shapes), the description covers the main purpose, input structure, and output units. The schema handles parameter-shape associations. It lacks details on error handling or dimension requirements, but an output schema exists to supplement return value clarity. Overall, it is mostly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters having descriptions. The description adds 'Inputs: shape + dimensions' but does not provide additional meaning beyond what the schema already offers. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes area for common 2D shapes, lists examples (rectangle, triangle, circle, trapezoid), and mentions use cases (geometry, real estate, paint estimates). It does not explicitly differentiate from sibling tools like calculate_perimeter or calculate_volume, but the purpose is specific and well-defined.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage via examples and mentions 'See list_bundles for related math calculators,' but it does not provide explicit guidance on when to use this tool versus alternatives, nor does it state when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_auto_entrepreneurAInspect
Calculate French auto-entrepreneur (micro-enterprise) net income and social charges. Returns: {social_charges_rate_pct, social_charges, abatement_fiscal_pct, taxable_income_approx, net_before_tax, cfe_estimate_eur}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| revenue | Yes | Annual revenue (chiffre d'affaires) in euros | |
| category | No | Activity category: vente (sales), service_bic, service_bnc, liberal | service_bnc |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It describes the calculation and return format well but does not disclose side effects (e.g., external API calls, rate limits, or authentication requirements).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: the first clearly states purpose and output, the second directs to related tools. No wasted words, front-loaded with essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists (though not shown), the description is largely complete. It references related tools via list_bundles, but could briefly explain the category enum values for clarity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds value by explaining the output structure beyond the schema, but does not elaborate on parameter semantics beyond what's in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates French auto-entrepreneur net income and social charges, and lists the specific output fields. This distinguishes it from the many sibling calculation tools by specifying the exact French micro-enterprise context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for French auto-entrepreneur calculations and directs to list_bundles for related calculators, but does not explicitly state when to use vs. alternatives or provide exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_averageCInspect
Compute simple, weighted, geometric, or harmonic mean. Use for grade averages, returns, or rates. Inputs: values list, optional weights, mode. Returns mean and detail. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| values | Yes | Array of numbers | |
| weights | No | Optional weights for weighted average |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It minimally states it returns 'mean and detail' but does not disclose behavioral traits such as handling of empty arrays, missing values, default modes, or any side effects. The tool likely has more behavioral nuance (e.g., how weights are applied, mode defaults) that is not described.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences plus a reference) and front-loads the core purpose. However, it includes unnecessary words ('Compute simple, weighted, geometric, or harmonic mean') and could be leaner. The structure is adequate but not exceptional.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the large number of sibling tools and the presence of an output schema, the description should distinguish this tool more clearly (e.g., from calculate_statistics or calculate_percentage). It does not explain return value structure beyond 'mean and detail', and the missing 'mode' parameter indicates incompleteness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the two parameters (values, weights), but the description introduces a 'mode' parameter that does not exist in the schema, causing confusion. The description restates basic schema info (values list, optional weights) without adding significant meaning beyond what the schema provides. The misleading mode mention lowers the score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes multiple types of means (simple, weighted, geometric, harmonic) and gives concrete use cases like grade averages, returns, or rates. It uses a specific verb ('compute') and resource ('mean'). However, the mention of a 'mode' parameter not present in the schema slightly impairs clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides use cases ('grade averages, returns, or rates') and suggests seeing list_bundles for related calculators, offering some context. But it lacks explicit guidance on when not to use this tool or how it differs from siblings like calculate_statistics or calculate_grade_average.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bac_pointsAInspect
Estimate French Baccalauréat final score from grades and coefficients. Use to track lycée results before official scores. Inputs: grades by subject with coefficients. Returns weighted average and mention. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| grand_oral | Yes | Grand oral (/20, coeff 10) | |
| philosophy | Yes | Philosophy (/20, coeff 8) | |
| specialty1 | Yes | Specialty 1 (/20, coeff 16) | |
| specialty2 | Yes | Specialty 2 (/20, coeff 16) | |
| french_oral | Yes | French oral exam (/20, coeff 5) | |
| french_written | Yes | French written exam (/20, coeff 5) | |
| continuous_control | Yes | Continuous assessment score (/720 = 40%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must cover behavioral traits. It states inputs and return (weighted average and mention) but does not disclose behavior for invalid inputs, edge cases, or any potential pitfalls. It adequately describes core behavior but misses transparency around error handling or assumptions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose. Every sentence contributes: purpose, usage, input/output summary, and a pointer to related tools. No wasted words, efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema, so the description's mention of return values (weighted average and mention) is helpful but not strictly required. It covers the core functionality adequately for 7 required parameters. Could mention the grading scale or if it handles different Bac series, but overall complete enough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with each parameter described (subject, max score, coefficient). The description adds little beyond summarizing inputs as 'grades by subject with coefficients'. Given high schema coverage, the description adds marginal value, meeting the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates the French Baccalauréat final score from grades and coefficients. It specifies the target exam (Bac) and that it's for tracking lycée results. However, it does not explicitly differentiate from sibling tools like calculate_brevet_points or calculate_parcoursup_points, though the context of Bac is distinct.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context ('track lycée results before official scores') and hints at related calculators via 'See list_bundles'. It lacks explicit guidance on when not to use this tool or clear alternatives, relying on implied usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_baking_altitudeAInspect
Adjust baking recipe for high altitude (less leavening, more liquid, higher temp). Use for mountain cooking. Inputs: altitude m, ingredients. Returns adjusted recipe. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| altitude_m | Yes | Altitude in meters | |
| flour_cups | Yes | Flour in cups | |
| sugar_cups | Yes | Sugar in cups | |
| liquid_cups | Yes | Liquid in cups | |
| oven_temp_c | Yes | Oven temperature °C |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description carries full burden. It discloses key behavioral traits (adjusting leavening, liquid, temperature) but does not elaborate on whether the adjustment is proportional or absolute, nor does it mention the structure of the returned recipe.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Very concise, front-loading the main action and context. Every sentence adds value, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers purpose, usage, and references. With 5 parameters and an output schema, the description is mostly complete, though it could briefly explain the adjustment method or mention that altitude is in meters (already in schema).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so descriptions already provide meaning. The tool description adds only a generic 'ingredients' mention, which adds minimal value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states verb 'adjust', resource 'baking recipe', and specific context 'high altitude'. Differentiates from sibling calculators by mentioning 'mountain cooking' and referencing 'cuisine' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for mountain cooking', providing clear context. Directs to list_bundles for related calculators, but does not explicitly state when not to use this tool or name specific alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_baking_conversionCInspect
Convert baking measurements between cups, tablespoons, grams, and milliliters for common ingredients. Use for translating recipes across regions. Inputs: ingredient, value, from-unit, to-unit. Returns converted quantity. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| ingredient | Yes | ||
| quantity_cups | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description is misleading: it lists 'value, from-unit, to-unit' as inputs, but the actual schema only includes 'ingredient' and 'quantity_cups'. This mismatch undermines transparency and could cause incorrect usage. The description does not clarify behavior like exact conversion, rounding, or density assumptions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences) and front-loaded with the purpose. However, the inclusion of a reference to 'list_bundles' adds minor noise. It is concise but omits critical details, making it incomplete rather than efficiently informative.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the mismatch between description and schema, and the omission of crucial parameter information, the description is highly incomplete. It fails to explain how to use the actual parameters or interpret the output, despite having an output schema. Sibling differentiation is insufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, and the description contradicts the schema by listing parameters ('value, from-unit, to-unit') that do not exist while failing to explain the actual parameters ('ingredient' and 'quantity_cups'). The enum for ingredient is not mentioned either. This provides no meaningful semantic help.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts baking measurements between cups, tablespoons, grams, and milliliters for common ingredients, with a specific use case of translating recipes across regions. This distinguishes it from generic conversion tools like 'convert_cooking' and 'calculate_cooking_conversion' among siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a usage guideline ('Use for translating recipes across regions'), which gives context for when to use the tool. However, it does not specify when not to use it or mention alternative tools like 'convert_cooking' for general cooking conversions, limiting its guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_base_converterCInspect
Convert numbers between bases (binary, octal, decimal, hexadecimal, any base 2-36). Returns: {input, decimal_value, result}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Number as string | |
| to_base | Yes | Target base | |
| from_base | Yes | Source base |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description should fully disclose behavioral traits. It only states the return structure (input, decimal_value, result) but omits information on input validation, error handling, supported formats (e.g., negative numbers, fractions), or performance considerations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief at two sentences plus a returns note. It is front-loaded with the core purpose, but the added 'see list_bundles' note is tangential and could be omitted.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with three parameters and an output schema (not shown but mentioned). The description partially explains return fields, but lacks differentiation from the very similar sibling 'calculate_number_base_convert', and does not cover edge cases or expected input examples.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage for all three parameters. The description does not add any extra meaning beyond the schema, such as expected string format for 'value' or base ranges, which are already in the schema constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts numbers between bases 2-36, listing common bases. However, it does not distinguish itself from the sibling tool 'calculate_number_base_convert', which likely serves a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool vs alternatives such as 'calculate_number_base_convert'. The only note is to 'see list_bundles', which is generic and not specific to this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_beam_loadAInspect
Calculate max bending moment and shear for a beam under uniform distributed load. Returns: {load_kN_per_m, max_moment_kNm, max_shear_kN, note}. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| span_m | Yes | Beam span in meters | |
| beam_type | No | Support type | simply_supported |
| load_kg_per_m | Yes | Distributed load in kg/m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description must convey behavioral traits. It describes the output fields (load_kN_per_m, max_moment_kNm, max_shear_kN, note) and the input parameters implicitly. For a calculation tool, this is sufficient as it has no side effects or destructive actions. The description does not contradict any annotations (none exist).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences. The first sentence encapsulates the tool's purpose and output format. The second provides a pointer to related tools. No fluff or redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists and the input schema fully describes parameters, the description is largely complete. It mentions the output fields and references related tools. A slight gap is that it could clarify that the load input is in kg/m for clarity, but the schema already states that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, with each parameter having a clear description (e.g., 'Beam span in meters'). The tool description does not add additional parameter semantics beyond what the schema provides. A score of 3 is appropriate as the schema already handles parameter documentation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate max bending moment and shear for a beam under uniform distributed load.' It specifies the resource (beam) and the specific calculations. The sibling guidance ('See list_bundles for related construction calculators') further distinguishes it from other calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions related calculators via 'See list_bundles for related construction calculators', which implies a grouping but does not explicitly state when to use this tool versus others. No conditions or alternatives are provided, leaving the agent to infer usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_car_advantageAInspect
Calculate Belgian benefit-in-kind for company car (avantage de toute nature voiture). Returns: {co2_reference, benefit_rate_pct, annual_taxable_benefit, monthly_taxable_benefit, estimated_monthly_tax_impact}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| co2 | Yes | CO2 emissions in g/km | |
| fuel_type | No | Fuel type | essence |
| catalog_value | Yes | Catalog value of the vehicle (HTVA) in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so description carries full burden. It discloses return structure and implies a read-only computation. Does not mention authentication or rate limits, but for a simple calculation tool, the disclosure is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states purpose and output, second references related tool. No fluff, information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no explicit output schema, the description covers return values. Lacks edge cases or default behavior details, but for a straightforward calculation tool with 3 parameters, it is largely complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. Description adds no additional semantics beyond the schema's field descriptions. Does not explain formulas or relationships between parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it calculates Belgian benefit-in-kind for company car, specifying the calculator and listing return fields. The name and description align, and it is distinct among many sibling 'calculate' tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides minimal guidance: mentions related 'finance-belgique' calculators via list_bundles. No explicit when-to-use or when-not-to-use, but the tool's specialized nature makes usage somewhat obvious.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_donationBInspect
Compute Belgian donation tax (droits de donation) by region (Bruxelles/Flandre/Wallonie). Use for estate planning in Belgium. Inputs: region, amount, recipient relation. Returns tax due and effective rate. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Donation amount in euros | |
| relationship | Yes | Relationship: direct_line (parents/children), between_spouses (or cohabitants), others |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It does not disclose any behavioral traits such as whether the tool is read-only, destructive, or requires authentication. It only mentions returned values.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. The key action and context are front-loaded. It efficiently communicates purpose and provides a cross-reference.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Even though an output schema exists, the description omits critical information: the region parameter that is mentioned is not in the schema, causing incompleteness. Additionally, no guidance on prerequisites or edge cases is provided.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'region' as an input, but the schema does not include a region parameter. This is misleading and contradicts the schema. The description does not add helpful semantic meaning beyond the schema; instead, it introduces confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute Belgian donation tax (droits de donation) by region', using a specific verb and resource. It distinguishes itself from numerous Belgian tax siblings by focusing specifically on donation tax.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises 'Use for estate planning in Belgium', providing clear context. It also recommends 'See list_bundles for related calculators', guiding the agent to an alternative for related tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_income_taxBInspect
Calculate Belgian personal income tax (IPP/PB) using 2026 progressive brackets. Returns: {income, tax_free_amount, taxable_base, income_tax, effective_rate_pct, marginal_rate_pct, ...}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income | Yes | Annual taxable income in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided. The description does not disclose behavioral traits such as idempotency, safety, or side effects. As a calculator, it likely is read-only, but this is not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first covers purpose, brackets year, and output fields; second directs to related tools. No fluff, efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with one parameter. The description covers purpose and output, but lacks usage guidelines and behavioral transparency. It is adequate for basic invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the schema already describes the parameter as 'Annual taxable income in euros'. The description adds no extra meaning beyond that, so the baseline of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Belgian personal income tax using 2026 progressive brackets, and specifies the output fields. This distinguishes it from other tax tools, though it doesn't explicitly contrast with siblings like calculate_belgian_salary.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The mention of list_bundles is a weak hint but does not provide exclusion criteria or conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_pensionBInspect
Estimate Belgian retirement pension based on career years and average salary. Use for retirement planning in Belgium. Inputs: years of contribution, average salary, status (employee/self-employed). Returns monthly pension estimate. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| career_years | Yes | Number of career years | |
| average_salary | Yes | Average annual salary in euros | |
| household_type | No | Pension type: single rate (60%) or household rate (75%) | single |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states the tool returns a monthly pension estimate but does not disclose assumptions, accuracy limitations, or any side effects. It does not mention that the tool is a safe read operation or if any data is saved. Lacks behavioral detail expected for a financial estimation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at 4 sentences, front-loading the purpose and listing inputs. It is well-structured without extraneous information. The only minor issue is the parameter mismatch which could have been clarified.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists (not shown but indicated), the description does not need to detail the return format. However, it omits explaining the impact of the household_type parameter (e.g., single rate vs household rate) which is important for users. The pointer to related calculators is helpful but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the descriptions in the schema already document each parameter. The description adds 'status (employee/self-employed)' which does not match the schema's 'household_type' (single/household). This adds some value by implying a real-world concept but introduces confusion due to mismatch. Overall, marginal additional meaning.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates Belgian retirement pension, lists inputs (career years, average salary, status), and mentions output. However, it inaccurately describes the third parameter as 'status' (employee/self-employed) whereas the schema uses 'household_type' with enum values single/household. This minor mismatch slightly reduces clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for retirement planning in Belgium' which implies its context. It does not provide explicit when-to-use or when-not-to-use guidance, nor distinguishes from the many sibling calculation tools. The mention of 'list_bundles' is a vague pointer to related calculators but no direct alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_salaryAInspect
Convert Belgian gross monthly salary to net salary (approximation). Returns: {gross_monthly, social_cotisations_13_07pct, special_social_contribution, professional_withholding_tax, net_monthly, net_annual, ...}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_monthly | Yes | Gross monthly salary in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It discloses the approximation nature and partially lists outputs, but does not cover all behavioral aspects like performance, resource consumption, or full output structure. The '...' indicates incompleteness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two clear sentences front-load the main purpose and efficiently cover output hints and sibling reference. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of salary calculations and lack of an output schema (only referenced), the description is incomplete. It lists some output fields but ends with '...' and does not explain each component. Adequate but leaves gaps for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% for the single parameter. The description adds no additional semantic value beyond restating the parameter's role. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: converting Belgian gross monthly salary to net salary. It specifies the input and lists output fields. It distinguishes from siblings by referencing a related bundle, but could better differentiate from very similar tools like calculate_belgian_income_tax.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The mention of 'See list_bundles for related calculators' hints at grouping but does not provide clear decision rules or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_social_contributionsAInspect
Calculate Belgian self-employed social contributions (cotisations INASTI). Returns: {annual_income, tier1_up_to_73850_at_20_5pct, tier2_73850_to_108785_at_14_16pct, tier3_above_108785, total_contributions, effective_rate_pct}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_income | Yes | Annual net professional income in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description compensates by detailing the return structure (fields and tiers). However, it does not disclose side effects, auth requirements, or rate limits, though the nature of the tool suggests read-only calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the main action and return structure, and ends with a reference to related tools. No wasted words, efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple single-parameter input and explicit return fields listed, the description is largely complete. It references a bundle for related tools, but could briefly mention prerequisites or typical usage scenarios.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already provides a clear description for the single parameter 'annual_income'. The description adds context by explaining the calculation's purpose and tiers but does not add significant semantics beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action 'Calculate' and the specific resource 'Belgian self-employed social contributions (cotisations INASTI)'. It mentions the return fields and references a related bundle, distinguishing it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for Belgian self-employed social contributions but does not explicitly state when to use it versus other calculators. It only hints at related tools via 'See list_bundles', lacking clear exclusion or alternative guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_belgian_vatBInspect
Calculate Belgian VAT — convert between HT and TTC. Returns: {amount_ht, amount_ttc, vat_amount, vat_rate}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=before tax, ttc=after tax | ht |
| rate | No | VAT rate: 6%, 12% or 21% | 21 |
| amount | Yes | Amount in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It mentions conversion between HT and TTC and the return object, but does not disclose any behavioral traits such as precision, edge cases, or that this is a standard deterministic calculation. The return format is redundant if an output schema exists.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose and return fields. No unnecessary words; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and complete schema coverage, the description is adequate but lacks usage guidelines and behavioral transparency. With an output schema present, the return object mention is redundant, but overall it covers the basics.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds minimal extra meaning beyond the schema (e.g., hints at mode via 'convert between HT and TTC'), but no further detail on parameter constraints or usage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Belgian VAT and converts between HT and TTC. It names the specific operation and return fields, distinguishing it from other VAT calculators through the explicit 'Belgian' qualifier.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus sibling VAT calculators (e.g., calculate_french_vat, calculate_vat_generic). The only hint is a reference to list_bundles for related tools, but no explicit when-to-use or when-not-to-use information.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_biorhythmAInspect
Compute physical, emotional, and intellectual biorhythm cycles for a date based on birth date. Use for self-tracking enthusiasts (pseudoscience). Inputs: birth date, target date. Returns 3 cycle values (-100 to +100) and zone. See list_bundles for related 'fun' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| birth_date | Yes | Birth date YYYY-MM-DD | |
| target_date | Yes | Target date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses that it returns 3 cycle values (-100 to +100) and zone, but does not mention side effects, authorization needs, or determinism. For a calculation tool, this is adequate but could be more explicit about its read-only nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, each serving a clear purpose: stating the computation, providing audience context, and listing inputs/outputs. It is front-loaded and contains no filler, making it highly concise and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (context signal), the description adequately covers purpose, inputs, and output summary. It references list_bundles for related tools, which adds completeness. Slight deduction for not mentioning additional context like typical use cases or limitations beyond pseudoscience.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% as both parameters have descriptions in the schema. The description merely restates 'Inputs: birth date, target date.' without adding further meaning, validation, or format details. Baseline 3 is appropriate since schema already provides sufficient semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes physical, emotional, and intellectual biorhythm cycles. It uses a specific verb 'Compute' and resource 'biorhythm cycles', and distinguishes it from sibling tools by labeling it as pseudoscience for self-tracking enthusiasts, hinting at list_bundles for related fun calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for self-tracking enthusiasts (pseudoscience)', which provides clear context for when to use this tool. It suggests seeing list_bundles for related calculators, offering an alternative. However, it does not explicitly state when not to use it or exclude other scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_blood_alcoholBInspect
Estimate blood alcohol content (BAC) using Widmark formula. Returns: {bac_percent, legal_status_fr, estimated_sober_in_hours}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sex | Yes | Biological sex | |
| drinks | Yes | Number of standard drinks (1 drink = 14g pure alcohol) | |
| weight_kg | Yes | Body weight in kilograms | |
| hours_drinking | Yes | Hours elapsed since first drink |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description only mentions formula and output format, but does not disclose behavioral traits like safety, privacy, or destructiveness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences covering purpose and output, no unnecessary information. Could be slightly more structured but very efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Adequate for a simple calculator with full schema coverage and output schema presence, but lacks mention of prerequisites or assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage 100% with descriptions for all parameters. Description adds no additional parameter semantics beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Specifically states estimation of blood alcohol content using Widmark formula, clearly identifying the verb, resource, and method.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implied usage for estimating BAC but lacks explicit when-to-use vs alternatives. Mentions related calculators but does not guide selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bmiCInspect
Calculate Body Mass Index (BMI) and weight category. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height_cm | Yes | Height in centimeters | |
| weight_kg | Yes | Weight in kilograms |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as side effects, safety, or statelessness. The agent cannot infer whether this is a pure calculation or has side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (one sentence plus a useful reference) and front-loaded with the action. It earns its place without wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple 2-param tool with output schema, the description covers the basic purpose and output (BMI and weight category). It is adequate but minimal, lacking any behavioral context due to missing annotations.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema descriptions already cover the parameters (height_cm, weight_kg) with min values. The tool description adds no extra parameter context but leverages the output schema implicitly. Baseline 3 given 100% schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates BMI and weight category, providing specific verb and resource. However, it does not explicitly differentiate from similar siblings like calculate_body_fat or calculate_bmr, relying only on a reference to list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool vs alternatives. The reference to list_bundles hints at related calculators but does not give explicit exclusions or usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bmrAInspect
Calculate Basal Metabolic Rate using Mifflin-St Jeor equation. Returns: {bmr_kcal}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Age in years | |
| sex | Yes | Biological sex | |
| height_cm | Yes | Height in centimeters | |
| weight_kg | Yes | Weight in kilograms |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must disclose behavioral traits. It reveals the calculation method (Mifflin-St Jeor) and return format ({bmr_kcal}), which is transparent for a deterministic computation. However, it omits details like error handling or assumptions beyond input schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with zero waste: first states purpose and equation, second provides return format and a pointer to related tools. Perfectly front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (compute BMR) and the presence of an output schema, the description is complete. It covers purpose, method, and output, and references related calculators for further context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters documented. Description adds no new meaning to parameters beyond what the schema already provides, meeting the baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Calculate Basal Metabolic Rate using Mifflin-St Jeor equation', specifying the verb, resource, and formula. It distinguishes from sibling calculators by naming the exact equation and referencing related 'sante' calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives (e.g., calculate_tdee). The mention of list_bundles implied context but lacks direct differentiation or exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_body_fatAInspect
Estimate body fat percentage from BMI, age and sex using Deurenberg equation. Returns: {body_fat_pct}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Age in years | |
| bmi | Yes | Body Mass Index | |
| sex | Yes | Biological sex |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries the burden. It mentions the equation and return value but doesn't disclose that the tool is read-only or any side effects. For a simple calculation, this is adequate but not exceptional.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: first states the purpose, second mentions the return format and related tools. No wasted words; properly front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the low complexity and existence of an output schema, the description is sufficient. It names the equation and return key. However, it does not explain the output schema structure (but that's the schema's job).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and each parameter already has a clear description. The tool description adds no extra meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb ('Estimate') and resource ('body fat percentage'), and distinguishes from the sibling 'calculate_body_fat_navy' by naming the Deurenberg equation. The result format is also mentioned.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description clearly states the inputs (BMI, age, sex) and points to related tools via 'list_bundles'. It implicitly differentiates from the Navy method, but could be more explicit about when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bpm_to_msAInspect
Convert BPM tempo to millisecond delay times for different note values. See list_bundles for related 'musique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bpm | Yes | Tempo in beats per minute | |
| note_value | Yes | Musical note value to convert |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It states the conversion function but does not mention safety (read-only), authentication needs, or error cases. For a simple calculation, this is acceptable but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words: first sentence states purpose, second sentence points to related tools. Perfectly concise and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of a simple conversion tool with an output schema, the description is mostly complete. It lacks behavioral context due to missing annotations but covers the core functionality and relationship to other tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% and both parameters have clear descriptions in the schema. The tool description adds no new meaning beyond 'for different note values', which is already implied. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies the verb 'Convert' and the resource 'BPM tempo to millisecond delay times for different note values'. It differentiates from siblings by referencing related 'musique' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description directs users to list_bundles for related calculators, implying when to find alternatives. However, it does not explicitly state when to use this tool over others or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_braking_distanceBInspect
Compute reaction + braking distance by road condition (dry/wet/icy). Use for driver safety education. Inputs: speed km/h, reaction time s, road type. Returns total stopping distance m. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| condition | No | Road | dry |
| speed_kmh | Yes | Speed km/h |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the full burden. It clearly indicates the tool computes reaction and braking distance and returns total stopping distance. However, it does not disclose assumptions, limitations, or potential errors, but for a calculator, the behavioral disclosure is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences and front-loads the purpose. However, the inaccuracy regarding inputs detracts from its effectiveness. The structure is adequate but compromised by the error.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description claims inputs that are not in the schema, making it incomplete for correct usage. While it mentions the output and refers to list_bundles, the critical error in parameter description leaves gaps in understanding what the tool actually requires.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'reaction time s' as an input, but the input schema does not include a reaction_time parameter. This misrepresents the parameter set and could mislead the agent. Despite 100% schema coverage, the description adds incorrect information, severely undermining parameter semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes reaction and braking distance based on road condition, with specific inputs (speed, road condition) and output (total stopping distance). The mention of driver safety education provides context, and the reference to list_bundles for related calculators helps differentiate from the many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use for driver safety education but does not explicitly state when to use this tool over alternatives. The reference to list_bundles hints at related calculators but lacks clear guidance on when not to use this tool or what alternatives exist.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_braquetCInspect
Compute cycling gear ratio (braquet) and development per pedal turn. Use for road cycling gear analysis. Inputs: chainring teeth, sprocket teeth, wheel diameter mm. Returns ratio and meters per pedal turn. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cog_teeth | Yes | Number of teeth on the rear cog | |
| chainring_teeth | Yes | Number of teeth on the front chainring |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It mentions inputs and outputs but contradicts the schema by including 'wheel diameter mm' as an input, which is not an actual parameter. This inconsistency reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded with the core purpose. The reference to list_bundles is slightly extraneous but does not detract significantly. However, the misleading input mention is a flaw.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description lacks completeness due to the erroneous input reference and does not cover all necessary context, such as the exact output format beyond 'ratio and meters per pedal turn'. With an output schema presumably present, the description should align accurately.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Although schema coverage is 100% with descriptions for both parameters, the description adds 'wheel diameter mm' as an input that is not in the schema, which is misleading. The description does not enhance meaning beyond the schema and introduces confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes cycling gear ratio and development per pedal turn, with a specific use case for road cycling. However, it does not distinguish from similar sibling tools like calculate_gear_ratio or calculate_velo_development, which could cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies 'Use for road cycling gear analysis', providing clear context. It does not include when not to use or explicit alternatives, though it references list_bundles for related calculators, which is minimal guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bra_sizeAInspect
Calculate bra size in FR, US or UK system from underbust and bust measurements (cm). Returns: {band_size, size, cup_diff_cm}. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| system | Yes | ||
| bust_cm | Yes | ||
| underbust_cm | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool calculates bra size from measurements, which is a simple computation with no side effects. It does not disclose any additional behavioral traits such as edge case handling or precision, but for a calculator tool this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loading the purpose and input/output format. The second sentence provides a pointer to related tools. Every sentence adds value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the low complexity (three parameters, simple calculation), the description is fully complete. It explains what the tool does, the required inputs, the system options, and the return format. The output schema is effectively described inline, and a reference to related tools is provided.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 0% description coverage, so the description must compensate. It specifies that measurements are in cm and lists the system options (FR/US/UK), adding meaning beyond the schema's raw field names and constraints. However, it does not provide details on decimal handling or acceptable ranges beyond the schema's 'exclusiveMinimum'.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates bra size in FR, US, or UK systems from underbust and bust measurements in cm, and specifies the return format. It references 'list_bundles' for related calculators, but does not explicitly differentiate from 'calculate_bra_size_convert', which is a sibling.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal guidance on when to use this tool. It mentions 'See list_bundles for related textile-mode calculators' but does not explicitly state when to choose this tool over alternatives like 'calculate_bra_size_convert' or other specific calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bra_size_convertAInspect
Convert bra size between FR, US, UK and EU systems. Returns: {FR, US, UK, EU}. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cup | Yes | Cup letter (A, B, C, D, DD, E, F) | |
| band_size | Yes | Band size in source system (numeric) | |
| from_system | Yes | Source sizing system |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It specifies the return format ({FR, US, UK, EU}) implying a read-only conversion, but lacks details on error handling, edge cases, or any computational assumptions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the core purpose and output, then directing to related resources. Every sentence is useful and there is no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with complete schema and output schema, the description adequately covers inputs, outputs, and related tools. Minor omission: return type structure is implied but not formally specified.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds no additional meaning beyond the schema; it merely repeats the system names already present in the enum.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the verb 'Convert' and resource 'bra size' between systems, and mentions the return format. However, it does not directly differentiate from sibling 'calculate_bra_size' which may be a related but distinct tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description suggests seeing list_bundles for related calculators but provides no explicit guidance on when to use this tool versus alternatives like calculate_clothing_size_convert or calculate_shoe_size_convert.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_bread_hydrationAInspect
Compute baker's hydration % = water/flour×100. Use for bread recipe analysis. Inputs: flour g, water g. Returns hydration % and dough type (firm/standard/wet). See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| flour_grams | Yes | Flour weight grams | |
| water_grams | Yes | Water weight grams |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description carries the full burden. It discloses return values (hydration % and dough type) and implies no side effects via 'compute', but does not explicitly state read-only nature or other behavioral traits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first defines purpose and formula, second lists inputs and outputs with a pointer to related tools. No extraneous words; every sentence is valuable.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given low complexity and an existing output schema (not shown but implied), the description covers the essential computation and return types. A minor gap is no explanation of dough type categories, but likely covered by output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions already explaining the parameters (flour and water grams). The description ('Inputs: flour g, water g') adds minimal new meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes baker's hydration percentage using the formula water/flour×100, specifies its use for bread recipe analysis, and distinguishes from siblings by referencing related cuisine calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides a clear use case ('bread recipe analysis') and indirectly guides to alternatives by mentioning list_bundles for related calculators, but lacks explicit when-not-to-use or exclusionary guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_break_evenBInspect
Compute break-even point in units and revenue. Use for business plans and pricing decisions. Inputs: fixed costs, price/unit, variable cost/unit. Returns break-even units and revenue. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| fixed_costs | Yes | Total fixed costs | |
| price_per_unit | Yes | Selling price per unit | |
| variable_cost_per_unit | Yes | Variable cost per unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the outputs (break-even units and revenue) but does not disclose potential limitations, edge case behavior, or any side effects. For a computation tool, it is minimally informative.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose and output, and ends with a related bundle reference. No wasted words or unnecessary details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the existence of an output schema, the description covers the essential purpose, inputs, and outputs. However, it lacks details on edge cases (e.g., zero or negative profit scenarios) which could affect interpretation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description reiterates the inputs in text, adding little meaning beyond what the schema already provides. It does not explain how parameters relate to the calculation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes break-even point in units and revenue, with a specific business context (business plans and pricing decisions). It references a related bundle for similar calculators, which slightly differentiates siblings, though not explicitly.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides a use case ('Use for business plans and pricing decisions') but lacks explicit when-not-to-use guidance or alternative tools. The mention of list_bundles is a weak alternative reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_breeding_due_dateBInspect
Compute due date for animal breeding given mating date and species gestation period. Use for breeders. Inputs: species (dog/cat/horse/rabbit/cow), mating date. Returns due date and gestation milestones. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| animal | Yes | ||
| mating_date | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool returns due date and gestation milestones. However, it does not disclose date constraints (e.g., mating date must be in past), whether any side effects occur (likely none), or how unsupported species are handled. The species list inconsistency could confuse agents about valid inputs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at 24 words and front-loads the purpose. It has a clear structure: purpose, target user, inputs, outputs, and reference to related tools. Almost no wasted words. Could be slightly improved with bullet points for clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and presence of an output schema, the description adequately covers the core functionality. However, it does not mention date validation, optional behavior, or handle the species discrepancy. It references list_bundles for related tools, which is good. Overall adequate but with gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, so the description must add meaning. It lists the two inputs: species and mating date, and mentions acceptable species values (though with error). It explains their roles but omits format for mating date (schema provides pattern) and constraints. Provides basic semantics but incomplete.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: compute due date for animal breeding given mating date and species. It uses specific verbs and resources. However, there is a discrepancy: the description lists species including horse and cow, but the schema enum only includes dog, cat, rabbit, hamster. This inconsistency slightly undermines clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only says 'Use for breeders,' which is minimal. No guidance on when to use this tool versus the numerous species-specific siblings like calculate_dog_pregnancy or calculate_cat_pregnancy. No when-not-to-use or alternatives are provided. The reference to list_bundles is vague.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_brevet_pointsAInspect
Estimate French Brevet (DNB) score from grades and continuous-control marks. Use for collège students forecasting their result. Inputs: grades by subject, continuous control. Returns total points and mention. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| math | Yes | Math exam score (/100) | |
| oral | Yes | Oral exam score (/100) | |
| french | Yes | French exam score (/100) | |
| science | Yes | Science score (/50) | |
| history_geo | Yes | History-Geography score (/50) | |
| socle_commun | Yes | Socle commun points (50-400) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses that the tool returns total points and mention, but lacks details on side effects, authentication, rate limits, or error handling. For a stateless calculation tool, this is adequate but not exemplary.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the primary purpose. Every sentence adds value: purpose, usage suggestion, and pointer to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a calculation tool with 6 parameters and an output schema (mentioned but not shown), the description covers the basics: inputs, purpose, and return type. It could expand on calculation logic or edge cases, but it is sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the input schema already documents all parameters with descriptions. The description adds a general statement about inputs but does not provide additional semantics beyond what the schema offers. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates French Brevet (DNB) scores from grades and continuous-control marks, specifying the educational context (collège students). It distinguishes itself from siblings like calculate_bac_points by mentioning 'education' calculators and referencing list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear guidance ('Use for collège students forecasting their result') and directs users to list_bundles for related calculators. However, it does not explicitly exclude alternative tools like calculate_bac_points or specify when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_brick_countAInspect
Compute bricks or blocks needed for a wall including waste margin. Use for masonry projects. Inputs: wall dimensions, brick size, waste %. Returns brick count and pallets. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | Brick type | parpaing |
| height_m | Yes | Wall height m | |
| length_m | Yes | Wall length m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden; it clearly states the tool computes and returns results, implying read-only behavior with no side effects, meeting transparency needs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded with purpose, but the listing of inputs that don't match the schema reduces clarity; overall efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists and description mentions return values, but it omits details like default waste margin and how brick type affects sizing, leaving gaps for a complete understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions, so baseline is 3, but the description mentions 'brick size' and 'waste %' as inputs, which are not in the schema, causing confusion and detracting from value added.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes bricks or blocks for a wall including waste margin, and directs to list_bundles for related calculators, distinguishing it from numerous sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies 'Use for masonry projects' and suggests seeing list_bundles for other construction calculators, providing context but not explicit exclusions or alternative tool names.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_buoyancyAInspect
Compute buoyancy force, displaced volume, or floating analysis (Archimedes). Use for physics or shipping. Inputs: object mass/volume, fluid density. Returns buoyant force and float/sink verdict. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| volume_m3 | Yes | Object volume m³ | |
| object_mass | Yes | Object mass kg | |
| fluid_density | No | Fluid density kg/m³ |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Describes inputs and output (buoyant force and float/sink verdict) but does not disclose additional behavioral traits like side effects or limits. Since annotations are absent, the description carries the burden and is adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences with no superfluous words. Front-loaded with purpose and key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description sufficiently explains inputs, output, and usage context. Also references a related bundle for completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. The description summarizes inputs but adds no significant meaning beyond the schema. Meets baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it computes buoyancy force, displaced volume, or floating analysis using Archimedes principle, specifying domains physics or shipping. Distinguishes from other calculate_ tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Suggests usage for physics or shipping and references list_bundles for related science calculators. Lacks explicit when-not-to-use or alternatives but provides sufficient context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_burn_rateAInspect
Compute startup monthly burn rate and runway. Use for fundraising or expense control. Inputs: cash on hand, monthly expenses, monthly revenue. Returns burn, runway in months, profitability date. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cash_balance | Yes | Cash in bank EUR | |
| monthly_revenue | No | Monthly revenue EUR | |
| monthly_expenses | Yes | Monthly expenses EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool computes and returns values (burn, runway, profitability date), implying read-only behavior. However, it does not explicitly declare it as non-destructive or safe, which is important for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, with two sentences and a line listing inputs/outputs. It is front-loaded with the main action. Slightly more structure (e.g., separating inputs/outputs) could improve readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with an output schema, the description covers purpose, inputs, and outputs adequately. It does not explain edge cases (e.g., negative cash balance) but is sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers all three parameters (cash_balance, monthly_revenue, monthly_expenses) with descriptions, achieving 100% coverage. The description restates inputs but adds minimal new information. The mention of return values (burn, runway, profitability date) is helpful but does not enhance parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: 'Compute startup monthly burn rate and runway.' It specifies the verb 'compute,' the resource 'burn rate and runway,' and provides usage context ('Use for fundraising or expense control'). It distinguishes from siblings by referencing related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear use cases ('fundraising or expense control') and lists inputs. However, it lacks explicit when-not-to-use guidance or alternative tools. The mention of list_bundles for related calculators offers some context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cable_sectionAInspect
Compute electrical cable cross-section (mm²) per NF C 15-100. Use for residential wiring. Inputs: power kW, voltage, length, max voltage drop %. Returns required section. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| power_w | Yes | Power W | |
| voltage | No | Voltage | |
| length_m | Yes | Cable length m | |
| max_drop_pct | No | Max voltage drop % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears the burden. It mentions the NF C 15-100 standard, adding regulatory context, but does not disclose assumptions (e.g., copper vs aluminum, temperature) or the exact calculation method. The return value is simply 'required section' without further detail.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief with two main sentences plus a reference. It front-loads the core purpose. However, the unit mismatch and minor repetition reduce clarity slightly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an output schema exists, the description need not detail return values but only says 'Returns required section.' It references list_bundles for related tools but lacks deeper context on calculation prerequisites or edge cases. Completeness is adequate for a straightforward calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with parameter descriptions. The tool description lists inputs but misstates power_w as 'power kW' when the schema expects watts (W). This inconsistency could confuse the agent. The description adds little meaning beyond the schema entries.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes electrical cable cross-section in mm² per the NF C 15-100 standard, for residential wiring. This specific verb and resource with a recognized standard distinguishes it from sibling tools like 'calculate_cable_section_electrical' that may not specify the standard.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises using for residential wiring and references list_bundles for related calculators, but does not explicitly state when not to use it or mention alternatives. The context (French standard) implies scope but lacks exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cable_section_electricalCInspect
Calculate cable section from power, voltage, distance and max voltage drop. Returns: {current_a, allowed_drop_v, calculated_section_mm2, recommended_mm2}. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| power_w | Yes | Power in watts | |
| voltage | No | Voltage (default 230V) | |
| distance_m | Yes | One-way cable distance in meters | |
| max_drop_pct | No | Max voltage drop % (default 3) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the return structure but fails to mention important behavioral details such as assumptions (single-phase vs three-phase, AC/DC, cable material, temperature), side effects, or required permissions. This leaves significant gaps in understanding the tool's behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with only two sentences. It front-loads the primary action and includes the return structure. However, the pointer to list_bundles adds a minor detour. Overall, it earns its keep with minimal waste.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of electrical calculations, the description lacks critical context about assumptions (e.g., cable type, system configuration) and limitations. It does not mention output schema details beyond a brief tuple. For a tool with many siblings, more context is needed to ensure correct usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes all parameters with types, defaults, and descriptions. The description adds no additional semantic information about the parameters beyond what is in the schema, so baseline score 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates cable section from power, voltage, distance, and voltage drop. It distinguishes from siblings by mentioning it's a 'science' calculator and referencing list_bundles, but could be more explicit on what makes it unique among many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives. The mention of list_bundles for related calculators is vague and doesn't help an agent decide when to use this specific tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cac_ltv_ratioAInspect
Compute Customer Acquisition Cost vs Lifetime Value ratio. Use for SaaS unit economics analysis (target ≥3.0). Inputs: total CAC, LTV. Returns ratio and health verdict. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cac | Yes | Customer acquisition cost EUR | |
| ltv | Yes | Customer lifetime value EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It states it computes ratio and returns ratio and health verdict, which is transparent. No hidden side effects are implied. The behavioral traits are adequately disclosed for a simple calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with three sentences covering purpose, usage context, and related tools. Every sentence carries value with no redundancy or fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (implied by 'Has output schema: true'), the description need not detail return values but still mentions 'ratio and health verdict'. It provides useful context like SaaS unit economics and target ratio. Minor missing details (e.g., parameter constraints) are already covered by the schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with both parameters (cac, ltv) already described in the schema. The description simply reiterates 'Inputs: total CAC, LTV' without adding new semantic details beyond the schema, meeting the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute Customer Acquisition Cost vs Lifetime Value ratio', a specific verb+resource. It adds context for SaaS unit economics and provides a target benchmark (≥3.0), distinguishing it from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for SaaS unit economics analysis' and even provides a target ratio. It directs to list_bundles for related finance calculators, offering a pathway to alternative tools. However, it does not explicitly state when not to use, but the context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_caffeine_clearanceAInspect
Compute remaining caffeine in body over time using 5-hour half-life. Use for sleep planning. Inputs: caffeine mg, time since intake. Returns remaining mg and clearance forecast. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mg | Yes | Caffeine mg consumed | |
| hours | No | Hours since consumption |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses the 5-hour half-life, inputs (mg, hours), and outputs (remaining mg, clearance forecast). This is sufficient for a non-destructive calculator, though lacks details on accuracy or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: function, use case, inputs/outputs, related tools. No fluff, every sentence adds value. Well front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema and 100% schema coverage, the description is fairly complete. It identifies inputs and outputs and points to related calculators. It could elaborate on 'clearance forecast' but likely covered in output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline 3. The description restates parameters ('caffeine mg, time since intake') but adds no new meaning beyond the schema's descriptions. It does contextualize the purpose but not the parameters themselves.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes remaining caffeine over time using a 5-hour half-life, which is a specific verb+resource. It distinguishes itself from siblings like calculate_caffeine_half_life by focusing on clearance computation for sleep planning.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for sleep planning,' providing a clear use case. It also directs users to related 'sante' calculators via list_bundles, though it does not explicitly exclude other scenarios or compare with direct siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_caffeine_half_lifeAInspect
Calculate remaining caffeine in body after time elapsed. Returns: {hours_to_below_25mg, safe_to_sleep}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| hours_since | Yes | Hours since consumption | |
| mg_consumed | Yes | Caffeine consumed mg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description should disclose behavioral traits. It states the calculation and return fields but does not mention that it is a safe, read-only operation or any assumptions (e.g., standard half-life). Basic transparency but missing details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no fluff. Front-loaded with action and returns, then a pointer to related tools. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers purpose and output format. Given no annotations and no output schema provided (though context says it exists), it is adequate for a simple calculator. Could mention the underlying model (e.g., standard half-life of 5 hours) but not critical.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters (mg_consumed, hours_since). The description adds return field context but does not enhance parameter meaning beyond schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Calculate remaining caffeine in body after time elapsed' and specifies return fields (hours_to_below_25mg, safe_to_sleep). It distinguishes from a large set of sibling calculators by its specific topic and hints at categorization ('sante' calculators via list_bundles).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies a health context by referencing 'sante' calculators but does not explicitly state when to use this tool versus alternatives or any prerequisites. No guidance on when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_caffeine_intakeAInspect
Track caffeine intake against the safe daily limit (400 mg adult). Use for monitoring coffee/tea/soda consumption. Inputs: list of drinks (type, quantity). Returns total mg, % of limit, time-of-day distribution. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| drinks | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It describes the output (total mg, % of limit, time-of-day distribution) but lacks details about prerequisites, side effects, or data storage. It implies a non-destructive calculation but is not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no fluff. It front-loads the purpose, then gives usage, inputs/outputs, and a referral. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (not shown), the description need not detail return values, but it introduces 'time-of-day distribution' which is not supported by the input schema (no time field). It also lacks clarity on units. These gaps reduce completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 0% schema description coverage, the description compensates well by explaining the input as a list of drinks with type and quantity, and tying it to the safe limit. It adds context beyond the schema (e.g., 400 mg limit, output values), though it omits units for quantity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool tracks caffeine intake against a safe limit and lists specific beverages (coffee, tea, soda). It distinguishes itself from siblings like calculate_caffeine_clearance by focusing on consumption monitoring.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for monitoring coffee/tea/soda consumption' and directs users to list_bundles for related calculators. However, it does not specify when not to use this tool, such as for metabolic tracking.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_calories_burnedBInspect
Estimate calories burned during physical activity using MET values. Returns: {calories_burned}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| activity | Yes | Type of activity | |
| weight_kg | Yes | Body weight in kilograms | |
| duration_minutes | Yes | Duration in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the use of MET values for estimation, implying a standard calculation method. However, it does not detail limitations (e.g., accuracy, assumption of average demographics, or required units beyond schema). No annotations exist to provide additional safety or behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at two sentences, but the return placeholder '{calories_burned}' lacks units or structure, reducing clarity. It is efficient but not optimally informative.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple estimation tool with 3 required parameters and no nested objects, the description covers the basic function and return. However, lacking output schema details (units, format) and usage examples leaves it slightly incomplete for fully autonomous agent invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers 100% of parameters with descriptions, so the description adds minimal value. The reference to MET values provides semantic context for the activity parameter, but overall param meanings are well-documented in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates calories burned using MET values, with a specific verb and resource. However, it does not explicitly differentiate from sibling tools like calculate_bmr or calculate_dog_walking_calories, though the mention of 'sante' bundle provides some context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives. The reference to list_bundles is vague and does not specify selection criteria or scenarios where this tool is preferable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_canada_combined_taxAInspect
Calculate combined Quebec + federal income tax with the Quebec federal abatement (16.5%). Returns: {income_cad, quebec_provincial_tax, federal_gross_tax, federal_abatement_qc, federal_net_tax, total_combined_tax, ...}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income_cad | Yes | Annual income in CAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, placing full burden on the description. It mentions return fields but does not disclose whether the tool is idempotent, has side effects, or any limitations such as rate limits or authentication needs. For a calculation tool, this is a significant gap in transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences covering purpose, key detail (abatement percentage), return fields, and a pointer to related tools. It is efficient but could benefit from better structuring of the return fields list.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity (single parameter, no nested objects), the description is mostly complete. It explains what it calculates, lists return fields, and references a bundle for related calculators. It does not elaborate on edge cases or format, but the provided schema covers the input constraints.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a single parameter 'income_cad' described in the schema. The description does not add any additional meaning or context beyond what the schema already provides, meeting the baseline score as per rules.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates combined Quebec and federal income tax with the Quebec federal abatement (16.5%). It differentiates from sibling tools like 'calculate_canada_federal_tax' and 'calculate_quebec_income_tax' by specifying the combination and abatement percentage, and references a related bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The purpose is clear for when to use this tool (combined tax calculation). It references a bundle for related calculators, providing context. However, it does not explicitly state when not to use it or list alternatives, leaving some ambiguity for an agent deciding between this and the separate federal/Quebec tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_canada_eiBInspect
Calculate Canadian Employment Insurance (EI) premiums for Quebec and non-Quebec residents. Returns: {gross_annual_cad, max_insurable_earnings, insurable_earnings, employee_rate_pct, employee_premium}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| province | No | Province: QC (Quebec rate) or other (standard rate) | QC |
| gross_annual_cad | Yes | Gross annual insurable earnings in CAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as read-only nature, idempotency, or dependency on current rates. It only states the computation and return fields, leaving the agent to infer safety.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose and scope, concise list of return fields, and a pointer to related tools. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
While the output schema covers return values, the description lacks details on calculation assumptions (e.g., annual max insurable earnings, rate validity). It is adequate but minimal for a financial calculation tool with many siblings.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds no new meaning beyond the schema for parameters; it merely lists return fields. Baseline 3 is appropriate as schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates Canadian EI premiums for Quebec and non-Quebec residents, and lists return fields, which is specific and unambiguous. However, it does not explicitly differentiate from sibling tools like other Canadian tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use (for EI premium calculation by province) and references list_bundles for related calculators, but lacks explicit when-not-to-use guidance or comparison to alternatives like calculate_canada_federal_tax or calculate_canada_rrq.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_canada_federal_taxBInspect
Calculate Canadian federal income tax (CRA) with basic personal amount deduction. Returns: {income_cad, basic_personal_amount, taxable_income, federal_tax, effective_rate_pct, marginal_rate_pct, ...}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income_cad | Yes | Annual income in Canadian dollars (CAD) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full responsibility for behavioral disclosure. It states the tool includes the basic personal amount deduction and lists some return fields, but does not disclose assumptions, limitations, tax year applicability, or behavior for edge cases like negative income. This is insufficient for a financial calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise, using only two sentences. The first sentence clearly states the purpose and key feature, and the second lists return fields with a cross-reference. Every sentence is meaningful and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity and the presence of an output schema (mentioned), the description provides a reasonable overview. It lists key return fields and hints at related tools. However, it lacks explicit mention of the tax year and does not fully differentiate from provincial or combined tax tools, making it adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already provides a description for the only parameter (income_cad), achieving 100% coverage. The description adds no additional semantic value beyond the schema, maintaining the baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: calculate Canadian federal income tax with the basic personal amount deduction. It uses a specific verb ('Calculate') and resource ('Canadian federal income tax (CRA)'), and adds a distinguishing detail ('with basic personal amount deduction') that helps differentiate from related tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide explicit guidance on when to use this tool versus its siblings (e.g., calculate_canada_combined_tax, calculate_quebec_income_tax). It only vaguely references 'list_bundles' for related calculators, which is insufficient for an agent to determine the appropriate tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_canada_rrqBInspect
Calculate Quebec Pension Plan (RRQ) contributions for employee. Returns: {gross_annual_cad, rrq_base_earnings, rrq_contribution_tier1, rrq_additional_earnings, rrq_contribution_tier2, total_rrq_contribution, ...}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_annual_cad | Yes | Gross annual earnings in CAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It only states the calculation action and output fields, but does not disclose whether it is read-only, requires authentication, has rate limits, or any side effects. For a calculation tool, being read-only is implied but not confirmed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: one sentence for purpose and one for related tools. It is front-loaded with the core action. The second sentence is informative but not essential.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter) and presence of an output schema (implied by listed fields), the description covers the basic purpose and output. However, it lacks usage context and behavioral details, making it minimally complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents 'gross_annual_cad' with its description. The tool description adds no additional semantic value beyond what the schema provides, achieving the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates Quebec Pension Plan (RRQ) contributions for an employee, which is a specific verb+resource combination. This distinguishes it from the many other calculate_ tools via the specific reference to 'Canada RRQ' and 'Quebec Pension Plan'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lacks explicit when-to-use or when-not-to-use guidance. It only hints at related calculators via 'See list_bundles for related finance-afrique-quebec calculators', but does not compare with sibling tools like calculate_canada_combined_tax or calculate_canada_ei.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_capital_gains_propertyBInspect
Compute French property capital gains tax (plus-value immobilière) including duration abatements. Use for sellers of secondary residence. Inputs: purchase price, sale price, years held, work cost. Returns tax due, social charges, net gain. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sale_price | Yes | Sale price in euros | |
| years_held | Yes | Number of years the property was held | |
| purchase_price | Yes | Original purchase price in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It mentions inputs and outputs but introduces a critical inconsistency: describes 'work cost' as input, which is not in the input schema. This misrepresents behavior and reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Concise and front-loaded, two sentences plus reference. However, the inclusion of incorrect parameter info (work cost) undermines efficiency. Each sentence should be accurate; this one is not.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Mentions return values (tax due, social charges, net gain), but missing work cost parameter in schema leads to incomplete context. Output schema exists, so return details aren't fully required, but parameter inconsistency harms completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. However, description adds no useful meaning beyond schema and actually introduces a false parameter (work cost). This reduces reliability and adds confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool computes French property capital gains tax with duration abatements, specifies use case for sellers of secondary residence, and lists inputs/outputs. Distinguishes from siblings by referencing list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context: 'Use for sellers of secondary residence.' Does not explicitly exclude primary residence or other scenarios, nor list alternative tools directly, but references list_bundles for related calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_carbon_footprintAInspect
Estimate annual personal carbon footprint (tCO₂e). Use for sustainability awareness. Inputs: housing, transport, diet, lifestyle. Returns total emissions and breakdown. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| kwh | No | Electricity kWh/year | |
| km_car | No | Car km/year | |
| km_plane | No | Flight km/year | |
| meat_kg_week | No | Meat kg/week |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It states it 'estimates' and 'returns total emissions and breakdown,' implying a read-only calculation. However, it fails to explicitly confirm no side effects, idempotence, or prerequisites, leaving uncertainty about its behavioral profile.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose and unit, then inputs and outputs. Efficient and no redundancy. Could include one more sentence on behavior without losing conciseness, but current structure is very good.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, the description adequately covers purpose, input categories, and output summary. It also references related calculators. However, missing explicit behavioral transparency (e.g., read-only) leaves a gap, making it moderately complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with individual descriptions, so baseline is 3. The description adds value by grouping parameters into categories (housing, transport, diet, lifestyle) and mentioning the output breakdown, enriching the semantics beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates annual personal carbon footprint in tCO₂e, specifies the verb 'Estimate' and resource 'carbon footprint', and provides unit. It distinguishes from sibling tools by mentioning 'sustainability awareness' and pointing to 'list_bundles' for related calculators, leaving no ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for sustainability awareness,' giving a clear context. However, it does not explicitly state when not to use this tool or provide direct alternatives among siblings (e.g., 'calculate_carbon_sequestration'). The pointer to 'list_bundles' is indirect guidance, not a clear exclusion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_carbon_sequestrationBInspect
Estimate CO2 sequestration by trees over their lifetime. Returns: {age_factor, annual_kg_co2_per_tree, annual_kg_co2_total, lifetime_kg_co2, lifetime_tonnes_co2, equivalent_cars_off_road_1yr}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| count | No | Number of trees (default 1) | |
| age_years | Yes | Age of the trees in years | |
| tree_type | Yes | Species of tree |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It mentions output fields but does not disclose assumptions, data sources, limitations, or any behavioral traits beyond basic estimation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences front-loaded with purpose and output fields. Every sentence adds value; no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists (not shown but noted), so return values are covered. However, description lacks usage context, assumptions, or accuracy notes for a complex environmental calculation. Adequate but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage for all 3 parameters with descriptions. The description adds no additional meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the verb 'estimate' and resource 'CO2 sequestration by trees' over lifetime, with specific output fields listed. It distinguishes itself from the many other calculate_* sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. The only hint is a reference to list_bundles for related calculators, but no clear alternatives or context for selecting this tool over others.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_card_draw_probabilityBInspect
Calculate hypergeometric probability of drawing specific cards from a deck. Returns: {odds_one_in}. See list_bundles for related 'jeux-probabilites' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| deck_size | No | Total number of cards in the deck (default 52) | |
| draw_count | Yes | Number of cards drawn | |
| target_cards | Yes | Number of target cards wanted in the draw | |
| cards_in_deck_matching | Yes | Number of target cards in the deck |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It fails to mention that the tool is read-only, has no side effects, or requires no special permissions. The return format hint is helpful but insufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two efficient sentences: first defines purpose, second points to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists but is not shown; description mentions return key 'odds_one_in'. However, it lacks details on edge cases (e.g., invalid probabilities) and interpretation. Adequate for a simple calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with descriptions, so the description adds marginal value. It does provide the return format '{odds_one_in}', which aids understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it calculates hypergeometric probability for card draws, distinguishing it from related probability tools like dice or poker. Mentions a bundle for further context, but could be more explicit about its unique scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives (e.g., binomial probability, dice probability). Only a reference to the 'jeux-probabilites' bundle, which is indirect.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_car_depreciationAInspect
Calculate car residual value: Y1:-25%, Y2:-15%, Y3:-10%, Y4-5:-8%, Y6+:-5%. Returns: {residual_value, total_dep, pct_lost}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age_years | Yes | Age in years | |
| purchase_price | Yes | Original price |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses the depreciation percentages and return object structure. No annotations exist, so description carries full burden; it adds appropriate behavioral context for a non-destructive read-like calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, both informative. No filler. Front-loaded with critical depreciation rates.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with 2 parameters and an output schema, the description covers all essential aspects: input constraints (age, price), calculation logic (rates), and return fields.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters with descriptions. Description adds value by explaining the depreciation schedule and output format, going beyond parameter names.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clear verb 'Calculate' and specific resource 'car residual value', with explicit yearly depreciation rates. Distinguishes from numerous sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides a reference to 'list_bundles' for related calculators, giving context. No explicit when-not or alternative selection guidance, but sufficient for a straightforward financial calculator.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_car_lease_vs_buyAInspect
Compare leasing vs buying a car over the same period. Use for automotive purchase decisions. Inputs: car price, lease monthly cost, loan rate, ownership years. Returns total costs and recommendation. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| car_price | Yes | Car purchase price EUR | |
| loan_rate | Yes | Loan annual rate percent | |
| loan_months | Yes | Loan duration months | |
| lease_months | Yes | Lease duration months | |
| lease_monthly | Yes | Monthly lease payment EUR | |
| residual_value | Yes | Car residual value at lease end EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must fully disclose behavior. It mentions inputs like 'ownership years,' but the schema actually requires 'lease_months' and 'loan_months' (both in months). It omits 'residual_value' entirely and does not clarify that both lease and loan durations are separate parameters. This mismatch undermines transparency for a tool with 6 required parameters.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (three sentences) and front-loads the core purpose. Each sentence serves a clear role: purpose, use context, inputs/outputs, and pointer to related tools. No filler or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite the tool having 6 required parameters, the description only covers 4 of them and uses inaccurate terms ('ownership years' instead of months). It fails to mention critical inputs like 'residual_value' and 'loan_months.' While an output schema exists, the description is incomplete for guiding parameter entry.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds marginal value by listing main inputs in prose and stating that the tool 'Returns total costs and recommendation.' However, it inaccurately mentions 'ownership years' instead of months and omits 'residual_value,' reducing its usefulness beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Compare leasing vs buying a car over the same period.' It specifies the domain ('automotive purchase decisions') and the verb 'compare' accurately captures the action. There is no ambiguity, and it distinguishes from siblings like calculate_car_depreciation by focusing on lease vs buy comparison.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use the tool ('Use for automotive purchase decisions') and explicitly directs users to 'See list_bundles for related 'finance-universal' calculators,' suggesting alternative tools for broader financial calculations. However, it does not explicitly state when not to use it or provide direct comparisons to specific siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_carpet_flooringAInspect
Compute flooring cost including waste margin. Use for renovation budget. Inputs: surface m², product price/m², waste %. Returns total cost and m² to order. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_m | Yes | Width m | |
| length_m | Yes | Length m | |
| price_m2 | No | EUR/m² | |
| waste_pct | No | Waste % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It describes inputs and outputs but does not disclose any behavioral traits such as idempotency, side effects, or read-only nature, which is typical for a calculator but still implicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three efficient sentences: purpose, usage context, input/output summary. No redundant words; every sentence is valuable and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with 4 params (2 required) and an output schema exists. The description covers the main use case and return values. Minor gap: missing details on error handling or limits, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear param descriptions. The description adds minimal value by summarizing inputs, but does not explain meanings or constraints beyond what the schema already provides. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes flooring cost with waste margin for renovation budget. The verb 'Compute' and resource 'flooring cost' are specific. However, it does not explicitly differentiate from many similar 'calculate_*' sibling tools beyond referencing list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides context ('use for renovation budget') and references list_bundles for related calculators, but lacks explicit guidance on when not to use this tool versus alternatives like other flooring or cost calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cat_ageAInspect
Convert cat age to human-equivalent years (15+9+4×). Use for feline health. Inputs: cat age years. Returns human-equivalent age and life stage. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cat_years | Yes | Cat age in years |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the formula (15+9+4×), inputs, and returns (human-equivalent age and life stage), providing useful behavioral context beyond the schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences plus a reference to list_bundles, front-loaded with purpose, no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, output schema exists), the description provides purpose, formula hint, input/output, and a pointer to related tools, which is fully sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and the description merely restates 'Inputs: cat age years' without adding examples, constraints, or additional semantic details beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Convert cat age to human-equivalent years' with a specific formula hint, and distinguishes from siblings like calculate_dog_age by specifying 'feline health'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for feline health' and references list_bundles for related 'animaux' calculators, implying context but not explicitly stating when not to use or alternatives like calculate_dog_age.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cat_foodAInspect
Calculate daily cat food quantity based on weight, age and lifestyle. Returns: {kcal_per_day}. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | ||
| indoor | No | ||
| weight_kg | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes a simple calculation returning kcal_per_day, which implies a read-only operation. However, it does not explicitly state that it is safe, deterministic, or free of side effects. The description is minimal but adequate for a calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. It front-loads the core purpose and return value, then efficiently directs to related calculators. Extremely concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists (though not shown), the description covers the essential purpose, parameters, and return value. However, it lacks details such as calculation methodology, data sources, or edge-case behavior. It is adequate but not complete for a tool with three parameters and no annotation support.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'based on weight, age and lifestyle', mapping to the three parameters (weight_kg, age, indoor). However, it does not explain units (e.g., weight_kg is in kilograms), the default for indoor (true), or provide details beyond the parameter names. With 0% schema coverage, the description adds some meaning but is insufficiently detailed.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The tool name 'calculate_cat_food' and description clearly state it calculates daily cat food quantity based on weight, age, and lifestyle. It mentions the return value 'kcal_per_day' and references 'list_bundles' for related animal calculators, distinguishing it from siblings like 'calculate_dog_food'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for cat food calculation based on specified parameters, but it does not explicitly state when to use this tool over alternatives. It only mentions 'See list_bundles for related 'animaux' calculators', which provides indirect guidance. No explicit exclusions or alternative tools are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cat_pregnancyAInspect
Compute cat due date from mating date (gestation 63-67 days). Use for breeders. Inputs: mating date. Returns due date window and milestones. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mating_date | Yes | Mating date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, but the description outlines that it returns a due date window and milestones, which is sufficient behavioral insight for a non-destructive calculator. No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences delivering all necessary information without fluff. Efficiently front-loads purpose and gestation details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-input calculator with an output schema, the description covers purpose, gestation range, target user, return value summary, and related tools. Completeness given complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter already described as 'Mating date YYYY-MM-DD'. The description merely repeats 'mating date' without adding new meaning, so baseline score applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes cat due date from mating date with gestation range 63-67 days, specifically for breeders. It distinguishes from sibling tools like dog pregnancy or human due date calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Indicates target users (breeders) and points to related tools via list_bundles, but does not explicitly state when to avoid using this tool or compare to alternatives like calculate_breeding_due_date.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cheque_repasAInspect
Calculate Belgian meal voucher (cheque-repas / maaltijdcheque) benefit. Returns: {face_value_per_voucher, employer_contribution_per_voucher, employee_contribution_per_voucher, monthly_total_vouchers, monthly_employer_cost, monthly_employee_contribution, ...}. See list_bundles for related 'finance-belgique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| days_per_month | No | Working days per month | |
| employee_contribution | No | Employee contribution per voucher (min 1.09 EUR) | |
| employer_contribution | No | Employer contribution per voucher (max 6.91 EUR) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose whether the tool is read-only, requires authentication, or has side effects. For a calculator, it likely has no destructive actions, but this is not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence that front-loads the purpose and includes a helpful cross-reference, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with three parameters and an output schema, the description sufficiently explains what the tool does and what it returns. The reference to list_bundles adds context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description does not need to add much. It briefly mentions returned fields but does not elaborate on parameters beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate' and the specific resource 'Belgian meal voucher (cheque-repas / maaltijdcheque) benefit'. It distinguishes itself from sibling calculators by targeting a niche Belgian financial product.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal guidance; it mentions related tools via 'See list_bundles' but does not explicitly state when to use this tool over other calculators or specify prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_child_supportAInspect
Estimate French child support (pension alimentaire) based on income, custody and number of children. Returns: {income, rate_pct, monthly_support, annual_support}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income | Yes | Net monthly income of the paying parent in euros | |
| custody | No | Custody type: full (garde principale), alternating (alternee), reduced (visite et hebergement) | full |
| children_count | Yes | Number of children (1-6) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool estimates child support, lists inputs and returns, but does not disclose assumptions, formula, or edge cases. Adequate for a straightforward calculator but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose and inputs, followed by a reference to related tools. Every sentence is necessary and no extra fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 3 parameters, an output schema, and is a calculation tool, the description covers purpose, inputs, outputs, and related resources. No critical information is missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with full parameter descriptions. The description adds value by listing return fields (income, rate_pct, monthly_support, annual_support) and clarifying the French context, going beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates French child support based on income, custody, and number of children. It lists return fields and references a related tool for other calculators, distinguishing it from siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions related 'finance-france' calculators via list_bundles, providing some context, but does not explicitly specify when to use this tool versus alternatives or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_chinese_zodiacAInspect
Determine Chinese zodiac animal and element from birth year. Returns: {full_sign}. See list_bundles for related 'fun' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| birth_year | Yes | Birth year |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Since no annotations are provided, the description bears full responsibility for behavioral disclosure. It states the tool returns a result but lacks details on side effects, permissions, or response structure beyond the placeholder '{full_sign}'. The transparency is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise with two short sentences that convey the essential purpose and a pointer to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple one-parameter calculator with an output schema, the description is largely complete. It could elaborate on the returned 'full_sign' structure, but the existence of an output schema reduces the need.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a well-described parameter. The description adds no additional meaning beyond the schema, meeting the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the verb 'Determine' and resource 'Chinese zodiac animal and element from birth year', and mentions the return value format. Distinguished from sibling calculators by its specific focus and reference to list_bundles for related fun calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context by stating the input (birth year) and directs to list_bundles for related fun calculators. However, no explicit exclusions or when-not-to-use guidance is given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_churn_rateAInspect
Compute customer or revenue churn rate over a period. Use for SaaS retention analysis. Inputs: starting customers, churned, period length. Returns churn %, retention %, and annualized rate. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| period_months | No | Period in months | |
| lost_customers | Yes | Customers lost | |
| start_customers | Yes | Customers at period start |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Discloses return values: churn %, retention %, annualized rate. Does not mention side effects, but tool is read-only computation. Adequate transparency for a calculator with no destructive actions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four front-loaded sentences, no wasted words. First sentence states core function, second gives usage context, third lists inputs/outputs, fourth points to related tools. Efficient and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given tool's simplicity, full input schema with descriptions, and output schema available, description is complete. Adds sufficient context for an agent to understand purpose, parameters, and related functionality without missing critical details.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with property descriptions. Description adds value by listing inputs concisely: 'starting customers, churned, period length' and mapping to output semantics. Bridges schema fields to business meaning beyond raw property names.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description explicitly states it computes customer or revenue churn rate for SaaS retention analysis. It clearly identifies the verb (compute), resource (churn rate), and domain (SaaS). Distinguishes from sibling tools by referencing 'finance-universal' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
States 'Use for SaaS retention analysis' which clearly indicates when to use. Directs user to list_bundles for related calculators, providing context for alternatives. Lacks explicit exclusions (when not to use) but sufficient for a calculation tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_clothing_size_convertAInspect
Convert clothing size between EU, US and UK systems. Returns: {original}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sex | Yes | Sex | |
| size | Yes | Size number in source system | |
| garment | Yes | Type of garment | |
| from_system | Yes | Source system |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It states the conversion operation but does not disclose edge cases, accuracy, or limitations. The placeholder 'Returns: {original}' is vague but does not contradict annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with one sentence. However, the phrase 'Returns: {original}' appears to be a template placeholder, which slightly reduces clarity. Otherwise, it is well-structured and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity (4 basic parameters, output schema exists), the description is mostly complete. It covers the core conversion purpose but does not explain the output format, relying on the output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and all parameters are described in the schema. The description adds no additional meaning beyond what the schema already provides, so the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts clothing sizes between EU, US, and UK systems. The verb 'convert' and resource 'clothing size' make the purpose specific, and it distinguishes from sibling tools like convert_shoe_size.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. It references list_bundles for related calculators but does not provide when-to-use or when-not-to-use criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_compost_volumeCInspect
Compute compost volume produced from kitchen and garden waste over a year. Use for compost bin sizing. Inputs: household size, garden size m². Returns L/year and recommended bin volume. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| depth_cm | No | Compost layer depth in centimeters (default 5cm) | |
| surface_m2 | Yes | Surface area in square meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. However, it does not disclose any behavioral traits such as read-only nature, authentication needs, or side effects. For a calculator tool, it is safe to assume no mutation, but the description should explicitly state it.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences, front-loads the purpose, and avoids unnecessary details. However, the inaccuracy regarding parameters reduces efficiency.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description provides basic purpose, inputs, and output hints, but lacks behavioral transparency and accurate parameter listing. An existing output schema is implied but not detailed, which would have compensated.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, but the description incorrectly mentions 'household size' as an input, which does not exist in the schema (only surface_m2 and depth_cm). This introduces confusion and misleads the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes compost volume from kitchen and garden waste over a year for bin sizing. It specifies the purpose and scope, though it does not differentiate from many sibling 'calculate_*' tools explicitly.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using the tool for compost bin sizing and mentions related calculators via list_bundles, but it does not provide explicit when-to-use or when-not-to-use guidance or alternatives beyond a general reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_compound_interestAInspect
Compute compound interest growth A=P(1+r/n)^(nt). Use for savings, retirement projections, investment forecasting. Returns final amount, total interest, and yearly breakdown. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Investment duration in years | |
| principal | Yes | Initial amount | |
| annual_rate | Yes | Annual interest rate in % | |
| compounds_per_year | No | Compounding frequency per year |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses that the tool returns a final amount, total interest, and yearly breakdown, indicating it is a read-only computation with no side effects. This is sufficient for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences front-load the formula and purpose, then explain outputs and related tools. No redundancy or unnecessary detail.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, the description need not detail return values but does so anyway. It provides adequate context for a straightforward calculation tool. However, with many siblings, a brief note on when NOT to use this tool (e.g., for monthly compounding specifically) would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are already well-documented. The description adds the formula and output summary but does not clarify parameter details beyond the schema. Baseline 3 is appropriate since no significant additional semantic value is provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes compound interest using the formula A=P(1+r/n)^(nt). It specifies use cases (savings, retirement, investment) and distinct outputs (final amount, total interest, yearly breakdown), making it easy to distinguish from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests when to use the tool (savings, retirement projections, investment forecasting) and directs users to list_bundles for related calculators. However, it does not explicitly contrast with similar tools like calculate_compound_interest_monthly or simple interest calculators, missing an opportunity to prevent misuse.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_compound_interest_monthlyAInspect
Compute compound interest with monthly contributions (savings plan). Use for systematic savers. Inputs: initial amount, monthly contribution, annual rate %, years. Returns final value, total contributed, total interest. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Number of years | |
| principal | Yes | Initial capital EUR | |
| annual_rate | Yes | Annual interest rate percent | |
| monthly_contribution | Yes | Monthly contribution EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Discloses return values: 'final value, total contributed, total interest.' Could mention it is a read-only computation, but the output description adds useful context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences with front-loaded purpose. No unnecessary words; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an output schema exists, the description need not explain return values but does so briefly. Mentioning the related bundle enhances completeness. No obvious gaps for a simple computation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. Description briefly lists parameter roles and adds output semantics, providing slight additional value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clear verb-resource pair: 'Compute compound interest with monthly contributions (savings plan).' Distinguishes from siblings like calculate_compound_interest by specifying the monthly contribution feature.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
States 'Use for systematic savers' and points to related 'finance-universal' calculators via list_bundles. Does not explicitly state when not to use, but provides clear usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_concrete_mixBInspect
Compute cement, sand, gravel, and water for a given concrete volume (NF DTU 21). Use for construction projects. Inputs: volume m³, mix ratio. Returns weights of each ingredient. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| volume_m3 | Yes | Volume in m³ |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry this burden. It states inputs and outputs ('Returns weights of each ingredient'), but does not disclose behavioral traits such as whether it is idempotent, requires authentication, or has rate limits. The output schema exists but covers return values.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences) and front-loads the core purpose. However, it includes an inaccurate detail ('mix ratio') which slightly reduces efficiency.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description does not need to explain return values, but it still contains a significant error (missing mix ratio parameter) and omits critical context like default mix ratio or standard. The tool has only one parameter, yet the description is misleading, making it incomplete for reliable agent use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description adds a non-existent parameter 'mix ratio' which contradicts the input schema that only defines volume_m3. Schema coverage is 100%, so the schema already fully describes the parameter. The description introduces confusion rather than clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes concrete mix ingredients (cement, sand, gravel, water) for a given volume, referencing NF DTU 21. This distinguishes it from sibling tools like calculate_concrete_stairs and other construction calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for construction projects' but provides no guidance on when not to use it or alternatives. It mentions list_bundles for related calculators, but does not explicitly exclude other concrete tools. The mention of 'mix ratio' as an input is inaccurate and could mislead an agent into thinking it's a parameter.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_concrete_stairsBInspect
Calculate concrete stair dimensions, volume and materials using Blondel's formula. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_m | No | Stair width in meters (default 0.9m) | |
| height_m | Yes | Total stair height to climb in meters | |
| num_steps | Yes | Number of steps | |
| thickness_cm | No | Slab thickness under each tread in cm (default 15cm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the burden. It mentions Blondel's formula, implying it's a calculation tool, but does not disclose any behavioral aspects like being read-only or requiring permissions. It is adequate for a simple calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description consists of two concise sentences, with the purpose front-loaded and no extraneous information. Every word earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description need not detail return values. It mentions dimensions, volume, and materials, which aligns with expected output. It could be slightly more complete by noting that it strictly follows Blondel's formula, but it is sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for all 4 parameters. The description adds no additional meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates concrete stair dimensions, volume, and materials using Blondel's formula. It references related calculators via list_bundles, but does not directly differentiate from the many similar calculate_* siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only suggests seeing list_bundles for related calculators, providing no explicit guidance on when to use this tool versus alternatives, nor when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_condominium_chargesBInspect
Compute one owner's share of condominium charges from the budget and tantièmes. Use for syndic or owner verification. Inputs: total budget, tantièmes-owned, total tantièmes. Returns annual and monthly share. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| total_charges | Yes | Total annual condominium charges EUR | |
| ownership_share_pct | Yes | Ownership share (tantièmes) percent |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It states the tool computes and returns annual and monthly shares, implying a read-only calculation. However, it does not explicitly state the tool is stateless, requires no permissions, or has no side effects. Missing details like validation of inputs (e.g., exclusiveMinimum) are absent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (two sentences plus a see-also), effectively communicating core purpose. However, the inconsistency between described and actual inputs slightly reduces clarity. It earns points for being non-verbose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple two-parameter schema and existence of an output schema, the description adequately covers the computation purpose and output (annual and monthly share). Yet, the input inconsistency and lack of safety transparency leave gaps. An output schema is mentioned but not detailed in the description.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for both parameters, so baseline is 3. However, the description lists three inputs ('total budget, tantiemes-owned, total tantiemes') while the schema only has two parameters, creating a mismatch and potential confusion. This inconsistency undermines the clarity of parameter meaning despite schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('Compute one owner's share') and the domain (condominium charges). It also specifies the use case (syndic or owner verification). However, the mention of three inputs (total budget, tantiemes-owned, total tantiemes) conflicts with the schema's two parameters (total_charges, ownership_share_pct), causing minor confusion. No explicit differentiation from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a usage context ('syndic or owner verification') and points to 'list_bundles' for related calculators. However, it does not specify when to use this tool over alternatives, nor does it give any exclusions or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_coneAInspect
Compute cone volume V=(1/3)πr²h and lateral/total surface area. Use for geometry or container design. Inputs: radius, height. Returns volume and areas. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height | Yes | Height | |
| radius | Yes | Base radius |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It does not mention safety, idempotency, or side effects, but for a calculator tool, the behavior is inherently safe and read-only. Still, it does not add any behavioral context beyond what is obvious.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two short sentences, front-loaded with the core computation, uses mathematical formula, and references a sibling tool. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists (context signals indicate has output schema), the description does not need to detail return values. It adds helpful context: use cases and sibling tool reference. Adequate for a simple calculator tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description only repeats 'radius, height' without adding new semantics like expected units or constraints, which are already in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes cone volume and surface areas with explicit formula V=(1/3)πr²h, and specifies use for geometry or container design. This distinguishes it from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides concrete use cases ('geometry or container design') and directs to sibling tool 'list_bundles' for related calculators. No explicit when-not-to-use instructions, but context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_confidence_intervalAInspect
Compute confidence interval for a sample mean. Use for statistics, A/B test results, or polling. Inputs: mean, std dev, sample size, confidence (90/95/99%). Returns CI lower/upper bounds. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| std_dev | Yes | Standard deviation | |
| confidence | No | Confidence level | 95 |
| sample_mean | Yes | Sample mean | |
| sample_size | Yes | Sample size |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so description carries the burden. It states inputs and outputs but omits assumptions (e.g., normal distribution), error handling, or return format details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise, front-loaded sentences cover purpose, inputs, and outputs without redundancy or superfluous detail.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present, return values need not be explained. Inputs are well covered, but the description lacks assumptions or limitations (e.g., normal distribution requirement).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter. The description lists inputs but adds no new semantic detail beyond what the schema provides, keeping it at baseline 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly identifies the tool's action ('Compute confidence interval for a sample mean') and lists specific use cases (statistics, A/B test results, polling), differentiating it from many sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit usage context ('Use for statistics...') and directs to list_bundles for related calculators, but lacks explicit when-not-to-use or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cooking_conversionCInspect
Convert recipe quantities between cups, ml, grams, oz, tbsp, tsp. Use for international recipe translation. Inputs: value, from, to, ingredient (for density). Returns: {original}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Amount to convert | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully describe behavior. It mentions an 'ingredient' input for density, but the schema does not include this parameter, creating confusion. The return is vaguely described as '{original}', and no other behavioral details (e.g., error handling, precision) are given.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is relatively short and front-loaded with purpose, but it contains an inaccurate 'Inputs' list that includes a non-existent parameter. Each sentence should be accurate and necessary.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the many sibling calculators, the description lacks differentiation. It doesn't explain unique features or how this tool differs from 'convert_cooking' or 'calculate_baking_conversion'. The mention of ingredient density conversion is incomplete without the corresponding schema parameter.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3, but the description introduces an 'ingredient' parameter not present in the schema and uses different param names ('value' vs 'amount'). This misalignment reduces the added value and can mislead the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts recipe quantities between common cooking units and mentions international recipe translation, which gives a specific verb and resource. However, it doesn't differentiate from closely related siblings like 'convert_cooking' or 'calculate_baking_conversion'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for international recipe translation', which provides context but lacks explicit when-not-to-use or alternatives. It briefly references 'list_bundles' for related calculators but doesn't offer direct sibling differentiation.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cooking_timeBInspect
Estimate cooking time for meat, fish, or vegetables based on weight, method, and doneness. Use for kitchen planning. Inputs: food type, weight g, cooking method (oven/grill/sous-vide), doneness. Returns time min and target internal temp. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| food | Yes | ||
| method | Yes | ||
| weight_kg | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description provides basic behavioral info: returns time and target internal temp. However, it misses edge cases, error handling, and does not fully align with schema (e.g., weight unit mismatch).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, no fluff. First sentence states purpose, second context, third lists inputs/outputs. Could be slightly more structured but effective.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given moderate complexity and presence of output schema (assumed), description covers purpose and basic returns but lacks behavioral details and contains contradictions with schema (weight unit, method). Incomplete for accurate tool usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Description adds meaning beyond schema (e.g., doneness parameter) but contradicts schema: states 'weight g' while schema uses 'weight_kg', and mentions 'sous-vide' method not in enum. Parameters not fully explained.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb 'Estimate' and resource 'cooking time' for meat, fish, or vegetables. It distinguishes from sibling tools like 'calculate_meat_cooking_time' by including vegetables and fish, and mentions doneness.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description suggests use 'for kitchen planning' and references related calculators via list_bundles, but does not explicitly state when to use this vs. alternatives like calculate_meat_cooking_time or calculate_cooking_conversion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cost_per_useAInspect
Compute the cost-per-use of a purchase to evaluate value. Use for buying decisions on durable goods or subscriptions. Inputs: purchase price, expected uses or years. Returns cost per use and break-even use count. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| item_price | Yes | Item purchase price | |
| expected_uses | Yes | Expected number of uses |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It states 'Returns cost per use and break-even use count,' which is clear. However, it does not explicitly state that the tool is read-only or non-destructive, but given the nature of a calculator, this is acceptable though not ideal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose and usage, and includes inputs, outputs, and a reference to related tools. It is efficient with no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 params, output schema exists), the description covers purpose, usage, inputs, outputs, and a pointer to related bundles. The only gap is the unclear mention of 'years' and lack of explanation for output schema, but output schema is present, so it's mildly incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with minimal descriptions. The description adds context about 'expected uses or years' but introduces ambiguity since schema only has expected_uses. Overall, the description adds little beyond schema, so baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes cost-per-use for purchase evaluation. However, it mentions 'expected uses or years' while the schema only includes 'expected_uses' (number), which may confuse. Despite this, the overall purpose is clear and distinct from most sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for buying decisions on durable goods or subscriptions' and references 'list_bundles for related calculators.' While it provides context, it lacks explicit exclusions or differentiation from other similar calculators like calculate_car_lease_vs_buy.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cost_priceBInspect
Calculate unit cost price from raw materials, labor, and overhead. Returns: {total_cost}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| labor | Yes | Labor cost | |
| overhead | Yes | Overhead/indirect costs | |
| quantity | Yes | Number of units produced | |
| raw_materials | Yes | Raw material cost |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full burden of behavioral disclosure. It only states the purpose and return value ('Returns: {total_cost}') but does not mention side effects, destructive actions, authentication needs, rate limits, or any potential impacts beyond computation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description consists of two concise sentences with no wasted words. It efficiently communicates the tool's purpose and a pointer to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the core purpose and return value, but for a 4-parameter tool with no annotations, it lacks details such as the formula used or assumptions. The schema parameters are well-documented, but the description could provide more context about how the cost is calculated.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema coverage is 100%, so the baseline is 3. The description does not add additional meaning beyond the schema; it only mentions the parameters in a general list without providing new context or syntax details.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Calculate unit cost price from raw materials, labor, and overhead.' It identifies the specific verb 'Calculate' and resource 'unit cost price'. It also references sibling tool 'list_bundles' for related calculators, but does not explicitly differentiate from other cost calculators like 'calculate_cost_per_use' or 'calculate_unit_price'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a reference to 'list_bundles for related finance-universal calculators' but does not give explicit guidance on when to use this tool versus alternatives, nor does it mention prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_crop_factorAInspect
Calculate camera crop factor and equivalent focal length based on sensor width. See list_bundles for related 'photographie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sensor_width_mm | Yes | Camera sensor width in millimeters (full frame = 36mm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations are absent, so the description must provide behavioral transparency. It does not mention side effects, permissions, or return format, but for a simple calculator tool, the description is adequate. However, it could clarify that the crop factor is relative to full frame (36mm).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. It front-loads the purpose and then directs to related tools. Excellent conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (single parameter, deterministic calculation), the description is fully complete. It states the function, input, and a pointer to related tools. No gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with the parameter well-described. The description adds no additional parameter information beyond 'based on sensor width,' which is redundant. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates camera crop factor and equivalent focal length based on sensor width, with a specific verb and resource. It distinguishes itself from sibling calculators by referencing related 'photographie' calculators in list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for photography calculations but does not explicitly state when to use this tool versus alternatives. It only points to list_bundles for related calculators, lacking clear exclusions or context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_crypto_profit_lossAInspect
Compute crypto trading profit/loss including fees. Use for crypto investors tracking realized P&L. Inputs: buy price, sell price, quantity, buy fee, sell fee. Returns net P&L, ROI %, break-even price. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| quantity | Yes | Quantity of cryptocurrency traded | |
| buy_price | Yes | Purchase price per unit in fiat currency | |
| sell_price | Yes | Sale price per unit in fiat currency | |
| buy_fee_pct | No | Buy transaction fee percentage (default 0.1%) | |
| sell_fee_pct | No | Sell transaction fee percentage (default 0.1%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully transparently describes what the tool computes (profit/loss including fees) and its outputs (net P&L, ROI%, break-even price). As a calculator, it has no side effects or destructive behavior, so this is sufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long with a third sentence for related tools, all front-loaded and to the point. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (calculator with no side effects) and the presence of an output schema, the description covers the purpose, inputs, and outputs completely. It is self-contained and informative.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description lists inputs (buy price, sell price, quantity, buy fee, sell fee) but adds minimal value beyond the schema's own descriptions. It does mention defaults for fees, but that's already in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute crypto trading profit/loss including fees' with a specific verb and resource. It is differentiated from siblings like 'calculate_impermanent_loss' and 'calculate_crypto_tax_fr' by focusing on realized P&L from trades.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for crypto investors tracking realized P&L', providing clear context. However, it does not explicitly state when not to use it or list alternatives, though it mentions 'See list_bundles for related crypto calculators' as a hint.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_crypto_tax_frAInspect
Calculate French flat tax (30% PFU) on cryptocurrency capital gains at withdrawal. Returns: {gain_ratio}. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| total_gains_eur | Yes | Total unrealized gains in the portfolio in EUR | |
| withdrawal_amount_eur | Yes | Amount being withdrawn/sold in EUR | |
| total_portfolio_value_eur | Yes | Total current portfolio value in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It mentions the tax rate and return value {gain_ratio}, but does not disclose potential errors, permissions, or side effects. For a calculation tool, this is adequate but lacks extra context like constraints on input values.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. It front-loads the core purpose and includes a pointer to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator tool, the description covers the essential purpose, input context, and output return value. However, without an output schema, it could be more explicit about the exact structure of the return. The mention of {gain_ratio} helps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% (all parameters have descriptions), so baseline is 3. The description adds the 30% PFU context but does not significantly enhance understanding of individual parameters beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates French flat tax (30% PFU) on cryptocurrency capital gains at withdrawal, and distinguishes it from siblings by referencing list_bundles for related crypto calculators. The verb 'calculate' and resource 'cryptocurrency capital gains' are specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies the context 'at withdrawal' and directs to list_bundles for related calculators. However, it does not explicitly state when not to use this tool or provide alternatives beyond the bundle reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_currency_cross_rateAInspect
Calculate cross exchange rate between two currencies via USD. Returns: {cross_rate_a_to_b, cross_rate_b_to_a}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate_a_usd | Yes | Units of currency A per 1 USD | |
| rate_b_usd | Yes | Units of currency B per 1 USD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must fully disclose behavioral traits. It only states the basic calculation and output, omitting details about data freshness (e.g., live vs. fixed rates), assumptions, or error handling. This lack of transparency is a significant gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, with two sentences that efficiently convey the purpose, method, output structure, and a pointer to related tools. No extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity, schema coverage, and existence of an output schema, the description is nearly complete. It explains the calculation method and return keys, and references related tools. Minor gaps exist (e.g., data source), but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described as 'Units of currency A per 1 USD' and similarly for B. The description adds context ('via USD') but does not provide additional meaning beyond the schema, warranting a baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: 'Calculate cross exchange rate between two currencies via USD.' It specifies the output structure with keys like 'cross_rate_a_to_b' and 'cross_rate_b_to_a', and distinguishes itself from sibling calculators by referencing list_bundles for related conversions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context by explaining the calculation method (via USD) and directs users to 'list_bundles' for related 'conversions' calculators. However, it does not explicitly state when to avoid using this tool or compare it to other currency-related siblings like calculate_currency_exchange.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_currency_exchangeAInspect
Calculate currency exchange with bank margin and show fees lost. Returns: {amount_source, from_rate_vs_usd, to_rate_vs_usd, mid_market_amount, amount_after_margin, fees_lost, ...}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Amount to exchange in source currency | |
| to_rate | Yes | Target currency rate vs USD (e.g. JPY=150) | |
| from_rate | Yes | Source currency rate vs USD (e.g. EUR=1.08) | |
| bank_margin_pct | No | Bank/exchange margin percentage (default 2.5%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It adequately describes the tool as a calculator that applies a margin and returns fees lost, but does not explicitly state it is read-only or non-destructive. For a calculation tool, this is acceptable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences that communicate the purpose, key outputs, and a pointer to related tools. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a calculation tool with 4 parameters (all schema-described) and an output schema partially listed, the description covers the essential functionality. It lacks a brief usage scenario or explicit comparison to sibling exchange tools, but is largely complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline 3. The description does not elaborate on individual parameters beyond what the schema provides; it only adds context about the bank_margin_pct and the return fields. The return fields list adds some semantic value but not on parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates currency exchange with bank margin and shows fees lost. It mentions return fields and references related 'voyage' calculators via list_bundles, but does not explicitly differentiate from close siblings like calculate_currency_cross_rate or calculate_exchange_margin.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for exchange with bank margin and fees, but does not provide explicit when-to-use, when-not-to-use guidance or alternatives. The mention of list_bundles is only a link to related tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_curtain_fabricAInspect
Compute fabric meters needed for curtains, including hems and pattern repeat. Use for sewing. Inputs: window dimensions, fullness, pattern repeat. Returns fabric length to buy. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| num_panels | No | Number of curtain panels | |
| fullness_ratio | No | Fullness ratio (2 = double fullness) | |
| window_width_cm | Yes | Window width cm | |
| window_height_cm | Yes | Window height cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It mentions inclusion of hems and pattern repeat, which adds some behavioral context beyond the schema. However, it does not disclose any potential side effects, permissions, or return format, which is acceptable for a calculation tool but leaves some gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise, consisting of three short sentences with no filler. Every sentence adds value: purpose, inputs, output, and related tools. Front-loaded with the core purpose. Ideal conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description does not need to detail return values. It covers inputs, output type, and context (including hems, pattern repeat). It also references related tools. The only minor gap is that the tool is one of many calculators, but the description is sufficiently complete for its complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'pattern repeat' as an input, but the schema does not include such a parameter. This creates confusion. While schema coverage is 100%, the description adds misleading information, failing to enhance understanding beyond the schema. It partially compensates by summarizing key inputs but introduces inaccuracy.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes fabric meters for curtains, including hems and pattern repeat. It uses specific verb 'Compute' and resource 'fabric meters needed for curtains', making the purpose unambiguous. Though it doesn't explicitly differentiate from numerous siblings, the mention of 'curtains' and 'use for sewing' provides enough specificity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The phrase 'Use for sewing' gives context for when to use this tool. The reference to 'list_bundles' for related 'textile-mode' calculators implies alternatives but does not provide explicit guidance on when to choose this over siblings like calculate_curtain_width or calculate_fabric_needed. Lack of exclusion criteria reduces the score.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_curtain_widthAInspect
Compute total curtain width using a fullness factor (1.5-3.0×). Use for window dressing. Inputs: window width m, fullness ratio. Returns total fabric width. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| fullness | No | Fullness | standard |
| window_cm | Yes | Window width cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It describes a compute operation with no side effects, but does not disclose any behavioral traits such as authentication, rate limits, or reversibility. The description is adequate but minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two short sentences plus a reference. It front-loads the action and inputs, with no unnecessary words. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool is a simple calculator with two parameters and an output schema (present but not shown), the description covers purpose, inputs, and output adequately. However, the unit mismatch (m vs cm) and lack of enum explanation detract slightly from completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds context about fullness factor range (1.5-3.0×) but has a mismatch: it says 'window width m' while schema uses cm, and 'fullness ratio' while schema uses enum (flat, standard, generous). This could confuse but overall adds some value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes total curtain width using a fullness factor, with specific verb 'compute' and resource 'total curtain width'. It provides key details (fullness factor range 1.5-3.0×) and mentions inputs and output, distinguishing it from the many other calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for window dressing', providing basic context, but does not explicitly differentiate from sibling tools like calculate_curtain_fabric. It references list_bundles for related calculators but does not specify when to use this one over alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cycling_powerBInspect
Estimate cycling power output considering gradient, speed and total mass. Returns: {power_watts, watts_per_kg}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| speed_kmh | Yes | Speed in km/h | |
| weight_kg | Yes | Rider weight in kilograms | |
| gradient_pct | Yes | Road gradient in percent (positive = uphill) | |
| bike_weight_kg | No | Bike weight in kilograms |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility for behavioral disclosure. It states the tool 'estimates' power output, implying uncertainty, but does not clarify assumptions (e.g., ignores wind, drivetrain losses) or any side effects. The return format is given, which is useful, but overall transparency is minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, each earning its place: the first states the core purpose, the second gives the return format and a helpful pointer to related tools. There is no redundant or extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description provides the return format and a bundling hint, which adds value beyond the schema and annotations (nonexistent). However, it lacks details on the underlying model, precision, or data sources. Given the existence of an output schema, the description is adequate but not comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers all parameters with descriptions (100% coverage), so baseline is 3. The description adds the concept of 'total mass', which maps to weight_kg and bike_weight_kg, but does not explain the interaction or the default bike weight. It provides the output structure, which supplements parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates cycling power using gradient, speed, and mass. It is specific to cycling power, distinguishing it from other calculators in the sibling list. However, it does not explicitly differentiate from similar tools like calculate_bmi or calculate_calories_burned, but the name and context make the purpose clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives. It mentions 'See list_bundles for related sport calculators' but does not specify criteria for choosing this tool over others. There are no prerequisites, exclusions, or use case examples provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_cylinderAInspect
Compute cylinder volume V=πr²h and surface area A=2πr(r+h). Use for tanks, pipes, or containers. Inputs: radius, height. Returns volume and areas. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height | Yes | Height | |
| radius | Yes | Radius |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description says 'Returns volume and areas' but lacks detail on return format or behavior for invalid inputs. The presence of an output schema partly mitigates this, but the description could be more specific about the computed values.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first defines the computation, second provides context and reference. No redundant words; front-loaded with the core purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers what the tool does, inputs, and output types. Points to related tools via list_bundles. Lacks notes on edge cases or precision, but for a simple calculator with output schema, this is nearly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions 'Height' and 'Radius'. The description adds 'Inputs: radius, height' but does not significantly enhance meaning beyond the schema's own parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it computes cylinder volume and surface area with explicit formulas (V=πr²h, A=2πr(r+h)). It distinguishes from numerous sibling calculator tools by being specific to cylinders, with use cases like tanks, pipes, or containers.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context ('Use for tanks, pipes, or containers') and references related 'math' calculators via list_bundles. However, it does not explicitly state when not to use this tool or describe alternatives for other shapes.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_daily_proteinAInspect
Calculate recommended daily protein intake based on weight and fitness goal. Returns: {rate_g_per_kg, daily_protein_g, calories_from_protein}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| goal | Yes | Fitness goal | |
| weight_kg | Yes | Body weight in kilograms |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses behavior: a calculation with inputs and output structure. No side effects or destructive actions are implied. The return format is given, adding transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first covers purpose and inputs, second covers return shape and cross-reference. No redundant information, efficient and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with two parameters and an output schema mentioned, the description is sufficient. It could briefly explain the formula range (e.g., 1.2-2.2 g/kg) but this is not essential for invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description restates parameters but adds no new meaning or examples beyond what the schema provides. No additional context like units or enum values is needed, but also not provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates daily protein intake using weight and fitness goal. However, it does not explicitly distinguish itself from other nutrition calculators like calculate_daily_vitamins, though it references list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies the tool is for protein intake based on weight and goal, but lacks explicit guidance on when to use this vs. alternatives. Mentioning list_bundles for related calculators provides some context but no exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_daily_vitaminsCInspect
Check daily vitamin and mineral intake against RDA recommendations. Use for nutrition tracking. Inputs: list of foods with quantities. Returns % RDA per nutrient and deficiencies. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | ||
| sex | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must fully disclose behavior. It describes the operation as checking against RDA and returning percentages/deficiencies, but does not state whether it is safe/read-only, requires authentication, or has side effects. Given it's a calculation tool, the lack of explicit read-only hint is a gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but contains a critical contradiction (foods vs age/sex). It is not concise because it introduces confusion. Ideally, it should either match the schema or be updated.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (two undocumented parameters, no output schema shown), the description is incomplete. It fails to explain how the tool works with the given schema, and the input mismatch makes it unreliable for an agent to use correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 0% and the description does not explain the two actual parameters (age, sex). Instead, it mentions 'list of foods with quantities', which is not present in the schema. This mismatch severely misleads the agent about required inputs.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states it checks daily vitamin and mineral intake against RDA, which is clear for a general purpose. However, it does not explicitly differentiate from the sibling tool 'calculate_daily_protein', and the mismatch between described inputs (foods) and actual schema (age/sex) undermines clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for nutrition tracking' and mentions 'See list_bundles for related cuisine calculators', which provides some guidance. But it lacks explicit when-to-use vs alternatives, and the confusing input mismatch reduces usefulness.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_data_transfer_timeAInspect
Calculate file transfer time at a given connection speed. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| speed_mbps | Yes | Connection speed in Mbps | |
| file_size_gb | Yes | File size in GB |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It transparently states the tool calculates transfer time based on file size and speed, implying a safe read-only calculation. No side effects or limitations are mentioned, but for a simple calculation tool this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with only two sentences. The first sentence immediately states the purpose, and the second provides a pointer to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with two parameters and an output schema (not shown but present), the description is complete. It covers the core function and directs to related tools for other calculations.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for both parameters ('file_size_gb' and 'speed_mbps'). The tool description does not add additional meaning beyond what the schema provides, so baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate' and the resource 'file transfer time' with the condition 'at a given connection speed'. It also distinguishes from siblings by referencing 'list_bundles' for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides guidance on when to use this tool by saying 'See list_bundles for related conversions calculators', pointing to alternatives. However, it does not explicitly state when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_day_of_weekAInspect
Find the day of the week for any date (Zeller's congruence). Use for historical dates or birthday checks. Inputs: day, month, year. Returns weekday name. See list_bundles for related 'fun' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| date | Yes | Date in YYYY-MM-DD format |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must cover behavioral traits. It mentions the algorithm and return value but doesn't confirm read-only nature or prerequisites. Adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise with no wasted words. Purpose is front-loaded, and every sentence provides useful information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool, the description covers purpose, usage, algorithm, and alternative reference. It does not detail the output schema but provides the return format. Lacks a note on read-only nature, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'Inputs: day, month, year' which contradicts the actual single 'date' parameter in YYYY-MM-DD format. This could mislead an agent. Since schema coverage is full, the description adds no value and introduces confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it finds the day of the week for any date using Zeller's congruence. While it doesn't explicitly differentiate from sibling calculators, the purpose is unambiguous and the name itself is self-explanatory.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides specific use cases ('historical dates or birthday checks') and directs users to 'list_bundles' for related calculators. Does not explicitly state when not to use the tool, but the guidance is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_days_betweenAInspect
Calculate days, weeks, approximate months and working days between two dates. Returns: {weeks, months_approx, working_days}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| end_date | Yes | YYYY-MM-DD — End date | |
| start_date | Yes | YYYY-MM-DD — Start date |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided; the description relies on the tool's name and stated calculations to imply it is a safe read-only operation. It adds the return structure but does not mention side effects, date validation, or error handling, which would be helpful for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences: first declares the function and return fields, second references related tools. It is front-loaded with essential information and contains no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, output schema exists), the description covers the main purpose and return values. It also directs to related tools. However, it omits edge cases like date order or exclusion/inclusion of boundary days, which limits completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Both parameters are fully described in the input schema with format YYYY-MM-DD. The description does not add extra meaning or constraints beyond the schema's documentation, so the baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates days, weeks, approximate months, and working days between two dates, specifying the return object structure. This is specific and distinguishes from many sibling calculate_* tools by focusing on date intervals and naming the output fields.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The only guidance is 'See list_bundles for related temps-rh calculators,' which is vague and does not indicate when to use this tool over alternatives like calculate_working_days or calculate_age. No explicit when-to-use or when-not-to-use information is provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_debt_capacityAInspect
Calculate maximum loan capacity using French HCSF 35% debt ratio rule. Returns: {max_monthly_payment, max_loan, note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate | No | Annual interest rate in % (default 3.5) | |
| duration_years | No | Loan duration in years (default 25) | |
| existing_debts | No | Existing monthly debt payments in EUR (default 0) | |
| monthly_income | Yes | Net monthly income in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the rule (35% debt ratio) and output fields, but does not state whether the tool is read-only, requires authentication, or what side effects exist. For a calculation tool, it likely has no side effects, but this is not confirmed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loading the purpose and rule. It is concise, but could be slightly improved by structuring the output fields or adding a brief example.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema and 100% schema coverage, the description adequately conveys the core functionality and reference to related tools. However, it does not explain the 'note' output field, which would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with all four parameters already having descriptions. The description adds no additional meaning beyond what the schema provides, achieving the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Calculate maximum loan capacity using French HCSF 35% debt ratio rule', which specifies the verb (calculate), resource (maximum loan capacity), and distinguishing rule (French HCSF 35% debt ratio). This differentiates it from sibling tools like calculate_debt_to_income or calculate_loan_payment.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related immobilier calculators', hinting at related tools but not explicitly stating when to use this tool versus alternatives. It lacks explicit context on when to choose this over similar siblings like calculate_debt_service_ratio or calculate_loan_capacity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_debt_service_ratioBInspect
Calculate debt-to-income ratio and maximum additional loan capacity. Returns: {ratio_pct, max_monthly_debt_35pct, max_additional_loan_payment, status}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| monthly_debts | Yes | Existing monthly debt payments EUR | |
| monthly_income | Yes | Net monthly income EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description bears the full burden of disclosure. It describes the return object but does not state whether the tool is read-only, requires authorization, or has any side effects. It only says 'calculate,' which is insufficient for behavioral transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise, with two sentences that front-load the purpose and output. Every word adds value, and there is no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity and the presence of an output schema explaining return values, the description is fairly complete. It includes the output structure and a reference to related calculators. Minor gaps like prerequisites are covered by the schema's required parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description adds minimal value over the schema by providing context about the calculation purpose, but it does not elaborate on parameter semantics beyond what the schema already defines.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates debt-to-income ratio and maximum additional loan capacity, specifying the resource and outputs. However, it does not explicitly differentiate from sibling tools like 'calculate_debt_capacity' or 'calculate_debt_to_income', though the mention of 'maximum additional loan capacity' adds specificity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives. The mention of 'list_bundles' for related calculators is a weak reference and does not provide clear usage context or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_debt_to_incomeAInspect
Compute debt-to-income (DTI) ratio. Use for mortgage qualification or financial health checks. Inputs: monthly debt, monthly gross income. Returns DTI % and risk category. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| monthly_debt | Yes | Total monthly debt payments | |
| monthly_income | Yes | Gross monthly income |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully carries the burden. It declares the tool is a calculation, lists inputs (monthly debt, income), and outputs (DTI %, risk category), which is sufficient transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (two sentences) and front-loaded with the core purpose. It is efficient, though could be slightly more structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator, the description covers purpose, inputs, outputs, and references related tools. It is complete given the low complexity and the absence of an output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description repeats the parameter names and their categories (monthly debt, monthly gross income) but adds no additional meaning beyond the schema's descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes DTI ratio and specifies its use for mortgage qualification or financial health checks. It is a specific verb-resource pair, but does not explicitly distinguish from siblings like calculate_debt_capacity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear use cases (mortgage qualification, financial health checks) and references list_bundles for related calculators, but does not explicitly state when not to use or provide direct alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_delivery_costAInspect
Estimate shipping cost from weight, distance and service (standard vs express). Returns: {cost_eur, formula, note}. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | Service level | standard |
| weight_kg | Yes | Package weight kg | |
| distance_km | Yes | Delivery distance km |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses the operation (estimation) and return values, but does not address side effects, auth needs, or data source limitations. This is adequate for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences covering purpose, inputs, outputs, and a related tool. No redundant words, front-loaded, and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an implicit output schema and full schema coverage, the description covers core aspects. Could mention constraints (e.g., weight range) but sufficient for a calculator tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and description adds minimal detail beyond the schema (e.g., 'standard vs express' already in enum). Baseline 3 is appropriate with no additional semantic value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates shipping cost based on weight, distance, and service type (standard vs express). It specifies the return structure (cost_eur, formula, note), distinguishing it from other calculate tools like calculate_international_shipping.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The tool explains inputs and provides a cross-reference to list_bundles for related calculators. However, it does not explicitly contrast with alternatives like calculate_international_shipping or mention when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_densityAInspect
Compute density, mass, or volume given the other two. ρ=m/V. Use for materials, chemistry, fluid dynamics. Inputs: any 2 of (mass, volume, density). Returns the third. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| density | No | kg/m³ | |
| mass_kg | No | Mass kg | |
| volume_m3 | No | Volume m³ |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry full behavioral transparency. It correctly states the computation and relationship via formula. However, it does not disclose error handling, unit validation, or edge cases (e.g., providing all three or none). The schema already includes unit hints, but the description adds minimal additional behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, each serving a distinct purpose: defines the operation, provides the formula, and gives usage context plus a sibling reference. It is front-loaded with the core purpose and contains no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity of the tool (3 parameters with 100% schema coverage, output schema exists), the description is fairly complete. It covers purpose, formula, input selection, and related tools. Missing details like error handling or unit consistency are minor given the tool's straightforward nature.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with each parameter having a unit hint. The description adds crucial semantic value: it clarifies that exactly two of the three must be provided, and which will be computed. This relationship is not evident from the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes density, mass, or volume given the other two, with the explicit formula ρ=m/V. It specifies use cases (materials, chemistry, fluid dynamics) and distinguishes itself from sibling calculators by pointing to related 'science' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Inputs: any 2 of (mass, volume, density). Returns the third,' which provides clear guidance on when and how to use. It also suggests contexts (materials, chemistry, fluid dynamics) and references a sibling tool group for further related calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_density_convertAInspect
Convert density between kg/m³, g/cm³, lb/ft³, and lb/gal. Use for engineering, chemistry, fluid mechanics. Inputs: value, from-unit, to-unit. Returns converted density. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Density value | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It accurately describes a conversion operation with no side effects. While it doesn't explicitly state read-only or safe behavior, the nature of a unit conversion tool is inherently non-destructive and the description is clear.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, with no extraneous words. It front-loads the primary action and includes targeted usage contexts and a pointer to related tools. Every sentence is valuable.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity of a conversion tool and the existence of an output schema, the description is complete. It explains inputs, outputs, and usage context. The mention of list_bundles for related calculators adds helpful navigation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description lists the input fields and units, adding context beyond the schema's enum descriptions, but the schema already defines the parameters clearly. The description's addition is marginal.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts density between specific units (kg/m³, g/cm³, lb/ft³, lb/gal). The verb 'convert' and resource 'density' are explicit, distinguishing it from numerous sibling conversion tools and other calculation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases ('engineering, chemistry, fluid mechanics') but does not specify when not to use this tool versus alternatives like other conversion calculators. It mentions related tools via list_bundles, but lacks explicit exclusions or when-to-avoid guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_depth_of_fieldAInspect
Calculate depth of field, near/far focus limits and hyperfocal distance for a camera lens. Returns: {near_limit_m, far_limit_m, coc_mm}. See list_bundles for related 'photographie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| aperture | Yes | Lens aperture (f-number, e.g. 2.8) | |
| distance_m | Yes | Subject distance in meters | |
| focal_length_mm | Yes | Lens focal length in millimeters | |
| sensor_width_mm | No | Camera sensor width in mm (default 36 for full frame) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It indicates a read-only computation ('Calculate'), which is sufficient for a calculator tool, but doesn't disclose any potential side effects or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, front-loaded with purpose and returns, no redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers purpose and return format. With output schema present (though not shown), the description complements it well. Could mention how Circle of Confusion is derived, but not critical.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters have descriptions in the schema (100% coverage), and the description adds minimal extra meaning beyond what's already there. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates depth of field, near/far focus limits, and hyperfocal distance, with specific return fields listed. It distinguishes itself from the sibling 'calculate_hyperfocal_distance' by offering a more comprehensive calculation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implied usage for depth of field calculations, but lacks explicit guidance on when to use this tool versus alternatives like 'calculate_hyperfocal_distance' or prerequisites like knowing sensor width.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dew_pointAInspect
Compute dew point temperature using Magnus formula. Use for HVAC, weather, comfort analysis. Inputs: temperature °C, relative humidity %. Returns dew point °C and comfort class. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| temp_c | Yes | Temperature °C | |
| humidity_pct | Yes | Relative humidity % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the inputs (temperature, humidity) and outputs (dew point, comfort class), and mentions the Magnus formula. However, it does not explicitly declare safety (e.g., read-only) or discuss any limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences plus a reference to the bundle. No superfluous information; front-loaded with the main purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with two parameters and an output schema, the description covers the essential inputs, outputs, and usage context. It references related tools via bundle, which helps. Lacks mention of edge cases or precision, but is sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds limited value beyond what the schema already provides. It mentions the formula and outputs but does not elaborate on parameter syntax or constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes dew point temperature using the Magnus formula, and specifies application areas (HVAC, weather, comfort analysis). It mentions a related bundle but does not directly differentiate from specific sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives usage context ('Use for HVAC, weather, comfort analysis') but lacks explicit guidance on when not to use or alternatives. It references a bundle for related calculators but does not provide clear exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dice_probabilityAInspect
Calculate dice roll probability for exact values, minimum or maximum targets. Returns: {probability}. See list_bundles for related 'jeux-probabilites' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target | Yes | Target value to calculate probability for | |
| num_dice | Yes | Number of dice to roll | |
| num_sides | No | Number of sides on each die (default d6) | |
| comparison | Yes | Comparison type: exact match, at least target, or at most target |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states the return format '{probability}' but does not disclose if the calculation is exact, assumes fair dice, or handles edge cases (e.g., target out of range). It lacks details on performance or side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long. The first sentence is clear and front-loaded. The second sentence references list_bundles, which is helpful but not essential. It is fairly concise, though the return note could be integrated.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of dice probability with multiple comparison types and parameters, the description is minimal. It does not explain the calculation method, edge cases (e.g., target > num_dice * num_sides), or the range of probability values. Even though an output schema exists (context signal), the description could be more complete for a tool with no annotations.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so each parameter is already documented. The description does not add any additional meaning beyond what the schema provides. It mentions 'exact values...' which mirrors the comparison enum, but no extra details like how target is interpreted relative to num_dice.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates dice roll probability for exact values, minimum, or maximum targets. It specifies the verb (calculate), resource (dice roll probability), and scope (comparison types). It distinguishes from other probability tools like calculate_card_draw_probability by focusing on dice.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description tells users to see list_bundles for related 'jeux-probabilites' calculators, giving a hint about related tools. However, it does not explicitly state when to use this tool versus alternatives like calculate_probability_binomial or calculate_card_draw_probability. There is no when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dilutionAInspect
Compute dilution C1·V1=C2·V2. Solve for any unknown. Use for chemistry, lab work, pharmacy. Inputs: any 3 of (C1, V1, C2, V2). Returns the fourth. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| c1 | No | Initial concentration | |
| c2 | No | Final concentration | |
| v1 | No | Initial volume | |
| v2 | No | Final volume |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses the behavior: solves any unknown from three inputs using the dilution formula. It is transparent for a simple calculator, though it does not mention constraints like non-negativity or units.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: formula+domain, usage, inputs+output, related tools. Front-loaded and concise, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description is adequate for a simple calculator, but lacks units specification and does not differentiate from similar sibling 'calculate_solution_dilution'. Output schema exists, reducing need for return info.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers parameter names and descriptions (100% coverage), but the description adds the key semantic constraint that exactly three of the four parameters must be provided, plus the underlying formula.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes dilution using C1·V1=C2·V2 for any unknown, specifying the domain (chemistry, lab, pharmacy). However, it does not differentiate from the similar sibling 'calculate_solution_dilution', preventing a top score.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides usage context ('Use for chemistry, lab work, pharmacy') and mentions 'list_bundles' for related calculators, but no explicit guidance on when not to use or how to choose among similar siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_discountAInspect
Calculate discounted price with optional successive discounts. Returns: {original_price, price_after_first, effective_discount_pct}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| discount_pct | Yes | First discount percentage | |
| discount2_pct | No | Optional second successive discount | |
| original_price | Yes | Original price |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses the tool's behavior: it applies optional successive discounts and returns three fields. It does not mention potential edge cases, rounding, or precision, but for a simple calculation tool, the transparency is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the purpose, and includes return information and a referral. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with an output schema, the description covers the essential aspects: purpose, optional successive discounts, return fields, and a pointer to related tools via list_bundles.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 100% parameter description coverage. The description adds context by explaining 'optional successive discounts', indicating the order of application, which adds value beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate discounted price with optional successive discounts.' It specifies the action (calculate), resource (price), and feature (optional successive discounts). It also distinguishes from siblings by referencing list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a referral to list_bundles for related 'finance-universal' calculators, but does not explicitly define when to use this tool versus alternatives or when not to use it. The guidance is implied rather than explicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_discount_effectiveAInspect
Compute the effective discount when stacking multiple promotions. Use for promo design or shopping comparisons. Inputs: original price, discount % list. Returns final price and effective single-discount equivalent. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| discount_1_pct | Yes | First discount % | |
| discount_2_pct | No | Second discount % | |
| original_price | Yes | Original price |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It states inputs and returns but does not clarify important behavioral traits such as whether discounts are applied sequentially or simultaneously, or any calculation details. This is adequate but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with three sentences. The first sentence states the purpose, the second gives use cases, and the third lists inputs/outputs and references related tools. Every sentence earns its place with no superfluous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description appropriately omits detailed return-value explanations but still mentions 'final price and effective single-discount equivalent.' It covers purpose, inputs, outputs, use cases, and references related tools. Lacks only explicit mathematical ordering details, but overall sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description mentions 'original price, discount % list' but adds little beyond the schema's descriptions ('First discount %', 'Second discount %', 'Original price'). The phrase 'discount % list' could imply more discounts than the two accepted, but overall minimal added value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb (Compute) and resource (effective discount when stacking multiple promotions). It specifies use cases ('promo design or shopping comparisons') and distinguishes from siblings by pointing to list_bundles for related calculators. This meets the criteria for a specific verb+resource with sibling differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states when to use the tool ('Use for promo design or shopping comparisons') and references alternatives via list_bundles. However, it does not provide explicit when-not-to-use guidance or direct comparisons with similar discount calculators, which prevents a score of 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_distance_2dBInspect
Compute Euclidean distance between two 2D points. Use for geometry, mapping. Formula: √((x2−x1)²+(y2−y1)²). Inputs: x1,y1,x2,y2. Returns distance. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| x1 | Yes | X1 | |
| x2 | Yes | X2 | |
| y1 | Yes | Y1 | |
| y2 | Yes | Y2 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It only states 'Returns distance' and the formula, offering no additional behavioral details like error handling, input validation, or return format beyond what the schema implies.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with a formula, front-loading the purpose. Every sentence is meaningful and there is no redundant content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity of the tool and the presence of an output schema, the description covers the essential aspects: purpose, formula, inputs, and a reference to related tools. It is sufficient for an agent to invoke the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but descriptions are minimal ('X1', etc.). The description lists the parameters and provides the formula, adding context that they are coordinates. This meets the baseline for high coverage but does not exceed it significantly.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that it computes Euclidean distance between two 2D points, with formula and resource identification. It distinguishes from siblings implicitly by specifying '2D', but does not explicitly differentiate from calculate_distance_3d, which is a sibling.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides usage context ('Use for geometry, mapping') but no explicit when-not-to-use or alternative tool references. The mention of 'See list_bundles' is vague for specific alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_distance_3dAInspect
Compute Euclidean distance between two 3D points. Use for 3D modeling, physics. Formula: √(Δx²+Δy²+Δz²). Inputs: x1,y1,z1,x2,y2,z2. Returns distance. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| x1 | Yes | ||
| x2 | Yes | ||
| y1 | Yes | ||
| y2 | Yes | ||
| z1 | Yes | ||
| z2 | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It does not disclose any behavioral traits such as side effects, auth requirements, or rate limits. The description only states the formula and inputs, lacking safety or operational context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is three sentences, front-loaded with purpose and formula. It is fairly compact but could be more concise by removing the formula or input listing. However, it is not verbose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, return values need not be described. The description covers purpose, formula, and references a sibling tool. For a simple math tool, it is modestly complete, but lacks usage boundaries and edge case handling.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 6 parameters with zero description coverage. The description simply lists the parameter names (x1,y1,z1,x2,y2,z2), which adds no meaning beyond the schema. It does not explain units, ranges, or typical values.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes Euclidean distance between two 3D points, with a specific verb and resource. It distinguishes from sibling tools like calculate_distance_2d by specifying 3D.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description suggests use for 3D modeling and physics, and references list_bundles for related calculators. However, it does not explicitly exclude when not to use or compare to alternatives like calculate_distance_2d, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_distance_securiteAInspect
Calculate safe following distance using the 2-second rule (French highway code). Returns: {safety_distance_2s_m, highway_3s_m, note}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| speed_kmh | Yes | Vehicle speed in km/h |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
As a calculation tool, no side effects are expected. Description discloses calculation rule and output format. Given no annotations, behavior is transparent enough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences cover purpose, output, and related resources. No filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter calculator with output schema, the description fully explains what it does, what it returns, and how to find related tools. Complete for its complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter description. The tool description adds context (2-second rule, French highway code) but does not add specific parameter semantics beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it calculates safe following distance using the 2-second rule (French highway code). Output keys are listed, distinguishing it from similar tools like calculate_braking_distance.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Describes when to use (for following distance per French code) and suggests list_bundles for related calculators. However, it does not explicitly compare with siblings or state when not to use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_distance_to_horizonAInspect
Calculate the distance to the horizon from a given height. Returns: {distance_km, distance_miles, distance_nautical_miles}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height_m | Yes | Observer height above ground in metres |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description indicates a simple calculation returning a distance object. No annotations are present, so the description carries burden; it does not disclose any side effects or read-only nature, but the behavior is straightforward.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: one states purpose and output, another references related tools. No unnecessary words, efficient and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with one parameter and output schema, the description adequately covers purpose and output. Lacks formula assumptions but is sufficient given tool complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema fully documents the parameter. The description adds no new information beyond 'given height', meeting baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates distance to horizon from a given height and lists three output units. It also references a related bundle for context, distinguishing it from other calculation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The mention of a related bundle is indirect and does not provide when-not or alternative tool suggestions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dog_ageAInspect
Convert dog age to human-equivalent years using modern AAHA method. Use for canine health monitoring. Inputs: dog age years, breed size. Returns human-equivalent age. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | No | Dog size | medium |
| dog_years | Yes | Dog age in years |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the calculation method (AAHA) and inputs/outputs. With no annotations, it carries the full burden. It does not explicitly state that the tool is read-only or non-destructive, but this is implied by its nature. More explicit behavioral context would be beneficial.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences plus a reference, front-loading purpose and method. Every sentence adds value: method, usage context, and pointer to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers purpose, method, inputs, and related tools. For a simple calculation tool, this is mostly complete. However, it does not describe the output format or error conditions (e.g., invalid inputs), but the schema handles validation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'dog age years' and 'breed size' as inputs, matching the schema parameters. The schema descriptions are minimal ('Dog size', 'Dog age in years'), so the description adds some context (breed size) but not detailed enum values or default. Given 100% schema coverage, baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it converts dog age to human-equivalent years using the modern AAHA method. The verb 'convert' and resource are specific, and the tool is distinguished from sibling tools like calculate_cat_age and calculate_pet_age by its focus on dogs and reference to AAHA.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for canine health monitoring,' providing context. It also references list_bundles for related calculators, suggesting alternative tools. However, it does not explicitly state when not to use this tool versus similar ones like calculate_pet_age.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dog_foodBInspect
Calculate daily dog food quantity based on weight, age and activity level. Returns: {kcal_per_day}. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | ||
| activity | Yes | ||
| weight_kg | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. It only states the calculation and output, without mentioning permissions, side effects, rate limits, or that it is a safe read-only operation. For a calculator tool, the risk is low, but the lack of transparency is notable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, with the first sentence stating the core purpose and return value, and the second suggesting a related tool. There is no redundant information, and the structure is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with three parameters and an output schema, the description covers the essential inputs and output. However, it does not specify the units (kg is in schema) or clarify the age categories. Given the presence of an output schema, the return value detail is acceptable, but a bit more context on parameter values would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, so the description must compensate. The description names the three parameters (weight, age, activity level) but adds no additional semantics about their meaning, units, or the enum values (e.g., what low/medium/high activity means). It provides minimal added value over the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates daily dog food quantity based on weight, age, and activity level, and specifies it returns kcal_per_day. It distinguishes from general calculators but does not explicitly differentiate from sibling tools like calculate_cat_food or calculate_pet_food_portion, which are similar.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use when a dog food amount is needed, but does not provide explicit when-to-use or when-not-to-use guidance. The reference to list_bundles hints at related calculators but does not compare alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dog_pregnancyAInspect
Compute dog due date from mating date (gestation 63 days). Use for breeders. Inputs: mating date. Returns due date window and milestone dates. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mating_date | Yes | Mating date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses that the tool returns a due date window and milestone dates. It is a safe computational tool, and no hidden behaviors are implied.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with the core purpose. Every sentence adds value: purpose, user audience, and reference for related tools. No fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (not shown here but noted in context signals), the description need not detail return values. It mentions 'Returns due date window and milestone dates', which suffices. One parameter, no nested objects, so completeness is high.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a single parameter 'mating_date' already described as 'Mating date YYYY-MM-DD'. The description adds 'Inputs: mating date' which is redundant but confirms the parameter. No new semantics beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description specifies the verb 'compute' and resource 'dog due date', with context 'for breeders'. It clearly distinguishes from sibling tools like calculate_cat_pregnancy by being species-specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
States 'Use for breeders' and directs to list_bundles for related calculators. Provides context but could explicitly mention when not to use, though the specificity implies clear boundaries.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dog_walking_caloriesBInspect
Compute calories burned by dog and human during a walk. Use for pet weight management. Inputs: dog weight, walk duration, pace. Returns calories burned by both. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| pace | Yes | ||
| duration_min | Yes | ||
| dog_weight_kg | Yes | ||
| walker_weight_kg | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It omits any behavioral traits beyond stating it returns calories. No mention of safety, idempotence, or side effects, which is a gap for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences and a pointer, but it is misleading due to the missing parameter. Front-loading is adequate but incomplete.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description does not need to detail return values. However, the omission of the walker_weight_kg parameter leaves the tool incompletely specified for an agent, especially with 0% schema description coverage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description lists only three inputs (dog weight, walk duration, pace) but the schema requires four parameters, including walker_weight_kg. This omission misrepresents the tool's interface and adds no meaningful semantics beyond the schema structure.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes calories burned by both dog and human during a walk for pet weight management, using specific verb 'Compute' and resource 'calories burned'. This distinguishes it from sibling tools like calculate_calories_burned (probably human-only) and other pet calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It specifies 'Use for pet weight management' as a primary use case and points to list_bundles for related calculators. However, it does not explicitly state when not to use this tool or provide direct comparisons with alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dollar_cost_averageAInspect
Calculate DCA portfolio value and performance for recurring crypto investments. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| periods | Yes | Number of investment periods | |
| average_price | Yes | Average purchase price per unit over all periods | |
| current_price | Yes | Current market price per unit | |
| investment_per_period | Yes | Amount invested per period in fiat currency |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It states the tool calculates portfolio value and performance but does not disclose side effects, idempotency, or return format. The output schema exists, but the description adds no behavioral context beyond that.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences, front-loaded with the purpose. It contains no wasted words and efficiently delivers the core information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (4 parameters, output schema present), the description is mostly complete. However, it could briefly mention the formula or standard DCA assumptions, but it's adequate for a straightforward calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, and the input schema provides clear descriptions for all four parameters. The description adds no extra semantics beyond the schema, so it meets the baseline without adding value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate DCA portfolio value and performance for recurring crypto investments.' This is specific with a verb, resource, and scope. It also distinguishes from siblings by referencing 'list_bundles' for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description offers minimal guidance by mentioning 'See list_bundles for related 'crypto' calculators.' This hints at context but does not explicitly state when to use this tool versus other crypto calculators or provide when-not-to-use scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dpe_energy_classBInspect
Determine French DPE energy class from primary energy consumption. Returns: {note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| kwh_m2_year | Yes | Primary energy consumption in kWh/m2/year |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description only says 'Returns: {note}', which is vague. It does not disclose any behavioral traits (e.g., whether it requires authentication, has side effects, or is read-only). The agent is left guessing what the note contains.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences are used, with the purpose front-loaded. However, the second sentence about list_bundles is tangential to the tool's core function, slightly reducing conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present and a single well-described parameter, the description is adequate but not rich. It lacks explanation of the DPE class system or typical use cases, which would be helpful for a domain-specific calculation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema provides 100% coverage with a clear description of the parameter. The description adds minimal extra meaning beyond restating that it uses 'primary energy consumption'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('Determine'), the specific resource ('French DPE energy class'), and the input ('from primary energy consumption'). It distinguishes itself from the large set of sibling calculators by specifying the DPE context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description hints at related tools via 'See list_bundles for related 'immobilier' calculators', which gives some context but does not explicitly state when to use this tool versus alternatives or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_drain_slopeCInspect
Compute required drain slope (% and cm/m) to ensure proper water flow per plumbing code. Use for plumbing or roof drainage. Inputs: pipe length, pipe diameter, application. Returns slope % and drop in cm. See list_bundles for related 'plomberie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| fixture_type | Yes | Type of sanitary fixture being drained | |
| pipe_diameter_mm | No | Drain pipe diameter in millimeters (default 100mm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description must carry burden. Returns slope % and drop in cm is stated, but mentions inputs not in schema (pipe length, application), which confuses actual behavior. Lacks details on plumbing code compliance or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Concise with front-loaded purpose, but includes inaccurate input list that wastes space. Could be streamlined to match actual schema.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Mentions output format and references related tools, but fails to accurately describe inputs due to mismatch. No annotations to supplement behavioral info. For a tool with only 2 parameters, it should be more precise.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions, so baseline is 3. However, description adds misleading input references (pipe length, application) not in schema, failing to add value and potentially confusing. Enum values for fixture_type are not explained.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
States 'Compute required drain slope' which is specific, but mentions inputs not in schema (pipe length, application) while omitting actual schema parameter (fixture_type). This mismatch reduces clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for plumbing or roof drainage' and references 'list_bundles' for related tools, providing clear context. Does not specify when not to use, but sufficient for typical cases.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_dress_alterationsCInspect
Estimate dress alteration cost and time by alteration type. Use for tailoring or wedding-dress budgeting. Inputs: alterations needed (hem, sides, sleeves), garment type. Returns total cost estimate and time hours. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target_size | Yes | Target FR dress size | |
| measurement_bust | Yes | Actual bust measurement cm | |
| measurement_hips | Yes | Actual hip measurement cm | |
| measurement_waist | Yes | Actual waist measurement cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavior. It mentions returns of cost and time, but the input mismatch and lack of details on calculation or assumptions reduce transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief but contains inaccurate information about inputs, making it misleading despite its conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema, but the description fails to align with the actual inputs and provides misleading context. It references only one sibling tool among many, leaving out critical usage details.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions, so the description adds no value; it actually introduces conflicting input information (alterations, garment type) that is not in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states it estimates alteration cost and time by alteration type, but the schema requires only body measurements and a target size, not alteration type or garment type. This mismatch undermines clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests using for tailoring or wedding-dress budgeting and mentions list_bundles as a related tool, but gives no when-not-to-use guidance or differentiation from many other calculate_ tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_due_dateBInspect
Calculate estimated due date using Naegele's rule and return trimester milestone dates
| Name | Required | Description | Default |
|---|---|---|---|
| last_period_date | Yes | YYYY-MM-DD — First day of last menstrual period |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description reveals the tool uses a specific medical rule (Naegele's) and returns milestone dates, which adds behavioral context beyond the schema. However, with no annotations, it doesn't address safety, idempotency, or error conditions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, focused sentence that immediately conveys the tool's purpose. It is appropriately concise without unnecessary fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema, the description adequately covers its function. It mentions the specific rule and output type, making it sufficiently complete for a straightforward calculation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has full coverage (100%) with a clear description of the parameter. The tool description adds value by explaining how the parameter is used (Naegele's rule) and what output to expect (milestone dates), enhancing agent understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates an estimated due date using Naegele's rule and returns trimester milestone dates. It provides a specific verb and resource, but does not explicitly differentiate from similar sibling tools like 'calculate_pregnancy_due_date'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus alternatives. There is no mention of prerequisites, context, or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_earthquake_energyBInspect
Calculate energy released by an earthquake from its magnitude. Returns: {energy_joules, tnt_equivalent_kg}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| magnitude | Yes | Richter/moment magnitude |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must fully disclose behaviors. It only states the return format but no details on safety, auth, rate limits, or assumptions. For a calculation tool, it likely is read-only, but not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first states purpose, second gives return structure and cross-reference. Front-loaded and no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with an output schema, the description is largely complete. It could mention the magnitude range or formula but is adequate for an agent to use correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a descriptive parameter description. The tool description adds minimal value beyond the schema, only mentioning the return format. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates energy released by an earthquake from its magnitude, using specific verbs and resource. It mentions a bundle for related calculators but does not explicitly differentiate from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives. The only hint is a cross-reference to list_bundles for related calculators, but no explicit when-not or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ects_creditsBInspect
Estimate ECTS credit workload (1 ECTS ≈ 25-30 study hours). Use for university course planning across Europe. Inputs: course hours, credits target. Returns expected workload and balance. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| weeks | Yes | Number of weeks | |
| hours_per_week | Yes | Study hours per week | |
| hours_per_credit | No | Hours per ECTS credit (standard: 25-30) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries full burden for behavioral disclosure. It states it returns 'expected workload and balance' but does not explain that the calculation uses a configurable hours_per_credit (default 27.5) and multiplies weeks by hours_per_week. There is no mention of potential rounding, precision, or side-effect-free nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short with two sentences plus a reference. It is front-loaded with purpose. However, the conciseness sacrifices accuracy in parameter description.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (so return values are documented) and full schema parameter coverage, the description could be adequate if aligned. However, the misalignment and lack of behavioral details make it only moderately complete for an agent to invoke correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Although schema coverage is 100%, the description adds misleading information: it mentions 'course hours, credits target' as inputs, but 'credits target' does not correspond to any parameter. It fails to explain that weeks and hours_per_week multiply to total hours and that hours_per_credit is a conversion factor. This detracts from understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it estimates ECTS credit workload and provides the conversion rate. It distinguishes itself from siblings by mentioning ECTS and university planning. However, the mention of 'credits target' as an input is inconsistent with the actual schema parameters (weeks, hours_per_week, hours_per_credit), causing slight confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies use for university course planning across Europe and directs to list_bundles for related calculators. It does not explicitly state when not to use this tool or provide alternatives, but the context is clear enough for an agent to infer appropriate usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_electrical_powerAInspect
Compute electrical power for single or three-phase circuits. Use for electrical engineering. Inputs: voltage, current, phase, power factor. Returns power kW and apparent power kVA. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| phase | No | Phase | mono |
| cos_phi | No | Power factor | |
| current | Yes | Amps | |
| voltage | Yes | Volts |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It implies a read-only calculation (no side effects) but does not explicitly state that it is non-destructive or safe. The behavior is clear from context, but the description adds no unique transparency beyond the tool name.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: two sentences covering purpose, domain, inputs, returns, and guidance to related tools. Every part earns its place with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description adequately covers what the tool does, what it returns (power kW and apparent power kVA), and its domain. No missing essential information for an agent to invoke it correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description lists input names ('voltage, current, phase, power factor') but does not add meaning beyond what the schema already provides (types, defaults, descriptions). No additional semantic value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes electrical power for single or three-phase circuits, which is specific and distinguishes it from other calculate_ tools. The verb 'Compute' and resource 'electrical power' are precise, and the mention of 'electrical engineering' provides domain context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using the tool for electrical engineering but does not explicitly state when not to use it or provide alternatives. The reference to 'list_bundles' for related calculators is a weak guideline, lacking direct comparison to siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_electricity_costBInspect
Compute electricity cost for an appliance. Use for energy budgeting. Inputs: power W, daily usage hours, kWh price. Returns daily/monthly/yearly cost. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | Days | |
| power_w | Yes | Watts | |
| hours_day | Yes | Hours/day | |
| price_kwh | No | EUR/kWh |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description only states that it returns daily/monthly/yearly cost, but omits behavioral details like whether it is a pure computation (read-only), any prerequisites, error conditions, or output structure beyond the mention of time periods.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences and a reference, all information is relevant and front-loaded. No extraneous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description sufficiently covers the tool's purpose, inputs, and general output. It references related tools via a bundle. However, it does not mention defaults for 'days' and 'price_kwh' which could affect usage, but these are provided in the schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds minimal value by clarifying units (power W, daily usage hours) but does not elaborate on 'days' or 'price_kwh' beyond what the schema provides, and it omits mention of default values.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes electricity cost for an appliance using power, usage hours, and price, and mentions its use for energy budgeting. However, it does not explicitly differentiate from the sibling tool 'calculate_electricity_cost_appliance', though it provides context about the 'vie-quotidienne' bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description recommends use for energy budgeting and references a bundle for related calculators, but does not provide explicit when-not-to-use scenarios or directly compare with alternative tools among the many sibling calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_electricity_cost_applianceAInspect
Compute annual electricity cost of a household appliance. Use for energy audit and replacement decisions. Inputs: power W, daily hours, kWh price. Returns annual cost and CO₂ kg. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| power_w | Yes | Power in watts | |
| hours_day | Yes | Hours used per day | |
| price_kwh | No | EUR per kWh |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full burden. It states the tool computes annual cost and CO₂ emissions, which implies a read-only calculation. However, it does not explicitly state that the tool has no side effects or require any authorization. For a calculator, this is acceptable but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: three short sentences. The first sentence states the core function, the second gives usage context, and the third provides a cross-reference. No unnecessary words or repetitions.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists (though not shown), the description does not need to detail return values. It covers inputs, outputs (cost and CO₂), and usage context. For a straightforward calculator, this is largely complete, though it assumes knowledge of typical annual usage (e.g., 365 days) without specifying.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. The description merely restates the parameter names (power W, daily hours, kWh price) without adding meaningful detail beyond the schema. Thus, it meets the baseline but adds little extra value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes annual electricity cost of a household appliance, specifying use for energy audits and replacement decisions. It mentions inputs and outputs. However, it does not explicitly differentiate from the sibling 'calculate_electricity_cost' tool, relying on the 'appliance' qualifier.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases ('energy audit and replacement decisions') and directs to list_bundles for related calculators. It does not specify when not to use this tool, but the positive guidance is strong.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ellipseAInspect
Compute ellipse area A=π·a·b and approximate perimeter (Ramanujan). Use for elliptical fields, tracks, or design. Inputs: semi-major a, semi-minor b. Returns area and perimeter. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | Semi-major axis | |
| b | Yes | Semi-minor axis |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool returns area and perimeter, implying no side effects. However, it lacks details on potential limitations (e.g., approximation accuracy) or any behavioral constraints.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences) and front-loaded with purpose. Every sentence adds value, and the structure is clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the schema (2 parameters, output schema exists), the description adequately covers inputs, outputs, and usage context. It references list_bundles for further exploration, making it self-contained.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds value by specifying the parameters as 'semi-major a' and 'semi-minor b' and including the formulas, which aids understanding beyond the schema's minimum constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes ellipse area and perimeter using specific formulas (A=π·a·b and Ramanujan approximation). It identifies the resource (ellipse) and distinguishes from sibling tools like calculate_area or calculate_perimeter by being geometry-specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests use cases: 'elliptical fields, tracks, or design.' It also directs to list_bundles for related math calculators, providing context. However, it does not explicitly state when not to use this tool versus alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_emergency_fundAInspect
Compute recommended emergency fund target (3-6 months of expenses). Use for personal financial planning. Inputs: monthly expenses, dependents count, income stability. Returns recommended fund and savings timeline. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| dependents | Yes | Number of dependents | |
| job_stability | Yes | Job stability level | |
| monthly_expenses | Yes | Monthly expenses EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description explains it computes a target and returns recommendations and savings timeline. This adequately conveys its read-only nature, but lacks depth on edge cases or behavior beyond basic operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, each serving a purpose: purpose, usage, inputs/outputs, and pointer to bundles. No wasted words, front-loaded with key info.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given three parameters and an output schema, the description covers inputs and return type. It lacks details on error conditions or assumptions, but is sufficient for this simple calculator tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description's listing of inputs (monthly expenses, dependents, income stability) merely echoes schema. It adds no new semantic meaning, thus scoring baseline 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes an emergency fund target (3-6 months of expenses) and specifies it's for personal financial planning. It distinguishes itself from the many calculate_ siblings by its specific purpose, though it could be more explicit about uniqueness.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It advises use for personal financial planning and points to list_bundles for related 'finance-universal' calculators, providing some context. However, it does not explicitly state when to avoid this tool or provide direct alternatives among siblings, leaving some guidance gap.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_employer_cost_frBInspect
Compute total employer cost in France (gross + social charges). Use for hiring budget or freelance vs salary comparison. Inputs: gross salary, status (cadre/non-cadre). Returns employer cost and total charges. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_monthly | Yes | Monthly gross salary EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description bears full burden. It mentions inputs (including 'status' not in schema) and outputs, but fails to disclose side effects, prerequisites, or limitations. The input mismatch is misleading.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with action, no fluff. However, the inaccurate mention of 'status' undermines conciseness and reduces trustworthiness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists but description only vaguely mentions 'Returns employer cost and total charges'. Lacks context on permissions, data sources, and the discrepancy with described inputs. Incomplete for reliable use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has one parameter (gross_monthly) with description, so baseline is 3. But description inaccurately adds 'status (cadre/non-cadre)' as an input that does not exist in schema, adding confusion instead of clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Compute total employer cost in France (gross + social charges)', specifying verb and resource. Among many French calculator siblings, it uniquely identifies this tool for employer cost, distinguishing it well.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly suggests use cases: 'hiring budget or freelance vs salary comparison'. Does not list when not to use or direct alternatives, but references 'list_bundles' for related tools, providing partial guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_energy_physicsBInspect
Calculate kinetic (½mv²), potential (mgh), mass-energy (E=mc²), or work (F·d). Returns: {energy_joules, energy_kj, energy_kwh}. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | Yes | Energy type | |
| force_n | No | Force N (work) | |
| mass_kg | No | Mass in kg | |
| height_m | No | Height m (potential) | |
| distance_m | No | Distance m (work) | |
| velocity_ms | No | Velocity m/s (kinetic) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Does not disclose behavioral traits like auth requirements, rate limits, or side effects. Only describes basic functionality and return format.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, front-loaded with purpose. Efficient but could be more structured (e.g., separate sections).
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema is stated, but missing usage guidelines and behavioral transparency. Adequate for basic understanding but incomplete for safe autonomous invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and description adds formulas but does not explain parameter semantics beyond what schema provides. Baseline 3 for high coverage; no significant added value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it calculates kinetic, potential, mass-energy, or work, listing formulas and return values. It distinguishes from sibling tools by referencing list_bundles for related science calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies usage by listing energy types but provides no explicit when-to-use or when-not-to-use. Only mentions see list_bundles for related calculators, lacking specific alternatives or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_equationCInspect
Solve 1st degree (ax+b=0) or 2nd degree (ax²+bx+c=0) equations. Returns: {error}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | Coefficient a | |
| b | Yes | Coefficient b | |
| c | No | Coefficient c (for degree 2) | |
| degree | Yes | Equation degree |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description only mentions 'Returns: {error}', which is vague. It does not describe the success output (e.g., solutions), error scenarios, or any side effects. The tool's behavior is insufficiently disclosed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, consisting of two sentences that front-load the purpose and provide a reference for related tools. No unnecessary words or information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite good schema coverage and a moderate complexity, the description fails to explain the output format. The mention of 'Returns: {error}' is incomplete and does not clarify what success looks like, especially since no output schema is shown. The missing output description is a significant gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters have schema descriptions (100% coverage), and the description adds context by showing equation forms. However, it does not provide additional meaning beyond what the schema already conveys, such as constraints or relationships between parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool solves 1st and 2nd degree equations with explicit forms (ax+b=0 and ax²+bx+c=0). However, it does not differentiate itself from sibling tools like calculate_quadratic_equation, which may overlap.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only references 'see list_bundles for related math calculators' without specifying when to use this tool versus alternatives. No explicit when-to-use or when-not-to-use guidance is provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ev_charging_costBInspect
Compute electric vehicle charging cost and time to full. Use for EV trip planning. Inputs: battery kWh, current %, target %, charger kW, kWh price. Returns cost and hours. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| price_kwh | No | Price per kWh | |
| target_pct | No | Target charge % | |
| battery_kwh | Yes | Battery capacity kWh | |
| current_pct | Yes | Current charge % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It lists inputs and outputs, but omits details like default values and calculation method. The mention of 'charger kW' that does not match schema is misleading.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose. No unnecessary words, but the inclusion of an extra parameter (charger kW) reduces clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations and an output schema (though not shown), description covers basic purpose and inputs. However, missing details on defaults, error handling, and the input mismatch detract from completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, baseline 3. However, description introduces 'charger kW' not in schema, causing confusion. It adds little beyond schema and does not explain defaults or constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes charging cost and time to full for EV trip planning. It mentions key inputs and outputs, but does not explicitly differentiate from other EV-related tools among many siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit use case ('for EV trip planning') and refers to list_bundles for related calculators. Lacks when-not-to-use guidance, but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_excavationAInspect
Compute excavation volume (m³) and truck loads needed for a foundation, pool, or trench. Use for construction. Inputs: length, width, depth (m), bulking factor. Returns m³ to remove and 8m³-truck count. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| depth_m | Yes | Depth in meters | |
| width_m | Yes | Width in meters | |
| length_m | Yes | Length in meters | |
| soil_type | No | Soil type (swell: normal=1.25, rocky=1.50, clay=1.30) | normal |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It discloses that the tool computes volume and truck loads, and mentions inputs and outputs. However, it does not specify handling of edge cases (e.g., zero or negative dimensions) or confirm read-only behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no wasted words. It front-loads the primary function, then gives context, and ends with output and a related tool reference. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the tool's purpose, inputs, and outputs adequately. An output schema exists, so the description does not need to detail return values. The reference to a bundle of related tools adds completeness for the user.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by explaining the bulking factor (soil_type) and linking it to truck loads, which goes beyond the schema's enum descriptions. It also clarifies the output format.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes excavation volume and truck loads for specific use cases (foundation, pool, trench). It mentions inputs and outputs, and distinguishes from sibling tools by noting it's for construction and pointing to a bundle of related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for construction' and references 'list_bundles' for related calculators, providing some context. However, it lacks explicit guidance on when not to use the tool or alternatives within the tool set.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_exchange_marginAInspect
Detect the hidden margin charged by a money exchanger over the mid-market rate. Use to compare currency exchange offers. Inputs: offered rate, mid-market rate. Returns margin %. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bank_rate | Yes | Bank/bureau rate | |
| market_rate | Yes | Mid-market rate |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description must carry full behavioral disclosure. It states inputs and output (margin %) but omits the formula, assumptions (e.g., same base currency), or edge case handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with the first stating purpose, second when to use, third listing inputs/outputs. No redundancy; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple 2-parameter tool with output schema, the description covers purpose, usage, inputs, and output. It does not detail the calculation or limitations, but these are minor given the low complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description rephrases parameter names ('offered rate' for bank_rate, 'mid-market rate' for market_rate) adding slight clarity but not beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool detects hidden margin using offered and mid-market rates. However, it does not explicitly differentiate from the sibling 'calculate_exchange_rate_margin', which may cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests using the tool to compare currency exchange offers but does not provide when-not-to-use scenarios or alternative tools. The reference to 'list_bundles' is vaguely related.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_exchange_rate_marginAInspect
Calculate the margin charged on a currency exchange. Returns: {cost_per_1000_eur, rating}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bank_rate | Yes | Rate offered by bank/exchange | |
| mid_market_rate | Yes | Mid-market (real) exchange rate |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden of behavioral disclosure. It reveals the return type and structure, and implies a read-only calculation, but does not discuss side effects, permissions, or other behavioral traits beyond the output.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loaded with the primary purpose, and includes a pointer to related tools without any extraneous information. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, no nested objects, output schema present), the description sufficiently covers purpose, output, and related tools. No significant gaps are present.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already fully documents both parameters. The description adds no further meaning beyond what the schema provides, resulting in a baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool calculates the margin charged on a currency exchange, using a specific verb-resource pair. It distinguishes itself from sibling tools by mentioning a related 'voyage' calculator in list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage context by mentioning a related tool for 'voyage' calculators, but lacks explicit guidance on when to use this tool versus alternatives, or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_expected_value_betAInspect
Calculate expected value and profitability of a bet or investment decision. Returns: {lose_probability}. See list_bundles for related 'jeux-probabilites' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bet_cost | No | Upfront cost to enter the bet (default 0) | |
| win_amount | Yes | Net amount won if outcome is positive | |
| loss_amount | Yes | Net amount lost if outcome is negative | |
| win_probability | Yes | Probability of winning (0 to 1) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It only mentions returning 'lose_probability' but does not describe side effects, error handling, permissions, or the full output format. This is insufficient for a tool with no annotation safety net.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the core purpose and return value, with a brief pointer to related tools. Every sentence is necessary and there is no redundancy or verbosity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite a known output schema, the description only mentions 'lose_probability' as output, omitting other likely fields (e.g., expected value, profitability ratio) and the underlying formula. This leaves some context gap for a complete understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents all parameters. The description adds no new parameter semantics beyond 'returns lose_probability', which is a minor extra. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates expected value and profitability of a bet or investment decision, with a specific verb and resource. It distinguishes itself from sibling probability calculators (e.g., dice_probability, poker_hand_probability) by focusing on expected value.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implicitly indicates use for EV calculations and directs users to 'list_bundles' for related calculators, providing some guidance. However, it lacks explicit when-to-use or when-not-to-use scenarios, and no alternatives are named besides the bundle reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_exposure_triangleAInspect
Calculate the missing exposure value (aperture, shutter speed or ISO) given the other two. Returns: {ev_value, lv_value, shutter_speed_s}. See list_bundles for related 'photographie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| iso | Yes | ISO sensitivity value | |
| aperture | Yes | Aperture f-number | |
| shutter_speed | Yes | Shutter speed in seconds (e.g. 0.004 for 1/250s) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It discloses return fields (ev_value, lv_value, shutter_speed_s) and the calculation nature, but does not explicitly state whether the tool is read-only, has side effects, or handles invalid inputs. The schema covers parameter constraints, so the description adds some value but is not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: one sentence for purpose, one for return values, and a pointer to related calculators. No fluff, front-loaded with the core action. Every sentence serves a purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the contradiction with schema (required vs. 'missing value'), the description fails to completely define the tool's behavior. The output schema is mentioned but not provided, and there is no explanation of the convention for handling the 'missing' parameter. The reference to siblings is vague. For a tool with clear complexity, the description is incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description claims the tool calculates the 'missing' value given 'the other two', implying only two parameters are needed. However, the input schema requires all three parameters (iso, aperture, shutter_speed) as mandatory. This is a direct contradiction between description and schema, misleading the agent about which parameters to provide. The description adds no clarifying semantics and actively contradicts the structured definition.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: calculate the missing exposure value (aperture, shutter speed, or ISO) given the other two. It identifies the specific resource (exposure triangle) and uses a precise verb. The mention of 'photographie' calculators and reference to list_bundles helps distinguish it from sibling tools, though not explicit contrast.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage context: provide two values to get the third. It references related calculators via list_bundles, offering alternatives. However, it lacks explicit 'when not to use' guidance or prerequisites, but for a simple computation, this is adequate.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fabric_neededCInspect
Compute fabric meters needed for a garment by pattern. Use for sewing. Inputs: garment type, size, fabric width. Returns meters of fabric. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | Garment size | |
| garment_type | Yes | Garment type | |
| fabric_width_cm | Yes | Fabric roll width cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description is the sole source of behavioral info. It only states it returns meters of fabric, with no mention of side effects, permissions, or operational constraints. Does not disclose if there are any destructive behaviors or rate limits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and to-the-point, with only two sentences. However, the reference to list_bundles could be considered extraneous, but overall concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 3 required parameters and an output schema (though not shown), the description minimally covers inputs and output. It does not detail the calculation logic or any assumptions, but the tool's purpose is simple enough that this may be adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameter descriptions are already present. The description merely repeats the input names (garment type, size, fabric width) without adding new meaning or constraints beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes fabric meters needed for a garment by pattern and is used for sewing. It distinguishes from siblings by mentioning related 'textile-mode' calculators via list_bundles, but could be clearer about specific differentiation from similar tools like calculate_fabric_yardage.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The only hint is to see list_bundles for related calculators, but it does not specify when to choose this one over others.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fabric_yardageAInspect
Calculate fabric needed for a garment in meters (includes 10% for pattern matching). Returns: {meters_needed, note}. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | ||
| garment | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses that the calculation includes a 10% pattern matching buffer and returns an object with meters_needed and note. This adds value beyond the schema, though it could mention that the tool is read-only and non-destructive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is only two sentences, with the key information front-loaded. Every sentence serves a purpose: stating the function, including a critical detail (10% pattern matching), and providing a cross-reference.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator, the description covers the core purpose, an important behavioral detail (10% addition), and the output format. It lacks parameter explanations but is otherwise complete. The cross-reference to list_bundles adds context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 0% description coverage, and the description does not explain the parameters (garment and size) or their enum values. The agent must rely solely on the schema, missing semantic context like typical usage or examples.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'calculate' and the resource 'fabric needed for a garment' in meters, including a specific detail about a 10% addition for pattern matching. It also mentions the output format and cross-references a related bundle, distinguishing it from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for garment fabric calculations and directs to list_bundles for related calculators, but it does not provide explicit guidance on when to use this tool over alternatives like calculate_fabric_needed, nor does it mention prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_factorial_permutationCInspect
Calculate factorial, permutations P(n,r), and combinations C(n,r). Returns: {factorial}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| n | Yes | n value | |
| r | No | r value for P(n,r) and C(n,r) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It states a return value format but is incomplete (only mentions factorial). It does not disclose side effects, permissions, or behavior when r is not provided. The missing output schema further reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences, front-loading the purpose. However, the return placeholder is incomplete, and the reference to list_bundles could be more integrated. Still, it efficiently conveys core functionality.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations and the output schema not shown, the description lacks completeness. It does not explain behavior for different parameter combinations (e.g., r > n), error handling, or return structure beyond factorial. The agent has insufficient information to invoke correctly in all cases.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are well-defined in the schema. The description adds context that n and r are used for permutations and combinations, but it incorrectly implies returns only factorial, which could mislead agents. It does not clarify that r is only needed for permutations/combinations, reducing semantic value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates factorial, permutations, and combinations. It uses a specific verb ('Calculate') and resource ('factorial, permutations, combinations'), distinguishing it from many sibling calculators. However, it does not explicitly differentiate from other calculators, and the mention of 'Returns: {factorial}' is ambiguous regarding permutations and combinations.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides basic guidance on when to use this tool (for factorial, permutations, combinations) and refers to list_bundles for related tools. However, it lacks explicit when-not-to-use instructions or alternatives, limiting its ability to guide selection among similar calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fertilizer_npkAInspect
Calculate NPK fertilizer quantities needed based on crop type and soil type. Returns: {total_kg}. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| crop_type | Yes | Type of crop to fertilize | |
| soil_type | Yes | Type of soil | |
| surface_m2 | Yes | Surface area in square meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It only states the calculation and return value, but does not mention any side effects, permissions, rate limits, or whether the operation is read-only. For a calculation tool, this is minimal but acceptable; however, more could be added (e.g., 'no data is modified').
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise at two sentences, with the key action and result in the first sentence. Every sentence adds value, and there is no extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the essential purpose and a key output (total_kg). However, it lacks details such as units, precision, or any additional output fields. Given that an output schema exists (mentioned in context), the description could be more complete by referencing the output structure, but it is adequate for a simple calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers all three parameters with descriptions and enums, achieving 100% coverage. The description does not add significant new meaning beyond the schema, so the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'calculate' and the resource 'NPK fertilizer quantities', and specifies the inputs (crop type, soil type). It also mentions the return value {total_kg}, distinguishing it from siblings like calculate_ anything by referencing a specific category ('jardinage') and directing to list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context for using the tool (calculating fertilizer based on crop and soil) and references list_bundles for related 'jardinage' calculators, implying when alternatives might be appropriate. However, it does not explicitly state when not to use this tool or give clear exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fish_tank_heaterAInspect
Compute aquarium heater wattage needed by tank volume and target temp. Use for aquarium setup. Inputs: tank L, current temp, target temp. Returns wattage W. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| room_temp_c | Yes | Room temperature °C | |
| target_temp_c | Yes | Target water temperature °C | |
| volume_liters | Yes | Tank volume liters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool computes and returns wattage, implying a read-only calculation, but does not explicitly confirm no side effects or mention any prerequisites. For a simple calculator, this is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three short sentences, front-loaded with purpose. It could omit 'Use for aquarium setup' as slightly redundant, but overall concise and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 required params, output schema present, 100% schema coverage), the description covers inputs, output, and a pointer to related tools. It is sufficiently complete for an agent to use correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds 'tank L, current temp, target temp' which paraphrases the schema but does not add new meaning beyond what the property descriptions already provide.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute aquarium heater wattage needed by tank volume and target temp,' with a specific verb and resource. It distinguishes itself from the many sibling calculate_* tools by specifying aquarium heater sizing, which is unique among them.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for aquarium setup,' providing basic context, but does not explicitly mention when not to use this tool or point to alternatives like calculate_aquarium_volume for volume calculations. The reference to list_bundles is indirect guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_flight_distanceAInspect
Calculate great-circle distance between two coordinates. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| lat1 | Yes | Departure latitude | |
| lat2 | Yes | Arrival latitude | |
| lon1 | Yes | Departure longitude | |
| lon2 | Yes | Arrival longitude |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Only mentions 'great-circle distance', but does not disclose assumptions (e.g., spherical Earth), output units, or formula used. Significant gap for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states purpose, second directs to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists, so return value explanation not needed. However, description lacks details on output units and potential limitations. Adequate for a simple tool but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with clear parameter descriptions (departure/arrival lat/lon). Description adds no extra meaning beyond schema, so baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clear verb 'calculate' and resource 'great-circle distance between two coordinates'. Distinguishes from siblings by pointing to list_bundles for voyage calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implicitly indicates when to use (distance between coordinates) and directs to list_bundles for related tools, but lacks explicit when-not-to-use or alternative comparisons.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_floor_areaBInspect
Calculate total floor area and Carrez habitable area from a list of rooms. Returns: {rooms, total_area_m2, carrez_area_m2, note}. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rooms | Yes | Rooms with length and width in meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as assumptions (e.g., rectangular rooms), rounding behavior, or error handling. It only mentions the return format, lacking critical transparency for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences and a return format, front-loading the purpose. It efficiently covers the main functionality and a related tool reference, though minor improvements could be made.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter) and the presence of an output schema (return format mentioned), the description is adequate but lacks behavioral transparency and usage guidance, leaving gaps in completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage, describing the rooms parameter. The description adds the context of 'from a list of rooms' but does not elaborate on parameter details beyond the schema. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: calculating total floor area and Carrez habitable area from a list of rooms. This distinguishes it from siblings like calculate_area or calculate_surface_carrez by combining both calculations and requiring a list of rooms.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some context by referencing list_bundles for related construction calculators, but does not explicitly state when to use this tool versus alternatives or any exclusions. It implies use when room dimensions are available.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_flow_rate_convertAInspect
Convert flow rate between L/s, L/min, L/h, m³/h, gpm, cfm. Use for plumbing, HVAC, or industrial design. Inputs: value, from-unit, to-unit. Returns converted flow rate. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Flow rate value | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Only states 'Returns converted flow rate' without details on return format, precision, or edge cases. Minimal behavioral info beyond conversion.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose and use case. Efficient but could include an example for clarity. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given a simple conversion tool with 3 parameters and output schema, description covers what it does, when to use, and inputs. Lacks explanation of return format but output schema likely covers it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter. Description merely lists parameter names without adding meaning. Baseline of 3 appropriate as schema already documents parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description states specific verb 'Convert' and resource 'flow rate', lists units. Clearly distinguishes from sibling conversion tools and calculators. Includes use case context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly suggests usage for plumbing, HVAC, or industrial design. Points to list_bundles for related calculators. Does not explicitly exclude alternatives but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_food_cost_per_servingAInspect
Compute food cost per serving from ingredient costs. Use for restaurants, meal-prep services. Inputs: list of ingredients with cost and quantity used, servings. Returns cost per serving and total. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| servings | Yes | ||
| ingredients | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must carry the burden. It states the tool 'Returns cost per serving and total', which is minimally informative about behavior. Beyond that, no side effects or permission requirements are disclosed. The description is adequate but not rich.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, no filler, immediately front-loads the purpose. Every word contributes to understanding. Exemplary conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has two parameters (one nested array) and an output schema (though not shown in input schema). The description minimally covers the return values ('cost per serving and total'). Given the complexity, it could detail the output structure or provide examples. It is adequate but not comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, so the description must compensate. It explains inputs as 'list of ingredients with cost and quantity used, servings', which clarifies the two top-level parameters. However, it does not elaborate on subfields like total_quantity or used_quantity, which remain somewhat self-explanatory from their names. The description adds moderate value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute food cost per serving from ingredient costs', which is a specific verb-resource pair. It also indicates applicability to restaurants and meal-prep services, distinguishing it from the many other 'calculate_' sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear usage context ('for restaurants, meal-prep services') and points to an alternative tool ('See list_bundles for related cuisine calculators'). It lacks explicit exclusion criteria but is sufficient for typical use cases.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_forceAInspect
Compute force using Newton's second law F=m·a. Use for physics problems. Inputs: mass kg, acceleration m/s². Returns force in newtons. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| force_n | No | Newtons | |
| mass_kg | No | Mass kg | |
| acceleration | No | m/s² |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose any behavioral traits beyond the calculation itself—no side effects, authentication needs, or limits. It merely restates the inputs and output.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading purpose with no wasted words. Every sentence serves a purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (not shown but indicated), the description need not detail return values. The description is sufficient for a simple physics calculator with no apparent gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by explicitly labeling which parameters are inputs (mass_kg, acceleration) and which is output (force_n), and explaining the formula, thereby clarifying roles beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes force via Newton's second law, specifying the verb 'compute' and resource 'force'. It is distinct from the many sibling calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for physics problems,' providing clear context. It points to list_bundles for related science calculators, but lacks explicit when-not-to-use or exclusion of alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fractionAInspect
Perform fraction operations: add, subtract, multiply, divide, simplify. Returns: {input, result, decimal}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| den1 | Yes | Denominator 1 | |
| den2 | No | Denominator 2 | |
| num1 | Yes | Numerator 1 | |
| num2 | No | Numerator 2 | |
| operation | Yes | Operation |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions return format ({input, result, decimal}) but does not disclose any behavioral traits such as side effects, whether it is read-only, permissions needed, or rate limits. For a calculation tool, being non-destructive is expected but not explicitly stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences with no filler. The first sentence immediately states purpose and operations. The second provides return format and a pointer to related tools. Information is front-loaded and every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 5 parameters (all described in schema), an output schema, and is a straightforward calculation, the description covers the essential points (operations, return format). It lacks detail on edge cases (e.g., simplify with one fraction) but does not need to duplicate schema. The reference to 'list_bundles' compensates for context about siblings. Overall, it is adequate for a well-structured tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds minimal value beyond the schema: it confirms the operations (already in enum) and output format, but does not clarify parameter meaning or relationships (e.g., how default values for den2 and num2 affect operations like simplify). The description neither harms nor significantly enhances understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Perform fraction operations: add, subtract, multiply, divide, simplify.' This directly identifies the verb (perform) and resource (fraction operations), listing all supported operations. It also differentiates from related tools by directing users to 'list_bundles' for other math calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage through the listed operations but does not explicitly state when to use this tool versus alternatives. References to 'list_bundles' provide a path to find related calculators but no specific exclusion criteria or prerequisites. No guidelines on when not to use this tool are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fraction_operationsBInspect
Add, subtract, multiply, or divide two fractions and return the simplified result. Use for math homework. Inputs: num1/den1, op, num2/den2. Returns result fraction (lowest terms). See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| den1 | Yes | Denominator of first fraction | |
| den2 | Yes | Denominator of second fraction | |
| num1 | Yes | Numerator of first fraction | |
| num2 | Yes | Numerator of second fraction | |
| operation | Yes | Operation to perform |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions returning a simplified result in lowest terms, but fails to disclose critical behaviors such as handling of zero denominators, division by zero, negative numbers, or the exact output format (e.g., string versus object). These gaps limit the agent's ability to anticipate tool behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (three sentences) with the core functionality front-loaded. Every sentence contributes: operation, use case, input format, output format, and a pointer to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has moderate complexity (5 params, output schema, many siblings). Output schema exists, so return format details are covered. However, the description omits edge cases (e.g., division by zero) and does not explain how the operation string maps to mathematical operations. It references a bundle but no direct sibling comparison.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description only echoes the parameter roles (num1/den1, op, num2/den2) without adding new meaning. It does not clarify constraints like non-zero denominators or valid operation values beyond the schema enum. Minimal value added.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool performs addition, subtraction, multiplication, or division of two fractions and returns a simplified result. It specifies the input format (num1/den1, op, num2/den2) and output ('lowest terms'). However, it does not distinguish from the sibling 'calculate_fraction' tool, which may have overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for math homework', providing a general usage context. It references 'list_bundles' for related math calculators, which gives a clue for alternatives but does not explicitly state when not to use this tool or compare with specific siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_freezer_durationAInspect
Return maximum recommended freezer storage duration for a food type. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| food_type | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the function returns a duration, which is a read-only operation, but does not disclose potential edge cases or output details. Given the presence of an output schema, the brevity is acceptable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (two sentences) and front-loaded with the primary purpose. Every word is necessary, and the reference to list_bundles is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple lookup tool with one enum parameter and an output schema, the description is nearly complete. It could mention units (e.g., days, months) or provide an example, but the core functionality is clear.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 0% description coverage, and the description provides no additional meaning about the 'food_type' parameter beyond the enum values. The enum is clear, but the description misses an opportunity to explain how food types map to durations or provide examples.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb ('Return') and clearly identifies the resource ('maximum recommended freezer storage duration') and scope ('for a food type'). It also distinguishes itself from siblings by referencing related 'cuisine' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly directs users to list_bundles for related calculators, providing helpful context on when this tool is appropriate. However, it does not provide explicit when-not-to-use instructions or details on prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_freezer_thaw_timeAInspect
Estimate thawing time for frozen food by weight and method. Returns: {safe_temp_check, tip}. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| method | Yes | Thawing method | |
| weight_kg | Yes | Food weight kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It implies a read-only computation by mentioning 'estimate' and 'returns', but lacks details on side effects, permissions, limits, or prerequisites such as food type. It is not misleading but minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first defines purpose, second adds output details and cross-reference. No redundant information; each sentence serves a clear function. Well-structured for quick parsing.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple 2-parameter calculator with an output schema, the description covers the basic purpose and return format. However, it lacks usage guidelines and deeper behavioral context, which are needed for a complete understanding despite the tool's simplicity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so the description adds little beyond restating 'weight and method'. The mention of return fields (safe_temp_check, tip) provides some extra context, but no elaboration on parameter values like enum meanings or units.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Estimate', the resource 'thawing time for frozen food', and the inputs 'weight and method'. It also mentions the return fields, making the purpose unambiguous and distinct from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a pointer to list_bundles for related 'cuisine' calculators but does not explicitly state when this tool should or should not be used versus alternatives. No exclusions or specific use cases are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_french_income_taxAInspect
Calculate French income tax (IR) for 2026 using progressive brackets per Article 197 CGI with family quotient system. Returns: {income, family_quotient, total_tax, effective_rate_pct, marginal_rate_pct, brackets}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| parts | No | Number of fiscal shares (1=single, 2=married, +0.5 per child) | |
| income | Yes | Annual net taxable income in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden of disclosure. It describes the basic operation (progressive brackets, family quotient system) and lists return fields, but it omits details about safety (e.g., read-only nature), authentication requirements, rate limits, or behavior on invalid inputs. For a calculation tool, more transparency about its dependencies or assumptions would be expected.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, with two sentences: the first clearly states the purpose, and the second lists return fields and points to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of French income tax, the description covers the core purpose and return structure. However, it does not explain the family quotient system or bracket methodology, which might be needed for deeper understanding. Since an output schema exists, the return field list is adequate but not exhaustive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described in the schema. The description adds no additional meaning beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate French income tax (IR) for 2026', specifies the relevant legal article (Article 197 CGI) and the family quotient system, and lists the return fields. This distinguishes it from the many other country-specific tax calculators in the sibling list.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies that this tool is for French income tax calculations and mentions related 'finance-france' calculators via list_bundles, but it does not explicitly state when to use this tool versus other tax calculators or when not to use it. No exclusion criteria or alternative tool names are given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_french_salaryAInspect
Convert French gross salary to net salary for 2026 (cadre, non-cadre, or civil servant). Returns monthly/annual net, social contributions, employer cost. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| status | No | Employment status | cadre |
| gross_monthly | Yes | Gross monthly salary in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses return values and that conversion uses 2026 rates. While it doesn't explicitly state read-only, the nature of a calculator implies no side effects. Good transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no wasted words. First sentence delivers purpose and output, second provides navigation to related tools. Strong front-loading.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 100% schema coverage and output schema present, description adds key context: year 2026, employee categories, return values, and guidance to related calculators. Complete for a conversion tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. Description does not add significant meaning beyond repeating enum values and gross_monthly purpose. Baseline 3 appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool converts French gross salary to net salary for 2026, specifies employee types (cadre, non-cadre, civil servant), and lists return values (monthly/annual net, social contributions, employer cost). This distinguishes it from sibling tools like calculate_belgian_salary.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description explicitly states when to use (for French salary conversion) and points to list_bundles for related calculators. However, it does not explicitly state when not to use or exclude other scenarios, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_french_vatAInspect
Calculate French VAT (TVA) — convert between HT (before tax) and TTC (after tax). Supports all 4 French VAT rates. Returns: {amount_ht, amount_ttc, vat_amount, vat_rate}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=before tax, ttc=after tax | ht |
| rate | No | VAT rate percentage | 20 |
| amount | Yes | Amount in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description explains the calculation purpose and return fields. As a read-only calculation tool with no annotations, it does not need to disclose destructive actions, but it could explicitly state it is non-destructive. No contradictions with annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loading the core purpose and adding return info plus a cross-reference. Every word adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with full schema descriptions and a stated output structure, the description is largely complete. It could mention the currency (euro) but that is in the schema. The reference to list_bundles helps with sibling tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema covers all parameters with descriptions (100% coverage). The description adds contextual understanding (e.g., conversion between HT and TTC) but does not significantly deepen parameter semantics beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates French VAT (TVA) and converts between HT and TTC, supporting all 4 French VAT rates. This specific verb+resource combination distinguishes it from sibling VAT tools for other countries.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related finance-france calculators' but does not explicitly differentiate from other VAT calculators like calculate_vat_generic or country-specific ones. Usage context is implied by the name, but no explicit when/when-not guidance is given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_frequency_noteAInspect
Calculate the frequency of a musical note based on equal temperament tuning. See list_bundles for related 'musique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| note | Yes | Note name in chromatic scale | |
| octave | Yes | Octave number (A4 = concert pitch reference) | |
| tuning_reference | No | Tuning reference frequency in Hz (default A4=440Hz) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full responsibility for behavioral disclosure. It only states the method (equal temperament) but omits details on output format, rounding behavior, or constraints, leaving the agent uninformed about side effects or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences) with no unnecessary words. The first sentence states the purpose, the second provides a pointer to related tools. It is efficient but could benefit from slightly more structure.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description need not detail return format, but it fails to mention edge cases (e.g., out-of-range octaves), accuracy, or how the equal temperament formula is applied. It is minimally adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers 100% of parameters with descriptions, so the tool description adds little beyond mentioning 'equal temperament tuning'. The baseline of 3 is appropriate as the schema already documents parameters adequately.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('calculate the frequency') and the specific resource ('a musical note based on equal temperament tuning'), and distinguishes the tool by referencing a sibling tool ('list_bundles' for related 'musique' calculators).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context for when to use the tool (calculating musical note frequency) and suggests where to find related calculators (list_bundles), but does not explicitly exclude scenarios or compare with siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fuel_consumptionAInspect
Calculate fuel consumption in L/100km and MPG from distance and fuel used. Returns: {l_100km, mpg_us, mpg_uk, co2_g_km_petrol}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| distance_km | Yes | Distance in km | |
| fuel_liters | Yes | Fuel consumed in liters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It lists inputs and outputs but does not disclose edge cases, error handling, or any side effects. For a read-only calculation, this is adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. First sentence states purpose and output units, second lists return fields and references related tools. Very efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the input schema and the self-documented output schema in the description, the tool is well-documented for its simple purpose. The return object is explicitly listed, making it complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with param descriptions already present. The description adds no extra meaning beyond 'from distance and fuel used', providing no additional context beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates fuel consumption in L/100km and MPG from distance and fuel used. It specifies the exact return object, distinguishing it from sibling tools like calculate_fuel_cost or calculate_fuel_economy_conversion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives the basic context (inputs are distance and fuel used) but does not explicitly state when to use this tool versus alternatives like calculate_fuel_economy_conversion or convert_fuel_consumption. It only hints at related tools via 'see list_bundles'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fuel_costAInspect
Compute fuel cost for a journey. Use for trip budgeting or company expense. Inputs: distance km, consumption L/100km, fuel price €/L. Returns total cost and L used. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| fuel_price | Yes | Price/liter | |
| consumption | Yes | L/100km | |
| distance_km | Yes | Distance km |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must disclose behavior. It states returns 'total cost and L used', but doesn't mention safety (e.g., read-only), limits, or side effects. For a calculation tool, the lack of explicit non-destructive assurance is a minor gap, but overall adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first defines purpose, second lists inputs/outputs and hints at related tools. Every sentence adds value; no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given simplicity (3 required parameters, simple calculation), description adequately explains inputs and outputs. No output schema provided, but description covers return values. Lacks edge-case handling, but not needed for basic understanding. Fairly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter. The description reiterates parameter meanings with units (km, L/100km, €/L) but adds no new semantic information beyond the schema. Baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb 'Compute' and resource 'fuel cost for a journey', clearly stating the function. It lists the exact inputs and outputs, and points to a related bundle for other auto-transport calculators, helping differentiate from siblings like calculate_fuel_consumption.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear use cases: 'trip budgeting or company expense.' References list_bundles for alternative calculators, implying when to use other tools. However, lacks explicit when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_fuel_economy_conversionAInspect
Convert between fuel economy units: L/100km, mpg-US, mpg-UK, km/L. Use for car comparisons across regions. Inputs: value, from-unit, to-unit. Returns converted economy. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Fuel economy value to convert | |
| to_unit | Yes | Target unit of fuel economy | |
| from_unit | Yes | Source unit of fuel economy |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description alone must inform behavior. It only says 'returns converted economy' without details on rounding, precision, error handling, or side effects, which is insufficient for a mutation-free operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the core purpose, and contains no superfluous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description does not need to detail return values. It covers units, use case, and references a related tool. It could mention that conversions use standard formulas, but is otherwise complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers all 3 parameters with descriptions (100% coverage). The description adds value by mapping enum codes to human-readable units (L/100km, mpg-US, etc.), improving clarity beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it converts fuel economy units and lists the supported units. However, it does not differentiate from the sibling tool 'convert_fuel_consumption', which may serve a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests using the tool for car comparisons across regions, providing some context, but does not give explicit when-to-use or when-not-to-use guidance relative to siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_future_valueBInspect
Compute the future value (FV) of a present sum at a given interest rate. Use for savings projections. Inputs: present value, annual rate %, years, compounding frequency. Returns FV and total interest. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate | Yes | Annual rate percent | |
| years | Yes | Number of years | |
| present_value | Yes | Present value EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description should disclose behavioral traits. It mentions returning 'FV and total interest', implying a read-only computation, but does not explicitly state safety, auth needs, or side effects. Adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise but contains a factual error (extra parameter), undermining its reliability. The structure is front-loaded but the misinformation reduces effectiveness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description omits the compounding frequency parameter, which is critical for future value calculations. It is incomplete given the tool's complexity and the presence of an output schema, which could clarify return values.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description incorrectly lists 'compounding frequency' as an input, which is not in the schema. This misleads the agent. The description adds little beyond the schema's own parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Compute' and resource 'future value', and specifies the use case 'savings projections'. It distinguishes from siblings like calculate_present_value by indicating directionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for savings projections' and references list_bundles for related calculators, providing context. However, it lacks explicit exclusions or direct comparison to similar tools like calculate_compound_interest.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_garden_soilCInspect
Compute soil volume (m³) and number of bags needed for a garden bed. Use for gardening. Inputs: area, depth, bag volume. Returns m³ and bag count. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_m | Yes | Width m | |
| depth_cm | Yes | Depth cm | |
| length_m | Yes | Length m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries the full burden. It correctly indicates a read-only computation but does not elaborate on side effects or safety. For a calculator, this is minimal but acceptable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded, but the misleading mention of bag volume reduces its efficiency. It could be more precise without extra fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
While the tool is simple and an output schema exists, the description omits the bag volume parameter and does not explain the calculation assumptions. This leaves the agent without key information for correct invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'bag volume' as an input, but the schema has no such parameter, only length, width, depth. This contradiction undermines the parameter semantics and could mislead the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes soil volume and bag count for a garden bed, distinguishing it from sibling tools like calculate_raised_bed_soil. However, it mentions 'area' as input while schema uses length and width, causing slight ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives a weak cross-reference to list_bundles but no explicit when or when-not to use. It does not differentiate from related tools such as calculate_raised_bed_soil, leaving the agent without usage guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_garden_sunlight_hoursBInspect
Estimate effective daily sunlight hours for a garden based on latitude, month and orientation. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| month | Yes | Month number (1=January, 12=December) | |
| latitude | Yes | Latitude in degrees (-90 to 90) | |
| orientation | Yes | Garden orientation / aspect |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the transparency burden. It correctly indicates the tool is an estimator (non-destructive calculation), but does not disclose assumptions, limitations, or what 'effective' means. Adequate for a simple calculator but could add context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences. The first sentence front-loads the purpose and inputs, and the second sentence usefully points to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 scalar parameters, all required, output schema exists), the description is sufficiently complete. It defines the operation and inputs; the output schema handles return values. No major gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema provides descriptions for all three parameters (100% coverage), so the description adds no additional meaning. It merely repeats the parameter names, which does not enhance understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates effective daily sunlight hours based on latitude, month, and orientation. It identifies the specific resource and inputs, but does not explicitly differentiate from sibling tools like calculate_sun_exposure.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related jardinage calculators,' which hints at alternatives but does not provide explicit when-to-use or when-not-to-use guidance. No criteria for choosing this tool over others are given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_garden_water_needsBInspect
Compute weekly water needs for a garden by area and plant type. Use for irrigation planning. Inputs: garden m², climate, plant mix. Returns L/week and watering frequency. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| season | Yes | Current season | |
| plant_type | Yes | Type of plants in the garden | |
| surface_m2 | Yes | Garden surface area in square meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool 'Returns L/week and watering frequency,' which is helpful. It does not mention any side effects, but as a computation tool, it is likely non-destructive. The description lacks details on prerequisites or error conditions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the main action. It efficiently conveys purpose and directs to related tools. Minor improvement could be replacing 'climate' with 'season' for accuracy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, the description does not need to explain return values. It covers the main use case and provides context. However, the mention of 'climate' instead of 'season' is an inconsistency that could mislead an agent. It is adequate but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'garden m², climate, plant mix' as inputs, but the schema has surface_m2, plant_type, and season. 'Climate' is not a parameter, causing a mismatch. The description adds little beyond the schema's descriptions, which already cover the parameters (100% coverage).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute weekly water needs for a garden by area and plant type,' which clearly identifies the verb, resource, and key inputs. It distinguishes from sibling tools like calculate_garden_soil and calculate_garden_sunlight_hours. However, it mentions 'climate' as an input while the schema uses 'season,' causing slight ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for irrigation planning,' providing context. It also suggests seeing list_bundles for related calculators. However, it does not explicitly state when not to use this tool or offer alternatives among the many garden calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_gas_fee_ethBInspect
Calculate Ethereum transaction gas fee in ETH and USD. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gas_limit | No | Gas limit for the transaction (default 21000 for simple transfer) | |
| eth_price_usd | No | Current ETH price in USD (default 3000) | |
| gas_price_gwei | Yes | Gas price in Gwei |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as whether it uses live data, what the formula is, or any side effects. It only states the calculation result, lacking transparency beyond the basic purpose.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with a single sentence stating purpose and a cross-reference to list_bundles. No unnecessary words, front-loaded with the main action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (which is not shown but indicated as true), the description does not need to explain return values. However, it lacks context about the formula, data sources, or that it's an estimate, which would be helpful for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the parameters are already documented in the schema. The description adds no additional meaning beyond the schema, thus baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates Ethereum transaction gas fee in ETH and USD, using specific verb and resource. However, it does not explicitly differentiate from other crypto calculator siblings, though the 'crypto' reference provides some context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for Ethereum gas fee calculation and points to list_bundles for related calculators, but gives no explicit when-to-use, when-not-to-use, or alternatives among the many calculate_ siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_gcd_lcmBInspect
Calculate GCD (PGCD) and LCM (PPCM) of two integers using Euclidean algorithm. Returns: {gcd, lcm}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | First integer | |
| b | Yes | Second integer |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility for behavioral disclosure. It mentions the Euclidean algorithm and return format, but does not reveal error handling, edge cases (e.g., negative numbers, zero), or whether the operation is safe (read-only). This is insufficient for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded, stating the main action and algorithm in the first sentence. It includes the return format and a reference to related tools. It could be slightly more structured, but overall it is well-sized with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity of the tool (2 parameters, 100% schema coverage, output schema implied), the description covers the essential aspects: action, algorithm, and output. It omits edge case handling but is otherwise complete for the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with clear descriptions for parameters 'First integer' and 'Second integer'. The description adds no additional meaning beyond what the schema already provides. Thus, the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate', the resource 'GCD and LCM of two integers', and the method 'using Euclidean algorithm'. It also distinguishes from siblings by directing to list_bundles for related math calculators. This provides a specific and unambiguous purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not explicitly state when to use this tool versus its many siblings. The only guidance is 'See list_bundles for related math calculators', which is too vague and does not help an agent decide between gcd/lcm and other math functions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_gear_ratioAInspect
Compute gear ratio and torque/speed multiplication. Use for mechanical engineering, cycling, automotive. Inputs: driver teeth, driven teeth. Returns ratio and torque multiplier. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| driven_teeth | Yes | Driven gear teeth | |
| driving_teeth | Yes | Driving gear teeth |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must stand alone. It adequately describes the tool as a computational function with no side effects. However, it does not elaborate on any potential limitations or edge cases.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences: purpose, use cases and inputs, then outputs and a helpful cross-reference. Every sentence adds value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with an output schema indicated, the description covers inputs, outputs, and use cases sufficiently. No additional behavioral or contextual details are necessary.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The description echoes the inputs ('driver teeth, driven teeth') and adds output info ('Returns ratio and torque multiplier'), providing value beyond schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states 'Compute gear ratio and torque/speed multiplication' with specific applications (mechanical engineering, cycling, automotive), clearly defining the tool's purpose and distinguishing it from numerous sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases: mechanical engineering, cycling, automotive. It also directs to 'list_bundles' for related science calculators, providing context. However, it lacks explicit when-not-to-use instructions or alternative tool names.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_glycemic_loadCInspect
Calculate glycemic load (GL) per food and total for a meal. Returns: {thresholds}. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| foods | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It only states the calculation and returns, omitting any details about safety, permissions, or side effects. It gives no indication that it is a read-only operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at two sentences, but the placeholder '{thresholds}' detracts from clarity. It could be more structured by explaining the output format.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite a simple single-parameter input, the description lacks essential details about the output (thresholds) and does not guide the agent on using the tool correctly. The incomplete output description and missing parameter explanations make it insufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, meaning the description must add meaning to the parameters. The description does not explain the 'foods' array structure, the meaning of 'gi', 'carbs_g', 'portion_g', or how they relate to the calculation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates glycemic load per food and total for a meal, which is a specific verb+resource. It also mentions returning thresholds, but does not explicitly distinguish from other nutrition calculators among the many siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description hints at related cuisine calculators via list_bundles, suggesting when to use alternatives, but does not explicitly state when to use this tool vs. other nutrition or calculator tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_gpa_frenchBInspect
Convert French school grades (out of 20) to GPA and academic mention. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| note_1 | Yes | Grade 1 out of 20 | |
| note_2 | Yes | Grade 2 out of 20 | |
| note_3 | No | Grade 3 out of 20 (optional) | |
| note_4 | No | Grade 4 out of 20 (optional) | |
| note_5 | No | Grade 5 out of 20 (optional) | |
| coeff_1 | No | Coefficient for grade 1 | |
| coeff_2 | No | Coefficient for grade 2 | |
| coeff_3 | No | Coefficient for grade 3 (0 if unused) | |
| coeff_4 | No | Coefficient for grade 4 (0 if unused) | |
| coeff_5 | No | Coefficient for grade 5 (0 if unused) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It fails to disclose behavioral traits such as how grades are averaged, the role of coefficients, the output GPA scale, or mention types. Basic functionality is stated but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single sentence plus cross-reference to list_bundles. Front-loaded and efficient, but could be more structured. Every part earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given complexity (10 params, output schema exists), description adequately conveys purpose. Output schema compensates for missing return value explanation. However, some behavioral context (e.g., coefficient defaults) is absent from description.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with clear parameter descriptions (min, max, defaults). The description adds no additional parameter-level meaning beyond schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts French school grades (out of 20) to GPA and academic mention, using specific verb and resource. It distinguishes from siblings by naming the French grading system, though no direct sibling differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs alternatives. The description merely references list_bundles for related calculators, but does not specify when this tool is appropriate or when to choose another calculator.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_grade_averageAInspect
Compute simple or weighted grade average. Use for school report cards. Inputs: grades list, optional weights/coefficients. Returns weighted average and missing-grade-needed forecast. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| grades | Yes | Array of grades | |
| coefficients | No | Optional array of coefficients/weights |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must disclose behavior. It mentions outputs ('weighted average and missing-grade-needed forecast') but does not state whether the tool is read-only, has side effects, or handles edge cases like empty inputs or mismatched coefficients. This is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (three sentences) and front-loaded with the core action. It efficiently covers purpose, usage, inputs, and outputs. Minor structural improvements (e.g., bullet points) could enhance readability but are not necessary.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description covers essential aspects: purpose, usage, inputs, and outputs. With both input and output schemas present, it is fairly complete. It lacks information on error handling or behavior with invalid inputs, but this is not critical for a calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description paraphrases the parameters ('grades list, optional weights/coefficients'), adding minimal value beyond the schema. It does not clarify constraints like coefficient length matching or grade ranges.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes simple or weighted grade averages, specifying the verb ('Compute') and resource ('grade average'). It also provides a usage context ('Use for school report cards'), which distinguishes it from sibling tools like 'calculate_average' that compute generic averages.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises using it for school report cards and directs users to 'list_bundles' for related education calculators, implying when to use it. However, it does not explicitly exclude cases or contrast with specific siblings (e.g., 'calculate_average').
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_grade_neededAInspect
Calculate the grade needed on remaining exams to reach target average. Returns: {error}. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| exams_done | Yes | Number of exams completed | |
| exams_total | Yes | Total number of exams | |
| target_average | Yes | Target final average | |
| current_average | Yes | Current average out of 20 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden for behavioral disclosure. It only mentions 'Returns: {error}' which is vague and does not explain error conditions, side effects (e.g., no state changes), or any constraints beyond the schema. This is insufficient for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences, the first stating the core purpose and the second referencing related tools. Every sentence is essential and front-loaded, with no waste.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (4 params, no nested objects, output schema exists), the description is adequate but lacks details on return format beyond '{error}', and does not explain validation constraints (e.g., exams_done < exams_total) that are hinted in the schema. It provides minimal context for an agent to fully understand behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents all four parameters with descriptions. The tool description adds no additional meaning to the parameters beyond what the schema provides, resulting in a baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates the grade needed on remaining exams to reach a target average, using specific verbs and resource. It distinguishes from siblings like calculate_grade_average by focusing on remaining exams, and references list_bundles for related calculators, providing grouping context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when a user has current average, target average, and exam counts, and references list_bundles for related calculators. However, it does not explicitly state when not to use it or compare directly to the similar sibling calculate_grade_average, leaving slight ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_gravel_quantityAInspect
Compute gravel volume (m³) and weight (tonnes) for a surface and depth. Use for paths, foundations, drainage. Inputs: area, depth, gravel density. Returns volume and weight. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_m | Yes | Width m | |
| depth_cm | Yes | Depth cm | |
| length_m | Yes | Length m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. It only states that the tool computes and returns values, but lacks details on whether it modifies state, requires authentication, has rate limits, or any other behavioral traits. For a calculation tool, the absence of side-effect disclosure is a notable gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loading the core function and essential inputs/outputs. Every sentence adds value: the first defines the tool, the second specifies use cases, and the third directs to related tools. There is no waste.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (calculation with three parameters) and the presence of an output schema, the description adequately covers the key aspects: what it computes, inputs, outputs, and related tools. The discrepancy about gravel density is a minor completeness issue, but overall it provides sufficient context for an agent to select and use the tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers all three parameters with descriptions, achieving 100% coverage. The description adds mention of 'gravel density' as an input, but this parameter is not present in the schema, creating a discrepancy. While the schema itself is clear, the description's extra detail confuses the parameter set, preventing a higher score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses specific verbs ('Compute') and clearly identifies the resource (gravel volume and weight) and application context (paths, foundations, drainage). It distinguishes from siblings by referencing the 'list_bundles' tool for related calculators, making its unique purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear use cases ('Use for paths, foundations, drainage') and directs users to 'list_bundles' for related calculators, offering context for when to use this tool. However, it does not explicitly state when not to use it or compare it directly to alternative tools, which would strengthen guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_grocery_unit_comparisonBInspect
Compare unit prices of grocery items — normalizes g→kg, mL/cL→L. Returns: {best_value, savings_vs_priciest}. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| items | Yes | Items: name, price, quantity, unit (kg/g/L/mL/cl/unit) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden for behavioral disclosure. It describes unit normalization (g→kg, mL/cL→L) and return fields, which covers some behavior. However, it does not address error handling, unit limitations, or other edge cases.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences and no fluff. It front-loads the main action and quickly follows with return format and a pointer. However, the pointer to list_bundles could be seen as minor noise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (nested input, many siblings, no annotations), the description covers the core purpose and normalization but lacks details on output schema, error cases, and when to choose this tool over similar ones. It is minimally adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description adds context about unit normalization but does not elaborate on individual parameter meanings beyond what the schema provides. No examples or clarifications for edge cases.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies the verb 'Compare unit prices' and resource 'grocery items', and explains the normalization. The mention of return values adds clarity. However, it does not explicitly differentiate from sibling tools like calculate_unit_price, though the multi-item comparison is implied.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal usage guidance. It mentions 'See list_bundles for related vie-quotidienne calculators' but does not state when to use this tool vs alternatives or when not to use it. No explicit context for selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_harvest_dateCInspect
Estimate harvest date for vegetables based on sowing date and region. Returns: {harvest_date, days_to_harvest}. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| region | Yes | Growing region: north (+10 days), south (-10 days), mediterranean (-15 days) | |
| plant_type | Yes | Type of vegetable | |
| sowing_date | Yes | Sowing date in ISO format (YYYY-MM-DD) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description must fully disclose behavior. It only states the input and output, but omits details like accuracy, assumptions, data sources, or error handling for invalid dates or regions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the main action and return type, and includes a helpful pointer to related bundles. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a straightforward calculation tool with a clear schema and output, the description covers the basics. However, it lacks usage guidelines and behavioral transparency, making it minimally complete but not comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter descriptions already informative (e.g., enum values with day offsets). The description adds minimal value beyond the schema, mentioning sowing date and region in prose. Baseline 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates harvest date for vegetables based on sowing date and region, and specifies the return object. However, it does not explicitly differentiate from sibling tools, though the unique purpose is evident among many calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives, no exclusions, and no contextual hints such as prerequisites. It only implies usage when harvest date is needed.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_hat_sizeAInspect
Calculate hat size in FR/EU, US/UK systems and standard S/M/L/XL from head circumference (cm). Returns: {head_circumference_cm, FR_EU, US_UK, standard_size}. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| head_circumference_cm | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It states what the tool does and what it returns, but does not disclose any behavioral traits such as input validation, side effects, or performance characteristics. The return structure is specified, which adds value.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: first states purpose and output, second provides a useful cross-reference. It is efficient, front-loaded, and contains no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with one parameter and an output schema (present but not shown), the description appropriately explains the return fields and cross-references related tools. It could include edge case handling or valid input ranges, but is mostly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 0% description coverage, so the description must add meaning. It adds the unit 'cm' and context that the parameter is a head measurement, but this is already implied by the parameter name. The description does not add significant new semantics beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates hat size in multiple systems (FR/EU, US/UK, S/M/L/XL) from head circumference in cm. It distinguishes from sibling calculators by referencing list_bundles for related 'textile-mode' calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for converting head circumference to hat sizes in various systems. It provides a cross-reference to find related calculators, but does not explicitly state when not to use or list alternatives directly.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_heart_rate_zonesAInspect
Calculate heart rate training zones Z1-Z5, optionally using Karvonen method. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| max_hr | Yes | Maximum heart rate in bpm | |
| resting_hr | No | Resting heart rate for Karvonen method (bpm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the burden of behavioral disclosure. It adequately indicates a non-destructive calculation operation, but lacks explicit statements about side effects, permissions, or data access. The calculator nature makes destructive behavior unlikely, so score is adequate but not exemplary.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise with only two sentences. The first sentence delivers the core purpose, and the second provides a helpful cross-reference. No redundant or unnecessary information is present.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema (not shown), the description is reasonably complete. It specifies the calculation type and optional method, and references related calculators. It could mention the output format (e.g., heart rate ranges for each zone), but the output schema likely covers that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with both parameters already documented. The description adds that the Karvonen method is optional, which is already implied by the optional resting_hr parameter. No additional semantic value beyond the schema is provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates heart rate training zones Z1-Z5, optionally using the Karvonen method. It specifies the verb (calculate) and resource (heart rate zones), distinguishing it from similar sibling tools like calculate_max_heart_rate or calculate_training_zones_running.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions the optional Karvonen method but does not provide explicit guidance on when to use this tool versus alternatives. The reference to list_bundles for related calculators implies some connection, but no when-to-use or when-not-to-use criteria are given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_heat_indexAInspect
Calculate the apparent temperature (heat index) from temperature and humidity. Returns: {heat_index_c, feels_warmer_by_degrees, risk_level}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| humidity_pct | Yes | Relative humidity in percent | |
| temperature_c | Yes | Air temperature in degrees C |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It explicitly states the return object structure ('heat_index_c, feels_warmer_by_degrees, risk_level'), providing behavioral insight beyond the input schema. No contradictions found.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is concise (two sentences) with front-loaded purpose. First sentence covers action and output, second provides a useful pointer to related tools. No unnecessary content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of a simple calculator with output schema, the description adequately covers purpose and return values. Missing edge cases like input validation ranges (e.g., temperature limits) but overall sufficient for a basic tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with both parameters having descriptions in the schema. The description adds no additional meaning beyond what schema already provides, so baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates heat index from temperature and humidity, using specific verb 'Calculate' and resource 'apparent temperature (heat index)'. It distinguishes from siblings like other weather calculators by naming the specific output.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidelines. The mention of 'list_bundles' suggests related calculators but does not differentiate this tool from alternatives like calculate_dew_point or calculate_wind_chill, which are among the many sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_heat_pump_copDInspect
Compute heat pump Coefficient of Performance (COP). Use for HVAC efficiency analysis. Inputs: heat output kW, electric input kW. Returns COP and seasonal SCOP estimate. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| indoor_temp | No | Indoor target °C | |
| outdoor_temp | Yes | Outdoor temperature °C |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full responsibility for behavioral disclosure. It only states the tool computes COP and returns a SCOP estimate, but does not mention any side effects, access requirements, or limitations (e.g., read-only nature). The description fails to add meaningful behavioral context beyond the basic operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is relatively short and front-loaded with the purpose, but the inclusion of contradictory input information makes it unreliable. While brevity is generally positive, the inaccuracy reduces the effectiveness of the structure.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema, the description partially compensates by mentioning 'Returns COP and seasonal SCOP estimate'. However, the input mismatch is a critical gap, and the description does not clarify that outdoor_temp is required or how the parameters relate to COP. Overall, the description is incomplete and misleading.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema provides 100% coverage with clear descriptions ('Indoor target °C', 'Outdoor temperature °C'), so parameter semantics are clear from the schema alone. However, the tool description explicitly lists 'heat output kW' and 'electric input kW' as inputs, which does not match the schema. This contradiction severely misleads the agent, adding confusion rather than value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute heat pump Coefficient of Performance (COP)' which provides a clear verb and resource. However, it then specifies 'Inputs: heat output kW, electric input kW' which directly contradicts the actual input schema (indoor_temp and outdoor_temp). This mismatch undermines purpose clarity, as the agent cannot correctly identify the inputs.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description vaguely suggests use for HVAC efficiency analysis and references list_bundles for related calculators, but does not provide explicit guidance on when to use this tool versus alternatives. No exclusions or prerequisites are mentioned, leaving the agent with insufficient context for selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_horse_weightAInspect
Estimate horse weight using Carroll formula from heart girth and body length. Use for vets, feed dosing. Inputs: heart girth cm, body length cm. Returns weight kg. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| body_length_cm | Yes | Body length cm | |
| heart_girth_cm | Yes | Heart girth circumference cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It states the formula (Carroll), inputs (heart girth cm, body length cm), and output (weight kg). However, it does not detail any limitations, accuracy, or edge cases, which would be helpful for a complete understanding.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, consisting of two sentences. The first sentence states the core purpose and formula, and the second provides use context and a reference to related tools. No extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that the tool is a simple calculator with 2 parameters and an output schema (assumed), the description adequately covers its functionality. It mentions the formula and use cases, and references a related bundle for broader context. Could be slightly more detailed about output format but it's sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema coverage is 100% with clear descriptions for both parameters. The description merely repeats the parameter names and units without adding further semantic value beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates horse weight using the Carroll formula from heart girth and body length. It specifies the formula and the intended use cases (vets, feed dosing), and is distinct from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions when to use the tool ('Use for vets, feed dosing') and points to related calculators via list_bundles. It does not explicitly state when not to use or provide alternatives, but the guidance is clear for the intended domain.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_hourly_costBInspect
Compute fully-loaded hourly cost-to-company. Use for project pricing or freelance rate. Inputs: monthly salary, social charges %, billable hours/month. Returns true hourly cost. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| work_days | No | Working days/year | |
| charges_pct | No | Employer charges % | |
| annual_gross | Yes | Annual gross salary EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It mentions the tool 'Returns true hourly cost' and lists inputs, but the input description is inaccurate (says 'monthly salary' while the schema requires 'annual_gross'). It does not disclose rounding, default behaviors (e.g., defaults for work_days, charges_pct), or prerequisites. This mismatch and lack of detail reduce transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is succinct, with three sentences covering purpose, usage, inputs/outputs, and related tools. It is well-structured and front-loaded. However, the inclusion of an inaccurate input description detracts from efficiency.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that this is a financial calculation with an output schema, the description does not need to detail return values, but it should accurately describe inputs. The description's mismatch (monthly vs annual) makes it incomplete for correct usage. It also does not clarify currency or what 'true hourly cost' encompasses. For a tool with no annotations, more accurate and complete context is needed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema parameter descriptions already cover 100% of parameters, so baseline is 3. The description adds contextual meaning by explaining the purpose of inputs (e.g., 'social charges %'), but it also introduces inconsistency (calls annual_gross 'monthly salary') which could mislead agents. Overall, it adds marginal value but with potential confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('Compute fully-loaded hourly cost-to-company') and the resource (hourly cost). It provides a usage context ('Use for project pricing or freelance rate'). However, it does not explicitly differentiate this tool from many sibling calculators (e.g., calculate_french_salary, calculate_auto_entrepreneur), only vaguely referencing 'list_bundles' for related tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives a clear use case ('Use for project pricing or freelance rate'), indicating when to use the tool. However, it lacks explicit guidance on when NOT to use it or which alternatives to consider. The reference to 'list_bundles' for related calculators is weak and does not provide concrete exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_housing_aidBInspect
Estimate French housing aid (APL — Aide Personnalisee au Logement). Returns: {rent, rent_ceiling, estimated_apl, note}.
| Name | Required | Description | Default |
|---|---|---|---|
| rent | Yes | Monthly rent in euros | |
| city_zone | No | City zone: 1 (Paris/IDF), 2 (large cities), 3 (rural) | 2 |
| household_size | No | Number of people in household (1-6) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It says 'estimate' implying approximation, but does not explain the estimation logic, limitations (e.g., rent ceiling not specified), or any special cases. The return object fields are listed but not described.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: one sentence plus a list of return fields. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema (described inline) and complete parameter schema, the description provides the core purpose and return structure. However, it lacks context like currency (euros), geographic applicability, and any usage notes that would help an agent understand when results are valid.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents all parameters. The description adds no extra parameter meaning; it only lists output fields. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates French housing aid (APL) and specifies the return structure. It uses a specific verb ('Estimate') and resource ('French housing aid'), which distinguishes it from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus alternatives or any prerequisites (e.g., user must be in France). The description only states what it does, not when it's appropriate.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_housing_loan_comparisonAInspect
Compare multiple mortgage offers sorted by total cost. Returns: {offers_count, best_offer, comparison}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| offers | Yes | List of mortgage offers to compare | |
| loan_amount | Yes | Loan amount in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states the tool compares offers and returns a comparison, but lacks explicit behavioral details like idempotency, side effects, or limitations beyond what is implied.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no fluff. First sentence states purpose, second provides return structure and pointer to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present, the description need not explain return values beyond what it does. It mentions return fields and points to related calculators. Could clarify what 'total cost' includes (e.g., interest, insurance).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds no additional parameter meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compare multiple mortgage offers sorted by total cost', which is a specific verb-resource combination. It also mentions the return fields, distinguishing it from single mortgage calculators like calculate_mortgage.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by referencing list_bundles for related calculators, implying alternatives. However, it does not explicitly state when to use this tool versus others or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_hydrationBInspect
Compute recommended daily fluid intake by weight, activity, and weather. Use for athletes and outdoor workers. Inputs: weight kg, activity hours, temperature °C. Returns L/day and electrolyte recommendation. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| climate | No | Climate/environment | temperate |
| weight_kg | Yes | Body weight in kilograms | |
| activity_minutes | No | Daily exercise duration in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It describes returning L/day and electrolyte recommendation but inaccurately lists inputs (says 'temperature °C' and 'activity hours', while schema uses 'climate' and 'activity_minutes'). This mismatch could mislead agents about required parameters.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences, covering purpose, use cases, inputs/outputs, and related tools. It is front-loaded with the main function. Minor issue: the input list format differs from the schema, but overall it is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the moderate complexity and presence of an output schema, the description covers purpose, use cases, inputs, and outputs. However, the input inaccuracies (temperature vs climate, hours vs minutes) reduce completeness. It does not detail the output schema structure, which is acceptable since an output schema exists.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds misleading information: it mentions 'temperature °C' and 'activity hours', while the schema uses 'climate' (enum) and 'activity_minutes'. It does add context about outputs (electrolyte recommendation) but fails to accurately convey parameter semantics beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes recommended daily fluid intake, specifying inputs (weight, activity, weather) and outputs (L/day, electrolyte recommendation). It mentions target users (athletes, outdoor workers) and references related calculators via list_bundles, providing some sibling differentiation. However, it does not explicitly distinguish from similar tools like calculate_water_intake.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises use for athletes and outdoor workers and points to list_bundles for related 'sante' calculators, indicating alternatives. However, it does not explicitly state when not to use or compare directly to siblings like calculate_water_intake, leaving some ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_hydraulic_pressureAInspect
Compute hydraulic system pressure P=F/A. Use for hydraulic design. Inputs: force N, area m². Returns pressure in Pa, kPa, bar, psi. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| force_n | Yes | Force N | |
| area_cm2 | Yes | Piston area cm² |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description is the sole source of behavioral info. It states the formula and return units, which is adequate for a read-only calculator. However, it lacks explicit mention of side effects, permissions, or safety, which is understandable for a calculation tool but could be improved.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise, with only two sentences that convey the essential purpose, inputs, and outputs. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the formula, inputs, and output units. With an output schema present, it need not detail return structure. However, it misses constraints (e.g., area in cm² as per schema) and does not address edge cases or precision, making it somewhat incomplete for a tool with a parameter inconsistency.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Although the schema has 100% coverage, the description introduces a contradiction: it says 'area m²' but the schema uses 'area_cm2' (cm²). This mismatch can mislead the agent. The description does add return units, but the error reduces its value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes hydraulic pressure using P=F/A, specifies the context (hydraulic design), and lists inputs and outputs. It distinguishes itself from siblings like calculate_pressure_convert by its specific formula and application.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for hydraulic design' and points to list_bundles for related calculators. While it doesn't detail when not to use or compare with alternatives, the context is sufficiently clear for a calculator tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_hyperfocal_distanceAInspect
Calculate hyperfocal distance and near/far sharp limits for a lens and aperture. See list_bundles for related 'photographie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| coc_mm | No | Circle of confusion in mm (default 0.03 for full frame) | |
| aperture | Yes | Aperture f-number | |
| focal_length_mm | Yes | Lens focal length in millimeters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It indicates a calculation with no side effects, but does not disclose precision, units, or formulas used. The behavior is straightforward for a calculator, but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences. The first front-loads the primary purpose; the second provides a navigation hint. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an output schema exists and schema coverage is complete, the description covers the essential purpose. However, it lacks context on output units or interpretation of 'near/far sharp limits,' which could hinder an agent's understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline 3. The description adds no parameter-specific details beyond what the schema provides. Terms like 'coc_mm' and 'aperture' are not explained further.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the specific function: 'Calculate hyperfocal distance and near/far sharp limits for a lens and aperture.' It identifies the resource and action distinctly among many 'calculate_*' siblings, and mentions a related bundle for context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides implicit usage context by mentioning 'photographie' calculators, but does not explicitly state when to use this tool versus alternatives. No when-to-use or when-not-to-use guidance is given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ideal_gasAInspect
Solve PV=nRT. Provide any 3 of: pressure_pa, volume_m3, moles, temperature_k. R=8.314. Returns: {error}. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| moles | No | Amount in mol | |
| volume_m3 | No | Volume in m³ | |
| pressure_pa | No | Pressure in Pa | |
| temperature_k | No | Temperature in K |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the equation and that the tool returns an error, implying a pure calculation. While a calculation tool has minimal side effects, the description could be more explicit about being read-only and idempotent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loaded with the core equation and key usage information. Every sentence provides essential information without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description need not detail return values. It covers the essential input constraints and constant. However, the mention of returning '{error}' is slightly vague and could be clarified.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already describes each parameter. The description adds value by clarifying the constraint of providing exactly three out of four parameters and specifying the gas constant R=8.314, which aids correct usage beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool solves PV=nRT, specifies the exact parameters involved (pressure_pa, volume_m3, moles, temperature_k), and indicates that any three of four are required. This provides a specific verb and resource, distinguishing it from the many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives explicit usage instruction: 'Provide any 3 of: ...' and mentions the constant R=8.314. It also references list_bundles for related calculators. However, it does not explicitly state when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ideal_weightAInspect
Estimate ideal body weight using Lorentz and Devine formulas. Returns: {lorentz_kg, devine_kg, average_kg}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sex | Yes | Biological sex | |
| height_cm | Yes | Height in centimeters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description carries burden. It discloses use of specific formulas and output structure, but lacks detail on formula assumptions (e.g., for adults only) or potential limitations. Adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose, then output format, then related tool. No unnecessary words, efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with two parameters and no output schema, description provides adequate context: purpose, formulas, and return object. Could mention formula applicability or ranges but not critical.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with descriptions. Description does not add additional meaning; it merely references formulas that use the parameters. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it estimates ideal body weight using specified formulas (Lorentz and Devine), which is a specific verb+resource. It also distinguishes from sibling 'calculate_ideal_weight_range' by focusing on specific weight formulas.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs alternatives. The only hint is 'see list_bundles for related calculators', which is vague and does not provide clear differentiation or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ideal_weight_rangeAInspect
Compute healthy weight range based on BMI 18.5-24.9. Use for nutrition planning. Inputs: height cm. Returns ideal weight range (min, max) in kg and lb. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sex | Yes | Biological sex | |
| frame | Yes | Body frame size | |
| height_cm | Yes | Height in cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses the calculation basis (BMI 18.5-24.9) and that inputs include height, sex, frame. It does not mention side effects, auth requirements, or error handling. While acceptable for a simple calculation tool, it lacks explicit statement of read-only or deterministic behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three short sentences, each serving a clear purpose: purpose, usage, inputs/outputs, and reference. No redundant words. Front-loaded with primary action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity, full schema coverage, and presence of output schema, the description provides all essential information: purpose, usage, inputs, outputs, and a pointer to related tools. It lacks explicit differentiation from 'calculate_ideal_weight' but overall is adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (all parameters described). The description adds minimal beyond schema: it mentions 'height cm' but schema already says 'Height in cm'. It does not elaborate on sex or frame parameter meanings. Baseline 3 is appropriate as description does not significantly enhance schema info.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Compute' and resource 'healthy weight range based on BMI 18.5-24.9'. It specifies the return format (min, max in kg and lb) and differentiates from simple BMI calculators by providing a range. It also mentions usage for nutrition planning, which distinguishes it from other health calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives a context ('Use for nutrition planning') and references related tools via 'See list_bundles for related calculators'. However, it does not explicitly guide when to use this tool versus similar siblings like 'calculate_ideal_weight', nor does it provide when-not-to-use scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_impermanent_lossAInspect
Calculate impermanent loss for a DeFi liquidity pool position when price ratio changes. Returns: {value_in_pool_ratio}. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| price_ratio_change | Yes | Price ratio change multiplier (e.g. 2.0 if token doubled in price, 0.5 if halved) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must carry full burden. It mentions output format but does not state whether the tool is read-only or any side effects. Adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences front-loaded with core purpose, no redundant information. Efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Has output schema so return values are partially covered, but lacks explanation of what impermanent loss conceptually is or formula context. Adequate for users familiar with DeFi.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage 100% with parameter description in schema. Tool description adds no extra meaning beyond 'when price ratio changes.' Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states 'Calculate impermanent loss for a DeFi liquidity pool position when price ratio changes.' Specific verb and resource, distinguishes from sibling calculators by specifying DeFi liquidity pool context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Suggests related tools via 'See list_bundles for related 'crypto' calculators.' but lacks explicit when-to-use or when-not-to-use guidance relative to other calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_inflation_adjusted_valueAInspect
Compute real (inflation-adjusted) purchasing power of a future amount. Use for retirement or savings goal in today's euros. Inputs: nominal future amount, years, average inflation %. Returns real value. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Number of years | |
| amount | Yes | Amount in EUR | |
| inflation_rate | Yes | Annual inflation rate percent |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It clearly describes a computational (non-destructive) tool that returns a real value. This is sufficient for a calculator with no side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three concise sentences with no wasted words. The core action is stated first, providing immediate clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator tool with an output schema, the description adequately covers purpose, usage, and related tools. No critical information is missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description adds minimal context ('in today's euros') beyond the schema descriptions, not significantly enhancing parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes inflation-adjusted purchasing power, specifying usage for retirement or savings goals in today's euros. However, it does not explicitly distinguish from similar tools like calculate_inflation_adjustment, so it gets a 4.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises using the tool for retirement or savings goals, providing some context. However, it lacks explicit guidance on when not to use it or alternatives, other than referring to list_bundles for related calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_inflation_adjustmentBInspect
Adjust a nominal amount to a target year using a constant inflation rate. Use for real-value comparisons across time. Inputs: amount, inflation rate %, years. Returns adjusted value and total inflation factor. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Number of years | |
| amount | Yes | Original amount | |
| inflation_rate | Yes | Annual inflation rate in % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as whether the tool is read-only, requires permissions, or has any side effects. It mentions returns values but gives no error handling or limits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences, front-loading purpose and usage. All content is relevant, though structure could be improved with clearer separation of sections.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description covers purpose, usage context, inputs, outputs, and related tools. However, it lacks explicit mention of the constant rate assumption and potential limitations.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter. The description merely repeats parameter names and units without adding new meaning beyond what the schema provides, meeting the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it adjusts a nominal amount to a target year using a constant inflation rate, and specifies use for real-value comparisons. However, it does not differentiate from the very similar sibling tool 'calculate_inflation_adjusted_value', leaving ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description advises using the tool for real-value comparisons across time and references list_bundles for related calculators, but lacks explicit when-to-use vs. alternatives or when not to use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_inheritance_taxAInspect
Calculate French inheritance tax (droits de succession) based on relationship and amount. Returns: {amount, abatement, taxable_base, tax_due, effective_rate_pct, marginal_rate_pct}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Inherited amount in euros | |
| relationship | Yes | Relationship to deceased: spouse, child, sibling, other |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as whether the operation is read-only, requires authentication, or has rate limits. It only describes inputs and outputs, leaving safety and behavioral expectations unclear.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (two sentences) with no unnecessary words. It front-loads the purpose and immediately lists the return structure, making it efficient and easy to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a 2-parameter tool with an output schema, the description covers the essential information: what it calculates, the required inputs, and the return fields. It lacks edge cases or examples, but is complete enough for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents both parameters. The description adds no additional meaning beyond what the schema provides (e.g., format, constraints, or examples), meeting the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates French inheritance tax based on relationship and amount, specifies return fields (amount, abatement, etc.), and distinguishes from siblings by referencing related calculators under 'finance-france'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for French inheritance tax calculations but does not explicitly state when to use or not use it compared to other tax calculators. The inclusion of 'See list_bundles for related calculators' provides limited guidance but no direct alternatives or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_insulation_rAInspect
Compute thermal resistance R = thickness/lambda. Use for insulation specification (RT2020/RE2020). Inputs: thickness m, conductivity λ W/m·K. Returns R-value m²·K/W. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| lambda | Yes | Conductivity W/(m.K) | |
| thickness_mm | Yes | Thickness mm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the calculation formula, input units, and output unit, which implies a pure computation with no side effects. However, it does not explicitly state that the tool is read-only or describe any limitations, permissions, or error conditions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences that front-load the formula and use case. Every word adds value, and there is no wasted text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, return values are covered. The description mentions the formula, use case, and references a related bundle. However, it lacks details about input ranges, precision, and the unit mismatch reduces completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema covers 100% of parameters with descriptions and units, but the description adds little beyond redundancy. Critically, the description states 'thickness m' while the parameter is 'thickness_mm' (millimeters), creating a unit contradiction that could confuse the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes thermal resistance using the formula R = thickness/lambda, and specifies the regulation context (RT2020/RE2020). However, it does not distinguish itself from the similarly named sibling 'calculate_insulation_r_value', which may cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says to use the tool for insulation specification under RT2020/RE2020, providing a clear use case. It also directs users to 'list_bundles' for related calculators, but does not list alternative tools or conditions when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_insulation_r_valueAInspect
Calculate thermal R-value: R = thickness/lambda. Compare with RE2020 targets. Returns: {lambda_w_mk, r_value_m2KW, re2020_targets, walls_ok, roof_ok, floor_ok}. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| lambda | Yes | Conductivity W/(m·K) — mineral wool ~0.035, polyurethane ~0.025 | |
| thickness_mm | Yes | Insulation thickness in mm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the burden. It clearly discloses the formula, the returned fields, and the comparison with RE2020 targets. For a calculation tool, this is sufficient behavioral context; no side effects or mutability concerns are expected.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences plus a structured return field list. Every element serves a purpose, and the key information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 2 parameters, clear inputs, and described outputs (even though no formal output schema), the description is complete. It also references related tools, providing sufficient context for a simple calculation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already covers both parameters with descriptions. The description adds value by providing the formula (R = thickness/lambda), typical lambda values (mineral wool ~0.035, polyurethane ~0.025), and explaining units, enriching understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: calculate thermal R-value using the formula R = thickness/lambda and compare with RE2020 targets. It distinguishes itself from siblings by specifying the comparison with regulatory targets and references list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
While the description mentions seeing list_bundles for related calculators, it does not provide explicit guidance on when to use this tool versus similar tools like calculate_insulation_r, nor does it specify prerequisites or scenarios. The guidance is implied but not explicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_insurance_estimateAInspect
Estimate annual car insurance from vehicle value, driver age and bonus-malus. Returns: {annual_premium_eur, monthly_eur, note}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| driver_age | Yes | Driver age | |
| vehicle_value | Yes | Vehicle value EUR | |
| bonus_malus_coefficient | No | Bonus-malus (0.5=best, 3.5=worst) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It does not disclose any behavioral traits such as read-only nature, side effects, or prerequisites (e.g., whether data is stored). The description only states what it returns, not how it behaves.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the purpose, and includes the return structure. Every word is necessary; no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 primitive parameters, output schema exists), the description is complete enough. It mentions related tools via list_bundles. Could optionally explain the note field, but not critical.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description lists the parameters but adds no extra meaning beyond the schema. No examples or additional constraints are provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'estimate', the resource 'annual car insurance', and the key inputs (vehicle value, driver age, bonus-malus). It distinguishes from siblings by referencing list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context for when to use the tool (estimating car insurance) and directs to list_bundles for related calculators. Could explicitly state when not to use it, but the context is sufficient given the sibling list.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_international_shippingCInspect
Estimate international shipping cost and delivery time by carrier and weight. Use for e-commerce or expat shipping. Inputs: from-country, to-country, weight kg, dimensions. Returns cost range and lead time. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| from_zone | Yes | Destination zone | |
| weight_kg | Yes | Actual parcel weight in kg | |
| dimensions_cm | Yes | Parcel dimensions in cm (length, width, height) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It states returns 'cost range and lead time' but does not disclose that the tool is read-only, requires no authentication, or any other behavioral traits. The mismatch between 'by carrier' in the description and the actual parameters further obscures behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at three sentences, but the inclusion of ambiguous and contradictory elements (e.g., 'by carrier', country vs. zone) reduces clarity. It could be more tightly focused on what the tool actually does.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 3 parameters (one nested object) and an output schema, the description fails to fully explain the parameter semantics, especially the 'from_zone' enum and the omission of origin country. It does not leverage the output schema to describe return values, but that is acceptable. Overall, the description is incomplete for accurate tool invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Although schema coverage is 100% with descriptions, the description adds misleading information: it says inputs include 'from-country, to-country' but the schema only has 'from_zone' (destination zone) and no origin country parameter. This directly contradicts the schema and provides no additional useful meaning.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates international shipping cost and delivery time, which distinguishes it from the numerous other 'calculate_*' tools. However, the mention of 'by carrier' is not reflected in the schema, and the input description says 'from-country, to-country' but schema only includes 'from_zone' (destination zone), creating slight confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context ('Use for e-commerce or expat shipping') and hints at an alternative ('See list_bundles for related 'voyage' calculators'), but lacks explicit when-not-to-use instructions or clear differentiation from non-voyage calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_inventory_eoqAInspect
Compute Economic Order Quantity (Wilson formula). Use for supply chain optimization. Inputs: annual demand, order cost, holding cost per unit. Returns EOQ and orders/year. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| order_cost | Yes | Cost per order | |
| holding_cost | Yes | Annual holding cost per unit | |
| annual_demand | Yes | Annual demand units |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so description must carry burden. It discloses that it computes EOQ and returns orders/year, which is sufficient for a stateless calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, no waste, front-loaded with purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an output schema exists and the tool is a simple calculator, the description covers inputs, formula, usage, and references for related tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. The description adds little beyond listing inputs again, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute Economic Order Quantity (Wilson formula).' This is a specific verb+resource that clearly distinguishes it from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It says 'Use for supply chain optimization.' and references list_bundles for related calculators, providing context but no explicit when-not-to-use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_inventory_turnoverAInspect
Compute inventory turnover ratio = COGS / avg inventory. Use for retail efficiency analysis. Inputs: COGS, average inventory value. Returns turnover and days-on-hand. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cogs | Yes | Cost of goods sold | |
| avg_inventory | Yes | Average inventory value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; the description only states the formula and outputs. Does not disclose any behavioral traits like side effects, authorization needs, rate limits, or response specifics. Falls short for a data-mutating or computational tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two efficient sentences; front-loaded with purpose, then context and reference. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (implied), the description mentions return values (turnover and days-on-hand). Adequate for a simple calculator, though edge cases or error handling are absent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters with descriptions (100% coverage). Description repeats the parameter names (COGS, average inventory) but adds little new semantic meaning beyond the schema, so baseline score applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the formula (COGS / avg inventory) and context (retail efficiency analysis), but doesn't strongly differentiate from the many other calculate_* sibling tools beyond referencing list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for retail efficiency analysis' and directs to list_bundles for related calculators, providing clear usage context. Lacks explicit when-not-to-use or comparisons to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_jet_lag_recoveryAInspect
Estimate jet lag recovery time based on timezone difference and direction of travel. Returns: {direction, tips}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| timezone_diff_hours | Yes | Timezone difference in hours (positive = eastward, negative = westward) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses the input (timezone difference and implied direction) and output format, but does not elaborate on assumptions, accuracy, or edge cases. This is adequate for a simple estimation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states purpose, second gives return type and a pointer to related tools. No redundant information, front-loaded, and every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description is complete for a simple parameterized tool: explains what it does, what it returns, and directs to a bundle of related tools. However, it could mention any limitations or assumptions, but given the simplicity, it is nearly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already fully describes the parameter (timezone_diff_hours) with min, max, and meaning of sign. The description adds no new information beyond what the schema provides, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates jet lag recovery time based on timezone difference and direction of travel, and specifies the return fields (direction, tips). It does not explicitly differentiate from sibling tools like calculate_timezone_convert, but the purpose is unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when one wants to estimate jet lag recovery, but it does not provide guidance on when not to use it or suggest alternatives. The pointer to list_bundles for related calculators is helpful but not a direct alternative.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_kinetic_energyAInspect
Compute kinetic energy KE=½·m·v². Use for physics, vehicle safety analysis. Inputs: mass kg, velocity m/s. Returns kinetic energy in joules. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mass_kg | Yes | Mass in kg | |
| velocity_ms | Yes | Velocity in m/s |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses the formula, required inputs (mass in kg, velocity in m/s), and output in joules. It is transparent about the calculation, though it does not cover edge cases or performance traits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first explicitly states purpose and formula, second contextualizes usage and directs to related tools. No superfluous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with 2 parameters, the description adequately covers inputs, formula, and output. It references related tools via list_bundles. Completeness is sufficient given the tool's simplicity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters with descriptions. The explanation adds the formula and clarifies units beyond the schema, providing meaningful context for parameter usage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes kinetic energy using the formula KE=½·m·v² and specifies use cases (physics, vehicle safety). It lacks explicit differentiation from the many sibling calculation tools, but references list_bundles for related calculators, indirectly aiding distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises use for physics and vehicle safety analysis, and points to list_bundles for related 'science' calculators, providing alternatives. It does not explicitly state when not to use, but the context is adequate.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_knitting_yarnAInspect
Calculate yarn needed for a knitting project (meters and number of 50g/100m balls). Returns: {meters_of_yarn, balls_50g_100m}. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | ||
| project | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It reveals the output structure ({meters_of_yarn, balls_50g_100m}) and assumes a standard 50g/100m yarn. It does not mention other constraints, safety, or side effects, but being a calculation tool, the description is moderately transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences. The first sentence conveys purpose and output, and the second provides a reference. No wasted words; information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (two enum parameters, output schema exists), the description covers purpose, output format, and related tools. It assumes a standard yarn type but does not mention limitations. It is adequate for a straightforward calculator, though it could briefly describe parameters for completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%, so the description should compensate by explaining the parameters. It does not describe the 'project' or 'size' parameters beyond their enum values. The only added context is the output unit, which indirectly relates to parameters but does not clarify their semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: calculating yarn needed for a knitting project in meters and number of 50g/100m balls. It specifies the resource (knitting yarn) and distinguishes itself from sibling calculators by referencing list_bundles for related textile-mode calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides guidance on when to use this tool by directing users to list_bundles for related calculators, implying this is the specific yarn calculator for knitting projects. However, it does not explicitly state when not to use this tool or provide alternatives beyond the bundle reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_laundry_costAInspect
Calculate weekly and annual laundry cost (electricity + water + detergent). Returns: {per_load_eur, weekly_eur, annual_eur}. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| loads_per_week | Yes | Loads per week | |
| water_liters_per_load | No | Liters per load (default 50) | |
| detergent_cost_per_load | No | Detergent EUR/load (default 0.30) | |
| electricity_kwh_per_load | No | kWh per load (default 1.2) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided. The description reveals the return structure but does not disclose behavioral traits such as destructive hints, authorization needs, or rate limits. For a calculator tool, the lack of transparency is moderate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences: the first states the core purpose and components, the second mentions the return format and sibling tool. No unnecessary words, and key information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, good schema coverage (100%), and presence of an output schema, the description is largely complete. It includes the return format and a sibling reference. However, it lacks explicit explanation of cost components or assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with each parameter having a description and defaults. The tool description adds minimal parameter meaning beyond the schema, primarily the return format. A baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate weekly and annual laundry cost (electricity + water + detergent).' It specifies the components and mentions the return format. The sibling tool reference helps distinguish it from other calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for laundry cost estimation but does not explicitly state when to use this tool versus alternatives. The reference to 'list_bundles' for related calculators provides indirect context, but no direct guidelines or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_lawn_mowing_frequencyAInspect
Calculate recommended lawn mowing interval based on grass type, season and rainfall. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| season | Yes | Current season | |
| grass_type | Yes | Type of grass: cool_season (fescue/rye), warm_season (bermuda/zoysia), or mixed | |
| weekly_rainfall_mm | No | Average weekly rainfall in mm (default 25mm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It only states what the tool calculates and does not mention any behavioral traits (e.g., no side effects, authentication needs, rate limits, or that it is a pure computation). The minimal description leaves the agent without insight into safety or prerequisites.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with front-loaded purpose. Every sentence adds value: first sentence states functionality, second provides a useful cross-reference. No redundancy or filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description adequately covers what the tool does and inputs. Although output schema exists and is not described, the presence of an output schema reduces the need for return value details in the description. A minor gap: no mention of units for the interval (days/weeks) but output schema likely handles that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with detailed enum descriptions and defaults. The description adds 'based on grass type, season and rainfall' which aligns with parameters but does not significantly enhance understanding beyond what the schema already provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb 'calculate', resource 'lawn mowing interval', and inputs 'grass type, season and rainfall'. It distinguishes from hundreds of sibling calculator tools by being specific to lawn mowing. The mention of 'list_bundles' for related calculators provides additional grouping context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implied usage: use this tool to calculate lawn mowing intervals based on given parameters. The hint to 'see list_bundles for related calculators' suggests grouping but does not explicitly state when to use this vs alternatives or provide when-not-to conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_lawn_seedAInspect
Compute grass seed quantity (kg) for a lawn area at recommended seeding rate. Use for landscaping. Inputs: area m², seed rate g/m². Returns seed kg. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | Yes | Lawn area m² | |
| rate_g_m2 | No | Seeding rate g/m² |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses inputs, output unit (kg), and computational purpose. For a simple calculator, this is sufficient, though it could mention any data constraints or assumptions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with clear front-loaded purpose, followed by inputs/outputs and cross-reference. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, presence of output schema, and cross-reference to related tools, the description is complete. Could be improved by noting potential edge cases or precision.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are fully documented. Description adds some value by restating units and default rate, but does not significantly extend beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it computes grass seed quantity for a lawn area at recommended seeding rate, specifying verb and resource. It distinguishes from construction calculators by referencing list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for landscaping' and directs to list_bundles for construction calculators, providing both use case and exclusion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_leave_daysAInspect
Calculate French paid leave (congés payés): 2.5 days/month, max 25 working days/year. Returns: {accrual_per_month, accrued_days, capped_at_max, max_annual_days, days_to_max, months_to_max}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| start_date | Yes | YYYY-MM-DD — Employment start date | |
| months_worked | Yes | Months worked in the reference period (1-12) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description carries the full burden. It explains the calculation logic (accrual rate, cap) and lists all return fields. It is a read-only computation tool, and the description implicitly conveys its non-destructive nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first concisely states the purpose and formula, and the second lists return fields and references related tools. It is front-loaded and every word adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (not shown but indicated), the description lists all return fields, making it complete. It provides sufficient context for a straightforward calculator with clear inputs and outputs.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already describes both parameters with 100% coverage. The description adds context by linking the parameters to the formula (e.g., months_worked drives the 2.5 days/month accrual) and explaining the output fields, which goes beyond basic parameter names.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates French paid leave (congés payés) with a specific formula (2.5 days/month, max 25 days/year). This distinguishes it from the many sibling calculators by specifying the exact resource and scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description references 'list_bundles' for related 'temps-rh' calculators, providing some guidance on alternatives. However, it does not explicitly state when to use this tool versus others or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_led_savingsAInspect
Compute energy and money saved by switching from incandescent/halogen to LED. Use for energy audit. Inputs: old wattage, LED wattage, daily hours, kWh price, bulbs. Returns yearly savings. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| led_w | Yes | LED replacement wattage | |
| old_w | Yes | Old bulb wattage | |
| hours_day | No | Hours per day | |
| num_bulbs | No | Number of bulbs | |
| price_kwh | No | EUR/kWh |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It accurately describes that the tool computes yearly savings based on inputs and returns a result. There are no destructive side effects, and the description does not contradict any annotations (none exist). It provides adequate transparency for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first clearly states the purpose, the second gives usage guidance and a pointer to related tools. There is no wasted text, and the key information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema is present (the tool has an output schema), the description does not need to explain return values. It mentions 'Returns yearly savings', which is sufficient. For a simple calculation tool with well-documented inputs and a known output, the description is complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, providing clear descriptions for each parameter. The description adds minimal extra meaning beyond repeating the input list. Per guidelines, baseline is 3 for high coverage with little additional value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Compute' and the resource 'energy and money saved by switching from incandescent/halogen to LED'. It specifies the context 'Use for energy audit' and differentiates from siblings by mentioning 'See list_bundles for related 'energie' calculators'. This provides a specific, actionable definition.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for energy audit', giving a clear usage context. It also directs to list_bundles for related calculators, which helps distinguish when to use this tool vs alternatives. However, it does not explicitly state when not to use it, so it gets a 4 instead of 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_life_path_numerologyAInspect
Calculate numerology life path number from birth date. Returns: {life_path_number, meaning}. See list_bundles for related 'fun' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| birth_date | Yes | Birth date in YYYY-MM-DD format |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses that the tool calculates and returns an object, but does not mention any behavioral traits such as side effects, authentication needs, or rate limits. Given it's a calculation, it's likely safe, but the description adds little beyond the purpose.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: one for purpose and one for output and related tools. Every sentence earns its place with no wasted words. Highly succinct and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter calculator, the description covers the essential purpose, output format, and even hints at related tools via list_bundles. While no explicit output schema is provided in structured form, the description mentions the return object structure. This is adequate for the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% (the birth_date parameter is well-documented in the schema). The description adds that the calculation is 'from birth date', which aligns with the schema. No additional meaning is provided beyond the schema, so baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it calculates the numerology life path number from a birth date, which is a specific verb+resource. The tool name and description distinguish it from siblings by focusing on numerology, a unique topic among many calculation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides some guidance by mentioning the return format and pointing to list_bundles for related fun calculators. However, it does not explicitly state when to use this tool vs alternatives or provide exclusion criteria. It implies usage context but lacks explicit when-not-to-use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_light_yearCInspect
Convert between light-years and km/miles. Use for astronomy. 1 ly = 9.461×10¹² km. Inputs: value, from, to. Returns converted distance. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| unit | Yes | Input unit | |
| value | Yes | Value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description claims inputs 'value, from, to' but the schema only has 'value' and 'unit', creating a direct contradiction. No annotations exist to clarify behavior. This misrepresentation is a serious flaw.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (three sentences) but includes a redundant reference to list_bundles. It could be crisp but sacrifices accuracy for brevity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The output schema exists, so return format is covered, but the description fails to correctly describe the input parameters, leading to incompleteness. The tool is simple but the description's inaccuracies undermine completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description adds incorrect parameter info ('from, to') not reflected in the schema. Schema already has descriptions for both params (100% coverage), so the description only adds confusion and no value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Convert between light-years and km/miles' which indicates a conversion purpose, but the schema includes units like parsec and au not mentioned. The sibling tool 'calculate_light_year_distance' may overlap, but no differentiation is provided. Purpose is clear in a general sense but incomplete and inconsistent.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Only states 'Use for astronomy' and suggests looking at list_bundles for related calculators. No guidance on when to use this vs. alternatives (e.g., calculate_light_year_distance). No exclusion or prerequisite info.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_light_year_distanceAInspect
Convert astronomical distances between light-years, parsecs, AU, km. Returns: {light_years, parsecs, au, km, original}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Distance value | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It only states the conversion purpose and output format, omitting whether the tool is read-only, any permissions needed, or potential side effects. For a conversion tool, read-only nature is implied but not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences, no extraneous words. The purpose is front-loaded in the first sentence, and the second provides output details and a reference to related tools. Every word serves a purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists (not shown), the description's mention of the return object suffices. It lacks a note about preserving the original unit and could clarify that all conversions are returned, but overall it is reasonably complete for a simple conversion tool with two parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are already explained. The description adds value by clarifying the output format ('Returns: {light_years, parsecs, au, km, original}') and the units involved, which the schema does not specify in context.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses specific verbs ('convert') and resources ('astronomical distances between light-years, parsecs, AU, km') and lists sibling tools, making the purpose clear. It distinguishes itself from the many sibling calculators by focusing on astronomical distance conversion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description states what the tool does but does not explicitly guide when to use it over alternatives. The mention of 'See list_bundles for related calculators' hints at related tools but provides no exclusion criteria, especially against the sibling 'calculate_light_year' which may be confused.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_linear_regressionCInspect
Calculate linear regression slope and intercept from summary statistics. Returns: {error}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| n | Yes | Number of data points | |
| sum_x2 | Yes | Sum of (xi-x_mean)² | |
| sum_xy | Yes | Sum of (xi-x_mean)*(yi-y_mean) | |
| x_mean | Yes | Mean of x values | |
| y_mean | Yes | Mean of y values |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description only mentions returning an error object without detailing edge cases, assumptions, or limitations (e.g., required centering of input sums). This is insufficient for behavioral disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very short and to the point, efficiently stating the core purpose and return format. However, it lacks essential context, so it is not perfectly concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having an output schema and full parameter descriptions, the description fails to explain how to use the summary statistics together, what the expected inputs represent, or any context for interpreting results. This is insufficient given the tool's mathematical complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The tool description adds no extra meaning beyond what the schema already provides, so baseline score applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Calculate linear regression slope and intercept from summary statistics,' which clearly specifies the tool's purpose and output. However, it does not distinguish between similar statistical tools among the many siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives, nor any conditions or prerequisites are mentioned. The reference to list_bundles is only partially helpful.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_lmnp_amortizationBInspect
Calculate LMNP rental property amortization and annual tax deduction (French tax regime). See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rent | Yes | Annual gross rental income in EUR | |
| property_value | Yes | Property purchase price excluding land in EUR | |
| furniture_value | Yes | Furniture and equipment value in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, and description does not disclose behavioral traits like side effects, permissions, or limitations. It only states what it calculates, not how it behaves.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first clearly states purpose, second provides a pointer to related tools. No fluff, front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Adequate for a calculation tool with output schema, but lacks usage guidance and behavioral transparency. Could benefit from more details on prerequisites (e.g., property value excluding land).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and schema descriptions already explain parameters. Description does not add additional meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates LMNP rental property amortization and annual tax deduction, specifying the French tax regime, which distinguishes it from many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Only mentions seeing list_bundles for related calculators, no explicit when to use this tool vs alternatives, nor when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_lmnp_deficitBInspect
Calculate LMNP (non-professional furnished rental) tax deficit. Returns: {total_deductible, deficit, note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rent | Yes | Annual rental income in EUR | |
| annual_charges | Yes | Annual deductible charges in EUR | |
| depreciation_annual | Yes | Annual depreciation amount in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
As a calculator with no side effects, the description appropriately states the return structure. However, with no annotations, it lacks details on prerequisites, edge cases, or data sources, though the tool's behavior is inherently straightforward.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences with no wasted words. The purpose and output are front-loaded. Additional context is minimal but not missing critically.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Adequate for a simple calculator with full schema coverage and output format hint. However, it lacks usage guidelines and sibling differentiation, which are important given the many calculation tools available.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents parameters. The description adds no extra parameter-level info beyond the return structure, but no compensation is needed.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates LMNP tax deficit with a specific verb and resource. However, it does not differentiate from sibling tax calculators or explain what LMNP entails, leaving agents to infer context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives like calculate_lmnp_amortization or other rental income tools. The mention of list_bundles is generic and does not provide explicit usage criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_loan_early_repaymentAInspect
Calculate interest savings from early partial loan repayment. Returns: {months_saved, new_months_remaining, interest_savings_eur, early_repayment_eur}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| early_amount | Yes | Early repayment amount EUR | |
| monthly_payment | Yes | Current monthly payment EUR | |
| months_remaining | Yes | Months remaining | |
| remaining_capital | Yes | Remaining loan capital EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must convey behavioral traits. It describes the return object but does not explicitly state that the operation is non-destructive, read-only, or requires no authentication. For a calculator tool, it is reasonably transparent, but could be improved.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences: first for purpose, second for output structure and cross-reference. It is front-loaded and efficient, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description covers purpose, output structure, and a reference to related calculators. An output schema exists, so detailed return values are not required. It is complete enough for an agent to use effectively.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% description coverage, so parameters are well documented. The description does not add additional meaning beyond the schema's field descriptions, thus it meets the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates interest savings from early partial loan repayment. It identifies a specific resource and verb, and distinguishes from related calculators by referencing list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a cross-reference to list_bundles for related 'finance-universal' calculators, giving some guidance on alternatives. However, it does not explicitly state when to use this tool versus other loan calculators like calculate_loan_payment, nor any prerequisites or conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_loan_paymentAInspect
Calculate monthly loan payment for any generic loan. Returns: {principal, monthly_payment, total_cost, total_interest}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| months | Yes | Loan duration in months | |
| principal | Yes | Loan amount | |
| annual_rate | Yes | Annual interest rate in % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It discloses the return fields (principal, monthly_payment, total_cost, total_interest) but does not mention assumptions (e.g., fixed rate, equal payments) or any behavioral constraints like authentication or rate limits. Adequate but could be more transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, front-loaded with purpose and return fields, no wasted words. Efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description adequately covers usage, return values, and related tools. The schema covers parameters, and output schema is implied. Could mention calculation type (e.g., fixed-rate amortization) for completeness, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are already described. The description adds meaning by stating 'any generic loan' and listing return fields, but does not elaborate on parameter usage or format beyond schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates monthly loan payments for any generic loan, using a specific verb and resource. It distinguishes from many sibling 'calculate_*' tools by specifying 'generic loan' and referencing 'finance-universal' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by pointing to list_bundles for related calculators, but does not explicitly state when NOT to use this tool. It implies generic loan calculation but lacks exclusions for specific loan types (e.g., mortgage, student loan) which exist among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_loan_to_valueAInspect
Compute Loan-to-Value (LTV) ratio for mortgage risk. Use for mortgage application or PMI thresholds. Inputs: loan amount, property value. Returns LTV %, risk level, PMI required y/n. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| loan_amount | Yes | Loan amount EUR | |
| property_value | Yes | Property value EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It discloses return values (LTV %, risk level, PMI required) beyond the input-only schema, but doesn't mention side effects, permissions, or rate limits. However, as a calculation tool, these are less critical. The output details add good transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loads the core purpose, and includes all essential information without redundancy. Every sentence is purposeful.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with complete schema and output schema, the description adds output details and references related tools. It is adequately complete given the low complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions already specifying units (EUR). The description repeats 'loan amount, property value' but adds no new semantic information beyond what the schema provides, resulting in minimal added value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes Loan-to-Value (LTV) ratio for mortgage risk, listing specific inputs (loan amount, property value) and outputs (LTV %, risk level, PMI required y/n). It distinguishes from siblings by referencing related 'immobilier' calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for mortgage application or PMI thresholds,' providing clear when-to-use scenarios. It also directs to list_bundles for related calculators, offering an alternative if needed.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_logarithmAInspect
Calculate logarithm in any base (natural, common, binary). Returns: {result}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| base | No | Log base: e=natural, 10=common, 2=binary | e |
| value | Yes | Value to take log of |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions the return format ('Returns: {result}') but does not disclose whether the tool has side effects, rate limits, or any other behavioral traits. The description is vague about the actual behavior beyond the return.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: one sentence stating the purpose and one sentence pointing to related tools. It is front-loaded with the core purpose and contains no filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, simple math operation) and the presence of an output schema (not shown but indicated), the description is largely complete. It explains what the tool does, its return, and points to related tools. Minor improvement could be more detail on the return structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for both parameters. The description adds context by naming the base variants ('natural, common, binary'), which clarifies the enum options. This provides marginal extra value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates logarithms in any base and lists natural, common, and binary. It directly explains the verb 'calculate' and resource 'logarithm', and the mention of bases distinguishes it from other calculation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description tells users to see list_bundles for related 'math' calculators, implying a family of tools, but it does not explicitly state when to use this tool over alternatives. No exclusions or specific context are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_lottery_oddsAInspect
Compute the odds of winning a lottery for various prize tiers. Use for awareness, education. Inputs: numbers to pick, total numbers, bonus number config. Returns probability and 1-in-N for each tier. See list_bundles for related 'jeux-probabilites' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bonus_pool | No | Size of the bonus number pool (default 0, no bonus) | |
| bonus_numbers | No | Number of bonus/powerball numbers to match (default 0) | |
| total_numbers | Yes | Total numbers in the main pool | |
| numbers_to_pick | Yes | How many numbers you pick |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavior. It states it computes odds and returns results, implying a read-only calculation. However, it lacks explicit safety or idempotency details, which is acceptable for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose and usage. No redundant words. Efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present, the description need not detail returns. It covers purpose, inputs, output (implicitly), and references sibling tools. For a low-complexity tool, this is complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description paraphrases the parameters (numbers to pick, total numbers, bonus config) but adds no new meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes lottery odds for various prize tiers, using specific verbs and resource. It distinguishes itself from sibling calculator tools by specifying lottery odds and mentions return values.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context ('for awareness, education') and directs to list_bundles for related calculators. It does not explicitly state when not to use, but the guidance is adequate.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_luggage_weightBInspect
Calculate total luggage weight and compare to airline limits (carry-on, economy checked, business checked). Returns: {total_kg, status}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| items | Yes | Array of luggage items with name and weight in kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden of disclosing behavior. It states the return fields (total_kg, status) and the comparison to airline limits, which is adequate. However, it does not explain what the 'status' values are (e.g., 'overweight') or the specific limits used, leaving some ambiguity.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, with no redundant information. The first sentence covers purpose and return, the second directs to related tools. Every word earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, no nested objects, output schema), the description is mostly complete. It covers what the tool does, its inputs, and outputs. However, it could be improved by specifying the comparison criteria or status values, but it remains adequate for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes the 'items' parameter thoroughly. The description adds minimal value beyond repeating 'luggage items' and mentioning airline limits, which is not tied to the parameter itself. The baseline score of 3 applies as the schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates total luggage weight and compares it to airline limits (carry-on, economy checked, business checked). It provides a specific verb-resource combination and mentions the return value. While it distinguishes from siblings by being a luggage-specific calculator, it does not explicitly differentiate from other 'calculate' tools, but the purpose is clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only points to 'list_bundles' for related calculators but provides no explicit guidance on when to use this tool vs alternatives, nor any when-not-to-use or prerequisite information. The usage context is implied but not stated.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_malus_ecologiqueAInspect
French ecological malus 2026: CO2 g/km based tax on new vehicle registration. Returns: {malus_eur, threshold, max}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| co2_g_km | Yes | CO2 emissions in g/km |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
In the absence of annotations, the description discloses the return format and that it's a calculation, but does not mention any prerequisites, side effects, or data sources, leaving some transparency gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences that front-load the core purpose, followed by a clear return format and a reference to related tools, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter) and the existence of an output schema, the description provides enough context including a pointer to related calculators, though it could mention any limitations or assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and the parameter description in the schema already explains the input well; the tool description adds no additional semantic value beyond what is in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates the French ecological malus for 2026 based on CO2 g/km for new vehicle registration, using specific verbs and resource, distinguishing it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for French ecological malus of new vehicles and references list_bundles for related auto-transport calculators, providing some context but lacking explicit exclusion guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_marathon_splitsBInspect
Generate target split times for a marathon, half-marathon, or other race. Use for race-day pacing. Inputs: target finish time, distance km. Returns 5K splits, halfway, and final pace. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target_time_minutes | Yes | Target marathon finish time in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must cover behavior. It claims to accept 'distance km' which conflicts with the input schema that only has target_time_minutes. This inconsistency undermines transparency. It does mention output (5K splits, halfway, final pace) which is helpful.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: purpose, use case, inputs/outputs. Efficient but includes inaccurate info about distance parameter. Could be improved by aligning with actual schema or removing the incorrect mention.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists, so return values are documented. Description adds helpful output specifics. However, the missing distance parameter in schema contradicts description's claim to support various distances, leaving a significant completeness gap for a tool that should handle different race lengths.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with one parameter well-described. Description adds 'distance km' which is not a parameter, causing confusion. It adds context about race types and outputs, but the misleading extra parameter reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool generates target split times for marathons, half-marathons, or other races, with specific verb 'generate' and resource 'split times'. It distinguishes from siblings like calculate_running_pace by mentioning splits and referencing list_bundles for sport calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for race-day pacing', providing a clear use case. Does not exclude alternatives or specify when not to use, but the sibling reference hints at related tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_markup_marginCInspect
Convert between markup and margin (often confused). Use for pricing decisions or COGS reporting. Inputs: cost and either markup % or margin %. Returns selling price and the other metric. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cost | Yes | Cost price | |
| selling_price | Yes | Selling price |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description claims inputs are cost and either markup % or margin %, but the input schema requires cost and selling_price. This is a serious contradiction about the tool's behavior. Annotations are not provided, so the description carries the full burden, and it fails to accurately describe how to use the tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: three sentences with no wasted words. It front-loads the purpose, then gives usage context and a link to related tools. Structurally efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the contradiction between description and schema, the description is incomplete and misleading. It fails to provide accurate contextual information about what parameters are accepted. The presence of an output schema is noted, but the description's inaccuracies undermine completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
While schema coverage is 100% (cost and selling_price have descriptions), the description adds inaccurate information: it says inputs are cost and either markup/margin %, which contradicts the schema. This misleads the agent about the required parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts between markup and margin, which are often confused. It specifies the verb 'Convert' and the resource 'markup and margin'. It also mentions use cases (pricing decisions, COGS reporting) and points to related tools via list_bundles. However, it does not directly distinguish from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases: pricing decisions or COGS reporting. It also suggests consulting list_bundles for related finance-universal calculators, indicating alternatives. However, it does not specify when not to use this tool or provide exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_maternity_leave_frAInspect
Compute French maternity leave duration and IJSS allowance. Use for HR or expectant parents. Inputs: due date, prior children count, multiple birth. Returns pre/post-birth leave days and allowance estimate. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| twins | No | Multiple birth | |
| existing_children | Yes | Existing children |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries burden. It mentions 'due date' as an input, but the schema does not include this parameter. This misalignment could mislead agents. It correctly notes it returns leave days and allowance estimate, but the false input claim undermines transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose. Includes inputs, outputs, and pointer to related tools. Concise but includes a minor inaccuracy. Overall well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given complexity of French maternity leave rules, description provides essential purpose and outputs. However, missing due date parameter and lack of details on calculation assumptions make it incomplete. Output schema likely covers return values, but description could still add context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, baseline 3. Description adds slight detail by mapping 'prior children count' to existing_children and 'multiple birth' to twins, but incorrectly introduces 'due date' as an input. This extraneous info reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it computes French maternity leave duration and IJSS allowance. Uses specific verb 'Compute' and specifies target users (HR, expectant parents). Distinguishes from sibling tools by being specific to French maternity leave.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for HR or expectant parents', providing context. Mentions related 'finance-france' calculators via list_bundles, hinting at alternatives. Does not explicitly state when not to use, but gives sufficient guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_max_heart_rateAInspect
Estimate maximum heart rate using standard or age-adjusted formulas. Returns: {max_heart_rate}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Age in years | |
| formula | No | Formula: standard (220-age), tanaka (men), gulati (women) | standard |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It states the tool estimates heart rate and returns an object, but lacks details on safety (e.g., read-only, no side effects) or any constraints beyond the input schema. For a simple calculation tool, this is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, containing only two sentences. The first sentence clearly states the purpose and method, and the second adds return info and a relevant sibling reference. No unnecessary words or repetition.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the absence of annotations and the fact that the output schema is not provided in the prompt (though context says it exists), the description covers the essential return value. It also references 'list_bundles' for related calculators, aiding discoverability. For a simple calculation tool, this is mostly complete, though it could mention typical output format more explicitly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'standard or age-adjusted formulas', which aligns with the 'formula' parameter, but adds minimal value beyond the schema. Since schema coverage is 100% and the schema already describes both parameters, the description's contribution to parameter understanding is marginal.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates maximum heart rate using standard or age-adjusted formulas, which is a specific verb-resource combination. It also mentions the return structure, making the purpose unambiguous. Although there are many sibling calculators, 'calculate_max_heart_rate' is distinct from others like 'calculate_heart_rate_zones'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a clear usage context: estimating max heart rate. It references 'list_bundles' for related 'sante' calculators, offering some guidance on related tools. However, it does not explicitly state when not to use this tool or compare it with alternatives like 'calculate_heart_rate_zones'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_meat_cookingAInspect
Compute meat cooking time and target internal temperature by cut and doneness. Use for kitchen prep. Inputs: meat type, weight kg, doneness. Returns oven time and target temp. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| meat | Yes | Meat type | |
| doneness | No | Doneness | medium |
| weight_kg | Yes | Meat weight kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool returns oven time and target temp, which is basic. It does not disclose any potential side effects, authorization needs, or non-obvious behaviors, but for a simple calculator, it is minimally adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, clear, and front-loaded with the core purpose. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the low complexity (3 parameters, no nested objects, output schema present), the description provides adequate context. It mentions inputs, outputs, and related tools via list_bundles.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description repeats the parameter names (meat type, weight kg, doneness) without adding significant new semantics beyond what the schema's enums and descriptions already provide.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes cooking time and target internal temperature based on meat type, weight, and doneness. However, there is a sibling tool 'calculate_meat_cooking_time' with a similar name, which could cause confusion. The description does not explicitly distinguish itself from that sibling.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for kitchen prep' and points to related cuisine calculators via list_bundles, but does not explicitly state when to use this tool versus alternatives or mention any prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_meat_cooking_timeAInspect
Compute oven cooking time for meat by cut, weight, and doneness. Use for cooking. Inputs: meat type, weight kg, target doneness. Returns time min and oven temp °C. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| doneness | Yes | Desired doneness | |
| meat_type | Yes | Type of meat | |
| weight_kg | Yes | Meat weight kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears the full burden. It states inputs and outputs (time and temp) but does not disclose if the tool is read-only, if it has side effects, or any operational constraints. It implies a calculation but lacks full transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise (two sentences) and front-loaded with the purpose. Every sentence adds distinct information without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 parameters, enums, output schema available), the description is adequately complete. It covers purpose, inputs, and outputs, though it could mention error handling or default assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
While schema coverage is 100%, the description introduces 'cut' as a parameter, but the schema has 'meat_type' instead. This inconsistency may confuse users. The description adds value by mentioning return fields (time min, oven temp °C), but the misalignment reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes oven cooking time for meat by cut, weight, and doneness. It uses specific verbs ('compute') and resources, and hints at related tools (list_bundles), distinguishing it from generic cooking tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for cooking' but provides minimal guidance on when to use this versus alternatives. It does not explicitly mention when not to use it or compare with sibling tools like calculate_cooking_time, leaving ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_menstrual_cycleBInspect
Calculate next period, fertile window, and ovulation date. Returns: {next_period, ovulation_date, fertile_window_start, fertile_window_end}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cycle_length | No | Average cycle length days | |
| last_period_date | Yes | Last period start date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It does not mention that the tool is read-only, has no side effects, or any prerequisites beyond the schema. It only describes the calculation and output, lacking transparency on authorization, rate limits, or error behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences. The first sentence states the purpose and the second lists return fields. No extraneous information, and it is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema (not shown but indicated), the description's summary of return fields is sufficient. It also references a related sibling tool for grouping. However, it could mention that this tool is part of a health bundle, but overall it is complete for its purpose.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes both parameters (last_period_date and cycle_length) with default values and constraints. The description adds no additional meaning beyond what the schema provides, meeting the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates next period, fertile window, and ovulation date. It lists the return values explicitly. However, it does not differentiate from the sibling tool 'calculate_ovulation', which performs a similar calculation, so it loses one point for lack of differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives like 'calculate_ovulation'. It only mentions 'list_bundles' for related calculators, but does not specify conditions for using this tool or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_mining_profitabilityAInspect
Compute crypto mining profitability after electricity costs. Use for miners evaluating ROI. Inputs: hashrate, power W, kWh price, network difficulty, coin price. Returns daily/monthly net profit. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| power_watts | Yes | Mining hardware power consumption in watts | |
| block_reward | No | Block reward in coins (default 3.125 BTC post-halving) | |
| hashrate_mhs | Yes | Mining hashrate in MH/s | |
| coin_price_usd | Yes | Current coin price in USD | |
| network_difficulty | Yes | Current network difficulty | |
| electricity_cost_kwh | Yes | Electricity cost per kWh in fiat currency |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses that the tool returns daily/monthly net profit after electricity costs, which is good. It does not mention side effects (none expected) or authorization needs. While it covers the main behavior, it omits details like default block reward usage or assumptions, but overall it meets expectations for a calculator tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences plus a bullet-like list of inputs. It front-loads the action and purpose. While the list of inputs is somewhat redundant with the schema, it is helpful for quick scanning. No waste is present, but it could be slightly more streamlined.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 6 parameters and an output schema, the description covers purpose, inputs, output (daily/monthly net profit), and points to related tools via list_bundles. It does not detail the output format (handled by output schema) or specify default values, but overall it provides sufficient context for an agent to understand and invoke the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description lists four of the six parameters (hashrate, power W, kWh price, network difficulty, coin price) but misses block_reward (which has a default). It adds a helpful summary but does not provide additional constraints or formatting guidance beyond the schema, resulting in marginal added value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes crypto mining profitability after electricity costs. It identifies the target users (miners evaluating ROI) and distinguishes it from generic crypto calculators by focusing on mining. However, it does not explicitly differentiate from sibling tools like calculate_crypto_profit_loss or calculate_staking_rewards, missing an opportunity for clearer distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a clear when-to-use suggestion ('Use for miners evaluating ROI') and points to list_bundles for related calculators. However, it lacks explicit when-not-to-use guidance or direct alternatives, which would help agents avoid mis-selection among many similar calculator tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moon_phaseAInspect
Compute current moon phase and illumination % for any date. Use for astronomy, agriculture, fishing. Inputs: date. Returns phase name, illumination %, age in days. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| date | Yes | Date in YYYY-MM-DD format |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It describes the tool as a read-only computation, but does not explicitly state idempotence or lack of side effects. Adequate but minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences efficiently convey purpose, uses, inputs, and outputs. Front-loaded and zero waste. Reference to list_bundles is helpful and concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with one parameter, description covers inputs and outputs. Lack of output schema details is compensated by description mentioning return fields. Slight gap in not mentioning that date must be valid.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% coverage with clear description for 'date' parameter. Description only repeats 'Inputs: date' without adding new meaning. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes moon phase and illumination for any date, specifies use cases (astronomy, agriculture, fishing), and mentions return values (phase name, illumination %, age). This distinguishes it from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit use cases (astronomy, agriculture, fishing) and points to related calculators via list_bundles. However, does not state when not to use or mention alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moroccan_cnssAInspect
Calculate Moroccan CNSS contributions (employee and employer shares). Returns: {gross_monthly_mad, employee, employer, pension_ceiling_mad}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_monthly_mad | Yes | Gross monthly salary in MAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses the return shape and parameter. For a calculation tool, this is adequate but could explicitly state idempotency or lack of side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first covers purpose and return, second points to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Description explains purpose, parameter, and return fields. Given output schema exists and parameter is simple, it's fairly complete. Could clarify CNSS acronym but sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds the same information as the schema. No additional semantic value beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states 'Calculate Moroccan CNSS contributions (employee and employer shares)' with specific verb and resource. The return object is detailed, and the tool is distinct from many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context for when to use (Moroccan CNSS calculations) and directs to related calculators via 'list_bundles'. However, no explicit exclusions or alternatives mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moroccan_income_taxAInspect
Calculate Moroccan income tax (IR) using DGI progressive brackets with family deductions. Returns: {annual_income_mad, taxable_income, income_tax_mad, effective_rate_pct, marginal_rate_pct, brackets}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| dependents | No | Number of dependents (360 MAD deduction each, max 6) | |
| annual_income_mad | Yes | Annual gross income in Moroccan Dirhams (MAD) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It states the method (progressive brackets with family deductions) and lists return fields but does not disclose assumptions, edge cases, or if it uses current tax year rates. This is adequate but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences: one defines the purpose and method, the second lists return fields and recommends a related tool. No wasted words, easy to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (progressive tax calculation) and the presence of an output schema (per context), the description covers core function and return values. It could mention year applicability or assumptions, but overall it is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description does not add additional meaning beyond what the schema already provides (e.g., the schema already specifies the 360 MAD deduction per dependent). Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates Moroccan income tax using DGI progressive brackets with family deductions, specifying the verb and resource. The distinct name 'calculate_moroccan_income_tax' differentiates it from numerous sibling tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related 'finance-afrique-quebec' calculators,' which provides contextual guidance but does not explicitly specify when to use this tool over alternatives or give exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moroccan_profit_foncierAInspect
Calculate Moroccan property income tax (profit foncier / revenus fonciers). Returns: {annual_rent_mad, taxable_income, income_tax_mad, effective_rate_pct, marginal_rate_pct}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| dependents | No | Number of dependents for family deduction | |
| expenses_pct | No | Deductible expenses as % of rent (default 40%) | |
| annual_rent_mad | Yes | Annual rental income in MAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It describes the computation and outputs but does not disclose assumptions (e.g., tax year, exchange rates, limitations). As a calculator, it is likely read-only, but this is not stated explicitly.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with the core purpose, and includes return fields and a cross-reference. No unnecessary words. Highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 parameters, 1 required, no nested objects), the description covers the essential: what it does, inputs, and outputs. A formal output schema is not provided but the return structure is described. Lacks usage context, but adequate for a calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents the three parameters and their descriptions. The tool description adds no new parameter-level information beyond what the schema provides. It lists return fields, which aids understanding but is not about parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Moroccan property income tax (profit foncier). It names the resource and action, and distinguishes from other calculators in the sibling list (e.g., Moroccan income tax, VAT). The return fields are listed, adding specificity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The sibling list includes many other tax calculators (e.g., French, Belgian, US), but no comparative cues are provided. The only hint is a reference to list_bundles for related calculators, which is weak.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moroccan_vatAInspect
Compute Moroccan VAT (TVA) — convert between HT and TTC. Use for invoicing in Morocco. Inputs: amount, rate (20/14/10/7), mode (ht/ttc). Returns HT, TTC, tax amount. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=hors taxe, ttc=toutes taxes comprises | ht |
| rate | No | VAT rate: 0%, 7%, 10%, 14%, or 20% (standard) | 20 |
| amount | Yes | Amount in MAD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Describes inputs and outputs (HT, TTC, tax amount) but does not explicitly state it is read-only or non-destructive. Adequate for a calculation tool but could be more transparent about safety.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Concise two-sentence description: first sentence states purpose, second covers inputs/outputs and a pointer to related tools. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers purpose, inputs, outputs, and regional context. No output schema, but returns are explained. Slight omission: currency (MAD) is only in schema, not description. Otherwise complete for a calculator with good schema support.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. Description summarizes parameters (amount, rate, mode) and enumerates rates, but adds little beyond the schema. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Compute Moroccan VAT (TVA) — convert between HT and TTC' for invoicing in Morocco. It distinguishes from sibling tools by specifying the regional focus and referencing related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for invoicing in Morocco', providing clear context. Does not explicitly exclude other uses, but the mention of alternative tools in list_bundles guides the agent appropriately.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_mortgageAInspect
Calculate mortgage/loan monthly payment, total cost, and optional amortization schedule. Returns: {principal, months, monthly_payment, total_interest, total_cost}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Loan duration in years | |
| principal | Yes | Loan amount in currency units | |
| annual_rate | Yes | Annual interest rate in % | |
| with_schedule | No | Include first 12 months + last month amortization |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description clearly discloses the return format (a JSON object with specific fields) and the optional amortization schedule feature. It implies a safe, non-destructive calculation tool. No additional behavioral traits like rate limits or auth needs are mentioned, but the transparency is adequate for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences. The first sentence covers purpose and output, the second provides a cross-reference to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's 4 parameters (3 required, 1 optional boolean) and the existence of an output schema, the description adequately covers the tool's functionality. It explains the output and optional feature. It does not cover edge cases like zero principal or negative rates, but for a standard financial calculator this is acceptable.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage, so parameters are already documented. The description adds minimal extra meaning beyond the schema: it mentions 'optional amortization schedule' for the with_schedule parameter and lists the output fields. This is sufficient but not exceptional.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: calculating mortgage/loan monthly payment, total cost, and optional amortization schedule. It specifies the output fields. The tool's name 'calculate_mortgage' is distinct among many calculate_* siblings, and the description further differentiates by mentioning the 'finance-france' bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a hint to related calculators via 'See list_bundles for related finance-france calculators', but does not explicitly state when to use this tool over other loan or mortgage calculators. There is no guidance on prerequisites or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_mortgage_insuranceBInspect
Calculate mortgage insurance (assurance emprunteur) cost. Returns: {monthly_insurance, annual_insurance, total_insurance}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate_pct | No | Annual insurance rate in % of loan (default 0.36) | |
| loan_amount | Yes | Loan amount in EUR | |
| duration_years | Yes | Loan duration in years |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It mentions return structure but does not disclose any behavioral traits beyond a simple calculation. Lacks statements about safety, side effects, or prerequisites.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first clearly states purpose, second gives return structure and cross-reference. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given simple calculation tool with full schema and output schema existence, description adequately covers core behavior and return fields. Could add more context about the French insurance system but is sufficient for agent selection.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline 3. Description does not add parameter meaning beyond what schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it calculates mortgage insurance cost. Mentions specific term 'assurance emprunteur' and distinguishes from siblings via name and reference to list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives like calculate_mortgage or calculate_insurance_estimate. Only a brief pointer to list_bundles.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_motor_torqueAInspect
Compute motor torque from power and RPM. T(Nm)=9550·P(kW)/RPM. Use for mechanical sizing. Inputs: power kW, rpm. Returns torque in N·m and lb-ft. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rpm | Yes | RPM | |
| power_w | Yes | Power watts |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses the computation formula, required inputs, and output units. However, it does not mention non-destructive behavior or any side effects, which would be helpful. No contradictions exist.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, front-loading the formula and purpose. Every sentence contributes, but the unit inconsistency detracts from overall efficiency.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (from context signals), the description appropriately omits return value details. It covers primary use case, formula, and output units. It could mention required units (kW vs watts) more clearly but is generally complete for a simple calculation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for parameters (RPM, power_w). The description mentions 'power kW' while the schema uses watts (power_w), creating a unit mismatch that could confuse agents. The formula reference adds value but the discrepancy reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute motor torque from power and RPM' with a clear formula and output units. It specifically defines the tool's function and distinguishes it from sibling tools by naming the specific calculation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'Use for mechanical sizing' and references 'list_bundles for related science calculators,' providing some context for when to use. However, it does not explicitly state when not to use this tool or name specific alternatives, leaving implied usage guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moving_cost_detailedBInspect
Estimate detailed moving cost based on volume, distance and floor. Returns: {base_cost, total_cost_eur, note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| floor | No | Floor number (default 0 = ground floor) | |
| elevator | No | Whether elevator is available (default true) | |
| volume_m3 | Yes | Volume of goods to move in m3 | |
| distance_km | Yes | Moving distance in km |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description discloses return values {base_cost, total_cost_eur, note} but lacks details on side effects, authentication, or limitations beyond functional purpose.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences), front-loaded with purpose, and ends with a useful cross-reference. Efficient but could be more streamlined.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 4 parameters (2 required) and output schema, the description mentions only 3 inputs and does not clarify 'detailed' or explain the note field. Missing elevator parameter mention.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so description adds little beyond functional grouping. It names volume, distance, and floor but omits elevator parameter, which is partially addressed by schema defaults.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Estimate' and resource 'detailed moving cost' with inputs (volume, distance, floor) and output structure. It distinguishes from list_bundles but not directly from calculate_moving_volume, a sibling.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It mentions list_bundles for related calculators but does not provide explicit guidance on when to use this tool versus alternatives, nor exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_moving_volumeCInspect
Estimate moving volume (m³) by home type and contents. Use for moving company quotes. Inputs: home size, rooms, furniture density. Returns m³ and truck size recommendation. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | Yes | Home type |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It claims to return m³ and truck size recommendation, which is helpful. However, it misleadingly lists inputs (rooms, furniture density) that are not in the schema, creating a discrepancy that harms transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, efficient and front-loaded with purpose and usage. Each sentence adds value: purpose, usage, and related tools. However, the misleading parameter information detracts slightly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description mentions outputs and usage but fails to correctly represent inputs, creating a gap. The schema covers one parameter well, but the description's extra claims are unsupported. No error handling or edge cases are addressed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has one parameter (type) with enum and description. The description adds 'home size, rooms, furniture density' which are not in the schema, actively misleading the agent. This contradicts the schema and reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates moving volume by home type and contents, with a specific use case for moving company quotes. The verb 'estimate' and resource 'moving volume' are specific. However, it mentions additional inputs (rooms, furniture density) not present in the schema, causing confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use for moving company quotes,' providing clear context. It refers to list_bundles for related calculators, offering some guidance. However, it does not specify when not to use or provide exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_net_worthAInspect
Calculate net worth and debt ratio from assets and liabilities. Returns: {net_worth_eur, debt_ratio_pct}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| assets_total | Yes | Total assets EUR | |
| liabilities_total | Yes | Total liabilities EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It details the calculation and return values but omits behavioral traits such as idempotency, error conditions, or safety considerations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, directly states the tool's action and output, and includes a helpful reference to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with well-specified inputs and outputs, the description is complete. However, the output schema is mentioned as existing but not provided, which slightly limits completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for both parameters. The description does not add additional meaning beyond the schema, meeting the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates net worth and debt ratio from assets and liabilities, specifying return fields. It distinguishes from sibling tools by referencing list_bundles for related finance calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by mentioning related calculators via list_bundles, but does not explicitly state when to use this tool vs alternatives or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_night_shift_payAInspect
Calculate night shift pay (21h-6h) with configurable premium percentage. Returns: {night_hourly_rate, total_pay, premium_earned}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| night_hours | Yes | Number of night hours worked (21h-6h) | |
| premium_pct | No | Night shift premium percentage (default 25%) | |
| base_hourly_rate | Yes | Normal hourly rate in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as side effects, idempotency, or dependencies beyond the return format. For a calculation tool, read-only nature is assumed but not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description consists of two concise sentences: the first states the purpose and constraints, and the second points to related tools. No filler or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and 100% schema coverage, the description is mostly complete. However, it lacks explicit mention of currency or timezone assumptions for the night hours range, which could improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with clear descriptions for all three parameters. The description adds no additional semantic meaning beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates night shift pay with a specific time range (21h-6h) and configurable premium percentage. The verb 'calculate' and resource 'night shift pay' are explicit, and the tool is distinct among many sibling calculate_ tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description hints at related tools via 'See list_bundles for related 'temps-rh' calculators' but does not explicitly state when to use this tool versus alternatives or when not to use it. Usage context is implied but not directly guided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_notary_feesBInspect
Calculate French notary fees (frais de notaire) for a real estate purchase. Returns: {price, droits_mutation, emoluments_notaire, frais_divers, total_frais, total_pct}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | Property type: ancien (old) or neuf (new) | ancien |
| price | Yes | Purchase price in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the return fields but does not mention any behavioral traits such as error handling, precision, or side effects. Since no annotations are provided, the description carries the burden of transparency. For a calculator, the lack of detail on how the type affects fees or rounding is a minor gap, so a score of 3 is appropriate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise—two sentences that front-load the purpose, list the return structure, and provide a cross-reference. There is no wasted text, and it is well-organized for immediate understanding.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters with full schema coverage and an output schema implied by the description), the description is almost complete. It covers purpose, return values, and related tools. However, it lacks details like the effect of property type on percentages or rounding behavior, which would be beneficial for a complete understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already provides 100% coverage for both parameters (type and price) with descriptions. The tool description adds no additional semantic information beyond the schema, but it does list the output fields. Baseline score of 3 is fair as the schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates French notary fees for real estate purchases and lists the return fields. However, it does not differentiate from the sibling tool 'calculate_notary_fees_detailed', which suggests a more detailed version exists. The verb 'Calculate' and resource 'French notary fees' are specific, but the lack of distinction reduces clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only briefly mentions seeing list_bundles for related calculators, but provides no explicit guidance on when to use this tool versus alternatives like 'calculate_notary_fees_detailed'. There is no when-to-use or when-not-to-use advice, leaving the agent without sufficient context for selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_notary_fees_detailedAInspect
Detailed breakdown of French notary fees by component (taxes, emoluments, debours). Use for property buyers in France. Inputs: property price, type (new/old), department. Returns total fees and per-component breakdown. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| type | Yes | Property type | |
| department | No | French department code (optional, affects DMTO rate) | |
| property_price | Yes | Property price EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the tool returns 'total fees and per-component breakdown', and lists inputs. It mentions the tool is for French property, and the department affects DMTO rate. It does not detail calculation assumptions or limits, but for a calculator, this is sufficient transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, consisting of three sentences that front-load the purpose, then cover inputs and output, and end with a related tool reference. Every sentence adds value without wordiness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity as a detailed calculator, the description covers purpose, inputs, output, and related tools. An output schema exists for return structure. One minor gap: the description does not explicitly note that 'department' is optional, though this is in the schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. The description reiterates these inputs ('property price, type (new/old), department') and adds minimal context (e.g., 'affects DMTO rate' for department, already in schema). It does not add significant new semantics beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Detailed breakdown of French notary fees by component (taxes, emoluments, debours)', specifying the verb (breakdown) and resource (French notary fees). It also includes the intended audience ('property buyers in France'), making the purpose distinct from sibling calculators like 'calculate_property_transfer_tax'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for property buyers in France', providing clear context. It also references 'list_bundles' for related calculators, hinting at alternatives. However, it does not explicitly state when not to use this tool or exclude other scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_number_base_convertAInspect
Convert a number between bases 2 (binary), 8 (octal), 10 (decimal), and 16 (hex). Use for programming. Inputs: value, from-base, to-base. Returns converted number. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Number to convert as string | |
| to_base | Yes | Target base | |
| from_base | Yes | Source base |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. While it states the basic operation, it lacks details on edge cases (e.g., handling of invalid inputs, negative numbers, or decimal points) and does not disclose any behavioral traits beyond the core conversion.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded with key information (bases, usage), followed by parameter listing and a reference to related tools. It is efficient but could be slightly more structured for readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity, complete schema coverage, and existence of an output schema (not shown but noted), the description provides sufficient context. It explains what the tool does, its inputs, and the supported bases. Minor gaps like validation behavior are acceptable for this simplicity level.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description reinforces the supported bases and input format but does not add significant new meaning beyond the schema's property descriptions. It is adequate but not exceptional.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Convert a number between bases 2 (binary), 8 (octal), 10 (decimal), and 16 (hex)' which is a specific verb-resource combination. It distinguishes from sibling tools that are general calculators by emphasizing its programming use case and base conversion focus.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for programming' and directs to list_bundles for related conversion calculators, providing context for when to use this tool. However, it does not explicitly state when not to use it or provide alternative tool names within the same context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ohms_lawAInspect
Compute Ohm's law: V=I·R. Solve for any of V, I, R given the other two. Use for electronics. Inputs: any 2 of (V volts, I amps, R ohms). Returns the third. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| current | No | Amps | |
| voltage | No | Volts | |
| resistance | No | Ohms |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the behavioral burden. It clearly describes that the tool solves for the missing variable given two inputs, and mentions the units. This is sufficient for a simple calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences long with no redundant information. It is front-loaded with the main purpose and formula. Minor repetition between first two sentences, but overall efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with full schema coverage and an output schema, the description provides enough context about operation (any two inputs) and domain (electronics). It also directs to list_bundles for related tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers 100% of parameters with descriptions. The description adds value by explaining the 'any two' logic and the formula (V=I·R), which goes beyond the individual parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes Ohm's law and solves for any of V, I, R given the other two. It specifies 'Use for electronics,' which provides some differentiation from siblings, though it does not explicitly distinguish from other similar calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for electronics' and 'Inputs: any 2 of (V volts, I amps, R ohms),' which implies when to use. It also references list_bundles for related calculators, but does not give explicit when-not-to-use or alternative tool names.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_one_rep_maxBInspect
Estimate 1 repetition maximum from submaximal lift using Epley, Brzycki and Lombardi formulas. Returns: {epley_1rm, brzycki_1rm, lombardi_1rm, average_1rm}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| reps | Yes | Number of repetitions performed | |
| weight_lifted | Yes | Weight lifted in kg or lbs |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description bears full responsibility. It states the purpose and output fields but does not disclose limitations (e.g., validity of formulas, handling of edge cases like reps>30) or assumptions. It provides basic transparency but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, efficiently stating the function and output. It is front-loaded with the action and results. No superfluous content. Could be slightly improved by structuring the output list more clearly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, 100% schema coverage, and an output schema implied by the listed fields), the description covers the essential information. Minor gaps include formula assumptions and valid ranges, but overall adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with both parameters already described in the schema. The description adds no extra meaning beyond the schema's definitions. Baseline score of 3 is appropriate since the schema fully documents the parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'estimate' and the resource '1 repetition maximum' using specific formulas. It lists the output fields. However, it does not differentiate from sibling tools such as 'calculate_1rm_table', which may also estimate 1RM.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
There is no guidance on when to use this tool versus alternatives, nor any prerequisites or exclusion conditions. The mention of 'see list_bundles for related calculators' is vague and does not provide actionable usage instructions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_overtime_frBInspect
Compute French overtime pay (heures supplémentaires) per labor code. Use for HR or employee verification. Inputs: hourly rate, normal hours, overtime hours. Returns gross overtime pay with 25%/50% premiums. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| base_hours | No | Base weekly hours | |
| hourly_rate | Yes | Hourly rate EUR | |
| actual_hours | Yes | Actual weekly hours |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. The description states it returns gross overtime pay with premiums, which implies a read-only calculation. However, it does not explicitly state that no data is modified, nor does it mention any other behavioral traits like prerequisites or error conditions. Given the tool's nature as a calculator, the lack of explicit destructive hint is acceptable but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief at two sentences, but the second sentence is somewhat redundant with the schema and includes the misleading 'overtime hours' input. It could be restructured to be more accurate. While concise, it sacrifices clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool, the description covers basic functionality and use case. It references related calculators via list_bundles. However, the param mismatch is a gap, and it does not explain the relationship between base_hours, actual_hours, and overtime. The output schema exists (though not shown) so return details aren't needed, but the description could be more complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description lists inputs as 'hourly rate, normal hours, overtime hours' but the schema has three parameters: base_hours (default 35), hourly_rate, and actual_hours. There is no parameter for 'overtime hours'; overtime is likely derived from actual_hours minus base_hours. This mismatch creates confusion and fails to add value beyond the schema. Schema coverage is 100% but the description introduces inaccuracy.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes French overtime pay per labor code, with specific verb 'compute' and resource 'French overtime pay'. It mentions the return includes 25%/50% premiums. However, it does not differentiate itself from the sibling tool calculate_overtime_pay_fr, which may be identical.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases: 'Use for HR or employee verification.' It also directs users to list_bundles for related calculators. However, it does not specify when NOT to use this tool or list alternatives beyond referencing a bundle.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_overtime_pay_frAInspect
Calculate French overtime pay: first 8h at +25%, beyond 8h at +50% (weekly threshold 35h). Returns: {hours_at_25pct, hours_at_50pct, pay_25pct_zone, pay_50pct_zone, total_overtime_pay, extra_vs_normal}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| overtime_hours | Yes | Total overtime hours worked beyond 35h/week | |
| base_hourly_rate | Yes | Normal hourly rate in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, but the description discloses the calculation logic (two tiers, rates) and the return fields. It does not mention side effects or permissions, which is acceptable for a pure calculation tool. The description adds context beyond the schema by explaining how hours are allocated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: one sentence explaining the calculation, a list of return fields, and a pointer to bundles. It is front-loaded with key information and contains no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of many siblings, the description covers the calculation rules and output fields. It references a bundle for related tools. It does not include error conditions or prerequisites, but for a calculation tool, this is sufficient. No output schema is given, but the return list compensates.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description does not add parameter details beyond the schema but explains how the parameters are used in the calculation (tiered rates). This adds some meaning but not significantly beyond the schema's descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates French overtime pay with specific rates (first 8h at +25%, beyond at +50%, weekly threshold 35h), distinguishing it from other calculation tools. The verb 'calculate' and resource 'overtime pay' are specific, and the mention of French rules differentiates it from potential siblings like 'calculate_overtime_fr'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states the purpose (French overtime pay) and references 'list_bundles for related temps-rh calculators', suggesting when to look for other HR calculators. However, it does not explicitly state when NOT to use this tool or compare to siblings like 'calculate_overtime_fr', but the specificity implies usage for French overtime scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ovulationBInspect
Calculate ovulation date and fertile window from last period and cycle length. Returns: {lmp, ovulation_date, fertile_window_start, fertile_window_end, next_period}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cycle_length | No | Menstrual cycle length in days | |
| last_period_date | Yes | YYYY-MM-DD — First day of last menstrual period |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose all behavioral traits. It only states the calculation purpose and return fields, but does not disclose that it is a read-only operation, any required permissions, or potential side effects. Minimal transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words. The main purpose is front-loaded, and the return structure is listed concisely. Ideal length for this tool.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with an output schema hinted in context and return fields described, the description is fairly complete. It lacks error handling or edge case info, but is otherwise sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes both parameters adequately. The description adds no additional meaning beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the verb 'Calculate' and the resource 'ovulation date and fertile window', with clear input and output. However, it does not differentiate from closely related siblings like calculate_menstrual_cycle, though it references list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when you have last_period_date and optionally cycle_length. It does not provide explicit when-to-use or when-not-to-use guidance, nor alternatives beyond a vague reference to list_bundles.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_paint_neededAInspect
Compute paint quantity for walls including coats and waste margin. Use for renovation budget. Inputs: room dimensions, coats, openings. Returns paint liters and recommended buying. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| coats | No | Number of coats | |
| area_m2 | Yes | Wall area m² | |
| coverage | No | Coverage m²/liter |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the full burden. It indicates a pure calculation (compute, returns) with no side effects, which is transparent for a calculator tool. However, it does not explicitly state that no data is stored or that no authentication is needed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: three sentences that front-load the purpose, usage, and output. No unnecessary information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description mentions returns (paint liters and recommended buying) and references related tools. However, the mention of 'openings' without a parameter is a gap, and it lacks guidance on handling complex scenarios like multiple rooms or irregular walls.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds context by mentioning 'openings', but this does not match any parameter in the schema (only area_m2, coats, coverage). This creates confusion and reduces the value added over the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes paint quantity for walls including coats and waste margin, and lists inputs and outputs. However, it does not differentiate from the sibling 'calculate_paint_quantity' tool, which likely has a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use for renovation budget and directs to list_bundles for related calculators, but does not explicitly state when to use this tool versus alternatives like calculate_paint_quantity, nor provides when-not-to-use conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_paint_quantityAInspect
Compute paint liters needed for a surface with chosen number of coats. Use for painting projects. Inputs: surface m², coats, paint coverage m²/L. Returns liters and number of cans. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| coats | No | Coats | |
| area_m2 | Yes | Area in m² | |
| coverage | No | m²/liter |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden of behavioral disclosure. It states it 'Computes paint liters needed' and returns results, implying a read-only calculation. This is sufficiently clear for a calculator tool, though it does not explicitly state it has no side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences, efficiently covering purpose, usage, inputs, outputs, and a sibling reference. No unnecessary words or repetition.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description covers essential aspects: purpose, inputs, outputs, and related tools. However, it omits mention of default values for coats and coverage, which are present in the schema. For a complete picture, this information would be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description rephrases the input schema by listing 'surface m², coats, paint coverage m²/L' but adds no new meaning beyond the schema descriptions, which already cover each parameter. With 100% schema coverage, the description provides minimal added value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes paint liters needed for a surface, listing inputs and outputs. However, it does not distinguish from the sibling tool 'calculate_paint_needed', which likely has overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for painting projects' and references list_bundles for related calculators, providing a general context. But it offers no explicit guidance on when not to use this tool or alternatives, such as the similar 'calculate_paint_needed'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_paper_size_convertCInspect
Get dimensions (mm, in) of standard paper formats (A0-A10, B0-B10, US Letter, Legal, Tabloid). Use for printing. Inputs: format name. Returns dimensions in mm and inches. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| format | Yes | Paper format name |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It states the output units (mm and inches) but omits disclosure of side effects (none expected) and fails to accurately list supported formats, which is a behavioral discrepancy.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with two sentences and a cross-reference, front-loading the main purpose without extraneous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having an output schema, the description is incomplete and misleading because it claims support for B0-B10 formats that are not in the input schema. This is a critical omission and makes the tool's capabilities unclear.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers the parameter with an enum and description. The description adds 'format name' which is redundant. More importantly, the description lists formats not in the enum (B0-B10), creating a contradiction that undermines parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it retrieves dimensions of standard paper formats, but it incorrectly lists B0-B10 as supported, which are not present in the input schema enum. This reduces accuracy, though the core purpose is still clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'Use for printing' and directs to list_bundles for related calculators, providing some usage context. However, it does not specify when not to use it or compare to alternative tools, and the inaccurate format list could mislead users.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_parcoursup_pointsBInspect
Estimate Parcoursup admission score from bac + lycée grades. Use for French university candidates. Inputs: grades and coefficients per subject. Returns estimated score. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bac_average | Yes | Expected/actual bac average (/20) | |
| option_bonus | No | Bonus points from options |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description should clarify behavior. It mentions 'Estimate' and 'Returns estimated score' but does not disclose the underlying algorithm, accuracy, or that it is a safe read operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences cover purpose, usage, and related tools efficiently. No redundancy, but slightly more front-loading of critical info could improve it.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having an output schema, the description fails to clarify the parameter mismatch and lacks detail on the estimation method, making it less complete given the many sibling calculators.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but description says 'grades and coefficients per subject,' implying multiple inputs, while schema only has bac_average and option_bonus, creating a mismatch that could mislead agents.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool estimates Parcoursup admission scores from bac and lycée grades for French university candidates, clearly distinguishing it from siblings like calculate_bac_points.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It indicates the tool is for French university candidates and directs to list_bundles for related calculators, but lacks explicit guidance on when not to use it or how it differs from similar calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_parcoursup_scoreBInspect
Estimate Parcoursup weighted score from French baccalaureate component grades. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| grand_oral_note | Yes | Grand Oral examination grade out of 20 | |
| bac_general_average | Yes | General baccalauréat average out of 20 | |
| specialite_1_average | Yes | First speciality subject average out of 20 | |
| specialite_2_average | Yes | Second speciality subject average out of 20 | |
| controle_continu_average | Yes | Continuous assessment (contrôle continu) average out of 20 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description should fully disclose behavior. It only says 'estimate', which hints at approximation, but does not mention that it uses standard weighting, requires full baccalaureate grades, or any rate limits/auth needs. The output schema likely covers return values, but side effects and safety are unaddressed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no wasted words. Purpose is front-loaded, and the second sentence provides a useful pointer to related tools. Highly concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 5 required parameters, 100% schema coverage, and an output schema, the description is minimally adequate. However, it lacks context about the Parcoursup system, the estimated nature of the score, and how it fits with other education tools. Slightly incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All 5 parameters are fully described in the input schema (100% coverage). The tool description adds no extra meaning, such as weighting or calculation method. Baseline 3 is appropriate as schema carries the semantic burden.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates a Parcoursup weighted score from baccalaureate grades. The verb 'estimate' and resource 'Parcoursup weighted score' are specific. However, it does not differentiate from the sibling 'calculate_parcoursup_points', so clarity is slightly reduced.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives like 'calculate_parcoursup_points' or 'calculate_bac_points'. The mention of 'list_bundles' for related calculators is vague and does not specify context or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_part_timeBInspect
Calculate part-time work percentage and optional pro-rata salary. Returns: {percentage, prorata_salary}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| full_salary | No | Full-time salary to pro-rate (optional) | |
| full_time_hours | No | Full-time weekly hours (FR default 35h) | |
| part_time_hours | Yes | Part-time weekly hours |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description is the sole source of behavioral info. It explains the calculations and optional parameters, but does not explicitly state that the tool is read-only or idempotent, which is typical for a calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with the main functionality and a reference to related tools. No wasted words, and the key information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator, the description covers the purpose and return format. However, it omits mentioning the default value for full_time_hours (35) and does not specify that percentage is always returned while prorata_salary is optional. The lack of an output schema is partially compensated by the return description, but completeness could be improved.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already provides descriptions. The description adds the context that full_salary is used for pro-rata calculation, but this is implicit from the function name. No additional parameter details are provided beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates part-time work percentage and optional pro-rata salary, and it references the return format. It also mentions related calculators via list_bundles, which provides some context among the many calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using list_bundles for related 'temps-rh' calculators, but it does not explicitly state when to use this tool versus alternatives, nor does it specify prerequisites or scenarios where it is not applicable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_passport_validityBInspect
Check if passport is valid for travel (6-month rule). Returns: {note}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| expiry_date | Yes | Passport expiry date YYYY-MM-DD | |
| travel_date | Yes | Planned travel date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full responsibility. It only mentions returning '{note}', which is vague and does not disclose behavioral traits like side effects, output format, or error handling. The 6-month rule check is stated but not elaborated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences. The first sentence front-loads the core purpose, the second adds return format and a cross-reference. Every sentence is purposeful with no filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and presence of an output schema, the description is too sparse. It does not explain the 6-month rule, what the '{note}' return contains, or how to handle invalid dates. The cross-reference to list_bundles is useful but insufficient to make the description complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% (both expiry_date and travel_date have descriptions in the input schema). The description adds no additional parameter information beyond what the schema already provides, so it meets the baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool checks passport validity for travel using the 6-month rule, with a specific verb and resource. It distinguishes itself from numerous sibling calculate_* tools by naming a related tool (list_bundles) for voyage calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use this tool (checking passport validity) and points to list_bundles for related voyage calculators, but does not explicitly state when not to use it or provide alternatives among the many calculate_* siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pasta_portionsAInspect
Calculate dry pasta, water and salt for a given number of people. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| appetite | Yes | Appetite level | |
| num_people | Yes | Number of people | |
| pasta_type | Yes | Pasta shape |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, and the description only states the calculation function. It does not disclose any behavioral traits such as idempotency, safety, or side effects. For a straightforward calculator, this is minimal disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words: first sentence states purpose, second provides cross-reference. Well-structured and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Tool has output schema (from context), so return values are covered. Description explains what is calculated and points to related tools. Could mention that it's for dry pasta only, but overall complete for a simple calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% description coverage for all three parameters, so the baseline is 3. The description adds overall context but no additional details beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool calculates dry pasta, water, and salt portions for a given number of people, with a specific verb and resource. It also mentions a related tool (list_bundles), distinguishing it from sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use (when pasta portions needed) and points to list_bundles for related cuisine calculators, but lacks explicit exclusions or alternatives. Sibling list is large but no direct comparison.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pendulum_periodAInspect
Compute simple pendulum period T=2π√(L/g). Use for physics homework or clock design. Inputs: length m, gravity m/s² (default 9.81). Returns period in seconds. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gravity | No | Gravity m/s² | |
| length_m | Yes | Pendulum length meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries full burden. It discloses the formula, default gravity (9.81), and that returns period in seconds. It does not mention rounding or error handling, but for a simple physics calculation this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two succinct sentences plus a note about list_bundles. Every sentence provides value, no redundancy, and the most important information (purpose, formula) comes first.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For this straightforward tool, the description covers purpose, formula, parameters, return value, and guidance to related tools. The presence of an output schema further reduces the need to describe return values in detail.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds the formula and default gravity value, but does not significantly enhance understanding of parameter meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes the simple pendulum period using the formula T=2π√(L/g), and gives specific use cases (physics homework, clock design). It distinguishes itself from numerous sibling calculator tools by being specific to this physical phenomenon.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for physics homework or clock design' and references list_bundles for related 'science' calculators. It provides clear context but does not explicitly exclude non-simple pendulum scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_percentageAInspect
Calculate percentages: value of total, percentage change, what percent. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | First value | |
| b | Yes | Second value | |
| operation | Yes | of: X% of Y; change: from A to B; what_pct: X is what % of Y |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. However, it does not mention side effects, idempotency, error handling, or return format. For a calculation tool, some transparency about output format or edge cases would be expected.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. The first sentence immediately states the tool's purpose, and the second provides a helpful pointer to related tools. It is appropriately sized for a simple calculator.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is straightforward with three required parameters and an output schema (not shown but noted). The description covers the core functionality. However, it could briefly mention the output type or that results are numeric to aid agent interpretation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for each parameter. The description adds minimal extra value by paraphrasing the operations ('value of total, percentage change, what percent') but does not include examples or formatting tips. Baseline 3 is appropriate given the schema already documents parameters well.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates percentages and specifies three operations: 'value of total, percentage change, what percent'. This maps directly to the input schema's enum values ('of', 'change', 'what_pct'). The mention of 'list_bundles' for related calculators helps distinguish it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only mentions 'See list_bundles for related math calculators' which provides a pointer to alternatives but does not explicitly state when to use this tool versus specific siblings like calculate_percentage_change or calculate_percentile_rank. It lacks clear guidance on usage context or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_percentage_changeAInspect
Compute % change between two values, signed (increase or decrease). Use for performance comparisons, statistics. Inputs: old value, new value. Returns absolute and relative change. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| new_value | Yes | New value | |
| old_value | Yes | Original value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states 'Returns absolute and relative change', which is helpful. However, it fails to disclose behavior for edge cases like division by zero when old_value=0, nor does it mention handling of negative values or precision. More transparency is needed for a safe and reliable tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is minimal: two sentences plus a reference. The first sentence front-loads the core purpose. Every sentence serves a purpose, with no redundant words. Highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema (though not shown), the description covers the main functionality. However, it omits handling of zero old_value and does not explain the output format fully. The reference to list_bundles helps but could be more specific.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The tool's description adds 'Inputs: old value, new value', which largely restates the schema. It provides no additional meaning or constraints beyond the schema, meeting the baseline but not exceeding it.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute % change between two values, signed (increase or decrease)'. This provides a specific verb and resource, and distinguishes this tool from siblings like calculate_percentage (which might be unsimplified) by emphasizing sign and return of absolute and relative change.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests 'Use for performance comparisons, statistics', giving clear context. It also directs to list_bundles for related calculators, implicitly guiding the agent to alternatives. However, it does not explicitly state when not to use this tool or name specific alternative tools, which is a minor gap.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_percentile_rankCInspect
Compute the percentile rank of a value within a dataset. Use for benchmarking scores or salaries. Inputs: value, dataset (list of numbers). Returns percentile (0-100). See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Value to rank | |
| total_values | Yes | Total number of values | |
| values_below | Yes | Number of values below |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. The description claims inputs include a dataset list, which is misleading as the tool expects precomputed counts. This undermines transparency about the tool's actual behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded, but the inaccuracy about inputs undermines its effectiveness. It could be concise and accurate, but it's not.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite high schema coverage and output schema, the description fails to explain the actual parameters (values_below, total_values) and misrepresents the input format, leaving the user unprepared.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. However, the description's claim of 'value, dataset (list of numbers)' contradicts the schema and adds confusion rather than value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes percentile rank and gives usage examples. However, it inaccurately describes the inputs as 'dataset (list of numbers)' when the schema requires summary counts, causing confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using for benchmarking scores or salaries and references list_bundles for related calculators, but does not explicitly exclude scenarios or compare with siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_perimeterAInspect
Calculate perimeter/circumference for common shapes. Returns: {shape, perimeter}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| side | No | Side for square/hexagon | |
| shape | Yes | Shape | |
| width | No | Width/side b | |
| length | No | Length/side a | |
| radius | No | Radius | |
| side_c | No | Side c for triangle | |
| semi_major | No | Semi-major for ellipse | |
| semi_minor | No | Semi-minor for ellipse |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It only states the return format but does not disclose behavioral traits such as handling of invalid inputs, edge cases, or required parameters for each shape. This is minimal transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise—two sentences with no filler. It front-loads the purpose and includes a useful pointer to related tools. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, the description need not explain return values in depth. However, it could be more helpful by summarizing which parameters are required for which shapes. As it stands, it is adequate but not comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes each parameter. The description adds no new parameter semantics beyond a vague reference to 'common shapes'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates perimeter/circumference for common shapes and specifies the return format. It distinguishes itself from siblings by referencing 'list_bundles' for related math calculators, making the purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for perimeter calculations but provides no explicit guidance on when to use this vs. sibling tools or when not to use it. The reference to 'list_bundles' is vague and does not give clear alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pet_ageAInspect
Convert pet age (dog/cat) to human-equivalent years. Use for pet health monitoring. Inputs: animal type, age years, breed size. Returns human-equivalent age. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | No | ||
| animal | Yes | ||
| age_years | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must convey behavior. It describes input and output but omits details like precision, range, or side effects beyond conversion.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three short sentences with clear first sentence; no unnecessary words. Includes a helpful cross-reference to list_bundles.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With output schema present, description adequately covers purpose and inputs. References related tools via list_bundles. Slightly lacks edge case handling but sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 0% description coverage, so description must compensate. It lists the three inputs (animal type, age years, breed size) but does not elaborate enum values or constraints beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool converts pet age (dog/cat) to human-equivalent years, specifying verb and resource. It distinguishes from siblings like calculate_cat_age and calculate_dog_age by combining both animals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides context for pet health monitoring and references list_bundles for related calculators, but lacks explicit when-not-to-use or direct comparison to specific sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pet_bmiCInspect
Estimate body condition score proxy (BMI) for dogs and cats. Returns: {thresholds}. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| animal | Yes | ||
| weight_kg | Yes | ||
| body_length_cm | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits, but it only mentions it returns '{thresholds}' without explaining what that means or any side effects. The placeholder is uninformative, leaving the agent without clarity on the tool's behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very short and front-loaded with the purpose, but it includes an unhelpful placeholder '{thresholds}' that reduces clarity. It is concise but sacrifices completeness for brevity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given three parameters, no schema descriptions, no annotations, and an output schema that is not shown, the description is incomplete. It does not explain the output format or how to interpret the BMI proxy, leaving the agent with insufficient information.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 0% description coverage, and the tool description adds no meaning to the parameters. It does not explain the required inputs (animal, weight_kg, body_length_cm) beyond their names, failing to compensate for the schema's lack of documentation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates a body condition score proxy (BMI) specifically for dogs and cats, using the verb 'Estimate' and specifying the resource. It effectively distinguishes itself from related tools like the human BMI calculator by explicitly mentioning pet types.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal usage guidance. It mentions 'See list_bundles for related "animaux" calculators', which hints at alternatives but does not explain when to use this tool versus others or any prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pet_food_portionAInspect
Compute daily food portion (g) for dogs and cats by weight, age, activity. Use for pet feeding. Inputs: animal type, weight, activity, life stage. Returns grams/day and meal split. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| activity | Yes | Activity level | |
| pet_type | Yes | Type of pet | |
| age_years | Yes | Pet age years | |
| weight_kg | Yes | Pet weight kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It states 'Returns grams/day and meal split' which gives some output clarity, but does not disclose any behavioral traits like safety or edge cases. Since the tool is a calculator, it is read-only by nature, but the description does not explicitly confirm that.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose, and includes a reference to a related tool. It is concise and avoids extraneous content, but could be slightly more streamlined.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers purpose, inputs, and output format. However, it lacks details on constraints, edge cases, or the exact meaning of 'meal split'. Given the presence of an output schema, the description does not need to explain return values in depth, but it could be more complete about the tool's behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema coverage is 100% (all parameters described). However, the description mentions 'life stage' as an input, while the actual parameter is 'age_years'. This mismatch could confuse the agent. The description does not add significant meaning beyond the schema, and the inaccuracy lowers the score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes daily food portion for dogs and cats, specifying the resource, verb, and scope. It distinguishes from many sibling calculators by naming the specific pet types and includes a reference to related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for pet feeding' and lists inputs, which implies usage context. However, it lacks explicit when-not-to-use guidance or alternatives. The reference to list_bundles is a weak form of alternative, but not sufficient for a higher score.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pet_medication_doseAInspect
Compute veterinary medication dose by pet weight (mg/kg). Use for medication administration. Inputs: weight kg, dose mg/kg. Returns total mg and tablet count. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| weight_kg | Yes | ||
| dose_mg_per_kg | Yes | ||
| concentration_mg_per_ml | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must carry the burden. It discloses that the tool is a calculator returning total mg and tablet count. However, it omits the optional concentration parameter's effect, which is a significant behavioral gap for a medical dose calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is fairly concise, with only a few sentences. It front-loads the purpose and inputs. However, 'Use for medication administration' adds little beyond the purpose, and the pointer to list_bundles could be shorter.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the basic purpose and inputs but lacks detail on the optional concentration parameter. Despite the presence of an output schema, the description omits important context about the liquid vs. tablet calculation, leaving the tool somewhat incomplete for users.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 0%. The description adds meaning for weight_kg and dose_mg_per_kg by mentioning 'weight kg' and 'dose mg/kg', but ignores the concentration_mg_per_ml parameter. The output description hints at total mg and tablet count, but this does not fully compensate for the undocumented parameter.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states 'Compute veterinary medication dose by pet weight (mg/kg)', which is a specific verb-resource pair. It clearly distinguishes from sibling pet calculators (e.g., calculate_pet_age, calculate_pet_bmi) by focusing on medication dosing.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for medication administration', providing clear context. It also directs to list_bundles for related calculators, offering guidance on finding alternatives. However, it does not explicitly state when not to use this tool or list alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pet_vaccination_scheduleCInspect
Generate upcoming vaccination schedule for a puppy or kitten. Use for pet care planning. Inputs: pet type, birth date, last vaccine date. Returns upcoming dates and vaccines. See list_bundles for related 'animaux' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| pet_type | Yes | Type of pet | |
| birth_date | Yes | Pet birth date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description fully carries the burden. It discloses the tool generates upcoming dates and vaccines, but does not mention readonly status, authentication, or side effects. The lack of annotation coverage reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (3 sentences) and front-loaded, but the inclusion of an incorrect parameter ('last vaccine date') detracts from clarity. The mention of list_bundles is helpful for discoverability, but the inaccuracy lowers the score.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With no annotations and only 2 parameters, the description should provide more context about the output (e.g., schedule format, time range) and edge cases. It only says 'Returns upcoming dates and vaccines', leaving gaps in understanding the full behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description erroneously lists 'last vaccine date' as an input, which is not in the input schema (only pet_type and birth_date). This misalignment undermines accuracy. For actual parameters, schema already provides adequate descriptions, so the description adds no value and introduces confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Generate upcoming vaccination schedule for a puppy or kitten', specifying the verb (generate) and resource (vaccination schedule). It differentiates from sibling pet calculators by focusing on vaccination planning.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'Use for pet care planning' and advises to 'See list_bundles for related "animaux" calculators', but does not explicitly state when to avoid using this tool or compare it to alternatives, leaving some ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_phAInspect
Compute pH from H+ concentration or vice versa. Use for chemistry or aquarium care. Formula: pH=-log10[H+]. Inputs: pH or [H+] mol/L. Returns the missing value and acidity class. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| ph_value | No | pH | |
| h_concentration | No | H+ mol/L |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses it returns missing value and acidity class, and gives the formula, but could explicitly state it is read-only and has no side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences, front-loading purpose and including essential details without extraneous content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with full schema coverage and an output schema, the description adequately covers inputs, outputs, formula, and usage context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description restates inputs in natural language and adds formula context, but adds limited new meaning beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes pH from H+ concentration or vice versa, with specific verb and resource. It distinguishes from many sibling calculation tools by focusing on pH.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use for chemistry or aquarium care and references related calculators via list_bundles, but does not explicitly exclude scenarios or compare with similar tools like 'calculate_soil_ph_amendment'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pinel_tax_reductionAInspect
Compute French Pinel rental investment tax reduction (rates 2026). Use to evaluate Pinel real estate investment savings. Inputs: investment amount, duration (6/9/12y). Returns total tax reduction and yearly amount. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| duration | Yes | Rental commitment duration in years: 6, 9 or 12 | |
| investment | Yes | Investment amount in EUR (max 300,000) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must carry the burden. It mentions inputs and outputs (returns total and yearly amount) but does not disclose any behavioral traits like rate limits, authorization needs, or whether the calculation is static (2026 rates). It's adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loaded with the purpose, and each sentence serves a clear function: stating what it does, when to use it, and what it returns. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (calculation with two parameters) and the existence of an output schema (context indicates true), the description covers the key points: purpose, inputs, outputs. It could mention the 2026 rate specificity but overall is complete for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds minimal value. It restates that duration is 6/9/12 years and mentions maximum investment (already in schema). It does not provide deeper semantics beyond what the schema already offers.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose with a specific verb ('Compute') and resource ('French Pinel rental investment tax reduction'). It also specifies the year (2026), distinguishing it from other tax calculators. The resource is unique among many 'calculate_' siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use to evaluate Pinel real estate investment savings' and directs to 'list_bundles' for related calculators. It provides context but doesn't explicitly state when not to use it or how it differs from similar tax reduction calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pipe_diameterBInspect
Calculate the minimum pipe diameter required for a given flow rate and maximum velocity. See list_bundles for related 'plomberie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| flow_rate_lpm | Yes | Required flow rate in liters per minute | |
| max_velocity_ms | No | Maximum water velocity in m/s (default 1.5 m/s per DTU norms) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It states what it calculates but discloses no behavioral traits like assumptions (e.g., fluid type, pipe roughness), side effects, required permissions, or error conditions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences. The first delivers core purpose; the second adds helpful cross-reference to related tools. No redundant or wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple two-parameter calculator with output schema, the description is mostly adequate. However, it omits important context like assumptions (e.g., water temperature, standard pipe roughness) and error handling, making it less complete for non-expert agents.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter descriptions. The description adds no new meaning beyond what the schema already provides for 'flow_rate_lpm' and 'max_velocity_ms', so baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states it calculates the minimum pipe diameter given flow rate and velocity, using specific verbs and resources. It distinguishes from sibling 'calculate_pipe_flow_rate' by the inverse direction, and points to a related bundle via 'list_bundles'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. Only a vague reference to 'list_bundles' for related plumbing calculators, which does not clarify selection criteria or alternatives for this specific tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pipe_flow_rateBInspect
Calculate water flow rate through a pipe using the Hazen-Williams formula. Returns: {C_coefficient}. See list_bundles for related 'plomberie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| length_m | Yes | Pipe length in meters | |
| material | Yes | Pipe material (affects Hazen-Williams C coefficient) | |
| diameter_mm | Yes | Pipe internal diameter in millimeters | |
| pressure_bar | No | Available water pressure in bar (default 3 bar) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose all behavioral traits. It mentions the formula but does not indicate permissions, rate limits, side effects, or any constraints beyond the formula name. The return value description is ambiguous.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded, but the second sentence about 'Returns: {C_coefficient}' is potentially misleading and wastes space. Otherwise, it is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
An output schema exists but is not shown; the description should clarify the return format. It mentions 'water flow rate' but then 'Returns: {C_coefficient}', which is inconsistent. For a calculation tool, more specifics on output units or range would be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already describes all parameters with 100% coverage. The description adds no additional semantic meaning or guidance about parameter relationships or units.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool calculates water flow rate using the Hazen-Williams formula, which is specific and matches the name. However, it confusingly says 'Returns: {C_coefficient}' which likely refers to the coefficient used, not the result, causing slight ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. The mention of 'list_bundles for related plomberie calculators' hints at related tools but does not differentiate from siblings like calculate_flow_rate_convert or calculate_pipe_diameter.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_planet_weightAInspect
Compute your weight on other planets using gravity ratios. Use for fun, education, sci-fi. Inputs: weight on Earth (kg). Returns weight on each planet of the solar system. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| planet | Yes | Target planet | |
| earth_weight_kg | Yes | Weight on Earth in kg |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided. The description discloses that the computation uses gravity ratios and returns weights, but it doesn't detail any behavioral traits like idempotency, side effects, or data handling. For a simple calculator, this is minimal but acceptable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loads the purpose, and includes a reference to related tools. Every sentence adds value, and there is no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with two parameters and an output schema, the description provides sufficient context: what it does, when to use it, and a pointer to related calculators. It could mention the output format, but the output schema likely covers that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents both parameters. The description adds no extra meaning beyond stating 'Inputs: weight on Earth (kg)', which is redundant. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: computing weight on other planets using gravity ratios. It specifies the resource (your weight) and the action (compute on other planets). It differentiates from sibling tools by its specific fun/educational/sci-fi context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases (fun, education, sci-fi) and points to a related bundle, but it doesn't explicitly state when to use this tool versus alternatives. It lacks clear when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_plasterAInspect
Calculate plaster volume and weight for a given surface and thickness. Returns: {area_m2, volume_m3, weight_kg, bags_25kg}. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | Yes | Surface area in m² | |
| thickness_mm | No | Thickness in mm (default 13) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It mentions the calculation and return fields but does not disclose potential side effects, prerequisites, or error handling. For a simple calculation tool, this is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two short sentences covering purpose and additional context (‘See list_bundles…’). No redundant information; every sentence is useful.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the low complexity, full schema coverage, and presence of an output schema, the description sufficiently covers what the tool does and its expected outputs. It could optionally mention that default thickness is 13mm, but the schema handles that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description adds value by listing the output fields (area_m2, volume_m3, weight_kg, bags_25kg), providing meaning beyond the input schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states it calculates plaster volume and weight for a given surface and thickness, using a clear verb-resource structure. It distinguishes itself from sibling calculation tools by specifying plaster.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only suggests seeing list_bundles for related construction calculators, without specifying when to use this tool versus alternatives or providing context for exclusion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_poker_hand_probabilityBInspect
Compute probability of common poker hands (straight, flush, full house, etc.) given a starting hand. Use for poker strategy. Inputs: hole cards, community cards. Returns probability per hand category. See list_bundles for related 'jeux-probabilites' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| hand_type | Yes | Poker hand type to calculate probability for |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so full burden on description. Description states it computes probabilities with given inputs, but does not disclose any behavioral traits (e.g., read-only, side effects). For a calculation tool, more transparency about the calculation scope would be useful.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no redundancy. Front-loaded with purpose and key inputs. Reference to list_bundles is efficient. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of poker probability and the presence of an output schema, the description omits crucial details about how hole/community cards are input (they are not in the schema). The description is incomplete, leading to potential misuse.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage for the only parameter (hand_type) with enum and description, but the description mentions additional inputs (hole cards, community cards) not present in the schema. This discrepancy undermines the value added by the description and could confuse agents.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states compute probability of common poker hands given a starting hand. Verb 'compute' and resource 'probability of common poker hands' are specific. Distinguishes from siblings like calculate_card_draw_probability by focusing on poker hand types.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description mentions 'Use for poker strategy' giving implicit context, but does not explicitly differentiate from sibling tools like calculate_card_draw_probability or when not to use it. The reference to list_bundles for related calculators is helpful but not definitive.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pool_chlorineCInspect
Compute chlorine dosage (g) for pool maintenance based on volume and target ppm. Use for pool care. Inputs: pool volume m³, target chlorine ppm, current ppm. Returns chlorine grams. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target_ppm | No | Target chlorine ppm | |
| current_ppm | No | Current chlorine ppm | |
| volume_liters | Yes | Pool volume liters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden but does not disclose behavioral traits beyond the schema. It omits whether the function is pure, side-effect-free, or requires authentication. The return value is mentioned ('Returns chlorine grams'), but other behaviors are absent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences and front-loads purpose without unnecessary words. The final sentence referencing list_bundles is not about this tool's usage but adds minor bloat; overall it is acceptably concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite an output schema existing, the description fails to explain defaults (target_ppm=1.5, current_ppm=0) and has a unit mismatch (liters vs m³) that could cause incorrect usage. For a simple calculator, this gap is significant.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds the return value and unit hints, but introduces a discrepancy: schema says volume_liters, description says 'pool volume m³', which may mislead. This partial value is offset by the inconsistency.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute chlorine dosage (g) for pool maintenance' with specific inputs and output. However, it does not explicitly differentiate from sibling tools like calculate_pool_volume, though the unique purpose is implied.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'Use for pool care' but provides no guidance on when not to use, prerequisites, or alternatives. It refers to list_bundles for related calculators, but that is vague and does not help an agent decide between this and other pool-related tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pool_volumeAInspect
Compute swimming pool water volume in m³ and liters. Use for pool maintenance dosing. Inputs: shape, dimensions. Returns volume m³ and L. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| shape | Yes | Shape | |
| depth_m | Yes | Avg depth m | |
| width_m | No | Width m | |
| length_m | No | Length m | |
| diameter_m | No | Diameter (round) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the output (volume in m³ and L) but does not mention other behavioral traits such as whether it is read-only, has side effects, or error conditions. As a calculator, it is safe, but the transparency is minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences that cover purpose, use case, inputs, outputs, and a related bundle reference. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with an output schema (implied), the description adequately explains what the tool does, its inputs, and its outputs. It also references a related bundle for additional context. No gaps given the tool's simplicity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with each parameter already described. The description adds only a summary ('Inputs: shape, dimensions') without providing additional meaning beyond the schema. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description explicitly states 'Compute swimming pool water volume in m³ and liters', providing a specific verb and resource. It also mentions the use case 'for pool maintenance dosing', clearly distinguishing it from the many sibling 'calculate_*' tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives context ('Use for pool maintenance dosing') and implies when to use, but does not explicitly state when not to use or compare to alternatives. However, given the sibling list, no other tool computes pool volume, so the guidance is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_portage_salarialAInspect
Estimate net income from portage salarial (freelance via umbrella company). Returns: {portage_management_fee_10pct, social_charges_45pct, net_monthly, net_annual_estimate, net_ratio_pct}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| daily_rate | Yes | Daily billing rate (TJM) in euros | |
| days_per_month | No | Billable days per month (default 20) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It mentions the calculation includes a 10% management fee and 45% social charges, and lists return fields like 'net_monthly' and 'net_annual_estimate.' However, it does not clarify whether these percentages are fixed or illustrative, nor does it address edge cases or assumptions (e.g., tax brackets). The description adds some value but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences. The first sentence clearly states the purpose and key output fields. The second sentence points to related tools. No unnecessary words. It is front-loaded and easy to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, no nested objects, output schema present), the description covers the essential aspects: purpose, return values, and a pointer to related tools. It is mostly complete, though it could mention that the calculation is a simplified estimate and may not account for all tax scenarios.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, meaning both parameters (daily_rate and days_per_month) have descriptions in the schema. The tool description does not add additional semantics beyond what is already in the schema. Baseline is 3 for high coverage, and the description does not improve it.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Estimate net income from portage salarial (freelance via umbrella company).' It specifies the verb 'estimate' and the resource 'net income from portage salarial.' The description also lists the return fields, making it unambiguous. Among many sibling calculate_* tools, this one stands out as specific to French portage salarial.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no explicit guidance on when to use this tool versus alternatives. It only suggests seeing list_bundles for related calculators, but does not explain the tool's scope or limitations. There is no mention of when not to use it or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_portfolio_allocationAInspect
Calculate portfolio allocation amounts by percentage for major crypto asset classes. Returns: {allocation_pct, sum_pct}. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| btc_pct | No | Bitcoin allocation percentage (default 40%) | |
| eth_pct | No | Ethereum allocation percentage (default 30%) | |
| alts_pct | No | Altcoins allocation percentage (default 20%) | |
| total_value | Yes | Total portfolio value in fiat currency | |
| stablecoins_pct | No | Stablecoins allocation percentage (default 10%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description mentions the return format as '{allocation_pct, sum_pct}', hinting at validation behavior. However, with no annotations provided, it does not disclose other behaviors like authentication needs, rate limits, or what happens if percentages don't sum to 100. It adds some context but is not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise, consisting of two sentences that front-load the purpose and provide a hint for related tools. Every sentence is necessary and there is no extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (implied by 'Returns:') and 100% schema coverage, the description is reasonably complete. It states the return format and directs to related tools. However, it could be more complete regarding edge cases or behavior when inputs are invalid.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description does not add parameter-specific details beyond what the schema already provides (each parameter has a clear description and default). It provides context about 'major crypto asset classes' but this is a minor addition.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate portfolio allocation amounts by percentage for major crypto asset classes.' It specifies a specific verb (calculate), resource (portfolio allocation), and scope (crypto asset classes), distinguishing it from sibling calculators like calculate_crypto_profit_loss.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a suggestion to 'See list_bundles for related 'crypto' calculators,' which offers some guidance on alternatives but does not explicitly state when to use this tool versus others or when not to use it. It implies a context but lacks explicit exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_power_unit_convertAInspect
Convert power values between W, kW, HP, BTU/h, cal/s. Returns: {original}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Power value to convert | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full burden for behavioral disclosure. It only states the conversion and mentions 'Returns: {original}', but fails to explain if conversions are exact, handle edge cases, or any required permissions or constraints. This is a significant gap for a tool with no annotation support.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two short sentences. The first sentence immediately states the core function, and the second provides a helpful pointer to related tools. There is no wasted text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple unit conversion tool with a well-defined schema and an output schema (not needing return value explanation), the description is mostly adequate. However, it lacks any mention of input validation, behavior for invalid values, or the format of the output beyond '{original}', leaving some gap in completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already provides clear descriptions for all three parameters (value, from_unit, to_unit) with 100% coverage. The description adds no further semantic information beyond listing the units, so it meets the baseline but does not exceed it.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool converts power values between specific units (W, kW, HP, BTU/h, cal/s), providing a specific verb and resource. The purpose is unambiguous and distinguishes itself from siblings like convert_pressure or convert_distance.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage by stating the conversion purpose and suggests using list_bundles for related converters, but it does not explicitly state when to use this tool versus alternatives like calculate_power for electrical power calculations or other unit converters.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pregnancy_due_dateBInspect
Calculate due date and current gestational week from last period. Returns: {due_date}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| last_period_date | Yes | Last menstrual period date YYYY-MM-DD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so description carries full burden. It discloses return values briefly but lacks details on prerequisites, side effects, or error conditions. Adequate for a simple calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Very concise: two sentences, first states purpose and input, second gives return format and related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and existence of output schema, the description is mostly complete. However, it fails to differentiate from sibling 'calculate_due_date', leaving a gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameter description. The description adds no new meaning beyond schema's 'Last menstrual period date YYYY-MM-DD'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates due date and gestational week from last period. However, the presence of sibling tool 'calculate_due_date' without differentiation slightly reduces clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Only implicit usage from 'last period' is given. No explicit when-to-use vs alternatives (e.g., differentiate from calculate_due_date). The pointer to list_bundles is weak guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_present_valueAInspect
Compute the present value (PV) of a future sum given a discount rate. Use in DCF, NPV, or retirement planning. Inputs: future value, annual rate %, years. Returns PV and discount factor. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rate | Yes | Annual discount rate percent | |
| years | Yes | Number of years | |
| future_value | Yes | Future value EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, but the description explicitly states 'Returns PV and discount factor', which adds behavioral context beyond the input schema. It does not disclose side effects, but for a calculation tool, the key behavioral trait is what it returns.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, each earning its place: purpose, usage, inputs/outputs. No fluff, front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given an output schema exists (reducing need to describe return values), the description covers purpose, context, and basic behavior. Slight gap in not mentioning precision or edge cases, but sufficient for a straightforward financial calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter. Description merely restates parameter names without adding new semantics. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Compute the present value (PV) of a future sum given a discount rate' with specific verb and resource. It also lists use cases (DCF, NPV, retirement planning), which distinguishes it from other calculate_* siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides context on when to use (DCF, NPV, retirement planning) and references list_bundles for related calculators. Lacks explicit when-not-to-use or direct alternative names, but the guidance is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pressure_convertBInspect
Convert pressure between Pa, kPa, MPa, bar, psi, atm, mmHg, mbar, torr. Use for engineering, weather, medicine. Inputs: value, from-unit, to-unit. Returns: {original}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Pressure value | |
| to_unit | Yes | Target unit | |
| from_unit | Yes | Source unit |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits but only vaguely mentions 'Returns: {original}'. It does not clarify side effects, rate limits, or output format beyond a cryptic placeholder.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is relatively concise with three sentences and includes purpose first. However, the sentence 'Inputs: value, from-unit, to-unit.' is redundant given the schema, and the unit list is slightly misleading.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with an output schema, the description is adequate but missing details: the return value description is vague, and the unit mismatch reduces completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage, so the description should add value. Instead, it lists units (kPa, MPa, torr) that are not in the enum, contradicting the schema. This misleads users about available parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it converts pressure between listed units and specifies use cases (engineering, weather, medicine). However, it does not differentiate from the sibling 'convert_pressure' tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some context for when to use (engineering, weather, medicine) but offers no explicit guidance on when not to use or how to choose between this and the similar sibling 'convert_pressure'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_prime_activiteAInspect
Estimate French prime d'activité monthly amount (CAF benefit). Use for low-income workers checking eligibility. Inputs: net monthly salary, household composition. Returns estimated benefit and eligibility note. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| salary | Yes | Net monthly salary in euros | |
| household_size | No | Number of people in household (1-6) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It discloses that the tool returns 'estimated benefit and eligibility note,' which is sufficient for a calculator. It does not describe idempotency or side effects, but for a read-only calculator, this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose, then inputs/outputs, and a reference to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, output schema exists), the description covers purpose, inputs, outputs, and sibling relation. It is fully sufficient for an AI agent to select and invoke correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with descriptions for both 'salary' and 'household_size' in the schema. The description adds 'Inputs: net monthly salary, household composition,' which reinforces schema info but doesn't provide additional meaning beyond what's already in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states 'Estimate French prime d'activité monthly amount (CAF benefit).' It specifies a clear verb (estimate) and resource (prime d'activité), distinguishing it from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises 'Use for low-income workers checking eligibility' and directs to 'list_bundles for related finance-france calculators,' providing context and alternatives. However, it doesn't explicitly state when not to use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_print_resolutionAInspect
Calculate print DPI quality and maximum print size from image pixel dimensions. Returns: {effective_dpi}. See list_bundles for related 'photographie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| image_width_px | Yes | Image width in pixels | |
| print_width_cm | Yes | Desired print width in centimeters | |
| image_height_px | Yes | Image height in pixels | |
| print_height_cm | Yes | Desired print height in centimeters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions the return value '{effective_dpi}' but does not describe whether the tool has side effects, handles errors, requires specific input ranges, or other behavioral traits. This is insufficient for a tool with four parameters.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences, no wasted words, and front-loads the core purpose. It efficiently conveys the tool's functionality and a pointer to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists (stated in context signals), the description does not need to explain return values in detail, but it does mention the return key. The tool is straightforward, and the input schema covers all parameters. The description could be slightly more complete by mentioning that the effective DPI determines maximum print size, but it remains adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 100% schema description coverage, the input schema already provides descriptions for all four parameters. The tool description adds no additional meaning beyond the schema, simply restating that it uses 'image pixel dimensions' and 'print size'. Under the baseline rule, a score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates print DPI and maximum print size from image pixel dimensions, and mentions the return value. The verb 'Calculate' plus the specific resource 'print DPI quality and maximum print size' makes the purpose unambiguous. It also distinguishes from the many sibling calculators by referencing the 'photographie' bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use this tool (for print DPI calculation) and directs users to 'list_bundles' for related photography calculators, but it does not explicitly state when not to use it or compare to alternatives. For a calculator tool, the usage context is implied, but the guideline is minimal.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_probability_binomialAInspect
Calculate binomial probability P(X=k) and cumulative P(X<=k). Returns: {exact_probability, cumulative_probability, std_deviation}. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| k | Yes | Number of successes | |
| n | Yes | Number of trials | |
| p | Yes | Probability of success per trial |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It only mentions the output fields, lacking details on side effects, authentication needs, rate limits, or constraints beyond parameter ranges.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no wasted words. It front-loads the core purpose and adds a helpful cross-reference. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, an output schema exists (not shown), so the description need not detail return values. However, it omits edge cases (e.g., n=0, k>n) and prerequisites. Still, it is largely complete for a basic calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so the baseline is 3. The description does not add meaning beyond the schema; it only lists return fields, which are not parameter-related.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates binomial probability P(X=k) and cumulative P(X<=k), with specific output fields. It distinguishes itself from sibling tools by being a specific calculator and references related calculators via 'list_bundles'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for binomial probability calculations but does not explicitly state when to use this tool versus alternatives, nor does it provide exclusions or prerequisites. The reference to 'list_bundles' offers minimal guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_profit_marginCInspect
Calculate gross margin, net margin, and markup percentage. Returns: {revenue, cost}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cost | Yes | Total cost | |
| revenue | Yes | Total revenue/selling price |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behaviors. It states the outputs as {revenue, cost}, which is confusing and likely incorrect (inputs repeated). No mention of edge cases (e.g., negative margins) or formatting.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but includes a confusing 'Returns: {revenue, cost}' that wastes space. The cross-reference is helpful but the return statement reduces clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with output schema, the description fails to explain the output format and handling of invalid inputs. The misleading return statement makes it incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for both parameters. The description adds no additional meaning beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates gross margin, net margin, and markup percentage. The verb 'calculate' and specific metrics are provided. However, it does not differentiate among the vast sibling list of calculator tools, missing explicit distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives. It only mentions 'See list_bundles for related finance-universal calculators' but provides no criteria for selection or context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_projectile_motionBInspect
Compute projectile trajectory: range, max height, time of flight. Use for physics or ballistics. Inputs: initial velocity, launch angle, height. Returns range, peak, flight time. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height | No | Initial height m | |
| velocity | Yes | Initial velocity m/s | |
| angle_deg | Yes | Launch angle degrees |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description lacks detail on assumptions (e.g., ideal projectile motion ignoring air resistance) or output format. For a calculation tool, it is minimally transparent but could be more explicit about behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with key outputs. No superfluous content, every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Description covers inputs and outputs adequately for a simple calculator. However, it omits assumptions and error conditions. Output schema exists but is not shown; description does not need to repeat return values.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all 3 parameters. Description lists inputs but does not add significant detail beyond schema (units are already in schema). Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Compute projectile trajectory' and lists specific outputs (range, max height, time of flight). It distinguishes itself as a physics/ballistics calculator, but does not explicitly differentiate from other physics calculators among siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Says 'Use for physics or ballistics' and directs to list_bundles for related calculators. Provides implied usage context but no explicit when-not-to-use or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_property_capital_gains_frAInspect
Calculate French property capital gains tax after holding-period abatements. Returns: {raw_gain_eur, taxable_ir_eur, taxable_ps_eur, tax_ir_eur, tax_ps_eur, total_tax_eur, ...}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sale_price | Yes | Sale price EUR | |
| holding_years | Yes | Years held | |
| purchase_price | Yes | Purchase price EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It describes the calculation (post-abatement) and lists return fields, but does not explicitly state it is read-only, nor disclose any potential side effects, authorization requirements, or rate limits. For a calculator, non-disclosure is acceptable but minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first sentence clearly states purpose and return fields; second sentence directs to related tools. No wasted words, but could be structured with more headings or bullet points for readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description includes return fields, references the related bundle, and covers the core purpose. Although no output schema exists, the description compensates by listing expected output fields. Lacks explanation of abatement formulas, but may be acceptable for a simple calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter descriptions that define purchase_price, sale_price, and holding_years as numbers with constraints. The description adds context about 'holding-period abatements' but does not provide additional semantics beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates French property capital gains tax after holding-period abatements. It names the verb 'Calculate', specific resource 'French property capital gains tax', and distinguishes it from other calculators by referencing 'finance-france' bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides clear context: 'Calculate French property capital gains tax after holding-period abatements.' It directs to list_bundles for related tools, but does not explicitly state when not to use it or differentiate from similar tools like calculate_capital_gains_property.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_property_tax_estimate_frCInspect
Estimate French taxe foncière from cadastral value and commune rate. Returns: {estimated_tax_eur, taxable_base, commune_rate_pct}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| commune_rate | Yes | Commune tax rate percent | |
| cadastral_value | Yes | Valeur locative cadastrale EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It does not mention side effects, idempotency, authentication, or rate limits. While the tool is likely a safe read-only calculation, this is not explicitly stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the purpose and inputs, followed by the output structure and a reference to related tools. It is efficient without wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple two-parameter calculator, the description is minimally viable. It covers purpose, inputs, and outputs. However, it lacks usage guidance and transparency, and does not differentiate from a similarly named sibling, leaving gaps for an agent to fully understand when to invoke it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already describes both parameters with clear explanations. The description adds context by listing the return fields, but does not enhance parameter semantics beyond the schema. Baseline 3 applies due to 100% schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates French taxe foncière using cadastral value and commune rate. It specifies the output fields. However, it does not differentiate from the sibling tool 'calculate_property_tax_fr', which may cause confusion about which to use.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides the inputs needed but lacks guidance on when to use this tool vs. alternatives (e.g., 'calculate_property_tax_fr'). It mentions related calculators via 'list_bundles' but gives no explicit when-not-to-use or context for selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_property_tax_frAInspect
Compute French taxe foncière (annual property tax). Use for owners of property in France. Inputs: cadastral rental value (valeur locative), commune rate %. Returns annual tax due. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cadastral_value | Yes | Cadastral rental value (valeur locative cadastrale) in EUR | |
| commune_rate_pct | No | Commune tax rate in % (default 25) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears the full burden of disclosing behavioral traits. It states the tool computes a tax and returns the annual due amount, implying a read-only calculation. However, it does not explicitly mention that no side effects occur, nor does it discuss prerequisites (e.g., needing the cadastral value from an official source) or potential limitations (e.g., reliance on user-provided commune rate). This is adequate for a simple calculation tool but could be more transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise at three sentences. The first sentence clearly states the core purpose, the second identifies the target user, and the third lists inputs and output. Every sentence adds value, and there is no wasted prose. This is an exemplary model of conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, clear output schema), the description is nearly complete. It covers the purpose, user, inputs, and output, and even directs to related tools via list_bundles. It lacks only minor details like the default commune rate (though schema covers it) or that the tool does not fetch real-time rates, which might be assumed. Overall, it is sufficient for a developer or agent to understand and invoke the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description lists the two parameters ('cadastral rental value (valeur locative), commune rate %') but adds no additional meaning beyond what the schema already provides. There is no extra context about format, units (already in schema), or typical values, so it does not exceed the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes French taxe foncière, an annual property tax, and specifies it is for property owners in France. The verb 'Compute' and resource 'French taxe foncière' are specific. However, it does not explicitly differentiate from the sibling tool 'calculate_property_tax_estimate_fr', which may serve a similar purpose, so a perfect score is not warranted.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for owners of property in France' which provides a clear context for when to use the tool. However, it does not offer any guidance on when NOT to use it or mention alternatives like the nearby 'calculate_property_tax_estimate_fr'. The reference to 'list_bundles' for related calculators is helpful but not a direct exclusion or alternative recommendation.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_property_transfer_taxCInspect
Compute French property transfer tax (droits de mutation) by department. Use for property buyers. Inputs: property price, department, type (new/old). Returns tax due and effective rate. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| price | Yes | Property price in local currency | |
| country | Yes | Country code: FR/BE/US/UK/DE |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description claims it computes French tax but the schema accepts multiple countries (FR, BE, US, UK, DE), and mentions 'department' and 'type' not in schema. This is misleading. No annotations are provided, so the description must be accurate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences) but includes irrelevant and inaccurate information about parameters not in the schema, so it is not efficiently concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description mentions returns (tax due, effective rate) and references related calculators, but the mismatch with input schema makes it incomplete and unreliable. Output schema exists but is not utilized in description.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description adds parameters 'department' and 'type' that do not exist in the schema, while the actual parameters (price, country) are already described in the schema with 100% coverage. The description introduces confusion rather than adding value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool computes French property transfer tax and mentions inputs, but it says 'by department' and 'type (new/old)' which are not in the schema, causing confusion. It also does not clearly differentiate from other property tax calculators among siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for property buyers' and references list_bundles for related calculators, providing some usage context. However, no explicit when-not or alternatives are given, and the mismatch with schema undermines clarity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ptz_eligibilityAInspect
Check French PTZ (zero-rate loan) eligibility and maximum amount. Returns: {income_ceiling, ptz_max_pct_of_operation, note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| zone | Yes | Geographic zone of the property | |
| household_size | Yes | Number of people in household (1-5+) | |
| household_income | Yes | Annual household income (revenu fiscal de reference) in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It states the tool checks eligibility and returns data, implying no side effects, but does not explicitly confirm it is read-only or disclose any behavioral traits like rate limits or permission requirements.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences with no extraneous information. The first sentence defines purpose, the second details output and provides a useful cross-reference.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity (3 required params, no nested objects) and presence of output schema (return fields described), the description adequately covers core functionality. Minor gap: no explanation of what the 'note' field contains or error conditions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description does not add additional meaning beyond the schema; it does not reiterate or clarify parameter formats or constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('Check French PTZ eligibility and maximum amount') and the output structure. It distinguishes from siblings by referencing list_bundles for related calculators, but does not explicitly differentiate from other calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides no explicit guidance on when to use this tool vs. other financial calculators. It only directs to list_bundles for related tools, but lacks contextual cues such as prerequisites or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_puissance_fiscaleAInspect
French fiscal horsepower CV = (CO2/45) + (P_kW/40)^1.6. Returns: {cv_raw, cv_fiscaux}. See list_bundles for related 'auto-transport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| co2_g_km | Yes | CO2 g/km | |
| power_kw | Yes | Engine power in kW |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It reveals the calculation formula and output format but does not mention idempotency, side effects, or authentication requirements. As a simple calculator, the intent is clear, but deeper behavioral traits are unaddressed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences. The formula is front-loaded and the output is clearly stated. No superfluous information is present.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description includes the output structure but lacks explanations of what 'cv_raw' and 'cv_fiscaux' represent. Given the tool's straightforward nature and the presence of a formula, it is mostly complete, but the missing definitions reduce completeness slightly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage with min constraints. The description adds meaning by presenting the formula and naming the output fields, which enriches the agent's understanding beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool calculates 'French fiscal horsepower CV' using a precise formula, and specifies the return structure. It clearly distinguishes the tool from siblings by naming the specific formula and mentioning related bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for French vehicle taxation but does not explicitly state when to use this tool versus other calculators. It references list_bundles for related tools but provides no exclusion criteria or context for when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pump_powerCInspect
Compute pump power requirement. P=ρ·g·H·Q/η. Use for fluid system design. Inputs: flow m³/h, head m, fluid density, efficiency. Returns kW. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| head_m | Yes | Head m | |
| flow_m3h | Yes | Flow rate m³/h | |
| efficiency | No | Pump efficiency |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It states inputs, formula, and output (Returns kW) but does not explicitly declare that this is a read-only calculation with no side effects or state changes. For a calculator, this is partially obvious but still lacking explicit clarity.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences: purpose, formula, inputs/output, and pointer to bundles. It is concise but contains an error (density as input) which undermines clarity. Front-loading is good.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations and an output schema (not shown but marked as present), the description should cover return values adequately but does state 'Returns kW'. However, the description mentions a parameter (density) that is not in the schema, indicating incomplete alignment. Efficiency units and default value (0.75) are not explained. Overall, enough for a simple calculator but with a critical error.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage but with minimal descriptions. The description adds that inputs are flow m3/h, head m, fluid density, and efficiency. However, density is not present in the schema, creating a mismatch that could confuse the AI agent. Additional parameter context like acceptable ranges or units beyond what schema provides is missing.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute pump power requirement' and provides the formula. It mentions 'Use for fluid system design' as context. However, it does not explicitly differentiate from other similar physics calculation tools among the many siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises using for fluid system design and points to list_bundles for related science calculators. There are no explicit when-not-to-use instructions or comparisons to alternatives, but the usage context is provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_purchasing_powerAInspect
Compare purchasing power between two years. Use to translate historical prices, salaries, or savings to today's value. Inputs: amount, from-year, to-year, average inflation %. Returns equivalent value. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Amount to compare | |
| to_year | Yes | Target year | |
| from_year | Yes | Starting year | |
| avg_inflation | No | Average annual inflation rate in % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes inputs and the return of an 'equivalent value', which is sufficient for a calculator. However, it lacks details on limitations, edge cases, or any behavioral traits beyond the obvious calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences, front-loaded with purpose, then usage, then input/output summary, and finally a sibling reference. Every sentence adds value and there is no waste.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema, so the description does not need to detail return values. It covers the what, when, and how, and is complete for a simple calculator with 4 parameters. Could be slightly improved by stating the unit of the returned value, but the output schema likely handles that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so the baseline is 3. The description merely repeats 'Inputs: amount, from-year, to-year, average inflation %' without adding new meaning or constraints beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'compare' and the resource 'purchasing power between two years'. It distinguishes from the massive list of sibling calculate_* tools by focusing on a specific economic comparison, but does not explicitly differentiate from similar 'finance-universal' calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It explicitly says 'Use to translate historical prices, salaries, or savings to today's value', providing clear context. It also directs to 'list_bundles for related finance-universal calculators', implying alternatives, but does not give explicit when-not-to-use scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pyramidBInspect
Compute pyramid volume V=(1/3)·base_area·height. Use for geometry or architecture. Inputs: base area, height (and optional slant for surface area). Returns volume and surface area. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height | Yes | Pyramid height | |
| base_length | Yes | Base side length |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility for disclosing behavioral traits. It only states the computation formula and mentions an optional parameter (slant) that does not exist in the schema. It fails to clarify that the tool is a pure, side-effect-free computation or any access requirements.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very short (two sentences) and front-loaded with the core formula. However, the inclusion of an incorrect optional parameter (slant) diminishes conciseness, as it adds irrelevant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description is mostly complete for a simple calculation tool with few parameters, but the mismatch regarding the optional slant parameter creates a gap in completeness. The existence of an output schema is noted, but the description does not mention it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for base_length and height. However, the description introduces 'slant' as an optional parameter for surface area, which is not defined in the schema. This contradiction likely confuses an AI agent about available inputs.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool computes pyramid volume using a formula, and it mentions geometry or architecture as use cases. However, it does not explicitly differentiate from other geometry-related sibling tools like 'calculate_cone' or 'calculate_sphere', though the tool name itself distinguishes it.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using the tool for geometry or architecture and references 'list_bundles' for related calculators, providing some context. However, it does not explicitly state when to avoid this tool or compare it to similar pyramid-related tools among the siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_pythagorasCInspect
Find missing side of right triangle using Pythagorean theorem. Returns: {error}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | No | Side a length | |
| b | No | Side b length | |
| c | No | Hypotenuse c length |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
There are no annotations, so the description must carry this burden. It only mentions an error return without specifying conditions or describing any other behavior (e.g., idempotence, safety). The description is insufficient for an agent to understand what happens beyond the basic calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very short (two sentences) and efficient. The first sentence conveys the core purpose. The second sentence suggests related tools, which is helpful but could be more direct. Overall, it earns its place with minimal waste.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description need not detail return values, but it should still provide enough context for effective usage. The description lacks guidance on which two parameters to provide and does not explain the error condition. It feels incomplete for a tool with three optional parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with adequate descriptions for each parameter. The tool description adds no additional meaning beyond the schema, but it does hint at the concept of 'missing side'. However, it does not clarify that exactly two parameters must be provided, which would enhance usability.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose ('Find missing side of right triangle using Pythagorean theorem') with a specific verb and resource. However, it does not explicitly distinguish this from the many sibling calculate tools beyond a brief mention of 'math' calculators in a related-tools suggestion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The mention of 'see list_bundles' implies a grouping but does not provide specific context or exclusions. The tool would benefit from clarifying that it requires exactly two of the three sides.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_quadratic_equationAInspect
Solve quadratic equation ax²+bx+c=0 with discriminant analysis. Use for math homework or physics problems. Inputs: coefficients a, b, c. Returns roots (real or complex), discriminant, and vertex. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | Coefficient a | |
| b | Yes | Coefficient b | |
| c | Yes | Coefficient c |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description returns what the tool outputs (roots, discriminant, vertex) and mentions real or complex roots. It does not detail edge cases like a=0 or behavior with non-numeric inputs, but for a simple calculator, it is reasonably transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: two sentences plus a helpful pointer to list_bundles. It front-loads the purpose and uses no unnecessary words. Every sentence is valuable.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description adequately summarizes inputs and outputs. It could mention a ≠ 0 or discuss error handling, but for a math utility it is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with basic descriptions for parameters a, b, c. The description reiterates 'coefficients a, b, c' without adding new meaning beyond the schema. Baseline 3 applies as schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool solves quadratic equations (verb+resource) and mentions discriminant analysis, which distinguishes it from general equation solvers like calculate_equation. However, it does not explicitly differentiate from many sibling tools, so it's clear but not perfect.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use for math homework or physics problems, providing some context, but does not give explicit when-not-to-use or alternatives. It points to list_bundles for related calculators, but lacks direct guidance on choosing this tool over similar ones.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_quebec_income_taxAInspect
Calculate Quebec provincial income tax (Revenu Québec) with basic personal amount deduction. Returns: {income_cad, basic_personal_amount, taxable_income, provincial_tax, effective_rate_pct, marginal_rate_pct, ...}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income_cad | Yes | Annual income in Canadian dollars (CAD) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the output structure (list of fields) but does not mention any side effects, authentication requirements, or read-only nature. Without annotations, more behavioral context is expected.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the core function and then a cross-reference. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with one parameter and an output schema mentioned. However, the description lacks details such as the applicable tax year, tax rates, or assumptions about the basic personal amount, which would be helpful for a tax calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The sole parameter income_cad is fully described in the schema. The description adds only a mention of the 'basic personal amount deduction', which provides marginal context beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Quebec provincial income tax with a specific deduction (basic personal amount), distinguishing it from sibling tax calculators for other regions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only directs to list_bundles for related calculators but provides no explicit guidance on when to use this tool versus alternatives, nor any prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_race_predictionAInspect
Predict race time for a target distance using Riegel formula. Returns: {predicted_time_minutes, predicted_formatted, predicted_pace_min_km}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target_distance_km | Yes | Target race distance in km | |
| reference_distance_km | Yes | Reference race distance in km | |
| reference_time_minutes | Yes | Reference race time in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the formula and return format, but lacks details on limitations, accuracy, or edge cases. The behavior is adequately described but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences: first for purpose, second for return and cross-reference. No unnecessary words; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple prediction tool with 3 well-documented parameters and an output schema (partially described in the description), the context is complete. The cross-reference to list_bundles adds helpful navigation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with good descriptions for all parameters. The description adds little beyond naming the formula inputs; baseline 3 is appropriate as it does not significantly enhance understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool predicts race time for a target distance using the Riegel formula. It specifies the return object structure, distinguishing it from other calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by mentioning the Riegel formula and cross-referencing 'list_bundles' for related sport calculators. However, it does not explicitly state when not to use this tool or compare it to siblings like 'calculate_running_pace'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_radioactive_decayAInspect
Compute remaining quantity after radioactive decay. Use for physics or carbon dating. Formula: N=N0·(0.5)^(t/half-life). Inputs: initial qty, half-life, elapsed time. Returns remaining qty. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| time | Yes | Time elapsed | |
| initial | Yes | Initial amount | |
| half_life | Yes | Half-life |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavior. It states the calculation formula and input types, and mentions the return value. However, it does not address edge cases (e.g., invalid inputs), unit consistency requirements, or precision of results. The safety profile (read-only/ destructive) is not implied, but for a mathematical tool, the behavior is adequately implied.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is efficient, consisting of four sentences that cover purpose, formula, input explanation, and a pointer to related tools. It is front-loaded with the main action. The inclusion of the formula and the reference to list_bundles are useful, though the latter could be considered slightly extraneous. Overall, it is well-structured and concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (not shown), the description does not need to detail return values. It covers core aspects: purpose, formula, inputs, and domain. However, it lacks guidance on unit consistency (e.g., half-life and time must share units), which is important for correct usage. This gap prevents a higher score.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions, so baseline is 3. The description adds value by providing the formula (N=N0·(0.5)^(t/half-life)), which explains how the parameters interact beyond their individual definitions. This enhances understanding of parameter semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Compute remaining quantity after radioactive decay.' It specifies the domain (physics, carbon dating) and provides the formula, making it distinct from the many other calculate_* tools. The reference to list_bundles for related calculators further aids differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for physics or carbon dating,' indicating appropriate contexts. It also directs users to list_bundles for related science calculators, suggesting boundaries. However, it does not explicitly state when not to use this tool or list specific alternatives, so it falls short of a 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rainwater_collectionBInspect
Estimate annual rainwater collection volume from a roof. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| roof_area_m2 | Yes | Roof catchment area in square metres | |
| efficiency_pct | No | Collection efficiency percentage (default 80%) | |
| annual_rainfall_mm | Yes | Average annual rainfall in millimetres |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must carry the burden of behavioral disclosure. It only states 'estimate', which implicitly suggests a read-only calculation, but it does not explicitly confirm that the tool has no side effects, requires no authentication, or has rate limits. The description lacks explicit transparency about non-destructive behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences, the first stating the core purpose. It is front-loaded and free of unnecessary details. The second sentence provides a pointer to related tools, which is marginally useful but not essential. Overall, it is well-structured and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity (3 parameters, simple calculation, output schema present), the description is minimally adequate. It covers the basic purpose but lacks details like the formula or assumptions. The existence of the output schema offsets the need to describe return values. Still, more context could be beneficial.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters described, so the baseline is 3. The description adds no additional meaning beyond the schema—it only mentions 'from a roof', which is already implied by 'roof_area_m2'. Therefore, no extra value is provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Estimate annual rainwater collection volume from a roof.' The verb 'estimate' and specific resource 'rainwater collection volume' make the function unambiguous. Among numerous calculate_* siblings, this tool uniquely addresses rainwater collection, so there is no confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus alternatives. The mention of 'list_bundles for related calculators' hints at related tools but does not specify conditions for choosing this tool over other calculate_* tools. There is no explicit when/when-not or alternative directions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_raised_bed_soilBInspect
Compute soil volume needed to fill a raised garden bed. Use for gardening setup. Inputs: length, width, depth (m). Returns soil m³, bag count, mix recommendation (40% compost / 30% topsoil / 30% sand). See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_m | Yes | Raised bed width in meters | |
| depth_cm | No | Raised bed depth in centimeters (default 30cm) | |
| length_m | Yes | Raised bed length in meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses that the tool returns volume, bag count, and mix recommendation, but does not explain behavioral traits like read-only nature, assumptions (e.g., rectangular bed), or edge cases. Adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: one sentence with a clear list of outputs and a referral. It avoids unnecessary fluff. Could be slightly more structured with separate sections for inputs and outputs, but it's efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists and the input schema is complete, the description adequately covers the basics. However, it omits important contextual details like the formula used (length*width*depth), bag size assumptions, and why the mix is 40/30/30. Leaves some gaps for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already describes each parameter with 100% coverage. The description adds context by listing inputs and mentioning bag count, but introduces a unit mismatch (description says 'depth (m)' while schema uses depth_cm). This could cause confusion, so it adds marginal value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states its purpose: compute soil volume for a raised bed. It specifies inputs and outputs (volume, bag count, mix recommendation). However, it does not explicitly differentiate from similar sibling tools like calculate_garden_soil.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only says 'Use for gardening setup' which is vague. It does not provide guidance on when to use this tool versus alternatives or mention any prerequisites. The referral to list_bundles is weak as usage guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ratio_simplifyAInspect
Simplify a ratio to lowest terms using GCD. Use for proportions, mixing, or scaling. Inputs: a, b (and optional c). Returns simplified ratio. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| numerator | Yes | Numerator | |
| denominator | Yes | Denominator |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Describes the use of GCD for simplification and mentions return of simplified ratio. With no annotations, the description adequately covers behavioral traits for a pure calculation tool, but could be more detailed about edge cases or output format.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences and a sibling reference with no wasted words. Front-loaded with purpose and algorithm.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Reasonably complete for a simple tool, covering purpose, algorithm, and usage. However, the inaccuracy about an optional parameter and lack of error/edge-case details reduce completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters fully, but the description references an 'optional c' that does not appear in the schema, adding confusion. It does not clarify semantics beyond the parameter names.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool simplifies a ratio to lowest terms using GCD, and provides usage contexts like proportions, mixing, or scaling. However, it mentions an 'optional c' input that does not exist in the schema, causing minor confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly suggests use cases (proportions, mixing, scaling) and directs to list_bundles for related math calculators. Lacks explicit when-not-to-use guidance, but the sibling reference helps differentiation.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_reading_timeBInspect
Estimate reading time for a text based on word count. Returns: {hours_minutes}. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| word_count | Yes | Number of words in text | |
| reading_speed_wpm | No | Reading speed words per minute |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the return format and that it is based on word count, but does not disclose details like default reading speed, the calculation formula, or any limitations (e.g., maximum word count). Some behavioral context is missing.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, efficiently conveying purpose and a reference to related tools. It is front-loaded with the core purpose and avoids unnecessary detail. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with full schema coverage and an output schema, the description adequately covers purpose and return format. However, it lacks mention of the optional parameter's default value and does not elaborate on the output structure beyond '{hours_minutes}'. It is slightly incomplete but sufficient for most uses.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents both parameters. The description adds no additional meaning beyond what the schema provides (e.g., no clarification on what constitutes a word or how reading speed affects output). Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Estimate reading time for a text based on word count', which directly conveys the tool's purpose. It mentions the return format '{hours_minutes}', adding specificity. However, it does not mention the optional reading_speed_wpm parameter, which could affect the estimation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only says 'See list_bundles for related education calculators', which hints at alternatives but does not explicitly explain when to use this tool vs. others. No clear when-to-use or when-not-to-use guidelines are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_real_estate_agency_feesAInspect
Calculate French real estate agency fees using sliding scale. Returns: {agency_fees, scale}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sale_price | Yes | Property sale price in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present. The description mentions 'sliding scale' and return fields, but does not detail the scale mechanics or any behavioral constraints (e.g., pure computation, no side effects). It provides basic context but lacks thoroughness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences convey purpose, method, return structure, and a cross-reference. No extraneous content; front-loaded with core action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter calculator with output schema, the description sufficiently defines input, calculation method, and output fields. The reference to list_bundles aids discoverability. A minor gap: no explanation of the sliding scale thresholds.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with one parameter (sale_price) already described as 'Property sale price in EUR'. The description adds no additional semantics beyond the schema, meeting the baseline for full schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Calculate French real estate agency fees using sliding scale', clearly specifying the verb (calculate), resource (French real estate agency fees), and method (sliding scale). This distinguishes it from many sibling calculators (e.g., tax, loan).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives a hint to 'See list_bundles for related "immobilier" calculators' but does not explicitly state when to use this tool over alternatives (e.g., notary fees, property tax). Usage context is implied but not precise.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_recipe_nutritionCInspect
Compute total calories, protein, carbs, fat for a recipe and per serving. Use for meal planning or nutrition labels. Inputs: list of ingredients with grams, servings count. Returns macro breakdown per serving and total. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| ingredients | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It only states the computation performed, not whether it has side effects, requires authentication, has rate limits, or is read-only. The discrepancy between the described servings count parameter and the actual schema further undermines transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded with key information. It references list_bundles appropriately. However, the incorrect mention of servings count could be removed or clarified to improve precision.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has a single required parameter with nested structure and an output schema exists, the description should clarify the parameter format and the discrepancy about servings. It fails to provide complete context, leaving the agent with incomplete and partially incorrect information.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 0%, so the description must explain parameters. It mentions 'ingredients with grams' and 'servings count', but the schema has no servings parameter, making the description misleading. It also fails to explain the per-100g fields (calories, protein, carbs, fat) required per ingredient.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes total and per-serving nutrition for a recipe, with specific uses like meal planning and nutrition labels. It distinguishes itself from sibling tools by referencing list_bundles for related cuisine calculators, though it doesn't explicitly differentiate from other nutrition calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides explicit use cases (meal planning, nutrition labels) and suggests an alternative via list_bundles. However, it does not state when not to use this tool or provide exclusion criteria, leaving ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_recipe_scaleAInspect
Scale recipe quantities to a new servings count. Use for adjusting recipes. Inputs: ingredients with quantities, original servings, target servings. Returns adjusted quantities. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Original ingredient amount | |
| target_servings | Yes | Target servings | |
| original_servings | Yes | Original servings |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool 'returns adjusted quantities' and implies a read-only calculation, but does not disclose edge cases, validation behavior, or potential side effects. Basic context is provided, but more detail would improve transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loading the purpose and main inputs. It is concise and effective, though it could use structured formatting (e.g., bullet points) for improved clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple scaling tool, the description covers the essential points: purpose, inputs, and output. However, it does not explain the scaling formula (multiply by target/original), and the output schema is not shown (though context indicates it exists). Adequate but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with each parameter documented in the schema. The description adds little beyond the schema, merely listing inputs generically. Per guidelines, baseline is 3 when coverage is high.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Scale recipe quantities to a new servings count.' It specifies the verb 'scale' and resource 'recipe quantities,' and distinguishes itself from sibling tools like 'calculate_recipe_nutrition' by mentioning 'related cuisine calculators' via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use ('Use for adjusting recipes') and references list_bundles for related calculators, providing some guidance on alternatives. However, it does not explicitly state when not to use this tool or give exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_recipe_scalingBInspect
Scale recipe ingredients up or down by a factor or new servings count. Use for cooking adjustments. Inputs: ingredients list, original servings, new servings. Returns scaled ingredient list. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target | Yes | Target servings | |
| original | Yes | Original servings | |
| ingredients | Yes | Ingredients |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states the tool returns a scaled ingredient list, implying a pure calculation with no side effects. However, it does not explicitly confirm read-only behavior, statelessness, or any potential restrictions, leaving some ambiguity for a cautious agent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loading the main purpose and then listing inputs/outputs. Every word is necessary; there is no redundancy or irrelevant information. It is a model of conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists (not shown), the description need not detail return values. However, the tool has a very similar sibling ('calculate_recipe_scale') and the description does not clarify the distinction. Also, it lacks guidance on edge cases or prerequisites. For a simple tool, it is adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description's listing of inputs ('ingredients list, original servings, new servings') adds no extra meaning beyond the schema's descriptions. It does not clarify formats, constraints, or relationships; thus it meets baseline but adds no value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool scales recipe ingredients up or down by factor or servings. It uses specific verb 'scale' and resource 'recipe ingredients'. However, it does not differentiate from the sibling tool 'calculate_recipe_scale' which appears very similar, thus lacking full sibling distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only vaguely says 'Use for cooking adjustments' and points to 'list_bundles' for related calculators. It provides no explicit when-to-use or when-not-to-use guidance, nor alternatives for similar tools like 'calculate_recipe_nutrition'. This leaves the agent without clear usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_regular_polygonBInspect
Compute area, perimeter, and apothem of a regular n-gon. Use for geometry or tiling. Inputs: number of sides, side length. Returns full geometric properties. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sides | Yes | Number of sides | |
| length | Yes | Side length |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full behavioral burden. It states what it computes but does not disclose return format, error handling, or whether it is read-only. The phrase 'full geometric properties' is vague. For a computation tool, more transparency about output structure or side effects is needed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at three sentences, each serving a purpose: statement of computation, use case, and input/return summary. It is front-loaded with the primary action. No redundant information, but could be slightly more structured (e.g., listing returned properties explicitly).
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (as indicated in context), the description need not detail return fields. However, it does not mention that sides should be integers (schema allows numbers but min 3) or handling of edge cases. For a tool with many siblings, it could better connect to related calculators, but the reference to list_bundles helps. Overall adequate but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents both parameters. The description merely repeats 'number of sides, side length' without adding new meaning such as expected units, valid ranges beyond schema constraints, or defaults. Thus it adds no semantic value beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes area, perimeter, and apothem of a regular n-gon, with a specific verb ('Compute') and resource ('regular n-gon'). It distinguishes itself from the many sibling 'calculate_*' tools by being a geometry calculator, and references a bundle for related tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests using this tool for geometry or tiling, which provides general context. However, it lacks explicit guidance on when not to use it or alternatives besides referring to list_bundles, which is vague. It does not exclude cases or mention prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rental_profitabilityCInspect
Compute net rental profitability after taxes and charges. Use for real estate investment analysis. Inputs: purchase price, monthly rent, charges, vacancy %, tax bracket. Returns net yield % and cash flow. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_tax | Yes | Annual property tax in EUR | |
| monthly_rent | Yes | Monthly rental income in EUR | |
| purchase_price | Yes | Purchase price in EUR | |
| monthly_charges | Yes | Monthly charges/expenses in EUR | |
| notary_fees_pct | No | Notary fees as % of price (default 8) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions inputs like 'vacancy %' and 'tax bracket' that are not present in the input schema, creating a discrepancy. It does not disclose behavioral traits such as output format details or assumptions, though it mentions returns net yield % and cash flow.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief with three sentences, front-loading the purpose. It efficiently lists inputs and output. However, the inclusion of non-existent parameters detracts from precision, but the structure is otherwise concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists (but not shown), the description should still be complete. It fails to accurately describe all required parameters and introduces spurious ones. For a financial analysis tool, it lacks detail on assumptions and return format beyond yield and cash flow.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters have schema descriptions (100% coverage), so baseline is 3. However, the description lists inputs (vacancy %, tax bracket) that are not actual parameters, causing confusion. It does not add meaningful semantics beyond the schema for the actual parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes net rental profitability after taxes and charges, with specific verb 'Compute' and resource. It distinguishes itself from related real estate calculators by implying a more comprehensive analysis, but does not explicitly differentiate from sibling tools like calculate_rental_yield_net.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only broadly states 'Use for real estate investment analysis' without providing when to use this tool versus alternatives. It suggests seeing list_bundles for related calculators, but offers no specific guidance on when to choose this tool over other rental yield calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rental_yieldAInspect
Calculate gross and net rental yield for a real estate investment. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rent | Yes | Annual rental income in EUR | |
| annual_charges | No | Annual charges/expenses in EUR (default 0) | |
| purchase_price | Yes | Purchase price in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It does not mention whether the tool is read-only, has side effects, or requires any specific permissions. For a calculation tool, this is a notable gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no redundancy. The first sentence states the core function, and the second provides a practical pointer to related tools. Every sentence is warranted.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema, so return values are covered. Parameters are fully described in schema. The lack of behavioral transparency reduces completeness, but for a simple financial calculation, it is mostly adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents all parameters. The description adds no extra meaning beyond what the schema provides, hence a baseline score of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates gross and net rental yield for real estate, which distinguishes it from many sibling calculators that are specific to gross or net only. It also directs to list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a clear context (real estate investment) and suggests using list_bundles for related calculators, but does not explicitly compare with alternatives like calculate_rental_yield_gross or calculate_rental_yield_net.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rental_yield_grossBInspect
Calculate gross rental yield from property price and monthly rent. Returns: {gross_yield_pct}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| monthly_rent | Yes | Monthly rent in EUR | |
| purchase_price | Yes | Property purchase price in EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full burden. It only mentions the output format (gross_yield_pct) but does not disclose any behavioral details like precision, rounding, error handling, or edge cases.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: the first specifies the core function, the second directs to related tools. No wasted words; front-loaded with essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with two parameters and an output schema, the description is nearly complete. It includes inputs, output format, and a link to related tools. Minor missing details (e.g., validation constraints) prevent a perfect score.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with both parameters described in the schema. The description adds no additional semantic information beyond what the schema already provides, so a baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates gross rental yield from property price and monthly rent, with a specific verb and resource. However, it does not differentiate from siblings like calculate_rental_yield or calculate_rental_yield_net, which are closely related.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for gross yield calculation and points to list_bundles for related calculators, but lacks explicit guidance on when not to use or alternatives like net yield tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rental_yield_netAInspect
Calculate net rental yield after charges and vacancy. Returns: {net_yield_pct, effective_annual_rent, net_income}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| monthly_rent | Yes | Monthly rent EUR | |
| vacancy_rate | Yes | Vacancy rate percent | |
| annual_charges | Yes | Annual charges, taxes, insurance EUR | |
| purchase_price | Yes | Property price EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the full burden. It discloses the returned structure {net_yield_pct, effective_annual_rent, net_income}, indicating a read-only calculation. No contradiction with annotations as none exist.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the purpose, and concise without extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (mentioned in description) and full schema coverage, the description is fairly complete. It covers purpose, return structure, and related tools, leaving few gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters described. The description adds no additional parameter details beyond the schema, meeting the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies 'Calculate net rental yield after charges and vacancy', using a specific verb and resource. It distinguishes itself from sibling tools like calculate_rental_yield_gross by emphasizing 'net' and 'after charges and vacancy'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by mentioning 'net rental yield after charges and vacancy' and directs users to 'list_bundles for related calculators'. While not explicitly comparing to alternatives, it implies when to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rent_increase_irlAInspect
Calculate rent increase allowed by French IRL index. Returns: {new_rent_eur, increase_eur}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| new_irl | Yes | Latest published IRL | |
| old_irl | Yes | IRL at lease start | |
| current_rent | Yes | Current rent EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It states a calculation is performed and returns specific fields, implying no side effects. However, it does not detail any edge cases or assumptions about input limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long: the first states the purpose, the second details the output and points to related tools. It is front-loaded, concise, and every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with full schema coverage and an output description, the description is largely sufficient. It could provide more context about the IRL index or the calculation formula, but it meets the needs for an agent to use the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage for its three parameters. The description does not add any additional meaning or context beyond the schema's own descriptions, so the baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the specific verb and resource: 'Calculate rent increase allowed by French IRL index'. It includes the output structure, distinguishing it from other calculators. The reference to list_bundles provides further context without compromising clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use (French IRL index rent increase) but does not explicitly state when not to use or provide alternatives beyond a reference to list_bundles. No exclusion criteria or prerequisites are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rent_ratioAInspect
Compute the rent-to-income ratio. Use to assess housing affordability (rule of thumb: keep under 33%). Inputs: monthly rent, monthly gross income. Returns rent ratio % and verdict. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rent | Yes | Monthly rent | |
| income | Yes | Monthly income |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states that the tool returns 'rent ratio % and verdict', but the term 'verdict' is vague. It does not disclose whether the tool is read-only or has side effects, though for a calculation tool this is implied. Slightly above average.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise—two sentences that convey purpose, usage, inputs, outputs, and related resources. Every sentence adds value, and it is front-loaded with the core action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given it's a simple two-parameter calculator with an output schema (not shown but mentioned), the description covers purpose, inputs, outputs, and a usage guideline. It could be more detailed about the output format or potential errors, but it is sufficiently complete for the complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions that mostly match the parameter names. The description adds 'gross' to income, clarifying it's gross income, which is a slight improvement. However, the addition is minor, so it meets the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes rent-to-income ratio and provides a specific purpose: assessing housing affordability with a rule of thumb (33%). It uses a specific verb ('compute') and resource ('rent-to-income ratio'), making it highly distinguishable from siblings despite not explicit differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says to use for housing affordability assessment and gives a rule of thumb (under 33%). It also directs to list_bundles for related calculators, providing some guidance on alternatives. However, it could be stronger by explicitly stating when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_retirement_dateAInspect
Estimate retirement date from birth date and country legal retirement age. Returns: {retirement_age, retirement_date, years_remaining, already_retired, note}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| country | Yes | Country: FR=64 years, US=67 years, UK=66 years | |
| birth_date | Yes | YYYY-MM-DD — Date of birth |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It mentions the output includes 'already_retired' and a 'note', but does not disclose how the current date is used, edge cases (e.g., future birth dates), or if the retirement age is fixed. Adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences, front-loaded with purpose and return structure. No extraneous information. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (two parameters, straightforward computation), the description covers the essential: input, output fields, and a pointer to related tools. For a tool with no output schema, listing the return fields is adequate. Minor gap: does not mention how current date is determined for remaining years.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, and both parameters have clear descriptions in the schema (enum values with ages for country, format for birth_date). The description does not add significant meaning beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it estimates retirement date from birth date and country legal retirement age. It specifies the return fields. While there are sibling tools like calculate_retirement_pension, the purpose is distinct and well-defined.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal usage guidance: it references list_bundles for related calculators but does not explain when to use this tool versus alternatives or mention prerequisites. No explicit when-to-use or when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_retirement_pensionAInspect
Estimate French basic retirement pension (retraite de base Assurance Vieillesse). Returns: {average_salary_best25, annual_pension, max_monthly_pension, prorata_pct}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| target_years | No | Target quarters for full pension (default 172 = 43 years) | |
| years_contributed | Yes | Total years of contribution | |
| average_salary_best25 | Yes | Average annual salary of best 25 years in euros |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It states 'Estimate', implying a read-only calculation, but does not explicitly confirm idempotency, side effects, or prerequisites. Adding that it is safe to call multiple times would improve transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states purpose and returns, second directs to related tools. Front-loaded, no unnecessary words. Efficient but could include a brief usage context without being wordy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 3 parameters and an output schema (described in text), the description is adequate but not rich. It omits details like the meaning of 'best 25 years' or prorata calculation. For a French pension tool, more context on what the estimation assumes would be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description does not add extra meaning to parameters beyond the schema's property descriptions, but the output format is helpful. No additional parameter semantics are provided.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Estimate French basic retirement pension' with a specific verb and resource, and distinguishes from sibling calculators (e.g., Belgian pension) by specifying the French system. It also lists the return fields, adding clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions seeing list_bundles for related calculators, implying a family of tools, but does not explicitly state when to use this tool versus alternatives or provide exclusions (e.g., supplementary pensions). It gives a pointer but lacks direct guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_retirement_savings_gapAInspect
Project retirement savings vs need and identify shortfall. Use for retirement planning. Inputs: current age, retirement age, current savings, monthly contribution, target income. Returns projected balance and gap. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| current_age | Yes | Current age | |
| savings_rate | Yes | Annual return rate percent | |
| monthly_income | Yes | Desired monthly retirement income EUR | |
| retirement_age | Yes | Target retirement age | |
| current_savings | Yes | Current savings EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It states it returns projected balance and gap, and lists inputs, but it lacks disclosure of underlying assumptions (e.g., whether inflation is considered, if returns are nominal/real, or any default values). This is adequate but not rich.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose, followed by usage context and input/output overview. It is concise but has a minor inaccuracy in input listing.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
An output schema exists, so the description need not detail return values. However, for a financial projection tool with 5 parameters and no annotations, the description is minimal: it does not explain the calculation method, assumptions, or limitations. Adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 5 parameters with 100% schema coverage, but the description inaccurately mentions 'monthly contribution' while schema has 'monthly_income' and 'savings_rate'. It omits 'savings_rate' and adds 'target income' ambiguously. This mismatch undermines clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Project retirement savings vs need and identify shortfall'. It uses a specific verb 'project' and resource 'retirement savings', and distinguishes from siblings by referencing 'list_bundles for related finance-universal calculators'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It explicitly says 'Use for retirement planning', which indicates when to use. It also points to an alternative resource (list_bundles) for related calculators. However, it does not provide explicit when-not-to-use scenarios or specific alternative tool names from the vast sibling list.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_reverb_predelayAInspect
Calculate optimal reverb pre-delay based on room size and musical tempo. See list_bundles for related 'musique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bpm | No | Tempo in BPM (used to snap pre-delay to musical grid) | |
| room_length_m | Yes | Room length in meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It describes a calculation but does not mention side effects, idempotency, or authorization needs. For a calculation tool, this is minimally adequate but lacks transparency on potential constraints.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences, no wasted words, and front-loaded with the core purpose. It efficiently communicates the tool's function and directs to related resources.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description is fairly complete. It covers inputs and purpose, and references sibling tools. However, it could mention the return value or unit of pre-delay, though an output schema exists to handle that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds the context of 'optimal' calculation and references the parameters ('room size and musical tempo'), but does not extend meaning beyond the schema's existing parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate optimal reverb pre-delay based on room size and musical tempo.' It uses a specific verb ('calculate') and resource ('reverb pre-delay'), and the mention of 'list_bundles' distinguishes it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies the usage context by referencing related 'musique' calculators via 'list_bundles', but does not explicitly state when to use this tool over alternatives or when not to use it. No exclusions or alternatives are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ring_sizeBInspect
Find ring size in EU/US/UK/JP given finger circumference or diameter. Use for jewelry shopping. Inputs: circumference mm or diameter mm. Returns size in EU, US, UK, JP systems. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| from_system | Yes | ||
| circumference_mm | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must convey behavior. It states the tool returns ring sizes based on input measurements, implying a read-only calculation. However, it does not explicitly state side effects (if any), required permissions, or edge cases. The description is minimally adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (3 sentences) and front-loads the purpose. The phrase 'Use for jewelry shopping' is slightly redundant but not excessively verbose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 2 parameters and an output schema, the description covers inputs and output value mapping. However, it lacks details on how 'from_system' affects the conversion, what happens with invalid inputs, and whether diameter is actually accepted. The output schema exists but its contents are not shown; the description adds little beyond what the schema likely provides.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema documentation coverage is 0%, so the description should compensate. It mentions both circumference and diameter as inputs, but the schema only includes 'circumference_mm', creating a contradiction. This mismatch reduces clarity for agents.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool finds ring sizes in multiple systems (EU/US/UK/JP) given finger circumference or diameter, with a specific use case (jewelry shopping). However, it does not explicitly distinguish itself from the sibling tool 'calculate_ring_size_convert', which likely converts between systems.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a usage context ('Use for jewelry shopping') but does not specify when not to use the tool or mention alternatives such as 'calculate_ring_size_convert' for conversion tasks.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_ring_size_convertCInspect
Convert ring sizes between EU, US, UK, and Japan systems. Use for international jewelry shopping. Inputs: size, from-system, to-system. Returns equivalent size. See list_bundles for related 'textile-mode' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | Ring size in source system | |
| from_system | Yes | Source sizing system |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as idempotency, size range constraints, or return format. For a simple conversion tool, the description fails to add meaningful behavioral context beyond the schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but contains an inaccurate parameter listing. While concise, the inaccuracy undermines its utility. Every sentence should be accurate; this one is not.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that the tool has two parameters and no output schema in the input, the description does not fully explain how the conversion works or what the return value structure looks like. The missing 'to_system' parameter is a critical gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description mentions 'Inputs: size, from-system, to-system' but the schema does not include a 'to_system' parameter. This is a factual error that misleads the agent. Since schema coverage is 100% and the description introduces false information, it fails to add value and actively harms understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Convert ring sizes between EU, US, UK, and Japan systems' with a specific verb and resource. However, it mentions 'EU, US, UK, and Japan' while the schema uses 'FR, US, UK, EU, JP', causing minor inconsistency. Still, overall purpose is clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description says 'Use for international jewelry shopping' and references 'list_bundles for related textile-mode calculators', providing limited context. No explicit when-not-to-use or direct comparison with similar sibling tools like 'convert_shoe_size'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_roiAInspect
Compute Return on Investment as a percentage. Use to evaluate investments, marketing spend, or projects. Inputs: investment cost, return value. Returns ROI %, profit, and multiple. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| investment | Yes | Initial investment amount | |
| return_value | Yes | Final value or total returns |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the burden of behavioral disclosure. It mentions the return values (ROI %, profit, and multiple) but does not discuss side effects, permissions, or edge cases. This is adequate for a simple calculation but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences: purpose, usage context, and a summary of inputs/outputs. No fluff or redundant information. Front-loaded with the core action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that an output schema exists, the description does not need to detail return values, though it does. It lacks discussion of error cases or assumptions, but for a simple tool with constrained inputs, it is largely complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema provides full descriptions for both parameters (investment and return_value). The description adds minimal value by restating 'investment cost, return value' in a human-readable way. Since schema coverage is 100%, baseline is 3; the description does not enhance understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute Return on Investment as a percentage', which is a specific verb and resource. It mentions use cases (investments, marketing spend, projects) but does not explicitly differentiate from sibling tools like profit margin or break-even calculators. The purpose is clear but lacks explicit distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context for when to use ('evaluate investments, marketing spend, or projects') but offers no guidance on when not to use or alternatives. It references list_bundles for related calculators, but that is not a direct exclusion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_roman_numeralAInspect
Convert between Roman numerals and decimal (1-3999). Returns: {roman_numeral, decimal}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| value | Yes | Decimal number to convert to Roman numeral |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the conversion direction (from decimal to Roman, as input is integer), the range, and the output structure. It does not mention error handling or bidirectional conversion explicitly, but the input schema implies one direction. Overall good context, not perfect.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words. Front-loaded with purpose and range, then a pointer to related tools. Efficient and to the point.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with one parameter, the description covers purpose, input, output format, range, and a reference to related tools. No output schema provided, but the description compensates. Complete for this complexity level.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% for the single parameter 'value', which already describes its type and range. The description adds overall context ('convert between') but does not add new parameter-specific meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Convert' and the resources 'Roman numerals and decimal', with a specific range (1-3999). It distinguishes from sibling tools by directing to list_bundles for related conversion calculators. This is a specific verb+resource with sibling differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for converting between Roman numerals and decimal within the given range. It mentions list_bundles for related conversions, providing context for alternatives. However, it does not explicitly state when not to use this tool or list exclusions, so just short of a 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_roof_areaAInspect
Calculate roof surface area from building footprint and slope angle. Returns: {rafter_length_m, net_roof_area_m2, with_5pct_overhang_m2}. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| base_width_m | Yes | Building width in meters | |
| base_length_m | Yes | Building length in meters | |
| slope_degrees | Yes | Roof slope in degrees |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It discloses the return fields (rafter_length_m, net_roof_area_m2, with_5pct_overhang_m2), but does not mention side effects, permissions, or idempotency. As a calculation tool, behavior is mostly implied.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose, no wasted words. Efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 parameters, returns 3 values), the description is adequate. It mentions the return fields, which is helpful since no output schema is shown. Minor gap: no mention of units constraints beyond schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameters already described. The description adds no new meaning beyond 'building footprint and slope angle', so baseline 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates roof surface area from building footprint and slope angle, with a specific verb and resource, and distinguishes it from the many sibling calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description includes a reference to list_bundles for related construction calculators, but does not explicitly specify when to use this tool versus alternatives like calculate_roof_truss.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_roof_trussAInspect
Calculate roof truss dimensions, rafter length and material quantities for a pitched roof. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| span_m | Yes | Total roof span in meters (full width) | |
| load_kg_m2 | No | Total roof load in kg/m² including snow, wind and tiles (default 150) | |
| spacing_cm | No | Distance between trusses/rafters in cm (default 60cm) | |
| pitch_degrees | Yes | Roof pitch angle in degrees |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes what the tool calculates but does not disclose behavioral traits such as being a pure calculation with no side effects, which is typical for such tools. The description is adequate but not explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences: the first explains what the tool does, and the second directs to related tools. No unnecessary words, well-front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema and low complexity, the description is nearly complete. It covers the purpose and points to related tools, though it could mention the formula basis or a note about standard truss assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description does not add any additional meaning beyond the schema; it does not explain default values or constraints like minimum pitch.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates roof truss dimensions, rafter length, and material quantities for a pitched roof. It is specific and distinguishes itself from other calculation tools, though it could explicitly differentiate from siblings like calculate_roof_area or calculate_beam_load.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests seeing list_bundles for related construction calculators, providing some guidance. However, it does not specify when to use this tool versus alternatives, nor does it mention prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rule_of_72AInspect
Estimate years to double an investment using the Rule of 72. Returns: {doubling_years_rule72, doubling_years_precise, annual_rate_pct}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rate | Yes | Annual return rate percent |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Given no annotations, the description discloses the behavior (Rule of 72 approximation), outputs both rule72 and precise values, and mentions the parameter. No side effects or limitations are mentioned, but for a simple calculator it is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, no wasted words, front-loaded with purpose and outputs. Efficient and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple one-parameter tool with an output schema, the description is complete: it explains the purpose, expected output, and suggests related tools. No gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes the parameter. The description adds minimal extra meaning beyond the schema, only contextualizing it as input for the Rule of 72. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates years to double an investment using the Rule of 72, and lists the output fields. It distinguishes from siblings by mentioning related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not explicitly state when to use this tool versus alternatives (e.g., compound interest calculators). It only implies usage for doubling time estimation and references list_bundles, but lacks when-not-to-use or exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_rule_of_threeCInspect
Solve rule of three (cross-multiplication): if a→b, then c→? Use for proportions, recipe scaling, or unit pricing. Inputs: a, b, c. Returns x. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | Known value A | |
| b | Yes | Corresponding value B | |
| x | Yes | New value of A |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description should disclose behavioral traits like idempotency or side effects. It fails to do so, and moreover, the input/output mismatch between description and schema creates a misleading behavior profile.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (two sentences plus bundle reference) but contains an inaccuracy. While efficient in length, the error reduces its utility.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity of the tool and presence of an output schema, the description should be sufficient but is not due to the parameter mismatch. It omits that 'x' is an input and does not clarify the formula, leaving agents with incomplete and conflicting information.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description adds confusion by claiming inputs are 'a, b, c' while the schema lists 'a, b, x' as required. This contradicts and undermines the schema's own parameter descriptions, which are already minimal ('Known value A', etc.).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool solves rule of three and provides use cases (proportions, recipe scaling, unit pricing). However, it incorrectly lists inputs as 'a, b, c' while the schema requires 'a, b, x', causing confusion and reducing clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives useful use cases but does not explicitly exclude other tools or explain when not to use. The mention of 'list_bundles' provides some context but is not a direct alternative comparison.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_running_paceAInspect
Calculate running pace (min/km) and speed (km/h) from distance and time. Returns: {pace_min_per_km, pace_formatted}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| distance_km | Yes | Distance in kilometers | |
| time_minutes | Yes | Total time in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool calculates and returns values, but does not disclose edge cases, error handling, or side effects. For a simple calculator, this is adequate but lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences, front-loading the core purpose and including return format and a cross-reference. No unnecessary information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator tool, the description covers purpose, input, and output. It does not provide examples or constraints beyond schema, but overall it is sufficient given the tool's simplicity and the presence of many similar siblings.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already includes descriptions for both parameters (distance_km and time_minutes) with 100% coverage. The description does not add additional meaning beyond what the schema provides, meeting the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates running pace (min/km) and speed (km/h) from distance and time, and specifies the return format. It distinguishes itself from other 'calculate' tools by focusing on running metrics.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context for when to use the tool (given distance and time) and directs users to 'list_bundles' for related sport calculators. However, it does not explicitly state when not to use it or mention prerequisites beyond the required parameters.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_salary_comparison_pppAInspect
Compare salaries across countries using PPP (FR=0.79, US=1.0, UK=0.81, DE=0.77, CH=1.36, BE=0.80). Returns: {ppp_from, ppp_to, equivalent, ratio}. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| salary | Yes | Salary in local currency | |
| to_country | Yes | Target country | |
| from_country | Yes | Source country |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description is the sole source of behavioral information. It does not disclose whether the tool is read-only, any side effects, authentication needs, or rate limits. The description lacks transparency about the tool's behavior beyond its core function.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at two sentences, front-loading the purpose. It efficiently combines purpose, example PPP values, return fields, and a pointer to related tools. Minor improvement could separate examples from core info.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the main purpose, parameter constraints via schema, and return fields. However, it lacks clarification on currency assumptions (e.g., is salary in from_country's currency?) and does not explain the PPP factors beyond examples. With a simple tool and output schema, some gaps remain.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptive parameter names and enums, so the description adds minimal extra meaning. It offers example PPP values but does not clarify currency assumptions or parameter usage beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool compares salaries across countries using PPP, with a specific verb 'Compare' and resource 'salaries'. It distinguishes from sibling salary calculators by focusing on cross-country comparison rather than single-country calculations.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use this tool (comparing salaries with PPP) and directs users to list_bundles for related calculators, but does not provide explicit exclusions or when-not-to-use scenarios. The guidance is helpful but not exhaustive.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_salary_hourly_to_annualCInspect
Convert hourly wage to monthly and annual salary, gross or net. Use for job comparisons. Inputs: hourly rate, hours/week, weeks/year. Returns weekly, monthly, and annual figures. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| hourly_rate | Yes | Hourly rate | |
| hours_per_week | No | Hours worked per week | |
| weeks_per_year | No | Weeks worked per year |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It does not explicitly state that the tool is read-only (no side effects) or that the output is purely based on hours (no tax deductions). The claim of generating both 'gross or net' figures is misleading without a tax input.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at three sentences, with the key information front-loaded. There is minor redundancy (mentioning monthly and annual twice), but overall efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple, but the description lacks guidance on when to prefer this over the many country-specific salary tools. The mention of 'gross or net' creates confusion about the output. For a basic converter, the description is mostly adequate but incomplete in terms of context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds minimal value beyond listing the inputs. It mentions the inputs' purpose ('hourly rate, hours/week, weeks/year'), which is already clear from the schema. The addition of 'job comparisons' provides mild extra context.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts hourly wage to monthly and annual salary, and mentions use for job comparisons. However, the phrase 'gross or net' is ambiguous since no tax inputs are provided, and the tool likely only computes gross figures. This slight confusion prevents a perfect score.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for job comparisons' and references related calculators via list_bundles, but does not explicitly differentiate from the many country-specific salary tools among siblings. It does not state when to use this simple conversion versus a country-specific net salary calculator.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sample_sizeAInspect
Compute required sample size for a survey to hit a target margin of error. Use for survey design and A/B testing. Inputs: population, confidence %, margin of error %. Returns minimum sample size. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| confidence | No | Confidence level | 95 |
| population | No | Population size | |
| margin_error_pct | Yes | Margin of error % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description is the sole source of behavioral information. It states the tool 'Returns minimum sample size', indicating it is a read-only computation. However, it does not disclose assumptions (e.g., simple random sampling, normal approximation) or potential constraints, which would be helpful for an agent to assess correctness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, with the core purpose front-loaded in the first sentence. Every sentence adds value: the first states the primary function, the second clarifies usage context and directs to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the essential purpose, usage context, inputs, and output. Given that an output schema exists (as indicated in the context signals), the description does not need to detail return values. It is complete for a simple calculator tool, though it could mention underlying assumptions for completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All three parameters (population, confidence %, margin of error %) are listed in the description, but the schema already provides full coverage (100%) with descriptions for each. The description adds no additional meaning beyond what the schema already contains, so it meets the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes required sample size for a survey given a target margin of error, specifying the action (compute), resource (sample size), and context (survey design and A/B testing). The verb 'Compute' is specific and the resource 'sample size' is distinct from siblings like calculate_confidence_interval, which computes confidence intervals rather than sample sizes.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for survey design and A/B testing', providing clear context for when to use the tool. It also directs users to 'list_bundles for related math calculators' for alternative tools, but does not explicitly state when not to use it or list exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_savings_goalCInspect
Compute time and monthly contribution needed to reach a savings target. Use for goal-based personal finance. Inputs: target amount, current savings, monthly contribution, annual return %. Returns months to goal. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rate | Yes | Annual return rate percent | |
| target_amount | Yes | Savings target EUR | |
| monthly_savings | Yes | Monthly savings EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must fully disclose behavioral traits. It only states the computation and return value (months to goal), with no mention of side effects, permissions, rate limits, or safety. A calculation tool is likely read-only, but the description does not confirm this.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (two sentences plus a pointer to list_bundles) and front-loaded with the core purpose. Every sentence contributes value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple nature of the tool (savings goal calculation), the description covers the basic inputs and output. However, it fails to mention edge cases (e.g., negative rates, zero contributions) and includes a misleading extra parameter ('current savings'). With an output schema present (though not shown), the description could be improved by clarifying the output structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. However, the description lists 'current savings' as an input, which is not in the schema (possible error), and provides no additional semantic meaning beyond the schema's own descriptions. This inaccuracy reduces the score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes time and monthly contribution for a savings target, using a verb (compute) and resource (savings goal). It is useful for goal-based personal finance, but does not explicitly differentiate from similar tools like calculate_compound_interest or calculate_future_value among the many siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for goal-based personal finance' but provides no guidance on when not to use the tool or mention alternatives. No explicit exclusions or comparisons to sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_scholarship_comparisonAInspect
Compare net tuition cost across multiple scholarships and aid packages. Use for college choice. Inputs: list of {tuition, aid} pairs. Returns ranked net costs and best option. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| tuition | Yes | Annual tuition EUR | |
| scholarship_1 | No | Scholarship 1 amount EUR | |
| scholarship_2 | No | Scholarship 2 amount EUR | |
| scholarship_3 | No | Scholarship 3 amount EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states inputs as 'list of {tuition, aid} pairs' but the schema expects separate tuition and three scholarship fields, creating a slight inconsistency. It mentions returns 'ranked net costs and best option' but lacks detail on output format. The output schema exists, so this is acceptable, but the input description could mislead.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (4 sentences) and well-structured: first sentence states purpose, second provides use case, third explains inputs and output summary, fourth points to related tool. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's moderate complexity (comparing scholarships), the description covers purpose, use case, input concept, output summary, and a link to related calculators. The output schema exists, so return values are sufficiently covered. Minor deduction for the input description mismatch with the actual schema structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%: each parameter (tuition, scholarship_1-3) has a description with units (EUR). The description adds a conceptual view ('list of {tuition, aid} pairs') but does not significantly enhance understanding beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Compare net tuition cost across multiple scholarships and aid packages.' It specifies the resource (net tuition cost) and the action (compare), and adds context 'Use for college choice.' This distinguishes it from the many other calculate tools in the sibling list.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives usage guidance: 'Use for college choice.' It also directs to list_bundles for related calculators. However, it does not explicitly state when not to use this tool or mention specific alternatives beyond the general pointer.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sci_is_vs_irAInspect
Compare SCI taxation under IS (corporate tax) vs IR (income tax). Use for French real-estate investors choosing tax regime. Inputs: rental income, charges, owner tax bracket. Returns net result under each regime. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rent | Yes | Annual gross rental income in EUR | |
| annual_charges | Yes | Annual deductible charges in EUR (management fees, interest, maintenance) | |
| property_value | Yes | Property value for amortization calculation under IS | |
| marginal_tax_rate_pct | Yes | Shareholder marginal income tax rate in percent (e.g. 30, 41, 45) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must cover behavioral aspects. It states inputs and 'Returns net result under each regime,' indicating a pure calculation with no side effects. However, it does not disclose any prerequisites or limitations beyond what is obvious.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences: purpose, usage, and return value plus reference. Every sentence adds value, and it is front-loaded with the key action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (not shown), the description adequately covers inputs, purpose, target audience, and returns. It also references related tools via list_bundles, making it self-contained for an investor.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds context by noting 'property value for amortization calculation under IS,' which explains the purpose of the property_value parameter beyond the schema description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool compares SCI taxation under IS vs IR, targeting French real-estate investors. It uses specific verbs ('Compare') and resources ('SCI taxation under IS vs IR'), distinguishing it from many 'calculate_' siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description clearly states when to use the tool: 'Use for French real-estate investors choosing tax regime.' It also references related tools ('See list_bundles...'), but does not explicitly state when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_seed_quantityBInspect
Calculate the number of seeds needed based on surface area, spacing and germination rate. See list_bundles for related 'jardinage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| surface_m2 | Yes | Surface area in square meters | |
| row_spacing_cm | Yes | Distance between rows in centimeters | |
| plant_spacing_cm | Yes | Distance between plants in a row in centimeters | |
| germination_rate_pct | No | Germination rate in percent (default 85%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool calculates seeds needed, but does not disclose whether it is read-only, stateless, has side effects, or requires authentication. For a simple calculation, basic transparency is missing.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: one sentence plus a referral. It is front-loaded with the main action. While efficient, it could add a bit more context without becoming verbose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the completeness of the input schema (100% coverage), the description is adequate for an agent to understand the basic purpose and parameters. It does not need to explain return values due to the output schema, though it could mention the optional parameter.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% parameter description coverage, so the description adds little extra meaning beyond paraphrasing the schema. It mentions surface area, spacing, and germination rate, matching the parameters, but offers no additional semantics like units or relationships.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates the number of seeds needed based on surface area, spacing, and germination rate. It specifies the verb and resource, and references a bundle of related calculators, though it does not explicitly distinguish from other gardening calculator siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal usage guidance: it directs users to 'list_bundles' for related 'jardinage' calculators but does not explain when to use this tool versus alternatives like calculate_lawn_seed or calculate_garden_soil. No when-not scenarios or explicit criteria are given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_senegalese_cssBInspect
Calculate Senegalese social contributions (CSS/IPRES) for employee and employer. Returns: {gross_monthly_xof, employee, employer}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| accident_rate_pct | No | Work accident insurance rate 1-5% (employer only, default 3%) | |
| gross_monthly_xof | Yes | Gross monthly salary in XOF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It only states the purpose and return format, failing to disclose idempotency, side effects, or authentication needs. For a calculation tool, read-only nature is implied but not confirmed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (two sentences), directly stating purpose, outputs, and a pointer to related tools. Every sentence adds value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 2 parameters, both well-documented in schema, and an output schema exists, the description is minimally complete. However, it lacks context about calculation methodology (e.g., rates, caps) and does not explain what CSS/IPRES covers, which could limit an agent's understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, with both parameters clearly described (gross monthly salary, accident rate range and default). The description adds no new meaning beyond what the schema provides, so baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Senegalese social contributions (CSS/IPRES) for both employee and employer, and specifies the return structure. This distinctively separates it from similar tools like calculate_belgian_social_contributions or calculate_moroccan_cnss.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide explicit guidance on when to use this tool vs alternatives. It merely suggests seeing list_bundles for related calculators, but no context on prerequisites, excluded scenarios (e.g., independent contractors), or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_senegalese_income_taxAInspect
Calculate Senegalese income tax (IRPP) using DGI progressive brackets in XOF. Returns: {annual_income_xof, income_tax_xof, effective_rate_pct, marginal_rate_pct, brackets}. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_income_xof | Yes | Annual gross income in CFA Francs (XOF) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It explains the calculation uses progressive brackets and returns specific fields, but does not disclose whether the operation is read-only, requires authentication, or has side effects. For a mathematical tool, the transparency is adequate but not rich.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first covers purpose and return values, the second mentions related tools. Every sentence adds value, no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, one required), the presence of an output schema covering return values, and clear annotations (none needed), the description is complete enough. The mention of progressive brackets and XOF provides sufficient context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a clear description of the parameter. The description adds context by specifying 'in XOF' and referencing 'DGI progressive brackets', enhancing understanding beyond the schema alone. With high coverage, the baseline is 3, and the additional context justifies a 4.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description specifies it calculates Senegalese income tax using DGI progressive brackets in XOF, clearly stating the verb (calculate), resource (Senegalese income tax), and method (brackets). It distinguishes itself from numerous sibling tax calculators for other countries.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions a related bundle via list_bundles but does not explicitly state when to use this tool versus alternatives. While the name implies Senegal-specific usage, there is no guidance on prerequisites or situations where other tools are preferred.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_senegalese_vatAInspect
Compute Senegalese VAT (TVA) — 18% standard rate. Use for invoicing in Senegal. Inputs: amount, mode (ht/ttc). Returns HT, TTC, tax amount. See list_bundles for related 'finance-afrique-quebec' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=hors taxe, ttc=toutes taxes comprises | ht |
| rate | No | TVA rate in % (standard 18%) | |
| amount | Yes | Amount in XOF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It implies a safe, non-destructive calculation tool by listing inputs and outputs, but does not explicitly state that it performs no side effects or modifications. Given the tool's nature, this is adequate but not exemplary.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences cover purpose, inputs, outputs, and a pointer to related tools. No unnecessary words, perfectly front-loaded with the core action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with 3 well-documented parameters and an output schema (implied by 'Returns HT, TTC, tax amount'). The description is nearly complete; the only missing element is a brief note about the rate parameter (already in schema) and an explicit statement of read-only behavior, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, already documenting all parameters (amount, mode, rate). The description merely restates 'amount, mode (ht/ttc)' and does not add new semantics beyond what the schema provides. Thus, baseline 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Compute', the resource 'Senegalese VAT (TVA)', specifies the standard 18% rate, and distinguishes it from other VAT calculators by specifying Senegal. The purpose is specific and unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description states 'Use for invoicing in Senegal,' providing clear context for use. It hints at alternatives via the reference to list_bundles for related calculators, but does not explicitly state when not to use this tool (e.g., for other countries). This is a minor gap.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sequenceAInspect
Calculate nth term and sum of arithmetic or geometric sequence. Returns: {common_difference, nth_term, sum_of_n}. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| n | Yes | Number of terms | |
| type | Yes | Sequence type | |
| common | Yes | Common difference (arithmetic) or ratio (geometric) | |
| first_term | Yes | First term (a1) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full burden of behavioral disclosure. It states the tool calculates a sequence and returns specific fields, but does not mention any side effects, permissions, rate limits, or assumptions (e.g., valid inputs, error handling).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first states the core purpose clearly, the second lists the return object and points to related tools. No unnecessary words, every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists (context signals indicate it has one), the description appropriately lists the return keys. It covers the core functionality, though missing edge case handling. For a simple tool with rich schema, this is reasonably complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so the schema already defines each parameter. The description adds no further semantic meaning beyond the return object keys, which hints at the parameters' roles. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses specific verbs ('calculate') and resources ('nth term and sum of arithmetic or geometric sequence'), clearly stating the tool's purpose. It distinguishes from numerous sibling calculators by naming the exact mathematical operation, and the mention of related calculators via list_bundles provides context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains what the tool does but gives no guidance on when to use it over alternatives like the many other calculator tools. The reference to list_bundles is vague and does not provide explicit when-to-use or when-not-to-use conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_severance_payAInspect
Calculate French severance pay for rupture conventionnelle or licenciement. Returns: {monthly_salary, years_first_10, years_above_10, gross_indemnite, note}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| monthly_salary | Yes | Reference gross monthly salary in euros | |
| years_seniority | Yes | Years of seniority in the company |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the burden of disclosing behavioral traits. It states the return shape but does not explicitly indicate that the tool is read-only (no side effects) or mention authorization needs or rate limits. However, the action is inherently a calculation, so the lack of explicit safety note is a minor gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the purpose and return shape, then directing to related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, clear return shape), the description covers purpose, return fields, and related tools. However, it lacks explicit usage guidelines and behavioral transparency, preventing a perfect score.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and both parameters have clear descriptions in the schema. The description does not add additional meaning beyond the schema, so the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies the verb ('Calculate'), the resource ('French severance pay'), and the context ('for rupture conventionnelle or licenciement'). It distinguishes from sibling tools by being the only severance pay calculator among many calculation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for French severance pay and points to related calculators via 'See list_bundles...', but it does not explicitly state when to use this tool versus alternatives, nor does it provide exclusions or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_shipping_volumetricAInspect
Compute volumetric (dimensional) weight for shipping. Carriers bill the higher of actual and dim weight. Inputs: L×W×H cm, divisor (5000 air, 6000 ground). Returns dim weight kg. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| width_cm | Yes | Width cm | |
| actual_kg | Yes | Actual weight kg | |
| height_cm | Yes | Height cm | |
| length_cm | Yes | Length cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It transparently discloses the calculation formula (L*W*H/divisor) and mentions typical divisor values (5000 air, 6000 ground). It does not detail rounding or precision, but for a non-destructive calculation, it is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with three sentences: first sentence states purpose, second provides inputs and output, third points to related tool. No wasted words, front-loaded with purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool with full schema coverage and an output schema (present but not shown), the description is mostly complete. It explains the core concept, inputs, and output. Could mention precision or rounding, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with basic descriptions. The description adds value by explaining the context (shipping) and clarifying that dimensions are in cm and output in kg. It also discusses the divisor concept (though not a parameter), which helps understanding. Exceeds the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes volumetric weight for shipping, explaining the concept of billing based on the higher of actual and dimensional weight. It specifies inputs (L×W×H cm, divisor) and output (dim weight kg). However, it does not explicitly differentiate from similar calculate_ tools, though the context is distinct.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains when to use (for shipping dim weight) and hints at context (carriers bill higher). It mentions a related tool list_bundles but does not provide explicit when-not-to-use or alternative tools among the many siblings. More guidance on when not to use would improve.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_shoe_sizeCInspect
Convert shoe size between EU, US, UK, and Japanese systems. Use for international shopping. Inputs: size, from-system, to-system, gender. Returns equivalent. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | Shoe size | |
| from_system | Yes | From system |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must disclose behavioral traits. It states 'Returns equivalent' without explaining what happens for unsupported conversions, invalid sizes, or how the conversion works. The mismatch between described inputs ('to-system, gender') and schema undermines transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short with three sentences, but the third sentence listing inputs is inaccurate. It is concise but not entirely accurate, which reduces effectiveness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multiple shoe size systems) and the missing 'to-system' and 'gender' parameters in the schema, the description fails to provide complete context. It does not explain which conversions are possible, what the output looks like, or how to handle the missing parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with basic descriptions for 'size' and 'from_system'. However, the description lists additional parameters 'to-system' and 'gender' that are not present in the schema, creating confusion. This detracts from the semantics provided by the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Convert shoe size between EU, US, UK, and Japanese systems,' which specifies the verb and resource. However, it does not differentiate this tool from sibling tools like 'convert_shoe_size' and 'calculate_shoe_size_convert', and the mention of 'Japanese' is inconsistent with the schema's from_system enums.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using the tool for 'international shopping' and references 'list_bundles' for related calculators. This provides some context but lacks explicit guidance on when to use this tool versus alternatives like 'convert_shoe_size', and does not mention prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_shoe_size_convertAInspect
Convert shoe size between EU, US (M/W) and UK systems. Returns: {converted_size, eu_size, original_size}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| size | Yes | Shoe size in source system | |
| to_system | Yes | Target sizing system | |
| from_system | Yes | Source sizing system |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the return format (object with converted_size, eu_size, original_size), which adds some transparency. However, it does not mention whether the operation is read-only, destructive, or requires any special permissions. For a simple conversion tool, this is adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description consists of two concise sentences. The first sentence clearly states the action and scope, and the second lists the return fields along with a pointer to related tools. No unnecessary information; it is well-structured and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, the description covers the essential purpose and output. The presence of an output schema (as indicated by context signals) means the description does not need to detail return values. However, it lacks guidance on edge cases (e.g., invalid sizes or ambiguous conversions) and does not mention rounding or precision. Still, for a basic conversion tool, it is nearly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with parameter descriptions. The description reiterates the conversion systems and return structure but does not add new semantic meaning beyond what the schema already provides. Thus, baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the purpose: convert shoe sizes between EU, US (M/W), and UK systems. It is specific about the conversion task and mentions the involved systems. However, it does not differentiate itself from the sibling tool 'convert_shoe_size', which appears in the sibling list and may have overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when shoe size conversion is needed, but it lacks explicit guidance on when to use this tool versus alternatives. The mention of 'See list_bundles for related conversions calculators' is a weak pointer to other conversion tools, but no specific exclusions or contextual cues are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_simple_interestAInspect
Compute simple interest I=P·r·t. Use for short-term loans, basic savings accounts, and homework. Returns interest amount and final balance. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | Yes | Duration in years | |
| principal | Yes | Initial amount | |
| annual_rate | Yes | Annual interest rate in % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool returns 'interest amount and final balance', which is sufficient for a computational tool. It does not discuss error handling or edge cases, but given the simplicity, this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise—only two sentences—yet includes purpose, formula, use cases, output, and a cross-reference. Every sentence is informative with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 required parameters, no nested objects, output schema present), the description is complete. It explains what the tool does, when to use it, and what it returns. The presence of an output schema covers return details.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions (principal, annual_rate, years). The description adds the formula I=P·r·t, clarifying how the percentage-based annual rate is used, which enhances understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies the verb ('Compute'), resource ('simple interest'), the formula I=P·r·t, and use cases like short-term loans, savings accounts, and homework. It distinguishes from sibling calculators (e.g., compound interest) by naming the formula.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions appropriate use cases ('short-term loans, basic savings accounts, homework') and references 'list_bundles' for related calculators. While it does not explicitly state exclusions, the context implies when this tool is preferred over compound interest alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sleep_cyclesBInspect
Estimate optimal bedtime or wake time based on 90-min sleep cycles. Use for sleep optimization. Inputs: target wake or bedtime. Returns 4-6 cycle recommendations. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bedtime | Yes | Bedtime HH:MM | |
| wake_time | Yes | Wake time HH:MM |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It claims inputs are 'target wake or bedtime' (implying only one needed), but the schema requires both bedtime and wake_time. This contradiction misleads the agent and undermines trust.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, efficiently covering purpose and a hint about related tools. It could benefit from listing parameters explicitly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists, return values are partly explained ('returns 4-6 cycle recommendations'). However, the contradiction about required inputs makes the description incomplete and confusing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds that inputs are for target wake or bedtime, providing context beyond the schema, but the contradiction reduces clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it estimates optimal bedtime or wake time based on 90-minute sleep cycles, a specific verb and resource. It distinguishes from sibling calculators by focusing on sleep optimization.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for sleep optimization,' providing clear context. It mentions related tools via 'list_bundles,' but does not explicitly state when not to use or alternatives within this context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_slopeAInspect
Compute slope as percentage, angle in degrees, and ratio. Use for ramps, roofs, or terrain analysis. Inputs: rise, run. Returns slope% and angle°. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| run_m | Yes | Horizontal run m | |
| rise_m | Yes | Vertical rise m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It states the computation (slope percentage and angle) from rise and run, and implies non-destructive behavior. However, it does not clarify the fate of the ratio mentioned in the first sentence, nor does it discuss edge cases (e.g., very small run values are handled by schema minimum). The behavioral disclosure is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences and concise. It front-loads the purpose, then use cases, then inputs/outputs. However, the inclusion of 'ratio' in the first sentence that is not reflected in the output listing is an error that slightly mars conciseness. Overall, it is well-structured and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity and existence of an output schema (which can describe return values), the description is reasonably complete. It covers purpose, use cases, inputs, and outputs. The omitted ratio from the output list is a minor gap but not critical. The reference to list_bundles adds contextual awareness for related tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters (rise_m, run_m) including units and a minimum for run_m. The description adds only the generic names 'rise' and 'run', which is redundant. It adds no new meaning beyond the schema, so the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes slope as percentage, angle in degrees, and ratio for ramps, roofs, or terrain analysis. The verb 'Compute' and resource 'slope' are specific. However, there is a minor inconsistency: the first sentence includes 'ratio' but later says 'Returns slope% and angle°', which may cause confusion. Overall purpose is clear and distinguishes from many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by stating use cases 'for ramps, roofs, or terrain analysis' and references list_bundles for related construction calculators. It does not explicitly mention when not to use this tool or direct alternatives like calculate_drain_slope among siblings. The guidance is adequate but lacks explicit exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_smoking_savingsAInspect
Compute money and health time saved by quitting smoking. Use for motivation and budgeting. Inputs: cigarettes/day, pack price. Returns daily/monthly/yearly savings and life-time recovered. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| pack_price | Yes | Price per pack | |
| cigarettes_per_day | Yes | Cigarettes smoked per day | |
| cigarettes_per_pack | No | Cigarettes per pack |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations were provided, so the description carries the full burden. It discloses that it returns daily/monthly/yearly savings and life-time recovered, and lists required inputs. However, it does not mention side effects, idempotency, or authorization needs, but for a calculation tool this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first states the purpose and use case, the second lists inputs, outputs, and a pointer to a related tool. It is concise, front-loaded, and every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given that the tool has an output schema and is a simple calculator, the description is complete. It lists inputs and outputs, states the purpose, and provides a reference to a related tool. No gaps are apparent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for parameters. The description mentions 'cigarettes/day' and 'pack price' (the required ones) but omits the optional 'cigarettes_per_pack' parameter. It adds value by summarizing the output but doesn't fully detail all parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Compute money and health time saved by quitting smoking.' It specifies use cases ('for motivation and budgeting'), inputs, and outputs, making it easily distinguishable from sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for motivation and budgeting' and provides a pointer to a related tool ('See list_bundles for related 'sante' calculators'). It lacks explicit when-not-to-use advice, but the context is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_soil_ph_amendmentAInspect
Compute lime or sulfur amount to shift soil pH to a target. Use for gardening. Inputs: current pH, target pH, area m². Returns amendment type and kg/m². See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | Yes | Garden area m² | |
| target_ph | Yes | Target pH | |
| current_ph | Yes | Current soil pH |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions the tool returns amendment type and kg/m², but lacks details on assumptions, precision, or edge cases (e.g., pH out of range). For a simple calculator, this is adequate but not richly transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the main action, followed by inputs and output references. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description adequately covers purpose, inputs, and output. It also mentions related tools via list_bundles, providing further context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers 100% of parameters with descriptions. The description adds value by stating 'Returns amendment type and kg/m²', which is not in the input schema. It also clarifies the area unit as m², which the schema does not specify.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes lime or sulfur amounts to shift soil pH, specifying it is for gardening. It lists inputs and outputs, and distinguishes itself from the vast array of other calculate tools by its domain and function.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for gardening', providing clear context. It also points to list_bundles for related calculators in 'astronomie-nature', offering guidance on alternatives. However, it does not explicitly state when not to use or exclude other tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_solar_panel_outputCInspect
Estimate solar panel daily and yearly energy output by location and system size. Use for solar installation sizing. Inputs: kW capacity, latitude, panel orientation, shading %. Returns kWh/day and kWh/year. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | No | Panel surface area in m2 (optional, informational) | |
| panel_watt_peak | Yes | Total peak power of the installation in Watts (Wp) | |
| hours_sun_per_day | No | Average peak sun hours per day (default 4) | |
| efficiency_loss_pct | No | System efficiency loss percentage (default 15%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavior. It mentions returns kWh/day and kWh/year, but it omits important details such as assumptions (e.g., average sun hours), limitations, or that location parameters are not actually taken as input. The description misrepresents inputs, reducing transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences and front-loads the purpose. However, the second sentence lists inputs that are inconsistent with the schema, which wastes space and could mislead.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema, the description does not need to detail return values. However, it fails to provide sufficient context for correct usage, particularly due to the input mismatch and lack of differentiation from sibling tools. The context signals show many similar calculators, but the description offers no guidance on choosing this one.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes parameters. The description claims inputs like 'kW capacity, latitude, panel orientation, shading %' which do not correspond to actual parameters (panel_watt_peak, area_m2, hours_sun_per_day, efficiency_loss_pct). This misalignment degrades rather than adds value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool estimates daily and yearly energy output for solar installation sizing, which provides a clear verb-resource pair. However, there is a mismatch: the description mentions 'latitude, panel orientation, shading %' as inputs, but these are absent from the schema, causing potential confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for solar installation sizing' and references related calculators via list_bundles, but does not specify when to prefer this tool over the many other calculate_* siblings. No explicit exclusions or alternatives are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_solar_roiAInspect
Compute solar panel return on investment over their lifetime. Use for energy audit. Inputs: install cost, kW capacity, location production, kWh price. Returns ROI years and lifetime savings. See list_bundles for related 'energie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| price_kwh | No | Electricity price EUR/kWh | |
| annual_kwh | Yes | Annual production kWh | |
| system_cost | Yes | Total system cost EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It implies a safe, read-only computation but does not explicitly state that it does not modify data or require authentication. For a calculator, this is adequate but not explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences, front-loaded with main purpose, no redundant information. Efficient and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given a simple calculator with full schema and output schema, the description covers purpose, inputs, and outputs (ROI years, lifetime savings). It does not detail return format or assumptions, but is sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions. The description adds real-world context ('install cost', 'kW capacity', 'location production') but there is a slight mismatch with schema parameter names. Baseline 3 is appropriate as schema already defines parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb 'Compute', resource 'solar panel return on investment', and scope 'over their lifetime'. It distinguishes from generic 'calculate_roi' sibling by specifying solar and mentioning 'Use for energy audit'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for energy audit' and refers to related calculators via 'See list_bundles for related "energie" calculators', providing context and alternatives. However, it does not explicitly compare with similar tools like 'calculate_roi'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_solution_dilutionAInspect
Compute lab solution dilution C1·V1=C2·V2. Use for stock-to-working solution prep. Inputs: stock concentration, target concentration, target volume. Returns stock volume and diluent volume. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| c1 | Yes | Initial concentration mol/L | |
| c2 | Yes | Target concentration mol/L | |
| v1_ml | Yes | Initial volume mL |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden but only states the function and return values. It does not disclose any computational constraints, rounding behavior, or side effects. A simple read-only calculator assumption is inferred but not confirmed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading the core purpose and usage. It efficiently lists inputs and outputs and cross-references related tools. No extraneous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description is largely sufficient. However, the minor parameter labeling issue and lack of behavioral nuance slightly reduce completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but the description introduces confusion: it lists 'target volume' whereas the schema defines v1_ml as 'Initial volume mL'. This mismatch could mislead an agent about which parameter corresponds to which input. The description does not explain the formula's algebraic relationship.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute lab solution dilution C1·V1=C2·V2' with a specific verb and resource, and differentiates from siblings by mentioning 'list_bundles for related science calculators'. It explicitly lists inputs and outputs.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states 'Use for stock-to-working solution prep', providing clear usage context. It also cross-references list_bundles for alternatives, though does not specify when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_speed_distance_timeAInspect
Solve speed/distance/time — provide any 2 of 3 values to compute the missing one. Returns: {error}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| speed | No | Speed in km/h | |
| distance | No | Distance in kilometers | |
| time_minutes | No | Time in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions 'Returns: {error}' but does not explain success behavior or side effects. It fails to clarify that this is a read-only, deterministic calculation with no state changes. The mention of an error return is vague and potentially misleading.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first covers purpose and usage, second points to siblings. It is front-loaded and efficient. However, the 'Returns: {error}' fragment is awkwardly placed and could be integrated into a structured note about return values.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 optional parameters, output schema exists), the description covers the core logic. However, it does not specify what happens if all three or only one value is provided, nor does it explain the error return in detail. The presence of an output schema mitigates the need for return value explanation, but validation rules are missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage (speed, distance, time with units). The description adds value by clarifying the relationship that any two of three are needed. Beyond the schema, this usage context helps the agent understand parameter combinations. No further detail on each parameter is necessary.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool solves speed/distance/time problems by computing the missing value from any two of three inputs. It uses specific verbs ('solve', 'compute') and the resource is well-defined. It distinguishes itself from other calculate_* siblings by focusing on this specific relationship.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'provide any 2 of 3 values to compute the missing one,' which is a concise usage rule. It also suggests viewing list_bundles for related sport calculators, aiding tool selection. However, it doesn't explicitly exclude other use cases or specify when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_speed_of_soundAInspect
Compute speed of sound in air at a given temperature. Use for physics or audio engineering. Formula: c=331.3+0.606·T_C. Inputs: temperature °C. Returns speed in m/s. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| temperature_c | Yes | Celsius |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description carries full transparency burden. It discloses the return unit (m/s), the formula, and the input format (temperature°C). It does not mention limitations (e.g., air only) or assumptions, but the formula implies the ideal gas dependency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose, followed by usage, formula, and a reference to related tools. Every sentence is essential and concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, output schema present), the description adequately covers purpose, input, formula, and unit. It could mention that the calculation assumes air as the medium, but overall it is sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a well-described parameter. The description adds the formula linking temperature to speed, providing semantic context beyond the schema's basic description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute speed of sound in air at a given temperature' with a specific verb and resource. It provides the formula and mentions applicable fields (physics, audio engineering), distinguishing it from hundreds of sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises use for physics or audio engineering and references 'list_bundles' for related science calculators, offering context and alternative guidance. However, it does not explicitly state when NOT to use the tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sphereAInspect
Compute sphere volume V=(4/3)πr³ and surface area A=4πr². Use for ball, tank, or astronomy problems. Inputs: radius. Returns volume and area. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| radius | Yes | Radius |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the behavioral transparency burden. It states the tool computes and returns volume and area but does not mention that radius must be non-negative (schema already covers this) or other behavioral details like rounding or precision. Adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, front-loading key information (function and formulas) in the first sentence, followed by use cases and a reference to related tools. No superfluous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator tool with one parameter, the description sufficiently explains what the tool does, what it computes, and provides usage context. Having an output schema reduces the need to detail return values.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description only repeats 'Inputs: radius' which adds no additional meaning beyond the schema's parameter description. With 100% schema coverage, a baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool computes sphere volume and surface area using the formulas V=(4/3)πr³ and A=4πr², clearly distinguishing it from siblings like calculate_volume or calculate_cylinder.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases (ball, tank, astronomy) and references list_bundles for related math calculators, providing context for when to use. However, it lacks explicit exclusion of alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_spring_constantAInspect
Compute spring constant k from Hooke's law F=k·x. Use for physics or mechanical design. Inputs: force N, displacement m. Returns spring constant N/m. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| force_n | Yes | Applied force N | |
| displacement_m | Yes | Displacement m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It states the formula and output unit but does not discuss edge cases (e.g., zero displacement), error handling, or limitations. Minimal behavioral disclosure beyond the core function.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose, then usage, then input/output. The reference to list_bundles is a slight deviation but does not bloat. Efficient and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity, two parameters, and an output schema exists, the description covers the essential calculation formula, usage domain, and input/output units. No gaps for basic use cases.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, both parameters are described with name and unit. The description restates units but adds no additional semantics beyond the schema. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes the spring constant using Hooke's law, specifying the formula and output unit. However, it does not explicitly differentiate it from other physics-related calculators in the sibling list, which share similar naming patterns.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises use for physics or mechanical design, providing clear context. It does not mention when not to use or list alternatives, but the guidance is sufficient for typical scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_staircaseAInspect
Calculate staircase dimensions using Blondel formula. Returns: {step_height_cm, giron_cm, blondel, blondel_ok}. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| total_height_cm | Yes | Total height cm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries full burden. It discloses the formula and return fields, which is sufficient for a read-only calculator. However, it does not explicitly state that it does not modify data, but that is implied by its nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description consists of two concise sentences: one stating the purpose and formula, and another listing return values and suggesting related tools. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with one parameter and a clearly described return value, the description is complete. It references the related bundle, fulfilling any need for context among many sibling tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with a single parameter described as 'Total height cm'. The description adds no additional meaning beyond the schema, so it meets the baseline for high schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate staircase dimensions using Blondel formula.' It specifies the verb, resource, and method, effectively distinguishing it from siblings like calculate_concrete_stairs.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests seeing list_bundles for related 'construction' calculators, implying a category but does not explicitly state when to use this tool over alternatives like calculate_concrete_stairs or other dimension calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_staking_rewardsCInspect
Calculate staking rewards with optional compounding for a given APY and duration. Returns: {initial_amount}. See list_bundles for related 'crypto' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| amount | Yes | Initial staking amount in coins or fiat | |
| apy_pct | Yes | Annual Percentage Yield in percent | |
| compounding | Yes | Compounding frequency | |
| duration_days | Yes | Staking duration in days |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It states a pure calculation, implying no side effects, but doesn't disclose assumptions (e.g., no fees, simple interest model) or limitations. The placeholder return format also reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose. However, the second sentence contains a placeholder 'Returns: {initial_amount}' which is uninformative and reduces clarity. It could be more concise by removing that placeholder.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists but description's attempt to explain return values is erroneous (returns initial amount, not rewards). Lacks details on formula, assumptions, or precision. For a calculation tool, it should disclose what the result actually represents.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for all four parameters. The description adds 'with optional compounding' but doesn't provide new meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states 'Calculate staking rewards' with parameters APY, duration, compounding. However, the return statement 'Returns: {initial_amount}' is ambiguous and may confuse about what the tool outputs. While specific to staking, it doesn't distinguish from other crypto calculation tools like calculate_crypto_profit_loss.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs alternatives. The only hint is 'See list_bundles for related crypto calculators', which is vague and doesn't help with decision-making. No when-not or exclusions provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_stamp_duty_ukCInspect
Compute UK Stamp Duty Land Tax (SDLT). Use for UK home buyers. Inputs: property price, first-time-buyer flag, second-home flag. Returns SDLT due and effective rate. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| price | Yes | Property purchase price in GBP | |
| first_time_buyer | No | Whether buyer is a first-time buyer (default false) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It mentions inputs and outputs but lacks details on side effects, prerequisites, or limitations (e.g., read-only, rate limits).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but contains inaccuracy (non-existent parameter), which hurts even a concise structure. Every sentence should be accurate; this one fails.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with full schema coverage, but the description's error (second-home flag) and lack of output schema details (though output schema exists) make it incomplete for reliable agent usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% parameter documentation, so baseline is 3. However, the description erroneously mentions a 'second-home flag' parameter that does not exist in the schema, potentially misleading the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes UK Stamp Duty Land Tax for home buyers, distinguishing it from sibling tools like other UK tax calculators. However, it mentions a non-existent 'second-home flag' parameter, which slightly undermines clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for UK home buyers' and refers to list_bundles for related calculators, providing context. However, it does not explicitly state when not to use or offer alternatives, leaving room for ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_star_magnitude_distanceBInspect
Calculate star distance from apparent and absolute magnitude. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| absolute_magnitude | Yes | Absolute magnitude (M) | |
| apparent_magnitude | Yes | Apparent magnitude (m) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It states 'calculate star distance' but does not mention any side effects, required permissions, rate limits, or output specifics (e.g., units of distance). Since there is an output schema, some behavior may be inferred, but the description adds minimal context beyond the basic action.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first clearly states the purpose, and the second provides a useful cross-reference to related calculators. It is free of unnecessary words and front-loaded with essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given a simple tool with two parameters and an existing output schema, the description is adequate but lacks important contextual details such as units of the result (parsecs vs light-years), example input values, and the underlying formula. It is not fully complete for an agent to confidently use without further inference.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and both parameters are already described in the schema. The description adds no additional context about parameter semantics, such as valid ranges, typical values, or constraints. With full schema coverage, a baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Calculate star distance from apparent and absolute magnitude.' This is a specific verb and resource. It is not a tautology and distinguishes from many sibling calculators, though it does not explicitly differentiate from other distance calculation tools like calculate_light_year_distance.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description includes 'See list_bundles for related 'astronomie-nature' calculators,' which hints at alternatives but does not provide explicit when-to-use or when-not-to-use guidance. There are no exclusions or comparisons to clarify when this tool is preferred over similar ones.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_statisticsAInspect
Calculate descriptive statistics: mean, median, mode, std dev, quartiles. Returns: {count, std_deviation, min, max, range, iqr}. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| values | Yes | Array of numbers |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It declares that the tool returns specific statistics but does not disclose any behavioral traits such as read-only nature, error handling (e.g., empty arrays), or performance characteristics. The behavior is generally predictable but under-described.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, with no filler. Every sentence adds value: first states purpose, second lists return fields and directs to sibling tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
An output schema exists, and the description lists return fields, which is helpful. Given the complexity of statistics and many sibling calculators, the description could be more specific about when to use this over other statistics tools, but it adequately covers the essentials.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, and the parameter is straightforward (array of numbers). The description does not add additional meaning beyond what the schema provides, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates descriptive statistics (mean, median, mode, std dev, quartiles) and lists return fields. It distinguishes itself from numerous sibling calculators by focusing on 'descriptive statistics' and referencing 'education' calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions related 'education' calculators via list_bundles, giving a hint about context. However, it lacks explicit guidance on when to use this tool vs alternatives like calculate_average, or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_string_tensionAInspect
Calculate guitar or bass string tension in pounds, kilograms and Newtons. See list_bundles for related 'musique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| frequency_hz | Yes | Target tuning frequency in Hz (e.g. 329.63 for E4) | |
| gauge_inches | Yes | String gauge in inches (e.g. 0.010 for a light gauge high E) | |
| scale_length_inches | Yes | Instrument scale length in inches (e.g. 25.5 for Fender Stratocaster) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of disclosing behavioral traits. It only states the calculation and output units, but does not indicate that it is a pure read operation (e.g., no side effects, no destructive behavior). The agent cannot infer safety or performance characteristics from this description.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, front-loaded with the primary purpose. Every word serves a purpose—no redundancy. Highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema (exists but not shown) and only 3 parameters. The description covers purpose, output units, and a pointer to related tools. However, it lacks behavioral transparency, which is needed for completeness. Adequate but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with each parameter described in the schema (scale_length_inches, frequency_hz, gauge_inches). The description adds no additional meaning beyond the schema, only mentioning output units. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description specifies the tool calculates string tension for guitar or bass, with output in multiple units (pounds, kilograms, Newtons). This clearly identifies the tool's function and distinguishes it from the many sibling 'calculate_*' tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description states when to use (for string tension calculations) and refers to 'list_bundles' for related 'musique' calculators, providing context for alternatives. It does not explicitly state when not to use, but the purpose is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_student_loan_repaymentCInspect
Compute student loan repayment schedule and total interest. Use for graduates planning repayment. Inputs: loan amount, interest rate %, term years. Returns monthly payment, total paid, total interest. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_rate | Yes | Annual interest rate percent | |
| loan_amount | Yes | Loan amount EUR | |
| monthly_payment | Yes | Monthly payment EUR |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It describes return values (monthly payment, total paid, total interest) but fails to mention behavioral aspects like assumptions or side effects. More critically, it contradicts the input schema by listing 'term years' instead of the actual parameter 'monthly_payment', reducing transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief (two sentences plus a reference) but sacrifices accuracy for brevity. The inclusion of incorrect parameter information negates the benefit of conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (loan repayment calculation), the description is incomplete. It does not explain the repayment schedule type or assumptions, and the input mismatch creates confusion. The existence of an output schema is noted but not detailed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
While the schema provides 100% coverage of parameter descriptions, the description adds misleading information: it states inputs include 'term years', which is not a parameter, and omits the required 'monthly_payment'. This misdirection makes the description harmful rather than helpful, scoring a 1.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Compute' and the resource 'student loan repayment schedule and total interest'. It identifies the tool's function, though it does not explicitly differentiate it from sibling tools like calculate_loan_payment or calculate_us_student_loan.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description offers usage context ('Use for graduates planning repayment') and directs users to list_bundles for related calculators. However, it does not provide when-not-to-use guidance or direct comparisons to sibling loan calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_study_scheduleCInspect
Generate a study schedule based on exam date and topics. Returns: {total_hours_needed, daily_hours_needed, feasible}. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| exam_date | Yes | Exam date YYYY-MM-DD | |
| topics_count | Yes | Number of topics to study | |
| hours_per_topic | Yes | Hours needed per topic |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses return fields (total_hours_needed, daily_hours_needed, feasible) but lacks details on side effects, authorizations, or assumptions. No annotations are provided, so the description carries full burden, yet it does not state whether the tool is read-only or any constraints.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences) and front-loaded with the purpose. The cross-reference to list_bundles is efficient, though it could be more integrated.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the basic purpose and return values but lacks details on calculation assumptions (e.g., how feasible is determined) and does not provide an output schema. Given the tool's simplicity, it is adequate but not comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with each parameter having a clear description. The description adds no additional meaning beyond what the schema provides, so a baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool generates a study schedule based on exam date and topics. It mentions related calculators via list_bundles, providing some context, but does not explicitly differentiate from similar siblings like calculate_anything.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not specify when to use this tool versus alternatives. The mention of list_bundles for related calculators is indirect and insufficient guidance for an agent to choose this tool correctly.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sun_exposureAInspect
Calculate safe sun exposure time based on UV index and Fitzpatrick skin type. Returns: {skin_description, safe_exposure_minutes, with_spf30_minutes, uv_risk_level, recommendations}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| uv_index | Yes | UV index at destination (1–11+) | |
| skin_type | Yes | Fitzpatrick skin type: 1=very fair, 6=very dark |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavior. It describes return fields but does not mention side effects, rate limits, or read-only nature. For a calculator, this is adequate but not exceptional. Score 3 for basic transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with two sentences, front-loaded with purpose, and includes return fields and a sibling reference. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with 2 fully described parameters and an output schema mentioned, the description is sufficiently complete. It lacks usage examples or edge cases but covers the essentials.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents parameters. The description adds no additional meaning beyond what the schema provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates safe sun exposure time based on UV index and skin type, and lists return fields. It lacks explicit differentiation from sibling calculators like calculate_sunscreen_reapply, but mentions a related bundle. Score 4 for specific verb+resource but limited sibling distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when UV index and skin type are known, and hints at alternatives via 'see list_bundles'. However, no explicit when-not or exclusion criteria are provided. Usage guidance is implicit but not robust.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sunrise_approxCInspect
Estimate sunrise/sunset times for a latitude on a given day of year. Use for astronomy or outdoor planning. Inputs: latitude, day of year. Returns sunrise/sunset hours, daylight duration. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| latitude | Yes | Latitude | |
| day_of_year | Yes | Day of year (1-366) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral details. It mentions 'estimate' (implying approximation) and the outputs (sunrise/sunset hours, daylight duration), but lacks information on accuracy, limitations (e.g., does it account for refraction?), or dependencies like time zone.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loading the purpose and inputs. It is efficient and avoids unnecessary detail, though the reference to 'list_bundles' could be considered tangential.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description mentions outputs (sunrise/sunset hours, daylight duration), and with an output schema present, further detail on return values is less critical. However, it does not address how this tool relates to similar calculators like 'calculate_sunrise_sunset', which is a notable gap given the large number of sibling tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already describes both parameters (latitude and day_of_year) with ranges. The description merely lists these inputs without adding any additional semantic context, such as valid units or format hints. Since schema coverage is 100%, baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it estimates sunrise/sunset times for a given latitude and day of year, and mentions use cases like astronomy or outdoor planning. However, it does not differentiate from the sibling tool 'calculate_sunrise_sunset', which likely serves a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It suggests using for astronomy or outdoor planning but provides no guidance on when not to use this tool or alternatives. The description directs to 'list_bundles' for related calculators but does not explicitly compare to siblings like 'calculate_sunrise_sunset'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sunrise_sunsetBInspect
Approximate sunrise and sunset times based on latitude and day of year. Returns: {sunrise_solar_time, sunset_solar_time, day_length_hours}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| latitude | Yes | Latitude in degrees | |
| day_of_year | Yes | Day of year (1-365) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It notes 'approximate' and lists return fields, but does not disclose important behavioral details such as accuracy, time zone handling (solar time implications), or any prerequisites. This is insufficient for a scientific calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first defines purpose and output, the second directs to related tools. No wasted words; front-loaded with essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Although there is an output schema (not shown), the description adds the return fields. However, for a scientific calculation, more context about accuracy, time conventions, and edge cases would be beneficial given the lack of annotations.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters with descriptions (latitude and day_of_year). The description repeats these inputs without adding new meaning. With 100% schema coverage, the baseline is 3, and the description does not elevate beyond that.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that it calculates approximate sunrise and sunset times using latitude and day of year, and lists the return fields. However, it does not distinguish this tool from other astronomy-related calculators like calculate_sun_exposure, though it references list_bundles for related tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal guidance on when to use this tool. It mentions 'approximate' but does not specify limitations or alternatives. The reference to list_bundles is a pointer to find related calculators but lacks direct when-to-use or when-not-to-use instructions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_sunscreen_reapplyCInspect
Compute when to reapply sunscreen based on SPF, activity, and water exposure. Use for sun safety. Inputs: SPF, skin type, activity (sweat/swim), UV index. Returns next reapply time. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| spf | Yes | SPF factor | |
| uv_index | Yes | Current UV index | |
| skin_type | Yes | Fitzpatrick skin type 1-6 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description mentions return value but does not disclose if tool modifies data, requires authentication, or any side effects. Also misleading about inputs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is short (two sentences) but includes inaccurate information. Could be more concise and accurate.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists but description still omits how activity and water exposure impact calculation, which are central to the tool's purpose. Incomplete given tool complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Despite 100% schema coverage, description adds a non-existent parameter (activity) and misrepresents inputs. This contradicts the schema and can lead to erroneous tool calls.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description states it computes sunscreen reapply time, but lists inputs (activity, water exposure) that are not in the schema, causing confusion about actual parameters.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Only says 'Use for sun safety' and references list_bundles for related calculators. No explicit when-to-use vs when-not, no alternatives given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_surface_carrezAInspect
Calculate Carrez law surface area (French legal measurement). Returns: {carrez_surface_m2, total_surface_m2, excluded_m2, included_rooms, excluded_rooms, note}. See list_bundles for related 'immobilier' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| rooms | Yes | List of rooms with area and ceiling height |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the output fields, which is helpful, but does not mention any behavioral traits like idempotency, data persistence, authentication needs, or error conditions. For a simple calculator tool, this is adequate but could be more transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: one sentence stating the purpose and return fields, followed by a link to related tools. Every sentence adds value. It is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema, the description provides sufficient context: purpose, return fields, and related tools. However, it lacks prerequisites or edge cases, and behavioral transparency is minimal. For a straightforward calculator, it is moderately complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, so the schema thoroughly documents the 'rooms' parameter with its sub-properties. The description adds no extra meaning beyond what the schema provides, so a baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the specific purpose: 'Calculate Carrez law surface area (French legal measurement).' It uses a specific verb and resource, and distinguishes from siblings by referencing 'immobilier' calculators via list_bundles. The tool name itself is descriptive, and the description reinforces it.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives a pointer to related tools ('See list_bundles for related 'immobilier' calculators'), which helps an agent understand when to use this versus alternatives. However, it does not explicitly state when not to use or provide exclusions, but the context is clear enough given the specificity of the Carrez law measurement.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swimming_paceAInspect
Calculate swimming pace per 100m and SWOLF efficiency estimate. Returns: {pace_per_100m_min, pace_formatted, swolf_estimate}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| distance_m | Yes | Distance swum in meters | |
| time_minutes | Yes | Total swim time in minutes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description bears full responsibility. It correctly describes the tool as calculating pace and SWOLF, returning specific fields. However, it lacks explanation of what SWOLF represents or edge cases.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with the main action, and includes a return value hint and sibling reference. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the low complexity (2 params, simple operation), the description is complete. Output schema exists, and the description mentions the return fields. No gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description does not add new meaning beyond the schema: it only repeats that it calculates pace from distance and time.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it calculates swimming pace per 100m and SWOLF efficiency estimate. The verb 'calculate' and resource are precise, and it distinguishes itself from many sibling calculators by specifying a niche.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides a vague reference to 'list_bundles' for related sport calculators, but does not explicitly state when to use this tool vs others. No exclusions or clear guidance on context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_income_taxBInspect
Calculate Swiss income tax — federal + estimated cantonal tax. Returns: {income, federal_tax, federal_marginal_rate_pct, cantonal_tax_estimate, cantonal_rate_pct, effective_rate_pct, ...}. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| canton | No | Canton of residence | geneve |
| income | Yes | Annual taxable income in CHF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It mentions 'estimated cantonal tax' but does not disclose other behavioral traits such as prerequisites, limitations, or error conditions. The partial output listing is useful but insufficient for full transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose, and efficiently uses a second sentence for return fields and a cross-reference. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (tax calculation) and the presence of an output schema (not shown), the description covers key return fields and acknowledges estimation. It points to related tools via list_bundles. It could further clarify methodology or limitations, but is fairly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds no new parameter details but provides return structure information, which is valuable. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates Swiss income tax, covering federal and estimated cantonal tax, and lists return fields. However, it does not explicitly distinguish from other country-specific tax calculators among siblings, which would warrant a 5.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide guidance on when to use this tool versus alternatives (e.g., other country tax calculators). The mention of 'see list_bundles' hints at related tools but does not offer explicit usage context or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_lppAInspect
Calculate Swiss occupational pension (LPP / 2e pilier) contributions by age bracket. Returns: {eligible, note}. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Age of employee | |
| gross_annual | Yes | Annual gross salary in CHF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It states the tool calculates and returns eligibility info, implying a non-destructive read operation. However, it does not disclose any potential side effects, authorization needs, or limits beyond the calculation itself.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: one for purpose, one for output format and reference. No extraneous information, efficiently structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the basic purpose and output, and references related tools via list_bundles. For a simple calculation tool, this is nearly complete, though the meaning of 'eligible' and 'note' is not explained.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds no additional parameter meaning beyond 'by age bracket', which slightly differs from the exact age input. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates Swiss occupational pension (LPP) contributions by age bracket, with a specific verb and resource. It distinguishes itself from sibling tools like Swiss income tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related calculators' but does not explicitly specify when to use this tool versus alternatives or provide exclusion conditions. Usage guidance is implied but not detailed.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_pillar3aAInspect
Calculate Swiss pillar 3a tax savings (3e pilier lie). Returns: {annual_contribution, net_cost_after_saving, max_employee_2026, max_self_employed_2026}. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| marginal_tax_rate | Yes | Marginal income tax rate in % (federal + cantonal combined) | |
| annual_contribution | Yes | Annual contribution to pillar 3a in CHF (max 7056 for employees, 35280 for self-employed) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description states it calculates and returns fields. It is a non-destructive computation tool, but lacks disclosure of assumptions (e.g., tax year, residency) or potential constraints. Adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, one on purpose and one on return fields with sibling reference. No redundancy, efficient communication.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Output schema exists (though not detailed in input) but description does not clarify meaning of return fields (e.g., net_cost_after_saving) or calculation context. Missing details like tax year assumption. Adequate but incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with clear descriptions for both parameters. Description adds no additional meaning beyond what schema provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it calculates Swiss pillar 3a tax savings, specifies return fields, and uses '3e pilier lie' for clarification. It distinguishes from siblings by referencing related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. It suggests seeing list_bundles for related calculators, implying selection guidance, but does not specify circumstances that favor this tool over alternatives like calculate_swiss_income_tax.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_salaryAInspect
Convert Swiss gross monthly salary to estimated net salary. Returns: {gross_monthly, avs_ai_apg_5_3pct, ac_chomage_1_1pct, lpp_2e_pilier_10pct, lamal_health_fixed, net_monthly, ...}. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_monthly | Yes | Gross monthly salary in CHF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must fully convey behavioral traits. It discloses that the tool returns an estimate and lists output fields, but it does not mention whether the tool is read-only, has side effects, or any prerequisites (e.g., need for swiss tax knowledge). The output overview is useful but incomplete.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise: one sentence stating the purpose, followed by a list of output fields and a helpful reference to related tools. No filler words, and critical information is front-loaded. The output fields are clearly enumerated.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description provides sufficient context for a simple calculation tool: input, output fields, and a hint about related calculators. However, it could be improved by explaining that the deductions are based on standard Swiss rates or that the result is an estimate. The output schema is not provided but the field list compensates.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers the single parameter (gross_monthly) completely and clearly describes it as 'Gross monthly salary in CHF'. The tool description adds no further semantic detail beyond confirming the parameter's role in the conversion. Since schema_coverage is 100%, baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: 'Convert Swiss gross monthly salary to estimated net salary.' It specifies the input type (Swiss gross monthly salary) and output format, and the mention of related 'finance-suisse' calculators helps distinguish it from other calculators in the sibling list.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when one has a Swiss gross monthly salary and wants an estimated net salary, but it does not explicitly state when to use this tool versus alternatives (e.g., for specific deduction breakdowns). The reference to 'list_bundles' for related calculators is helpful but not a clear usage guideline.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_vatAInspect
Compute Swiss VAT (TVA/MWST) — convert between net (HT) and gross (TTC). Use for invoicing or expense reimbursements in Switzerland. Inputs: amount, rate (8.1, 3.8, 2.6, 0). Returns HT, TTC, and tax amount. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=before tax, ttc=after tax | ht |
| rate | No | VAT rate: 2.6% (reduced), 3.8% (hotel), 8.1% (standard) | 8.1 |
| amount | Yes | Amount in CHF |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It describes the calculation functionality (conversion, returns) but does not disclose any side effects, mutability, or permissions. For a read-only calculator, this is acceptable but lacks explicit reassurance (e.g., statelessness).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences efficiently cover purpose, use case, and input/output summary. Front-loaded and no wasted words. Structure is optimal for quick comprehension.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity, full schema coverage, and presence of output schema, the description provides sufficient context. It explains functionality, use cases, and related tools. Slightly lacking in explicit exclusions or non-functional behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds a high-level summary of inputs (amount, rate with listed values including '0' which is not in schema) and outputs, but does not significantly enhance understanding beyond the schema. There is a minor discrepancy (rate '0' not in schema enums).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes Swiss VAT and converts between net and gross, with specific use cases (invoicing, expense reimbursements) and country scope (Switzerland). It implicitly distinguishes from siblings by naming but does not explicitly contrast with other VAT calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives context for use (Swiss invoicing/reimbursements) and mentions related tools via 'See list_bundles', but does not explicitly state when not to use this tool or provide direct alternatives. The guidance is adequate but not thorough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_swiss_wealth_taxBInspect
Calculate Swiss wealth tax (impot sur la fortune) by canton. Returns: {net_wealth, tax_free_threshold, taxable_wealth, wealth_tax_rate_pct, annual_wealth_tax, note}. See list_bundles for related 'finance-suisse' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| canton | No | Canton of residence | geneve |
| net_wealth | Yes | Net wealth in CHF (assets minus debts) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description carries full burden. It doesn't disclose key behavioral traits like tax year applicability, rate recency, or whether it requires authentication. The return format is provided, but essential context for a financial tool is missing.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences with no wasted words. First sentence captures purpose, second lists return fields and cross-reference to bundles. Efficient and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple 2-parameter calculator with output format given in description and a bundle reference, it's fairly complete. Minor gap: lacks statement about tax year or data freshness, but not critical for basic usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions in the schema. The description adds no additional semantic meaning beyond what's in the schema, aligning with baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the specific action 'Calculate Swiss wealth tax' and the resource 'by canton'. The name and description differentiate from other Swiss calculators like income tax or LPP, providing a distinct purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. Only mentions 'see list_bundles for related calculators', which is vague and doesn't provide decision criteria for agent to select this tool over similar Swiss tax calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tdeeBInspect
Calculate Total Daily Energy Expenditure from BMR and activity level. Returns: {tdee_kcal}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bmr | Yes | Basal Metabolic Rate in kcal | |
| activity_level | Yes | Activity level |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description bears full responsibility. It states the calculation and return value structure but does not disclose any behavioral traits (e.g., assumptions, formula type, side effects, or permission needs). Minimal beyond what schema already indicates.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, first stating purpose directly, second providing a cross-reference. No unnecessary words or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple tool (2 parameters) and existence of an output schema, the description is marginally sufficient. It mentions the return value hint and links to related tools, but lacks explanation of the underlying formula or how to obtain BMR (e.g., from calculate_bmr).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Both parameters have 100% schema coverage with descriptions. The description merely repeats 'from BMR and activity level', adding no new meaning or constraints beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it calculates Total Daily Energy Expenditure from BMR and activity level. Verb and resource are specific. Reference to list_bundles for related calculators hints at sibling differentiation, but does not explicitly distinguish from similar 'calculate_*' tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies usage context by mentioning related 'sante' calculators via list_bundles, but no explicit when-to-use or when-not-to-use guidance, nor alternatives to this specific tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_telescope_magnificationAInspect
Compute telescope magnification, exit pupil, and field of view. Use for astronomy hobbyists. Inputs: telescope focal length, eyepiece focal length, eyepiece field of view. Returns magnification and exit pupil. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| eyepiece_mm | Yes | Eyepiece focal length mm | |
| focal_length_mm | Yes | Telescope focal length mm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description only states it 'computes' and 'returns' values. It does not disclose any side effects, permissions, rate limits, or other behavioral details beyond the basic computation, leaving a significant gap for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loaded with the core function, and contains no redundant information. It is efficient and easily scannable.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has an output schema (indicated but not shown), which reduces the need to describe return values. The description adequately covers inputs and outputs for a simple calculation tool, though the mention of 'eyepiece field of view' as an input not in the schema slightly detracts.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers 100% of parameters with descriptions, so the description adds minimal value. It mentions 'eyepiece field of view' as an input, but this parameter is not in the schema (schema only has focal_length_mm and eyepiece_mm), causing slight confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool computes telescope magnification, exit pupil, and field of view, clearly indicating the purpose. It also references a sibling tool (list_bundles) for related calculators, helping distinguish it from other tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'Use for astronomy hobbyists' and points to list_bundles for related calculators, but it does not provide explicit guidance on when to use this tool versus alternatives or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tile_groutAInspect
Compute grout quantity for a tiling job. Use for renovation budget. Inputs: surface m², tile size, joint width. Returns grout kg needed. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | Yes | Area m² | |
| tile_cm | Yes | Tile size cm | |
| depth_mm | No | Joint depth mm | |
| joint_mm | No | Joint width mm |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses that the tool computes and returns grout kg, implying a read-only calculation. However, it does not detail behavior for invalid inputs, side effects, or safety considerations beyond the implied compute-and-return nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences plus a reference. It is front-loaded with the purpose. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description mentions the return value (grout kg) and references related tools. However, it omits assumptions (e.g., square tiles, uniform joints) and edge cases. With an output schema present, the burden is lower, but still somewhat incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with clear parameter descriptions. The description lists inputs but adds minimal additional meaning beyond summarizing the schema fields. Baseline 3 is appropriate as the schema already does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes grout quantity for tiling jobs and mentions its use in renovation budgets. It distinguishes itself from other calculation tools by specifying the resource (grout) and domain (tiling), but does not explicitly differentiate from similar tools like calculate_tile_quantity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context for use ('renovation budget') and points to list_bundles for related calculators, but does not explicitly state when to use this tool over alternatives or mention any prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tile_quantityAInspect
Compute tiles needed including a waste margin (default 10%). Use for floor or wall tiling. Inputs: surface m², tile size, waste %. Returns tile count and surface ordered. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| area_m2 | Yes | Area m² | |
| tile_l_cm | Yes | Tile length cm | |
| tile_w_cm | Yes | Tile width cm | |
| waste_pct | No | Waste % |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It mentions the default waste percentage and that it returns tile count and surface ordered, providing good behavioral insight. No side effects are expected for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences, front-loading the action and default, then providing context and related tools. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description is adequate for a simple calculation tool: it states purpose, inputs, outputs, and default. It also directs to related tools. It could explain how waste is applied, but with output schema present, it is reasonably complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents parameters. The description paraphrases the inputs but does not add new meaning beyond what the schema provides, meeting the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes tile quantity including waste margin for floor or wall tiling. It is specific about the resource and operation, but does not explicitly differentiate from siblings like calculate_brick_count or calculate_tile_grout, though it points to list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates usage for floor or wall tiling and lists inputs, providing clear context. However, it lacks explicit guidance on when not to use or direct alternatives beyond referring to list_bundles.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_time_differenceDInspect
Compute the difference between two times or dates in seconds, minutes, hours, days. Use for project tracking, age, or scheduling. Inputs: start datetime, end datetime. Returns delta in multiple units. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| city1 | Yes | First city | |
| city2 | Yes | Second city |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description should compensate but does not. It mentions the return type ('delta in multiple units') but lacks details on side effects, authorization needs, or output structure, leaving behavioral transparency low.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but inaccurate and contradictory. Conciseness is only beneficial if the information is correct; here it misrepresents the tool, making it ineffective.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool expects city parameters but the description implies datetime inputs, so it fails to explain how to use the tool correctly. It also omits details about date/time format, output units, and differentiation from similar tools, making it incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description claims inputs are 'start datetime, end datetime', but the schema defines city1 and city2 as enum strings of city names. This direct contradiction means the description adds no value and actively misleads.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute the difference between two times or dates', which indicates a clear purpose, but the input schema expects city names (city1, city2) rather than datetime inputs, creating a contradiction that undermines clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides vague usage scenarios ('project tracking, age, or scheduling') and mentions a sibling tool group, but offers no explicit guidance on when not to use it or how it differs from the many time-related sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_time_signature_beatsAInspect
Calculate total beats and duration for a musical passage in bars. Returns: {time_signature}. See list_bundles for related 'musique' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bpm | Yes | Tempo in beats per minute | |
| bars | Yes | Number of bars | |
| beat_value | No | Note value of one beat (denominator of time signature, e.g. 4 for quarter note) | |
| beats_per_bar | No | Number of beats per bar (numerator of time signature) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It states the tool returns a time signature but does not explain the output format, any side effects, or limitations. For a calculation tool, the behavior is safe but incompletely described.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences: one for purpose and one for pointing to related tools. Every word serves a purpose, and it is efficiently front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 4 parameters (all documented in schema) and an output schema present, the description is minimal. It does not explain the output structure (the {time_signature} placeholder) or how optional parameters affect results, leaving gaps for an agent to interpret.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters described. The description adds no additional meaning beyond the schema; it only mentions 'total beats and duration' without connecting them to specific parameters. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate total beats and duration for a musical passage in bars.' It specifies the resource (musical passage) and action (calculate beats and duration), and mentions the return value as time signature. This effectively distinguishes it from the many sibling calculators by domain.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides implicit usage context ('musical passage') and references list_bundles for related calculators, but it does not explicitly state when to use this tool versus alternatives nor mention any prerequisites or conditions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_timezone_convertAInspect
Convert a time between two UTC offsets accounting for date rollover. Use for international meetings. Inputs: time, from-utc, to-utc. Returns local time and date offset. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| time | Yes | Time to convert HH:MM | |
| to_offset | Yes | Target UTC offset hours | |
| from_offset | Yes | Source UTC offset hours (e.g. 1 for UTC+1) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full responsibility for behavioral disclosure. It mentions date rollover and returns local time and date offset, but omits details on input validation (e.g., invalid times), potential side effects, authorization needs, or rate limits. The lack of such context reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences plus a brief list) and front-loads the core functionality and rollover handling. However, the list of inputs is somewhat redundant given the schema, and the reference to 'list_bundles' adds slight clutter. Overall efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema (not shown here but indicated), the description adequately covers the conversion behavior and key features (date rollover, return of local time/offset). However, it does not address edge cases like times near boundaries or handling of non-existent times due to DST (though UTC offsets avoid that).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with each parameter (time, from_offset, to_offset) already described with type and constraints. The description adds minimal value by listing them as 'Inputs: time, from-utc, to-utc' which repeats the schema. Baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'convert' and the resource 'time between two UTC offsets', and adds the key distinction of 'accounting for date rollover' which differentiates it from siblings like 'calculate_timezone_offset' and 'calculate_time_difference'. The use case 'Use for international meetings' further clarifies purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests 'Use for international meetings' and directs to 'list_bundles' for related calculators, but does not explicitly state when NOT to use this tool or compare with alternatives like 'calculate_timezone_offset'. The usage context is implied but not strictly bounded.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_time_zone_differenceAInspect
Compute the time difference (hours) between two timezones. Use for international meetings. Inputs: timezone A, timezone B. Returns delta hours and current local times. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| city1 | Yes | First city | |
| city2 | Yes | Second city |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool returns delta hours and current local times, but does not disclose constraints (e.g., the limited set of cities from the enum) or any behavioral traits like daylight saving time handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at 2-3 sentences, front-loaded with the main purpose, and contains no extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description adequately covers basic functionality and return values. However, it omits the limitation of the city enum and does not clarify how this tool differs from similar sibling tools. Given the output schema exists, return details are less critical.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds minimal context ('Inputs: timezone A, timezone B'), which does not significantly enhance understanding beyond the schema's property descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes time difference between two timezones and mentions its use for international meetings. It distinguishes from siblings only indirectly by referencing 'list_bundles' for related calculators, but does not differentiate from similar sibling tools like 'calculate_time_difference' or 'calculate_timezone_convert'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a usage hint ('Use for international meetings') but no explicit guidance on when not to use or alternatives. Sibling tools exist for similar purposes without differentiation.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_timezone_offsetCInspect
Compute current UTC offset for a timezone, accounting for DST. Use for scheduling and date math. Inputs: timezone (IANA name). Returns UTC offset and DST status. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to_zone | Yes | Target time zone | |
| from_zone | Yes | Source time zone |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description mentions that the tool accounts for DST and returns UTC offset and DST status, but it lacks details on real-time behavior, rate limits, or any required authentication. No annotations are provided to supplement transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and somewhat structured with a purpose statement, usage note, and a pointer to related tools. However, it could be organized more effectively, and the IANA/abbreviation inconsistency wastes conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with two enumerated parameters and a stated return of UTC offset and DST status. The description covers the basics, but given the context of scheduling, more detail on DST handling and offset precision would be beneficial. The output schema is mentioned but not described in detail.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description says 'Inputs: timezone (IANA name)', but the input schema uses timezone abbreviations (e.g., UTC, CET, EST) rather than IANA names. This contradicts the description and could mislead an agent into using the wrong format. Despite 100% schema coverage, the description introduces confusion.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes the current UTC offset for a timezone, accounting for DST. It specifies the verb 'compute' and the resource 'UTC offset', and mentions use cases for scheduling and date math. However, it does not explicitly distinguish itself from sibling tools like calculate_timezone_convert or calculate_time_difference.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal usage guidance: 'Use for scheduling and date math.' It does not specify when not to use this tool, nor does it compare with alternative tools. The reference to 'See list_bundles for related temps-rh calculators' is vague and does not give clear directives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tipAInspect
Compute restaurant tip and per-person split. Use for shared meals. Inputs: bill, tip %, people count. Returns tip, total, per-person share. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bill | Yes | Bill amount | |
| split | No | Number of people splitting | |
| tip_pct | No | Tip percentage |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It adequately states what the tool computes (tip, total, per-person share) and implies a read-only calculation, but does not mention any side effects, prerequisites, or limitations beyond the schema constraints.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise at two sentences, with the primary purpose stated first followed by inputs and outputs. Every sentence adds value, and there is no redundant or unnecessary information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple three-parameter calculator with a generated output schema, the description covers the essential functionality. It mentions inputs and outputs clearly. Minor improvement could be noting default values (e.g., tip_pct defaults to 15) or behavior for required fields.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds minimal value by listing inputs as 'bill, tip %, people count', which closely mirrors the schema property descriptions without providing deeper context or usage tips.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool computes restaurant tip and per-person split for shared meals, which is specific and actionable. However, it does not differentiate from sibling tools like 'calculate_tip_split' or 'calculate_tip_worldwide', which may have overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises 'Use for shared meals' and references related calculators via list_bundles. This provides context but lacks explicit guidance on when not to use this tool or specific alternatives beyond a generic cross-reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tip_splitAInspect
Calculate tip and per-person amount for a restaurant bill. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| tip_pct | Yes | Tip percentage | |
| num_people | Yes | Number of people splitting | |
| bill_amount | Yes | Total bill amount |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries burden. Describes what it calculates but does not disclose return format or that it's side-effect free. Adequate for a calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two clear sentences: first states purpose, second gives pointer to sibling tool. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Has output schema (provided), so return values are covered. Description is sufficient for a simple calculator with well-documented parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with good descriptions for each parameter. Description adds no additional meaning beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clear verb ('Calculate') and resource ('tip and per-person amount for a restaurant bill'). Distinguishes from siblings by referencing 'list_bundles' for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies usage context (restaurant bill) but does not explicitly say when to use this tool versus other tip calculators or alternatives. Provides a pointer to list_bundles but lacks when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_tip_worldwideAInspect
Compute restaurant tip following local custom (US 18-22%, FR included, JP no tip, etc.). Use when traveling. Inputs: bill, country. Returns recommended tip and total. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| bill | Yes | Bill | |
| country | Yes | Country |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description discloses basic behavior (follows local custom, returns tip and total) but does not mention any additional traits such as rounding, assumptions, or limitations of the custom rules.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is two sentences plus a cross-reference, succinct and front-loaded with key purpose. Every sentence contributes value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with two parameters and assumed output schema, the description covers purpose, usage context, inputs, outputs, and a cross-reference. It lacks details on edge cases (e.g., countries not in enum) but is largely complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds context by providing examples of country values and what the tip covers, but does not significantly enhance meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states compute restaurant tip following local custom with examples (US 18-22%, FR included, JP no tip). It includes inputs and outputs. However, it does not explicitly distinguish from sibling tools like calculate_tip or calculate_tip_split.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description says 'Use when traveling' and lists countries, implying context. It references list_bundles for related calculators but does not provide explicit when-to-use or when-not-to-use guidance relative to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_torusAInspect
Compute torus volume V=2π²Rr² and surface area. Use for ring-shaped objects (donuts, inner tubes). Inputs: major radius R, minor radius r. Returns volume and area. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| major_r | Yes | Major radius (center to tube center) | |
| minor_r | Yes | Minor radius (tube radius) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description must disclose behavior. It mentions inputs and outputs but does not state that the tool is non-destructive, idempotent, or any constraints like value ranges beyond minimums. Adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences: formula, use case, inputs/outputs. No fluff, front-loaded with the core purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present (though not shown), the description adequately covers the tool's behavior for its domain. Missing details like edge cases or input validation but sufficient for a simple geometry tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description reiterates the parameter names and formulas but adds no additional meaning beyond the schema, meeting the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes torus volume and surface area, provides the formula, and specifies the shape context (ring-shaped objects, donuts, inner tubes). It is specific and distinguishes from siblings by referencing the 'math' calculators bundle.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions use cases ('donuts, inner tubes') and points to list_bundles for related calculators, but does not provide explicit exclusions or when-not-to-use scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_training_zones_runningAInspect
Calculate 6 running training zones as speed ranges based on VMA. Returns: {vma_kmh}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| vma | Yes | VMA (Maximal Aerobic Speed) in km/h |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full burden. It does not disclose that the tool is read-only or computational, nor does it mention any side effects, permissions, or other behavioral traits beyond the return value.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two clear, efficient sentences. The first states the core purpose, and the second provides a pointer to related tools. No fluff or repetition.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple parameter and presence of an output schema, the description is minimally adequate. However, it mentions a return value '{vma_kmh}' that may be incomplete (likely zones are returned), and it does not elaborate on the zones' structure or units. The pointer to list_bundles adds discoverability but does not fully compensate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% as the sole parameter 'vma' is described in the schema. The description adds 'Calculate 6 running training zones as speed ranges based on VMA,' which marginally reinforces the parameter's role but does not add new semantic info beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates '6 running training zones as speed ranges based on VMA,' which is a specific verb+resource combination. It distinguishes itself from siblings like calculate_vma and calculate_heart_rate_zones by focusing on running training zones based on VMA.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description hints at related calculators via 'See list_bundles for related sport calculators' but does not provide explicit guidance on when to use this tool versus alternatives. No when-not-to-use or specific context is given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_travel_budgetAInspect
Estimate total trip budget by category (transport, accommodation, food, activities). Use for trip planning. Inputs: destination, days, traveler count, comfort level. Returns total and per-day breakdown. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| days | Yes | Number of travel days | |
| travelers | Yes | Number of travelers | |
| destination | Yes | Destination region |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It states that the tool 'Returns total and per-day breakdown,' which is useful. However, it does not mention limitations, accuracy, read-only nature, or any side effects. Adequate but not detailed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose. It covers action, inputs, and outputs without fluff. Every sentence is essential and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and the presence of an output schema (not shown but indicated), the description provides the key components: purpose, inputs, and output structure. However, the misleading 'comfort level' input and lack of usage prerequisites leave some gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3, but the description incorrectly lists 'comfort level' as an input, which is not in the schema. This introduces false information that could mislead the agent. It adds no extra meaning beyond the schema for the existing parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: 'Estimate total trip budget by category (transport, accommodation, food, activities).' It specifies the verb (estimate) and resource (trip budget). However, it mentions 'comfort level' as an input that does not exist in the schema, causing slight confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for trip planning' and points to a related tool ('list_bundles for related 'voyage' calculators'). This provides clear context for when to use this tool versus alternatives, though it does not cover all exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_travel_insuranceBInspect
Calculate estimated travel insurance cost based on destination, duration, age and activities. Returns: {base_per_day_eur, activity_factor, estimated_premium_eur, coverage_tips}. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Traveler age in years | |
| activities | Yes | Activity level: standard (city/beach), adventure (hiking/skiing), extreme (mountaineering/motorsport) | |
| destination | Yes | Travel destination zone | |
| duration_days | Yes | Trip duration in days |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description should fully cover behavioral traits. It partially does by stating the return structure and that it calculates an estimated cost, but it does not mention idempotency, side effects, or authoritative data sources. This is a significant gap for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first sentence states purpose and inputs, second sentence lists outputs and points to related tools. No wasted words, good front-loading.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is relatively simple with 4 enum-constrained parameters and an output schema. The description covers inputs, outputs, and a related bundle, which is nearly complete. It could mention any assumptions (e.g., age limits), but the schema already handles constraints.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents parameters adequately. The description adds the output fields but does not elaborate on input parameters beyond what the schema provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate' and the resource 'travel insurance cost' with key inputs listed. It does not explicitly differentiate from sibling tools like 'calculate_travel_insurance_estimate', but the reference to 'list_bundles' gives some context. Overall, purpose is clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some guidance by mentioning related 'voyage' calculators via 'list_bundles', implying when to look for alternatives. However, it lacks explicit when-to-use or when-not-to-use conditions, leaving usage context implied.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_travel_insurance_estimateCInspect
Estimate travel insurance cost based on trip cost, age, duration, destination. Use for trip planning. Inputs: trip cost, traveler age, days, region. Returns premium estimate. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | Yes | Traveler age | |
| days | Yes | Trip duration days | |
| destination | Yes | Destination region | |
| coverage_level | Yes | Coverage level |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It only states 'Returns premium estimate' with no information about side effects, permissions, rate limits, or whether it is read-only. This is insufficient for a tool with no annotation support.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences plus a list) and starts with the core purpose. Every sentence adds value, though the parameter list duplicates schema information with errors. Overall well-structured but slightly redundant.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description is incomplete for a tool with 4 parameters and an output schema. It does not explain the destination enum, the required coverage_level, or how the output schema works. The reference to list_bundles is vague and does not integrate with the current tool's context. The parameter mismatch further reduces completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The description lists input parameters as 'trip cost, traveler age, days, region' but the schema has 'age, days, destination, coverage_level' (no trip cost, and region vs destination). This mismatch misleads the agent about the required parameters. Schema description coverage is 100%, so the description adds confusion instead of clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it estimates travel insurance cost with specific inputs (trip cost, age, duration, destination) and mentions it's for trip planning. However, it does not differentiate from the sibling tool 'calculate_travel_insurance' despite the explicit reference to list_bundles for related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only says 'Use for trip planning' which is vague. There is no explicit guidance on when to use this tool versus alternatives, no exclusions, and no mention of prerequisites or limitations.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_triangle_heronBInspect
Compute triangle area from three side lengths using Heron's formula. Use when angles aren't known. Inputs: sides a, b, c. Returns area, perimeter, type (equilateral/isoceles/scalene). See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| a | Yes | Side a | |
| b | Yes | Side b | |
| c | Yes | Side c |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full disclosure burden. It mentions return values (area, perimeter, type) but omits critical behavior like handling invalid triangle side lengths (e.g., triangle inequality). This is a significant gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences: purpose, usage condition, and outputs. It is front-loaded, concise, and free of unnecessary verbiage.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple mathematical tool, the description covers basic purpose and outputs. However, it lacks details on error handling (e.g., invalid triangle) and does not reference the output schema. Given the complexity, it is adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds minimal context by restating 'sides a, b, c' and the method (Heron's formula), but does not add meaningful semantic details beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes triangle area using Heron's formula, with a specific verb and resource. It hints at when to use ('when angles aren't known') but does not explicitly differentiate from sibling tools like calculate_area.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a conditional usage hint ('Use when angles aren't known') but does not name alternatives or specify when not to use. The suggestion to see list_bundles is vague.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_trigonometryBInspect
Compute sin, cos, tan and inverse functions in degrees or radians. Use for geometry, physics, navigation. Inputs: function, value, unit. Returns result and reciprocal. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| func | Yes | Trig function | |
| unit | No | Input angle unit | degrees |
| value | Yes | Input value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden for behavioral disclosure. It mentions returning result and reciprocal, which is helpful, but does not disclose error handling, input constraints (e.g., domain for inverse functions), or confirm it is read-only.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is exceptionally concise with three front-loaded sentences. Each sentence adds value: main action, use cases, inputs/outputs plus cross-reference. No redundant words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
An output schema exists, so return values are covered. The description adds that it returns result and reciprocal. For a moderately complex trig tool, it covers most needs, though missing domain warnings for inverse functions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All three parameters have schema descriptions (100% coverage), so baseline is 3. However, the description merely repeats 'Inputs: function, value, unit' without adding extra meaning or constraints like valid ranges or examples.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes sin, cos, tan, and inverse functions in degrees or radians, and lists usage contexts (geometry, physics, navigation). It differentiates from sibling calculators by focusing on trigonometry, but does not explicitly distinguish from similar tools like calculate_angle_convert.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using for geometry, physics, navigation, but does not provide when-not-to-use guidance or mention specific alternative tools beyond referring to list_bundles for related math calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_uk_council_taxBInspect
Compute UK Council Tax based on property band and local authority rate. Use for UK residents and home buyers. Inputs: band (A-H), local authority. Returns annual and monthly council tax. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| band | Yes | Council Tax band (A=lowest, H=highest) | |
| region | No | Region | england |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description indicates non-destructive calculation and returns annual/monthly tax. Lacks explicit safety/permission details but is consistent with a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first explains purpose and audience, second lists inputs and outputs. Efficient with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers basic purpose, inputs, and outputs. Missing detail about how region maps to rates and exact output format, but output schema likely fills gap. Adequate for simple calculation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage but description uses different terminology ('band (A-H)' matches schema, but 'local authority' instead of 'region'). Adds confusion about whether a separate rate parameter exists.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
States 'Compute UK Council Tax' with clear verb and resource. Mentions inputs and outputs. However, description says 'local authority' while schema parameter is 'region', causing minor confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Says 'Use for UK residents and home buyers' and references sibling tool list_bundles. Provides context for when to use but no explicit when-not or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_uk_income_taxAInspect
Calculate UK income tax for 2025/26 using HMRC progressive brackets with personal allowance taper. Returns: {gross_income, personal_allowance, taxable_income, income_tax, effective_rate_pct, marginal_rate_pct}. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income | Yes | Annual gross income in GBP |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, but description explains logic (HMRC brackets, personal allowance taper) and lists return fields. It could mention that it's read-only or that income must be positive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no fluff, front-loaded with key information. Strucure is clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Complete for a simple calculator: covers purpose, usage, parameters, and return fields. Reference to related tools aids discovery.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema already describes the single parameter (income) with 100% coverage. Description adds tax year context, slightly enhancing semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb (calculate), resource (UK income tax), and scope (2025/26, HMRC rules). It distinguishes from sibling tax calculators like French or Belgian ones.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description implies use for UK income tax but provides no explicit when-to-use or when-not-to-use guidance. Reference to list_bundles for related calculators is minimal.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_uk_ni_contributionsAInspect
Calculate UK National Insurance contributions (Class 1 employee) for 2025/26. Returns: {annual_salary, ni_annual, ni_monthly, effective_rate_pct}. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| annual_salary | Yes | Annual gross salary in GBP |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; the description bears full burden. It discloses the input, output fields, year, and class. For a calculation tool, this is sufficient transparency; no contradictory or missing behavioral traits are evident.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states action and scope, second lists output fields and a reference. Front-loaded and efficient, every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given one parameter, an output schema, and a simple calculation domain, the description is complete. It covers what, when, and output, plus a pointer to related tools.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with the parameter 'annual_salary' described. The description's return fields add context but do not enhance parameter meaning beyond the schema description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates UK National Insurance contributions for Class 1 employee for 2025/26. The specific verb 'calculate' and resource 'UK National Insurance contributions' distinguish it from the many sibling calculate_* tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related finance-uk calculators', providing context. It does not explicitly exclude alternatives but the specific year, class, and output fields make its use case clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_uk_student_loanAInspect
Calculate UK student loan repayments based on plan type and salary. Returns: {annual_salary, repayment_rate_pct, annual_repayment, monthly_repayment}. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| plan | No | Student loan plan: 1, 2, 4, 5, or postgrad | 2 |
| annual_salary | Yes | Annual gross salary in GBP |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool calculates and returns specific fields, which implies read-only behavior, but does not explicitly declare idempotency, safety, or error handling. For a simple calculator this is adequate but not exemplary.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words. The key action, inputs, outputs, and a pointer to related tools are all included in a front-loaded structure.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, no nested objects, explicit return fields), the description fully covers what an agent needs to know: inputs, output format, and related context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The description adds value by listing the return object fields, which are not in the input schema, and clarifying the plan types. This enhances understanding beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Calculate UK student loan repayments based on plan type and salary', specifying the verb and resource. It differentiates from siblings by focusing on UK student loans, with a pointer to related calculators via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description mentions 'See list_bundles for related calculators' but does not provide explicit when-to-use or when-not-to-use guidance compared to alternatives like calculate_uk_income_tax. Usage context is implied but not detailed.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_uk_vatAInspect
Calculate UK VAT — convert between net (ex-VAT) and gross (inc-VAT) amounts. Returns: {amount_net, amount_gross, vat_amount, vat_rate_pct}. See list_bundles for related 'finance-uk' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | Input mode: ht=net (ex-VAT), ttc=gross (inc-VAT) | ht |
| rate | No | VAT rate: 0% (zero), 5% (reduced), 20% (standard) | 20 |
| amount | Yes | Amount in GBP |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavior. It describes the return object, which adds value, but does not explicitly state side effects (e.g., it is a read-only calculation) or any edge cases. It is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences. The first sentence delivers the core purpose, and the second lists the output and points to related tools. No wasted words, well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple nature of the tool and the presence of a clear output schema (return object described), the description covers conversion and output adequately. It also mentions related tools. However, it could include minor details like rounding behavior, but overall complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds the return structure but does not elaborate on parameter meaning beyond the schema. No contradiction or additional semantic value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Calculate UK VAT — convert between net (ex-VAT) and gross (inc-VAT) amounts.' It specifies the resource (UK VAT) and the action (convert), and differentiates from sibling VAT calculators by explicitly naming 'UK'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not explicitly state when to use this tool versus alternatives. It only mentions 'See list_bundles for related finance-uk calculators,' which hints at related tools but does not provide clear guidance on when to choose this one.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_unemployment_benefitAInspect
Estimate French unemployment benefit (ARE — Aide au Retour a l'Emploi). Returns: {daily_ref_salary, daily_are, monthly_are_estimate, min_daily, max_daily_75pct_sjr}. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| daily_ref_salary | Yes | Salaire Journalier de Reference (SJR) in euros — typically last 12 months salary / 261 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It lists return values but does not explain assumptions, limitations (e.g., accuracy, applicability to all unemployment cases), or any prerequisites beyond the input parameter. Adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two efficient sentences: first states purpose and outputs, second directs to related tools. No redundant information, front-loaded with key details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter tool with output schema, the description is mostly complete. It could explicitly state 'for France only' or note eligibility conditions, but overall provides sufficient context for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a clear description of daily_ref_salary. The description adds no extra parameter meaning beyond the schema. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Estimate French unemployment benefit (ARE)'. It specifies the return fields and differentiates from siblings by naming the exact benefit and referencing related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs alternatives (e.g., other French tax/salary calculators). The reference to list_bundles is vague and does not provide decision criteria for tool selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_unit_priceAInspect
Compare unit prices across packages to find the best deal. Use for shopping. Inputs: list of {price, quantity, unit}. Returns price per unit and best buy. See list_bundles for related 'education' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| items | Yes | Items to compare |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool compares and returns results, implying a read-only operation. However, it does not explicitly disclose behavioral traits (e.g., no side effects, statelessness). For a calculator, this is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise, consisting of two sentences and a reference. It front-loads the core purpose and includes all essential information without unnecessary detail.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the purpose, inputs, and outputs (price per unit and best buy). It references 'list_bundles' for related calculators, providing some context among many siblings. However, it could be more complete by explicitly distinguishing from similar tools like 'calculate_cost_per_use'.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema describes 'items' as 'Items to compare' without individual property descriptions. The description adds meaning by summarizing the nested structure: 'list of {price, quantity, unit}' and mentions 'name' is included. This helps the agent understand the required fields beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: comparing unit prices across packages to find the best deal. It specifies the inputs and outputs. However, it does not differentiate from sibling tools like 'calculate_cost_per_use' or 'calculate_grocery_unit_comparison', which are related.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for shopping' which provides context, but it does not explicitly guide when to use this tool versus alternatives, nor does it state prerequisites or cases where this tool should not be used.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_401kAInspect
Calculate US 401(k) contribution, employer match, and total retirement savings. Returns: {annual_salary, employee_contribution, employer_match, total_annual_contribution, catch_up_eligible, max_employee_limit}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| age | No | Employee age (50+ enables catch-up contributions) | |
| annual_salary | Yes | Annual salary in USD | |
| contribution_pct | Yes | Employee contribution percentage (1-100) | |
| employer_match_pct | No | Employer match percentage of employee contribution (default 50%) | |
| employer_match_limit | No | Employer match cap as % of salary (default 6%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavior. It lists return fields, which adds transparency about outputs, but does not disclose assumptions, plan rules, or potential side effects (though calculation tools are inherently read-only).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences that efficiently convey the purpose and a pointer to related tools, with no extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description and schema provide sufficient detail for a calculation tool: required parameters, optional parameters with defaults, and return fields. It lacks explicit mention of current tax year limits, but this is common for such calculators and does not hinder immediate use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters have inline descriptions in the schema (100% coverage), so the description adds no extra meaning beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates US 401(k) contribution, employer match, and total retirement savings, and lists the return fields. This separates it from the many other 'calculate_*' siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'See list_bundles for related finance-us calculators,' which implies exploring related tools, but does not explicitly state when to use this tool vs. alternatives or provide exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_capital_gainsAInspect
Calculate US capital gains tax — short-term (ordinary rates) or long-term (preferential rates). Returns: {tax, gain_type}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sale_price | Yes | Sale price in USD | |
| filing_status | No | Filing status for rate thresholds | single |
| purchase_price | Yes | Original purchase price in USD | |
| holding_period_months | Yes | Holding period in months |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must convey behavior. It accurately indicates no side effects (calculation only) and specifies return fields. However, it omits details like rate assumptions or edge cases, which is acceptable for a straightforward calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences cover purpose, rate type distinction, return structure, and cross-reference to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a low-complexity tax calculator, the description covers essential aspects: what it calculates, key distinctions, and return format. The 'see list_bundles' provides context for related tools. Slight omission of underlying tax year/rates is acceptable.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% parameter description coverage, describing all four parameters thoroughly. The description adds no further parameter details, so it meets the baseline with no additional value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates US capital gains tax, distinguishes short-term vs long-term, and specifies the return structure. It also references 'list_bundles' for related calculators, differentiating it from siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides context on tax rate types (short-term vs long-term) and hints at related calculators via 'list_bundles'. Lacks explicit when-to-use or when-not-to-use guidance, but the context is sufficient for a calculation tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_child_tax_creditAInspect
Calculate US Child Tax Credit for 2026 with phase-out based on AGI. Returns: {agi, base_credit, credit_reduction, final_credit}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| agi | Yes | Adjusted Gross Income in USD | |
| filing_status | No | Filing status | single |
| children_under_17 | Yes | Number of qualifying children under age 17 |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description effectively discloses the tool's behavior by listing return fields (agi, base_credit, credit_reduction, final_credit) and mentioning the phase-out mechanism. It adds useful context beyond the input schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose, return structure, and a pointer to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the tool's purpose, return values, and directs to related calculators. For a calculation tool with complete schema coverage, this is sufficient. A minor gap is the lack of explanation about the phase-out formula or edge cases, but the core is covered.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already covers 100% of parameters with descriptions. The description adds minimal extra meaning by linking AGI to phase-out, but does not elaborate on parameter semantics beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates the US Child Tax Credit for 2026 with phase-out based on AGI. It uses specific language about the resource and action, distinguishing it from other tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises users to see list_bundles for related 'finance-us' calculators, providing a path to alternatives. While it doesn't give explicit when-not-to-use guidance, the context is clear for a specialized calculator.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_federal_taxAInspect
Calculate US federal income tax for 2026 using progressive brackets with standard deduction. Returns: {gross_income, standard_deduction, taxable_income, federal_tax, effective_rate_pct, marginal_rate_pct, ...}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| income | Yes | Gross annual income in USD | |
| filing_status | No | Filing status | single |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It discloses the use of progressive brackets and standard deduction, and outlines the return object fields. For a calculation tool, this is transparent enough; no destructive behavior is expected. The description could mention that it is read-only, but the purpose implies that.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first clearly states the tool's core purpose, and the second adds return details and a pointer to related calculators. No unnecessary words; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of US tax calculation, the description covers essential aspects: year, method, input parameters, and return structure. It lacks mention of possible limitations (e.g., no state tax, no credits), but for a federal tax calculator, it is sufficiently complete. The pointer to list_bundles helps with context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with both parameters (income, filing_status) having descriptions in the schema. The description adds value by specifying 'for 2026' and 'with standard deduction', and by listing return fields like effective_rate_pct and marginal_rate_pct, which are not in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates US federal income tax for 2026 using progressive brackets with standard deduction. The verb 'calculate' and specific resource 'US federal income tax' are unambiguous, and the tool is easily distinguished from sibling tax calculators for other countries (e.g., calculate_canada_federal_tax, calculate_french_income_tax).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests seeing list_bundles for related 'finance-us' calculators, which hints at context but does not explicitly state when to use this tool versus alternatives like calculate_us_capital_gains or calculate_us_state_tax. It lacks clear when-to-use or when-not-to-use guidance, earning a middling score.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_ficaAInspect
Calculate US FICA taxes (Social Security + Medicare) employee share for 2026. Returns: {gross_annual, social_security_taxable, social_security_tax, medicare_base_tax, medicare_additional_tax, medicare_total, ...}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| gross_annual | Yes | Gross annual salary in USD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description bears full burden. It states that it calculates employee share for 2026 and lists returned fields. This is adequate for a simple calculation tool, but does not disclose any assumptions, limitations, or side effects beyond what is stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise: two sentences. First sentence packs purpose, scope, and output summary. Second sentence directs to related tools. No wasted words. Front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a single-parameter calculation tool with an output schema, the description is complete: it states the year (2026), components, and links to the bundle. Could mention that it's only employee share (already done) and perhaps note that gross_annual must be within valid ranges, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Only one parameter (gross_annual) with 100% schema description coverage. The description adds the context of year 2026 but does not add meaning beyond the schema, which already defines it as 'Gross annual salary in USD'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it calculates US FICA taxes (Social Security + Medicare) employee share for 2026. Specifies the verb 'Calculate' and the resource, distinguishing it from other US tax calculators. Also hints at related tools via list_bundles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides no explicit when-to-use or when-not-to-use guidance. Only a reference to list_bundles for related calculators, which is a weak implicit suggestion. No prerequisites or conditions mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_mortgageAInspect
Calculate US mortgage with PMI, property tax, and insurance estimates. Returns: {home_price, down_payment, loan_amount, monthly_pi, monthly_pmi, monthly_property_tax, ...}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| years | No | Loan term in years (default 30) | |
| home_price | Yes | Home purchase price in USD | |
| annual_rate | Yes | Annual mortgage interest rate in % | |
| down_payment_pct | No | Down payment percentage (default 20%) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It states the tool calculates with PMI, property tax, and insurance estimates, and lists return fields. However, it does not explicitly state that it is read-only or non-destructive, nor does it mention authorization needs or rate limits. For a calculation tool, this is adequate but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise (two sentences) and front-loaded. The first sentence states the core purpose, and the second adds a preview of return fields and a reference to related tools. Every sentence provides value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the existence of an output schema and 100% parameter schema coverage, the description provides a sufficient overview. It includes a snippet of return fields and a pointer to related calculators. The only minor gap is that it does not explain the output schema in detail, but that is handled by the output schema itself.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and each parameter already has a clear description in the schema. The description does not add any additional meaning or context beyond what the schema provides, so it meets the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it calculates US mortgage with PMI, property tax, and insurance estimates. It names the resource and provides partial return fields. However, it does not explicitly distinguish from siblings like 'calculate_mortgage' (generic) or 'calculate_mortgage_insurance', though the specificity to US mortgages implies differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions seeing list_bundles for related calculators, which gives a hint for further information. However, it does not specify when to use this tool versus alternatives, nor does it provide explicit prerequisites or exclusions for usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_paycheckAInspect
Estimate US bi-weekly net paycheck after federal/state withholding and FICA. Returns: {annual_salary, fica_biweekly, net_biweekly, net_annual_estimate}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| state | No | State for state income tax (TX/FL/WA have no state tax) | TX |
| annual_salary | Yes | Annual salary in USD | |
| filing_status | No | Federal withholding filing status | single |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the behavioral burden. It lists the expected return fields (annual_salary, fica_biweekly, net_biweekly, net_annual_estimate), providing good insight into output. It does not explicitly mention that the result is an estimate or if any assumptions are made, but the word 'estimate' implies this. It could be more explicit about precision or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences that efficiently convey purpose and return fields. The first sentence states the core function, the second lists outputs and provides a pointer to related tools. No wasted words, front-loaded with essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has three parameters and an output schema (implied by context signals), the description covers purpose, output, and related tools. It could briefly note that state income tax only applies to the listed states (evident from schema) or that the estimate is bi-weekly. Overall, it is sufficient for most use cases.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already provides 100% coverage with descriptions for all three parameters (annual_salary, state, filing_status). The description does not add new meaning beyond the schema, but it does indirectly imply the parameters by mentioning the deductions. Since schema coverage is high, baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates US bi-weekly net paycheck after federal/state withholding and FICA. It specifies the verb 'estimate' and the resource 'US bi-weekly net paycheck', distinguishing it from sibling tools like calculate_us_federal_tax or calculate_us_fica. The mention of return fields adds specificity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context by naming the tool's purpose and pointing to the 'finance-us' bundle via list_bundles. However, it does not explicitly state when to use this tool versus alternatives like calculate_us_federal_tax or calculate_us_fica, leaving the differentiation implicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_property_taxAInspect
Estimate annual US property tax by state using effective tax rates. Returns: {home_value, effective_rate_pct, annual_property_tax, monthly_property_tax}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| state | No | State (affects effective property tax rate) | TX |
| home_value | Yes | Assessed home value in USD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully convey behavioral traits. It only says 'Estimate' and gives a return format, but fails to disclose that only 6 states are available (NJ, IL, TX, CA, FL, NY), or discuss accuracy, permissions, or side effects. This leaves the agent with incomplete information.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: one for purpose and method, one for output format and related tools. No extraneous words, efficient and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculator with two parameters, the description covers purpose, input, output, and related tools. However, it omits the limitation of only 6 states and does not mention data sources, making it slightly incomplete for a fully self-contained description.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds context about effective tax rates and output format, but does not provide additional semantics beyond the schema's descriptions for the parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Estimate annual US property tax by state using effective tax rates', specifying a precise verb, resource, and method. It also directs to list_bundles for related calculators, distinguishing it from the many sibling calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for US property tax estimation and points to a related bundle, but it does not explicitly state when to use this tool versus alternatives, nor does it mention prerequisites or constraints like the limited set of states.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_state_taxBInspect
Compute US state income tax for a chosen state. Use for paycheck planning across states. Inputs: state, gross income, filing status. Returns state tax due and effective rate. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| state | Yes | US state | |
| income | Yes | Annual taxable income in USD |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It mentions inputs that do not match the schema (filing status is listed but not in schema) and does not clarify state limitations or return details beyond the output schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and includes a reference to related tools, but the inclusion of an input not in the schema reduces its reliability and clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema and simple parameters, the description provides enough context for basic usage but lacks details on state scope, tax rules, and potential errors.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with basic descriptions, but the tool description lists 'filing status' as an input that is not in the schema, contradicting the structural definition and misleading the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes US state income tax for a chosen state, defines the resource (state tax), and distinguishes it from sibling tools like federal tax calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using it for paycheck planning across states and references related tools via list_bundles, but does not explicitly specify when not to use it or name direct alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_us_student_loanBInspect
Calculate US student loan repayment under standard, graduated, or income-driven plans. Returns: {loan_balance}. See list_bundles for related 'finance-us' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| plan | No | Repayment plan | standard |
| annual_rate | Yes | Annual interest rate in % | |
| loan_balance | Yes | Outstanding loan balance in USD | |
| annual_income | No | Annual income (required for income_driven plan) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry behavioral disclosure. It only states the tool calculates repayment and returns a loan balance, omitting any mention of side effects, authorization needs, or rate limits. For a calculation tool, read-only behavior is implied but not stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Extremely concise, two sentences that are front-loaded with the core purpose. No filler or redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
While the tool has 4 parameters, 2 required, an enum, and an output schema, the description is minimal. It covers the basic purpose and plan types but does not mention default parameter values (e.g., plan defaults to 'standard') or the conditional nature of annual_income. The output is vaguely referenced as '{loan_balance}', but the output schema exists to fill gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description mentions the plan options ('standard, graduated, or income-driven') which aligns with the enum, but does not add new parameter semantics beyond what the schema already provides (e.g., the conditional requirement for annual_income is in the schema but not in the description).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates US student loan repayment, specifying three plan types (standard, graduated, income-driven). This distinguishes it from generic student loan calculators (e.g., calculate_student_loan_repayment) and other country-specific tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The description mentions 'See list_bundles for related finance-us calculators' but does not specify prerequisites, plan selection criteria, or when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vacation_days_frAInspect
Compute French paid vacation days earned (congés payés). Use for HR planning. Inputs: months worked, contract type. Returns days earned (2.5/month rule) and equivalent in working days. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| full_time | No | Full-time | |
| months_worked | Yes | Months worked |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the calculation rule (2.5/month) and return values but does not mention behavioral traits like read-only status, authentication needs, or potential side effects. For a calculation tool, this is adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no unnecessary words. The first sentence clearly states the action and resource, followed by usage context and a reference to related tools. It is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has an output schema (context signal), the description does not need to detail return values. It mentions the key return items (days earned and working days equivalent). However, it does not explain the conversion factor for working days or how the full_time parameter influences the result. For a simple calculation tool, this is mostly complete but leaves minor gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage for both parameters (months_worked and full_time). The description adds minimal value by mentioning 'months worked' and 'contract type', but does not explain how 'full_time' affects the calculation or clarify the 'contract type' mapping. Baseline 3 is appropriate given schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute French paid vacation days earned' with specific verb and resource. It mentions the 2.5/month rule, making the tool's purpose unambiguous. However, it does not explicitly differentiate from sibling tools like 'calculate_vacation_days_optimal' or 'calculate_leave_days', which could cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for HR planning', providing a context for usage. It also references 'list_bundles' for related calculators, hinting at alternatives. However, it lacks explicit when-to-use or when-not-to-use guidance, and does not address prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vacation_days_optimalCInspect
Compute optimal vacation usage by chaining bridge days with public holidays. Use for HR or worker planning. Inputs: vacation days, country, year. Returns best plan with day count. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| public_holidays_count | Yes | Number of public holidays near weekends | |
| vacation_days_available | Yes | Annual vacation days available |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry full burden. It states 'Returns best plan with day count' but does not explicitly declare non-destructive behavior, auth needs, or rate limits. The behavioral traits are inferred but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (2 sentences) but includes irrelevant and incorrect information (country, year). This undermines conciseness and accuracy. The mention of list_bundles is helpful but does not compensate for the error.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The output schema exists, but the description does not explain key concepts like 'bridge days' or what 'optimal' means. It fails to provide a complete understanding of the tool's functionality, and the erroneous parameter mention further degrades completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description incorrectly lists 'country, year' as inputs, which are not in the schema. This misleads the agent and contradicts the schema. The description adds no additional semantic value beyond the schema's own parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes optimal vacation usage by chaining bridge days with public holidays, using specific verbs and resources. Mention of 'HR or worker planning' provides context. However, it does not explicitly differentiate from sibling tool calculate_vacation_days_fr.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use for HR or worker planning and points to list_bundles for related 'voyage' calculators. This gives some guidance but lacks explicit when-not-to-use or clear alternatives among the many sibling calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vat_genericAInspect
Calculate VAT/GST/sales tax for any country with custom rate. Returns: {amount_before_tax, amount_after_tax, tax_amount, tax_rate}. See list_bundles for related 'finance-universal' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mode | No | ht=before tax, ttc=after tax | ht |
| rate | Yes | Tax rate in % | |
| amount | Yes | Amount |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description only states return format. It does not disclose behavioral traits like idempotency or safety, but the nature of calculation tools makes it inherently non-destructive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with front-loaded purpose and return format. Points to related tools efficiently.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and presence of output schema, the description is complete enough. Could mention mode parameter behavior but schema covers it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. The description adds little beyond the schema, repeating the purpose but not explaining mode or rate format.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb (calculate), resource (VAT/GST/sales tax), and scope (any country with custom rate). It distinguishes from country-specific VAT tools like calculate_belgian_vat by being generic.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description implies usage for custom rates and mentions related tools via list_bundles. However, it does not explicitly state when to use this tool over country-specific ones or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vat_reverseAInspect
Reverse-VAT: extract the VAT and net price from a TTC amount. Use to back out tax from a gross invoice. Inputs: TTC amount, VAT rate %. Returns HT, VAT amount. See list_bundles for related 'finance-france' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| vat_rate | No | VAT rate % | |
| amount_incl | Yes | Amount including VAT |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes the calculation behavior (inputs and outputs) but does not explicitly state that it is a read-only, stateless operation. However, the nature of a calculator implies no side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences, front-loaded with purpose. No wasted words. Efficiently communicates the tool's function.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple calculation tool, the description is complete: it explains inputs, outputs, and usage context. It mentions a related bundle tool for further exploration.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (both parameters documented). The description adds context (TTC, HT) but does not add meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'extract the VAT and net price from a TTC amount' and 'back out tax from a gross invoice'. It uses specific verbs and distinguishes from siblings by referencing related calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states when to use: 'Use to back out tax from a gross invoice'. It also provides an alternative by pointing to 'list_bundles for related 'finance-france' calculators'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_velo_developmentBInspect
Calculate bicycle development in meters per pedal revolution. Returns: {gear_ratio, development_m, speed_at_90rpm_kmh}. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cog_teeth | Yes | Number of teeth on the rear cog/sprocket | |
| chainring_teeth | Yes | Number of teeth on the front chainring | |
| wheel_circumference_mm | No | Wheel circumference in mm (700c road default = 2105mm) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description must disclose behavioral traits. It only describes the calculation and returns, but does not mention that it is a read-only operation, any side effects, or permission requirements. For a simple calculator, this is minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose, and no wasted words. It efficiently states the function and points to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool is a simple calculation with an existing output schema, the description adequately covers purpose, return structure, and related calculators. It does not explain the formula or units, but the output schema likely fills that gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for all three parameters. The description adds no extra parameter information beyond the schema, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate' and the resource 'bicycle development in meters per pedal revolution', and mentions the return fields. However, among many sibling calculators, it does not explicitly distinguish itself from 'calculate_braquet' or others.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some guidance by referencing 'list_bundles' for related sport calculators, implying a context of use. However, it lacks explicit when-to-use or when-not-to-use guidance compared to other similar tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vmaAInspect
Compute VMA (Maximal Aerobic Speed) from a fitness test result. Use for runners building training plans. Inputs: test type (Cooper 12-min, Luc Léger), result. Returns VMA km/h and zones. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| test | Yes | Test type: cooper (12min run), demi_cooper (6min run), vameval (final speed km/h) | |
| result_value | Yes | Distance in meters (cooper/demi_cooper) or final speed in km/h (vameval) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It mentions 'Returns VMA km/h and zones,' indicating output, but does not clarify that it is a read-only calculation or note any side effects. For a calculation tool, this is adequate but minimal.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences plus a cross-reference, front-loaded with the core purpose. Every sentence adds value, with no wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present (as per context), the description appropriately focuses on input/output and usage context. It mentions return format (VMA km/h and zones) and directs users to related tools, making it complete for a simple calculator.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds context by mentioning test types (Cooper 12-min, Luc Léger) but omits 'demi_cooper' and misaligns with the enum. It repeats the parameter roles without significant additional meaning.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Compute VMA (Maximal Aerobic Speed) from a fitness test result,' specifying the verb and resource. It distinguishes from siblings by focusing on VMA calculation and referencing list_bundles for related sport calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for runners building training plans,' providing clear context. It also directs to list_bundles for related tools, implying alternatives, but does not explicitly state when not to use or exclude other calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_vo2maxAInspect
Estimate VO2max from VMA (maximal aerobic speed). Use for runners assessing cardio fitness. Formula: VO2max ≈ VMA × 3.5. Inputs: VMA in km/h. Returns VO2max in mL/kg/min and fitness category. See list_bundles for related 'sport' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| vma | Yes | VMA in km/h |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavior. It explains the formula (VO2max ≈ VMA × 3.5) and that it returns VO2max in mL/kg/min and a fitness category. It could mention estimation limitations, but it's adequate for a simple calculation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loads the purpose, and every sentence adds value: purpose, usage scenario, formula and output. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the output schema exists and the input schema is fully described, the description covers what the tool does, how to use it, and what it returns. It could mention that it's an estimation for athletes, but it's complete enough for use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with description 'VMA in km/h'. The description repeats this and adds the formula but does not add new semantic meaning beyond the schema's field description. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool estimates VO2max from VMA, specifying it's for runners assessing cardio fitness. It distinguishes itself from sibling tools like calculate_vma by mentioning the formula and specific use case.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for runners assessing cardio fitness' and points to related calculators via 'See list_bundles for related 'sport' calculators.' It implies a specific context but doesn't explicitly state exclusions or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_voltage_adapterCInspect
Determine voltage adapter and plug type needed for a destination country. Use for international travel with electronics. Inputs: home country, destination, device wattage. Returns adapter type, voltage, plug shape. See list_bundles for related 'voyage' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to_country | Yes | Destination country | |
| from_country | Yes | Country of origin |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must fully disclose behavior. It states inputs include device wattage, which is not in the schema—a significant error. It does not mention rate limits, auth, or side effects, but the main issue is the inaccurate parameter claim.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at three sentences, but the inaccuracy regarding device wattage undermines its value. It could be better structured by aligning inputs with the actual schema.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the tool's purpose and outputs, but omits that countries are limited to the supplied enum list. The mention of device wattage as an input is incomplete since it's not a parameter.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers 100% of parameters with descriptions, but the description adds 'device wattage' as an input, which is not in the schema. This introduces confusion and contradicts the schema, reducing clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool determines voltage adapter and plug type for a destination country, addressing international travel. It references related tools via list_bundles but does not explicitly differentiate from sibling calculators beyond that.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises using the tool for international travel, providing context. It points to list_bundles for related calculators, but lacks explicit when-not-to-use guidance and introduces an input (device wattage) not present in the schema, which could mislead.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_volumeAInspect
Compute volume for common 3D shapes (cube, cylinder, sphere, cone, prism). Use for geometry, packaging, or construction. Inputs: shape + dimensions. Returns volume in input-units cubed. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| shape | Yes | Shape | |
| width | No | Width | |
| height | No | Height | |
| length | No | Length/side | |
| radius | No | Radius | |
| base_area | No | Base area for prism/pyramid |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It states the output is 'volume in input-units cubed' and lists inputs. However, it does not disclose error handling, validation, or side effects, nor does it specify which dimensions are required for each shape.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single concise sentence with a helpful tip about list_bundles. It is front-loaded with the primary action and key shapes. While efficient, it could be slightly more structured by bullet points for shapes.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the basic purpose, inputs, and output. Given the tool has 6 parameters and multiple shapes, it lacks details on which dimensions are needed per shape. The existence of an output schema mitigates the need for return value explanation, but shape-specific parameter requirements are missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so all parameters are described individually. The description summarizes inputs as 'shape + dimensions' but does not add extra meaning about relationships or required combinations. Baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes volume for common 3D shapes (cube, cylinder, sphere, cone, prism) and lists use cases (geometry, packaging, construction). This distinguishes it from sibling tools that are specialized for specific volumes (e.g., aquarium, pool).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description advises 'Use for geometry, packaging, or construction' and points to list_bundles for related calculators. It provides context but does not explicitly state when to avoid this tool in favor of more specialized volume tools among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_waist_hip_ratioBInspect
Calculate waist-to-hip ratio and cardiovascular risk level. Returns: {risk_threshold, cardiovascular_risk}. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| sex | Yes | Biological sex | |
| hip_cm | Yes | Hip circumference in centimeters | |
| waist_cm | Yes | Waist circumference in centimeters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the return format {risk_threshold, cardiovascular_risk}, but does not elaborate on behavioral aspects such as read-only nature, side effects, or prerequisites. Since no annotations are provided, the description should cover these aspects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief with clear purpose and return info, but could be better organized (e.g., bullet format for parameters or usage).
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description provides the purpose and return format, and directs to related tools via list_bundles. However, it does not include important contextual information such as units (already in schema), or any disclaimers or assumptions. For a health risk calculator, more completeness might be expected.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already describes all three parameters with clear text. The description does not add any additional semantic meaning or context beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates waist-to-hip ratio and cardiovascular risk level, which is specific and distinct from other calculators. However, it does not explicitly differentiate from other health-related calculators like calculate_bmi, but the unique risk component provides some distinction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description only directs to list_bundles for related calculators, but does not provide explicit guidance on when to use this tool versus alternatives, nor any prerequisites or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wallpaperCInspect
Compute wallpaper rolls needed for a room with pattern repeat factor. Use for renovation budget. Inputs: room dimensions, roll size, pattern repeat. Returns roll count. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height_m | Yes | Height m | |
| roll_width | No | Roll width m | |
| perimeter_m | Yes | Room perimeter m | |
| roll_length | No | Roll length m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It only states that the tool 'Returns roll count' but does not disclose any behavioral traits such as rounding behavior, waste assumptions, or how the pattern repeat factor is applied. This is insufficient for a calculation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is compact with three sentences and front-loads the core purpose. However, it includes an inaccurate mention of a pattern repeat input, which reduces effectiveness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of a very similar sibling tool and the mention of a pattern repeat factor not in the schema, the description lacks key details about calculation assumptions and limitations. The output schema exists but does not compensate for the gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters. However, the description mentions 'pattern repeat factor' as an input, but no such parameter exists in the schema. This is misleading and adds negative value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'Compute wallpaper rolls needed for a room with pattern repeat factor' which clearly identifies the tool's purpose. However, it fails to distinguish from the similar sibling 'calculate_wallpaper_rolls' present in the tool list, so it does not achieve a perfect score.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for renovation budget' and suggests looking at list_bundles for related calculators, providing some context. However, it lacks explicit guidance on when to use this tool versus similar ones like paint or flooring calculators, and does not mention when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wallpaper_rollsCInspect
Compute wallpaper rolls for a room including pattern repeat and waste. Use for renovation. Inputs: walls m², roll dimensions, pattern repeat. Returns roll count and length needed. See list_bundles for related 'construction' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| height_m | Yes | Wall height m | |
| roll_width | No | Roll width m | |
| perimeter_m | Yes | Room perimeter m | |
| roll_length | No | Roll length m |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions pattern repeat and waste but pattern repeat is not in the input schema, and waste calculation is not explained. The tool's behavior regarding these factors is unclear and potentially misleading.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences and a reference. It front-loads the purpose. However, the inaccuracies (pattern repeat, m²) reduce its effectiveness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a specialized calculation involving pattern repeat and waste, the description is incomplete. It does not explain how pattern repeat is handled, what default values are used for width/length despite schema defaults, or the algorithm. The output is partially described (roll count and length), but lacks detail on waste factor or assumptions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. However, the description mentions 'walls m²' and 'pattern repeat' which do not correspond to actual parameters (perimeter_m and height_m derive area, no pattern repeat parameter). This misaligns with the schema and adds confusion rather than clarity.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes wallpaper rolls for a room, including pattern repeat and waste. It identifies the domain as renovation. However, it does not differentiate from the sibling tool 'calculate_wallpaper' which may serve a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for renovation,' providing a general context. It refers to 'list_bundles' for related construction calculators, but gives no specifics on when to use this tool versus others, nor any exclusions or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_water_billBInspect
Compute water bill from cubic meters consumed and tariff bands. Use for household budget. Inputs: m³ consumed, fixed fee, variable rate. Returns total bill. See list_bundles for related 'vie-quotidienne' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| cubic_meters | Yes | Water consumption in m³ | |
| price_per_m3 | No | Price per m³ (default 4.34 EUR — France 2026) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states the tool computes a bill and returns total cost, implying a read-only calculation. However, it does not explicitly confirm no side effects or highlight the default parameter value. The description is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise at three sentences. It front-loads the purpose then expands. However, the inaccuracy in listing inputs reduces its effectiveness. It earns its keep but could be better structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with only two parameters and an output schema exists. The description covers the basics but fails to mention the default value of 'price_per_m3' and misrepresents inputs. It is sufficient for a basic understanding but incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
While the input schema has 100% description coverage, the tool's description incorrectly lists 'fixed fee' and 'variable rate' as inputs, which are not parameters. The correct parameters are 'cubic_meters' and 'price_per_m3'. This misalignment reduces clarity for selecting and invoking the tool.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes a water bill from cubic meters consumed and tariff bands, and provides a usage context ('household budget'). It distinguishes itself among many similar 'calculate_*' tools. However, the mention of 'fixed fee' and 'variable rate' is not reflected in the input schema, which may cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using it for household budgets and references 'list_bundles' for related calculators. However, it does not explicitly mention when not to use this tool or provide direct alternatives. The guidance is minimal but adequate for a simple calculator.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_water_hardnessAInspect
Calculate water hardness in French degrees from calcium and magnesium concentrations. Returns: {thresholds}. See list_bundles for related 'cuisine' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| calcium_mg_l | Yes | ||
| magnesium_mg_l | Yes |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry full behavioral disclosure. It only states it calculates and returns thresholds, but does not mention whether the operation is read-only, has side effects, requires authentication, or has rate limits. Safety and idempotency are not addressed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences long, front-loads purpose and inputs, and includes a useful cross-reference. Every sentence contributes meaning without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple two-parameter calculator, the description covers purpose and return value (thresholds) and points to related tools. The presence of an output schema (not shown but indicated) reduces the need to detail return structure. Some guidance on when to use French degrees vs other scales would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 0% coverage for parameter descriptions, so the description must compensate. It names the two inputs ('calcium and magnesium concentrations') but does not specify units or provide guidance beyond the schema constraints (min=0). Parameter names are self-explanatory, so the description adds minimal extra value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates water hardness in French degrees from specific inputs. It uses a specific verb-resource combination and is distinct among siblings, which are all named with 'calculate_' prefix covering different domains.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a cross-reference to 'list_bundles' for related 'cuisine' calculators, implying a context but not explicit when-to-use or when-not-to-use. No exclusions or alternatives given, though no direct sibling tool performs the same function.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_water_heater_sizeBInspect
Calculate recommended water heater tank size for a household. See list_bundles for related 'plomberie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| usage | Yes | Water usage level: low (30L/person), normal (50L/person), high (70L/person) | |
| household_size | Yes | Number of people in the household |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavioral traits. It does not indicate whether the tool is read-only, mutates state, requires authentication, or has side effects. Since it is a calculation tool, it likely has no side effects, but this is not stated. The lack of behavioral transparency is a gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise, consisting of a single sentence plus a reference to list_bundles. Every word serves a purpose, and the structure is front-loaded. However, it sacrifices some detail for brevity, making it slightly less informative than ideal.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 params, no nesting, output schema exists), the description is adequate but not comprehensive. It does not mention the output format or edge cases, though the output schema likely covers return values. The pointer to list_bundles adds some context but does not fully round out the tool's purpose.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, already defining both parameters and their ranges/enums. The description adds no additional semantic information beyond what the schema provides. Therefore, it meets the baseline of 3 but does not enhance parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the purpose: 'Calculate recommended water heater tank size for a household.' It specifies the verb 'calculate' and the resource 'water heater tank size', distinguishing it from many sibling tools that calculate other things. The reference to list_bundles for related plumbing calculators further contextualizes its scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use this tool (for household water heater size), but it does not explicitly state when not to use it or provide alternatives beyond a pointer to list_bundles. No prerequisites or exclusions are mentioned, making usage guidance implicit rather than explicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_water_intakeAInspect
Compute recommended daily water intake in liters by weight, activity, climate. Use for hydration planning. Inputs: weight kg, activity level, climate (temperate/hot). Returns L/day and glass count. See list_bundles for related 'sante' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| climate | Yes | Climate | |
| weight_kg | Yes | Body weight kg | |
| activity_level | Yes | Activity level |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It explains inputs and output but does not disclose any side effects, authorization needs, or rate limits. For a calculation tool, this is adequate but could be more transparent about its read-only nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three short, focused sentences: purpose, usage, and related tools. No redundancy or fluff. Each sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers inputs, output, and context. Mentions output schema returns L/day and glass count. Could note weight range or precision, but overall complete for a simple calculation tool with rich schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds little beyond restating parameter names and enum values (temperate/hot). It does not provide additional semantics like units or format beyond what the schema already offers.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool computes water intake based on weight, activity, and climate. It specifies inputs, outputs (L/day and glass count), and distinguishes itself from siblings by focusing on hydration planning and referencing related 'sante' calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use for hydration planning' and directs to list_bundles for related calculators, providing context. However, it does not explicitly state when not to use or mention alternatives beyond the bundle reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_water_pressure_lossBInspect
Calculate water pressure loss in a pipe circuit including friction and elevation. Returns: {equiv_length_m}. See list_bundles for related 'plomberie' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| diameter_mm | Yes | Pipe internal diameter in millimeters | |
| flow_rate_lpm | Yes | Flow rate in liters per minute | |
| pipe_length_m | Yes | Pipe length in meters | |
| fittings_count | No | Number of fittings and elbows (each adds ~0.5m equivalent length) | |
| elevation_change_m | No | Elevation change in meters (positive = uphill, negative = downhill) |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses that the calculation includes friction and elevation, and mentions the return value. However, it does not address edge cases, input validation, or that this is a read-only calculation (no annotations exist to cover this). The transparency is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with two sentences. The first sentence conveys the core purpose and return value. The second provides a helpful cross-reference. There is no extraneous information, making it efficient for an AI agent to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the absence of an output schema, the description should detail the complete return structure. It only mentions '{equiv_length_m}', which is vague. For a tool with five parameters, the description does not specify the output format or any additional return fields, making it incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with each parameter having a clear description. The tool description adds context (e.g., 'including friction and elevation') but does not provide additional semantic detail beyond what the schema already gives. Therefore, a baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Calculate' and the resource 'water pressure loss in a pipe circuit including friction and elevation'. It distinguishes itself from sibling calculators by specifying the hydraulic context and mentioning elevation, which is not typical for most calculator tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides minimal guidance on when to use this tool. It mentions a cross-reference to 'list_bundles' for related 'plomberie' calculators, but does not explain when this specific tool should be chosen over others, nor does it state prerequisites or limitations.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wavelength_frequencyAInspect
Solve c=λ·f for EM waves. Provide wavelength or frequency. Also returns photon energy E=hf. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| frequency_hz | No | Frequency in Hz | |
| wavelength_m | No | Wavelength in meters |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description covers the basic behavior: solving for EM waves and returning photon energy. However, it does not specify what happens if both or neither parameter is provided, nor does it mention any side effects or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief and front-loaded with the core function. Every sentence adds value without redundancy, making it efficient and easy to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple input schema and presence of output schema, the description adequately covers the tool's purpose and inputs. It mentions related resources but lacks explicit handling of edge cases like missing inputs.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds context ('for EM waves') and hints at usage (provide one), but does not significantly enhance parameter meaning beyond the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool solves c=λ·f for EM waves and returns photon energy, using specific verbs and resources. It distinguishes itself from siblings like calculate_wave_properties by specifying 'EM waves' and mentioning photon energy.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description instructs the user to provide either wavelength or frequency, and directs them to list_bundles for related calculators. However, it does not explicitly exclude alternative tools or scenarios where this tool should not be used, though the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wave_propertiesCInspect
Compute wave frequency, wavelength, or period from any two. Formula: c=λ·f. Use for physics or acoustics. Inputs: any 2 of (frequency Hz, wavelength m, speed m/s). Returns missing values. See list_bundles for related 'science' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| speed_ms | No | Wave speed m/s (343=sound) | |
| frequency_hz | Yes | Frequency in Hz |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description states it returns missing values, which is a basic behavioral trait. However, with no annotations, it does not disclose error handling, unit handling, or output format. An output schema exists but is not described. The description falls short of full transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but contains inaccurate information regarding available inputs. While concise, the incorrect content makes it less useful and potentially harmful. It would be better to be accurate or omit the false detail.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given only 2 schema parameters, the description should accurately cover inputs. It fails by claiming three possible inputs and omitting that wavelength is not directly acceptable. With an output schema available, return value explanation is not required, but basic accuracy is missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but description introduces a non-existent 'wavelength m' parameter, which misleads the AI agent. It does clarify unit Hz for frequency and default speed (343 for sound). The mismatch reduces semantic value significantly.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description claims it computes from 'any 2 of (frequency Hz, wavelength m, speed m/s)', but the input schema does not include wavelength as a parameter. This mismatch makes the purpose unclear and misleading, as the AI agent cannot input wavelength. The formula is stated correctly, but the parameter list is wrong.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description gives context 'Use for physics or acoustics' and suggests seeing list_bundles for related tools. This implies appropriate use but does not specify when not to use or provide explicit alternatives. No exclusion criteria or comparison with siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wind_chillBInspect
Calculate the perceived wind chill temperature (Environment Canada formula). Returns: {feels_colder_by_degrees}. See list_bundles for related 'astronomie-nature' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| temperature_c | Yes | Air temperature in degrees C (must be 10C or below) | |
| wind_speed_kmh | Yes | Wind speed in km/h |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must carry full burden. It discloses the formula source and return format, but lacks details on side effects (none expected), accuracy, or limitations. Adequate for a simple calculator.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with the main purpose, then return value and related tools. No extraneous content, efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema and full parameter schema, the description is mostly complete for a simple calculator. It lacks edge-case handling explanation but is sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for both parameters. Description adds no extra parameter-specific information beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool calculates wind chill temperature using Environment Canada formula. It does not explicitly differentiate from sibling tools like calculate_heat_index, but the mention of a related bundle provides some context.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. It only mentions a related bundle, but does not say when this tool is appropriate or when to use other calculators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_wire_gauge_convertAInspect
Convert AWG wire gauge to diameter (mm) and resistance (ohms/m). Use for electrical projects. Inputs: AWG number. Returns diameter, area, resistance, max current. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| awg | Yes | AWG gauge number |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry full behavioral disclosure. It states what is returned but does not disclose assumptions (e.g., material type for resistance), limitations, or whether the tool is read-only. This is insufficient for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: one sentence plus a reference to list_bundles. It front-loads the purpose and efficient no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description lists return values (diameter, area, resistance, max current) and units. With an output schema present, this is nearly complete. However, it lacks information about material dependency and is somewhat sparse for a conversion tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a description for the awg parameter. The description repeats that the input is an AWG number but adds no new semantic details beyond the schema. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts AWG wire gauge to diameter (mm) and resistance (ohms/m), with a specific verb and resource. It also mentions use for electrical projects, making the purpose unambiguous and distinct from other tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a general context ('Use for electrical projects') but does not specify when to use this tool over alternatives like calculate_cable_section or other conversion tools. It mentions seeing list_bundles for related calculators, but no explicit exclusions or when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_working_daysAInspect
Count working days Mon-Fri between two dates with optional French public holiday exclusion. Returns: {working_days, public_holidays_excluded}. See list_bundles for related 'temps-rh' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| country | No | Country for public holidays (FR supported) | FR |
| end_date | Yes | YYYY-MM-DD — End date | |
| start_date | Yes | YYYY-MM-DD — Start date |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses that the tool counts weekdays (Mon-Fri) and can exclude French public holidays, and returns a specific structure. It does not address edge cases like start_date > end_date or timezone handling, but covers the primary behavior adequately.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no filler. It front-loads the primary purpose and includes a reference to related tools. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema (context signal) and the schema descriptions, the description reasonably covers the tool's behavior. It mentions the return format and optional holiday exclusion. Minor details like inclusive/exclusive interpretation are omitted, but overall it provides sufficient context for an AI agent to use the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds value by explaining the tool's purpose (working days, holiday exclusion) which gives context to the parameters, especially 'country' and the date range. It clarifies that the date range defines the period over which working days are counted.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool counts working days Mon-Fri between two dates, optionally excluding French public holidays. It also mentions the return format. However, it does not explicitly differentiate from sibling tools like 'calculate_days_between' or 'calculate_leave_days', which count days or leave days respectively.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions optional French public holiday exclusion and references 'list_bundles' for related calculators, providing some context for when to use this tool. However, it lacks explicit guidance on when not to use it or specific scenarios (e.g., inclusive/exclusive dates, handling of OTHER country).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
calculate_z_scoreAInspect
Compute z-score (standardized score) and percentile from a value, mean, and standard deviation. Use for statistics and outlier detection. Returns z, percentile, p-value. See list_bundles for related 'math' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| mean | Yes | Population mean | |
| value | Yes | Observed value | |
| std_dev | Yes | Standard deviation |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must fully disclose behavior. It states the tool returns z, percentile, and p-value, but does not mention edge cases (e.g., zero standard deviation), side effects, or authorization needs. For a computational tool, this is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose, and includes a pointer to related tools. Every word adds value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple mathematical tool with three parameters and an output schema, the description covers the computation, use case, and return fields. It is sufficient for an agent to invoke correctly, although it could mention potential pitfalls like zero standard deviation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema already describes all three parameters (value, mean, std_dev) with 100% coverage. The description adds little beyond restating 'from a value, mean, and standard deviation,' so the baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes z-score and percentile from a value, mean, and standard deviation, and specifies use for statistics and outlier detection. This distinguishes it from sibling tools, many of which are domain-specific calculators.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Use for statistics and outlier detection,' providing clear context for when to use the tool. It also directs users to list_bundles for related math calculators, but does not explicitly state when not to use it or compare directly with specific siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_angleAInspect
Convert angle between degrees, radians, gradians, turns, arcminutes, arcseconds. Use for math, navigation, surveying. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Angle value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. The description claims support for arcminutes and arcseconds, but the input schema enum only includes degree, radian, gradian, turn. This contradiction reduces transparency. The return description 'Returns: {input}' is vague and does not clarify the output format.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence, concise and front-loaded with purpose. However, it could be structured better to separate supported units and usage.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description fails to align with the schema (missing arcminutes/arcseconds), making it confusing. It does not explain return value or provide enough context for correct usage. Output schema is available but not described.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so description adds limited value beyond the schema. It redundantly mentions parameter names but provides context on usage domains. Does not add new semantic details.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts angle units, listing specific units and use cases (math, navigation, surveying). It distinguishes from sibling conversion tools by focusing on angles.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives context for usage ('Use for math, navigation, surveying') and references list_bundles for related calculators, but does not explicitly state when not to use or compare with alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_areaAInspect
Convert area between m², km², ha, are, acre, ft², yd², cm². Use for real estate, agriculture, or construction. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Area value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description should fully disclose behavioral traits. It only states 'Returns: {input}', which is vague and does not confirm the operation is read-only or explain any side effects. For a simple conversion tool, the description could mention that no data is mutated or that it is a pure function. The lack of detail reduces transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three short sentences, all essential. The action verb is front-loaded. No filler words. Every sentence adds value: the units, use cases, and a pointer to related tools.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 basic parameters, no nested objects, output schema likely clarifies return type), the description is adequately complete. It covers what units are supported, when to use it, and hints at related tools. The omission of return format details is acceptable as output schema exists.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter (value, from, to). The description repeats 'Inputs: value, from, to' without adding new meaning beyond the schema. It does not provide formatting or usage rules for the parameters. Baseline 3 is appropriate since schema already does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Convert area between m², km², ha, are, acre, ft², yd², cm²', specifying the exact verb and resource. It distinguishes itself from sibling convert tools (e.g., convert_angle, convert_temperature) by focusing solely on area units, and the use cases (real estate, agriculture, construction) provide additional clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Use cases for real estate, agriculture, or construction are explicitly mentioned, providing clear context. The reference to list_bundles for related calculators suggests alternatives. However, there is no explicit exclusion for when not to use this tool versus other convert tools, but the sibling names make the domain distinction clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_cookingBInspect
Convert cooking measurements between ml, L, g, kg, cup, tbsp, tsp, fl_oz, oz. Use for recipe scaling and translating. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Quantity |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided. The description adds minimal behavioral context: returns format '{input}' but does not disclose precision, error handling, or permissions. The tool is simple, so a score of 3 is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short but includes an awkward 'Returns: {input}' that is not informative. It could be restructured for clarity. Every sentence earns its place but the structure feels rushed.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with 3 required params and enums. The description covers units and use case. Output schema exists (not shown) so return value explanation is minimized. Still, could mention conversion precision or edge cases.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all parameters (value, from, to). The description redundantly lists 'Inputs: value, from, to' but adds no new meaning beyond the schema. Baseline of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts cooking measurements and lists specific units. It mentions the use case for recipe scaling. However, it does not differentiate from similar sibling tools like calculate_cooking_conversion or convert_volume/weight.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a brief context 'Use for recipe scaling and translating' and directs to list_bundles for related calculators. No explicit when-not-to-use or alternative tools are mentioned; guidance is implied.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_data_storageBInspect
Convert between digital storage units (binary and decimal). Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Storage value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must cover behavioral traits. It only states 'Returns: {input},' which is ambiguous (does not confirm the converted result). No disclosure of side effects, permissions, or idempotency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Very short but includes a confusing placeholder 'Returns: {input}' that undermines clarity. Information is front-loaded but could be better structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With an output schema present, full return details are not required. Description points to list_bundles for related tools, aiding discoverability. However, lacks guidance on typical use cases or constraints.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with basic descriptions for each parameter. Description adds value by mentioning 'binary and decimal' to clarify the units, but this is minimal. Baseline 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Convert between digital storage units (binary and decimal).' This specifies the exact resource and distinguishes it from sibling conversion tools like convert_angle or convert_area. The binary/decimal nuance adds precision.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. Only mentions 'See list_bundles for related conversions calculators,' which is vague. Does not specify scenarios, prerequisites, or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_distanceAInspect
Convert distance between metric (m, km, cm, mm) and imperial (in, ft, yd, mi) plus nautical miles. Use for travel, sport, or engineering. Inputs: value, from, to. Returns: {input, factor}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Distance value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses the return format ({input, factor}) and implies it's a pure conversion without side effects. The description adds value by specifying the output structure beyond the schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences front-load the key information: what it does, use cases, inputs, output format, and sibling reference. No filler or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with full schema and output structure given, the description covers all needed context: unit systems, inputs, output format, use cases, and sibling guidance. It is complete for effective agent usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and provides enums. The description lists inputs and mentions unit systems, but adds minimal extra meaning beyond what the schema already provides. Baseline of 3 is appropriate as schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts distance between metric, imperial, and nautical miles, and distinguishes it from other conversion tools by mentioning specific unit systems and use cases. It also references list_bundles for related calculators, showing awareness of siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases (travel, sport, engineering) and lists inputs. It doesn't explicitly state when not to use, but the context is clear. The reference to list_bundles guides users to alternatives among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_energyCInspect
Convert energy between J, kJ, cal, kcal, kWh, BTU, eV, ft-lb. Use for nutrition, electricity, science. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Energy value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. Description merely says 'Inputs: value, from, to. Returns: {input}' without detailing behavior like validation, precision, or side effects. For an unannotated tool, this lacks necessary transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences are reasonably compact, but the first sentence includes an inaccurate unit (ft-lb), and '{input}' is imprecise. Not as tight as possible.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with 3 parameters, the description covers use cases and related tools. However, the ft-lb discrepancy and lack of output specification (despite output schema existing) leave gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so description adds little. But mention of ft-lb (not in schema) misleads about allowed values. The phrase 'Returns: {input}' is vague and doesn't clarify output format.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Convert energy between J, kJ, cal, kcal, kWh, BTU, eV, ft-lb' identifying the specific verb and resource, and the list of units distinguishes it from sibling conversion tools. However, ft-lb is not in the schema enum, causing a slight mismatch.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
'Use for nutrition, electricity, science' provides context, but no explicit when-to-use vs alternatives. Reference to list_bundles helps but doesn't directly guide agent away from sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_fuel_consumptionBInspect
Convert fuel consumption between L/100km, mpg-US, mpg-UK, km/L. Use for car comparison across regions. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Consumption value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It only says 'Returns: {input}', which is vague and does not explain the actual output structure or any side effects. The output schema exists but is not leveraged in the description. This is insufficient for an agent to understand the tool's behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short (3 sentences) and front-loads the purpose. It is efficient, though it could be slightly more structured. Every sentence contributes value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple operation (conversion) and presence of an output schema, the description is somewhat complete. However, the lack of behavioral transparency and detailed usage guidance leaves gaps for an AI agent. Adequate but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents each parameter. The description adds no new meaning beyond listing the inputs. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts fuel consumption between specific units (L/100km, mpg-US, mpg-UK, km/L) and provides use context (car comparison across regions). However, it does not explicitly differentiate from the sibling tool 'calculate_fuel_economy_conversion', which may cause ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests using it for car comparison across regions, but does not provide explicit when-not-to-use guidance or compare to siblings like 'calculate_fuel_consumption'. The reference to 'list_bundles' is helpful but does not substitute for direct exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_pressureAInspect
Convert pressure between Pa, kPa, MPa, bar, psi, atm, mmHg, mbar, torr. Use for engineering, tires, weather. Inputs: value, from-unit, to-unit. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Pressure value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It states 'Convert pressure' and 'Returns: {input}', but 'Returns: {input}' is ambiguous (likely a placeholder) and the description does not clarify if the operation is read-only or has any side effects. For a simple conversion tool, this is adequate but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded with the main action. Each sentence serves a purpose: listing units, use cases, parameters, return, and related tools. However, the return statement 'Returns: {input}' is unclear and detracts from conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with 3 parameters, the schema covers all descriptors, and an output schema exists. The description provides use cases and a reference to related tools. It does not fully explain the return format, but the output schema likely handles that, making the description fairly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with each property described (e.g., 'Source unit', 'Target unit'). The description lists 'value, from-unit, to-unit' and enumerates the units, but this adds minimal value beyond the schema. Baseline is 3, and the description does not significantly enhance understanding of the parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts pressure between specific units (Pa, kPa, MPa, bar, psi, atm, mmHg, mbar, torr) and lists use cases (engineering, tires, weather), making it distinct from sibling conversion tools like convert_angle or convert_temperature.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases ('Use for engineering, tires, weather') and references 'list_bundles for related conversions calculators', giving context for when to use this tool. However, it does not explicitly state when not to use it or compare with similar tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_shoe_sizeCInspect
Convert shoe sizes between EU, US (men/women), and UK systems. Use for online shopping. Inputs: size, from, to, gender. Returns converted size. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target system | |
| from | Yes | Source system | |
| value | Yes | Shoe size |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It lacks disclosure of conversion accuracy, handling of half sizes, valid ranges, or the need for gender (which is not in the schema). The mismatch between described inputs (gender) and actual parameters undermines trust.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is short and front-loaded but contains inaccurate information (gender input). Every sentence should be correct, so this fails to be concise without error.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
An output schema exists, but the description does not describe the output format beyond 'Returns converted size'. Sibling tools are not differentiated, and missing details about ranges or assumptions make the description incomplete for a conversion tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers 100% of parameters with descriptions. The description adds no extra meaning and confuses by renaming 'value' to 'size' and including 'gender' as an input. This actively misleads the agent about the actual parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts shoe sizes between EU, US (men/women), and UK systems and gives a use case (online shopping). However, it mentions a 'gender' input that does not exist in the schema, and it does not distinguish from siblings like calculate_shoe_size or calculate_shoe_size_convert.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Only a single use case ('Use for online shopping') is given. No guidance on when not to use, prerequisites, or alternatives among the many sibling conversion tools. The mention of list_bundles is vague.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_speedBInspect
Convert speed between km/h, mph, m/s, knots, ft/s. Use for travel, sport, or maritime/aviation. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Speed value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description must disclose behavioral traits itself. It only repeats schema info ('Inputs: value, from, to. Returns: {input}') and lacks details on precision, rounding, error handling, or side effects. This is insufficient for a tool with no annotation support.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (three sentences) but includes slightly redundant phrasing ('Inputs: value, from, to. Returns: {input}') that is already evident from the schema. Front-loading is good with the main purpose stated first.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity (3 parameters, output schema present), the description covers basic purpose and use cases but lacks detail on output format or potential edge cases. The output schema mitigates some gaps, but the description could be more informative.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage with descriptions for all parameters (value, from, to) and enums for units. The description adds no additional meaning beyond listing parameter names, so it meets but does not exceed the baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states 'Convert speed between km/h, mph, m/s, knots, ft/s' and provides specific use cases (travel, sport, maritime/aviation). This clearly defines the tool's purpose and distinguishes it from sibling conversion tools (e.g., convert_distance, convert_temperature) by domain and units.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions relevant contexts ('travel, sport, or maritime/aviation') and references list_bundles for related calculators, but does not explicitly state when to use this tool versus alternatives or when not to use it. The guidance is implied rather than directive.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_temperatureAInspect
Convert temperature between Celsius, Fahrenheit, and Kelvin. Use for cooking, weather, science. Inputs: value, from, to. Returns converted temperature. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Temperature value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavior. It mentions inputs (value, from, to) and output (converted temperature), but does not go beyond the schema. It omits details like rounding, validation, or edge cases, which are not critical for a simple converter.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences) with the core purpose front-loaded. Every sentence adds value, though the second sentence is slightly crowded with multiple clauses.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and full schema coverage, the description adequately covers purpose and usage. The output schema (exists but not described) reduces the need for return value details. Could mention that conversions are exact or include rounding behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents all parameters. The description merely repeats 'Inputs: value, from, to' without adding extra meaning or clarifications beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Convert' and resource 'temperature' with specific units (Celsius, Fahrenheit, Kelvin). It distinguishes from sibling conversion tools (e.g., convert_angle, convert_area) by naming three exact units and use cases (cooking, weather, science).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context on when to use (cooking, weather, science) and mentions 'list_bundles' for related calculators, offering a hint for alternative tools. However, it lacks explicit when-not or direct sibling differentiation.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_timeAInspect
Convert time between seconds, minutes, hours, days, weeks, months, years. Use for project planning or unit homogenization. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Time value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It only mentions 'Returns: {input}' which is vague and does not disclose typical behavioral traits like idempotency, side effects, or rate limits. Minimal behavioral insight beyond functional purpose.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise with two sentences. It front-loads the core purpose and provides context efficiently without extraneous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity, the description covers purpose, use cases, and related tools. Output schema exists, so lack of output detail is acceptable. The mention of list_bundles is helpful. Slightly incomplete due to vague return statement, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for each parameter, so baseline is 3. The description only lists parameter names (value, from, to) without adding new meaning or format details beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts time between specific units (seconds to years) and provides use cases (project planning, unit homogenization), distinguishing it from sibling conversion tools like convert_angle or convert_area.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states use cases (project planning, unit homogenization) and directs users to list_bundles for related calculators, providing good context. It does not include when not to use it, but the purpose is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_volumeAInspect
Convert volume between L, mL, cL, fl_oz, cup, tbsp, tsp, gal_us, gal_uk. Use for cooking and science. Inputs: value, from, to. Returns: {input}. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Volume value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full burden. It states 'Returns: {input}', which is ambiguous and uninformative—it does not clarify the return format or whether the tool is safe/idempotent. This omission reduces transparency for an agent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is compact (two sentences plus a reference), with key information front-loaded. The 'Returns: {input}' line is vague and somewhat wasteful, but overall the structure is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity and the presence of an output schema (not shown but confirmed), the description covers purpose and usage context. However, the vague return statement and lack of clarification on output format leave a minor gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with each parameter having a description. The description lists parameters ('Inputs: value, from, to') and enumerates units, but adds minimal meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts volume, lists all supported units (L, mL, cL, fl_oz, cup, tbsp, tsp, gal_us, gal_uk), and specifies use cases (cooking and science). This provides a specific verb+resource distinction from sibling conversion tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use the tool ('Use for cooking and science') and directs to list_bundles for related conversion calculators, providing context. However, it does not explicitly mention when not to use it or offer alternative tools (e.g., calculate_volume).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
convert_weightBInspect
Convert weight/mass between kg, g, mg, lb, oz, st, tonne. Use for cooking, shipping, or fitness. Inputs: value, from, to. Returns converted mass. See list_bundles for related 'conversions' calculators.
| Name | Required | Description | Default |
|---|---|---|---|
| to | Yes | Target unit | |
| from | Yes | Source unit | |
| value | Yes | Weight value |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden for behavioral disclosure. It only states 'Returns converted mass' without mentioning any limitations, precision, or side effects. For a simple conversion tool, this is minimal but acceptable.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose and units, followed by use cases and a reference to related tools. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple conversion tool with low parameter count and high schema coverage, the description covers purpose, use cases, and provides a pointer to related tools. It does not explain return format or error handling, but overall it is sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% parameter description coverage, so the description adds little beyond listing the parameter names. No additional semantics are provided beyond what the schema already offers.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool converts weight/mass between specified units and lists common use cases. It does not explicitly differentiate from other convert_* tools, but the units listed are specific to weight, making the purpose clear.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions three use cases (cooking, shipping, fitness) and points to list_bundles for related calculators, but does not provide exclusions or tell when to use alternatives like other convert tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_bundle_toolsAInspect
Get the list of tools in a specific bundle. Returns tool names and descriptions for the requested domain bundle.
| Name | Required | Description | Default |
|---|---|---|---|
| bundle_id | Yes | Bundle ID from list_bundles |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose behavioral traits such as read-only nature, authentication, or error handling beyond the basic purpose. The description carries the full burden but offers no extra behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with the verb, no redundant information. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with an output schema, the description is minimally adequate but lacks context about how to obtain the bundle_id (e.g., from list_bundles) and what the returned descriptions entail.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and the schema itself documents the parameter with an enum. The description adds no additional meaning beyond referencing a 'specific bundle'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action 'Get the list of tools', the resource 'in a specific bundle', and what is returned ('tool names and descriptions'). It distinguishes from sibling 'list_bundles' which lists bundles themselves.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Usage context is implied (use after selecting a bundle from list_bundles) but not explicitly stated. No guidance on when not to use this tool or alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
list_bundlesAInspect
List all available tool bundles (grouped by domain). Use this to discover which tools are available for a specific topic instead of browsing all 500+ tools.
| Name | Required | Description | Default |
|---|---|---|---|
| _unused | No | No parameters needed |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | No | Computed result. Object whose fields depend on the tool (e.g. {tax, marginal_rate, brackets} for tax tools, {volume_l, gallons} for volume tools). |
| source | No | Authoritative source for the rule or formula (e.g. "Article 197 CGI", "NF DTU 21"). |
| formula | No | Human-readable formula or method used (e.g. "I=P·r·t", "Magnus formula"). |
| reference_url | No | Link to a calcul2 page documenting the calculation in detail. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It describes a simple read-only listing behavior, but does not disclose any further traits (e.g., caching, rate limits). Adequate given tool simplicity.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first states action, second provides use case. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple listing tool with an output schema (not shown), the description sufficiently explains what it returns (bundles grouped by domain) and why to use it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% description coverage with a single placeholder parameter. The description adds no new parameter info beyond the schema, meeting the baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action: 'List all available tool bundles' and indicates grouping by domain. It distinguishes from sibling tools like get_bundle_tools by emphasizing discovery over browsing all tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly advises using this tool for discovering tools by topic instead of browsing all 500+ tools. No explicit when-not-to-use, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!