Skip to main content
Glama

Server Details

Official MCP server for HireSquire. Automate resume screening, candidate ranking, and interview scheduling for autonomous agents.

Status
Healthy
Last Tested
Transport
Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client
Glama
MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.
Tool DescriptionsA

Average 3.9/5 across 24 of 24 tools scored. Lowest: 2.9/5.

Server CoherenceA
Disambiguation5/5

Every tool has a distinct purpose and scope. From bulk vs individual email generation to different stages of screening (create, status, results, cost estimate, cancel), there is no ambiguity. Even related tools like enable/disable_auto_reload are clearly differentiated.

Naming Consistency5/5

All tool names follow a consistent verb_noun pattern in snake_case (e.g., create_screening, get_results, enable_auto_reload). No mixing of conventions, making it easy to predict tool names for missing functionality.

Tool Count4/5

24 tools is slightly above the ideal range but justified by the broad domain (screening, scheduling, billing, system management). Each tool addresses a specific need without feeling bloated.

Completeness4/5

The tool surface covers the main workflows: onboarding, screening, scheduling, interviewing, billing, and outcome reporting. Minor gaps exist (e.g., no candidate update or search), but core operations are well represented.

Available Tools

24 tools
bulk_generate_emailsAInspect

Generate multiple personalized candidate emails in a single call. Use this for high-volume outreach batches (max 50 candidates) after a large screening job.

ParametersJSON Schema
NameRequiredDescriptionDefault
toneNoCommon tone for the batch.
typeYesCommon email type for the batch.
job_idYesThe job ID.
candidate_idsYesArray of candidate IDs to process (max 50).
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It mentions a max of 50 candidates and personalization, but it does not disclose whether the emails are actually sent or just created as drafts, what happens on failure, authentication requirements, or any rate limits. For a batch generation tool, these behavioral traits are important but missing.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences with no superfluous words. The key information is front-loaded. Every sentence adds value: first states what it does, second gives usage context and a batch limit.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description should explain return values or side effects, but it does not. It provides a batch limit and usage context, but lacks details on whether emails are sent immediately, how errors are handled, or what the tool returns. This is adequate for a simple tool but incomplete for a batch operation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds 'after a large screening job' for context on job_id and implies personalization (though not reflected in schema). The note about 'common tone/type' is already in the schema descriptions. The description adds marginal value beyond the schema, so a score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'generate', the resource 'multiple personalized candidate emails', and the scope 'in a single call'. It also mentions 'high-volume outreach batches (max 50 candidates) after a large screening job', which distinguishes it from the sibling 'generate_email' tool. Purpose is specific, actionable, and well-differentiated.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use this for high-volume outreach batches (max 50 candidates) after a large screening job', providing clear context for when to use the tool. However, it does not explicitly state when not to use it or mention alternatives, though the context strongly implies it is for batches vs. single emails.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

cancel_screening_jobAInspect

Immediately stop a running screening job. Use this if the user made a mistake or the job is no longer needed. Note: Credits for already-processed candidates are not refunded.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe ID of the job to cancel.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses important behavioral detail: credits for processed candidates are not refunded. No annotations exist, and description conveys mutation and irreversible aspect well, though could mention immediate effect more explicitly.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences front-load action and usage, with no wasted words. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Tool is simple (single required param, no output schema), and description fully covers purpose, usage context, and key behavioral side effect. No gaps remain.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema provides full coverage (100%) with parameter 'job_id' described as 'The ID of the job to cancel.' Description adds no additional meaning beyond schema, so baseline score applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states action 'Immediately stop a running screening job' with specific verb and resource, and distinguishes from siblings like create_screening or get_screening_status.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use ('if user made a mistake or job no longer needed'), providing clear context. Does not explicitly mention when not to use or alternatives, but context is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

compare_candidatesAInspect

Perform a side-by-side AI comparison of 2-10 candidates. Use this when you need a clear "Delta Analysis" to decide between top-tier applicants for a final interview.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe job ID.
candidate_idsYesList of 2-10 candidate IDs to compare.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It mentions 'AI comparison' but does not disclose behavioral traits such as whether the operation is read-only, whether it modifies data, required permissions, or any side effects. This lack of transparency is a significant gap.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the purpose, and no redundant information. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The tool has no output schema, but the description does not explain what the comparison result looks like (e.g., text, table). While the tool is simple, the lack of return format information is a notable gap. Adequate but not complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the schema already documents both parameters. The description adds context about the purpose (Delta Analysis, final interview) but does not add additional meaning to the parameters beyond what the schema provides. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (perform side-by-side AI comparison), the resource (candidates), and the scope (2-10, for final interview decision). It is a specific verb+resource combination that distinguishes it from sibling tools like generate_email or create_screening.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use this when you need a clear Delta Analysis to decide between top-tier applicants for a final interview,' providing clear context for when to use. However, it does not offer alternatives or state when not to use this tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

create_calendar_connectionAInspect

Securely connect a scheduling provider to HireSquire using an API key. Use this during initial user onboarding or when adding a new interview tool.

ParametersJSON Schema
NameRequiredDescriptionDefault
api_keyYesValid API key/token from the provider.
providerYesThe scheduling tool provider.
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must carry burden. Mentions 'securely connect using API key' but lacks details on consequences, error handling, or side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with no waste: first defines purpose, second gives usage context. Efficient and front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple 2-param tool with no output schema, the description adequately covers purpose and usage context. Could include mention of authentication or return value but not necessary.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds minimal extra meaning beyond the schema descriptions for 'api_key' and 'provider'.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses specific verb 'connect' and resource 'scheduling provider to HireSquire', clearly distinguishing from sibling 'list_calendar_connections'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use ('during initial user onboarding or when adding a new interview tool'), provides clear context, but does not list alternatives or when not to use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

create_checkout_sessionAInspect

Generate a Stripe Checkout URL for manual credit purchase. Use this to provide the human user with a secure payment link in the chat.

ParametersJSON Schema
NameRequiredDescriptionDefault
packNoThe desired credit pack.
amountNoCustom amount (min $5.00).
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description bears full responsibility. It describes generating a URL but omits any side effects (e.g., creating a Stripe session), idempotency, expiration, or error handling. The behavioral impact is understated.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences front-load the purpose and usage. Every word adds value; no redundant or irrelevant content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the core action but lacks details about the return value (e.g., URL format), error conditions, or that the URL is one-time use. Given no output schema, some completeness is missing, but it's acceptable for a simple URL generator.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and both parameters already have clear descriptions (enum for pack, min amount for amount). The tool description adds no additional param info beyond the schema, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the verb 'generate' and the resource 'Stripe Checkout URL' for manual credit purchase. The phrase 'provide the human user with a secure payment link in the chat' differentiates it from sibling tools like bulk_generate_emails or purchase_credits.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Use this to provide the human user with a secure payment link in the chat,' giving clear context for when to invoke it. Does not mention alternatives or when not to use, but the context is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

create_interviewAInspect

Confirm an interview time, create a calendar event, and notify the candidate. Use this as the final step in the scheduling process once a time is selected.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe job ID.
providerNoScheduling provider.
candidate_idYesThe candidate ID.
scheduled_atYesISO 8601 timestamp (e.g., "2024-05-20T14:30:00Z").
duration_minutesNoDuration in minutes.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations exist, so the description fully carries the burden. It discloses that the tool creates a calendar event and notifies the candidate—both important side effects. Could mention provider requirements or idempotency, but overall transparency is good.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, entirely front-loaded with the tool's purpose and usage context. Every word earns its place; no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description adequately covers the tool's purpose and major effects for a multi-step scheduling process. Lacks return value information (no output schema) and error handling hints, but given the clarity and parameter coverage, it is mostly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the description adds no new parameter-level meaning beyond the schema's own descriptions. The description's mention of 'final step' contextualizes 'scheduled_at' but doesn't enhance parameter understanding.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses specific verbs ('confirm', 'create', 'notify') and resources ('interview time', 'calendar event', 'candidate'), clearly distinguishing this as the final scheduling step from siblings like 'create_calendar_connection' or 'get_available_slots'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states 'Use this as the final step in the scheduling process once a time is selected', providing clear context for when to invoke. Does not list alternatives or exclusions, but the context is strong enough for an agent to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

create_screeningAInspect

Submit candidate resumes for AI screening against a specific job description. Use this as the entry point for candidate evaluation. It parses resumes, scores them (1-100), and provides reasoning. Side effect: Debits credits from the user account based on the number of resumes.

ParametersJSON Schema
NameRequiredDescriptionDefault
titleYesThe official job title for the position.
resumesYesA list of resumes to screen. Each item must include a filename and the content (raw text, base64, or public URL).
descriptionYesThe complete job description including requirements and responsibilities (minimum 50 characters).
webhook_urlNoHTTPS URL for asynchronous completion notifications. Recommended for large batches.
leniency_levelNoStrictness scale (1-10). 1 is lenient, 10 is strict. Default is 5. Use 8+ for critical senior roles.

Output Schema

ParametersJSON Schema
NameRequiredDescription
job_idNo
statusNo
messageNo
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses a key side effect: 'Debits credits from the user account based on the number of resumes.' It also explains the scoring and reasoning behavior, adding value beyond the schema.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences, no wasted words. The purpose is front-loaded, and every sentence contributes to understanding the tool's function and side effects.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains the tool's role, outcome, and side effect. It mentions the webhook for async, but does not explicitly state that results are stored or how to retrieve them later. Given the output schema exists, this is not a major gap.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description does not add detailed parameter semantics beyond what the schema already provides, but it gives context about the overall process (parsing, scoring) that indirectly relates to parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (submit resumes), resource (AI screening against a job description), and results (parses, scores 1-100, provides reasoning). It positions itself as the entry point for candidate evaluation, distinguishing it from sibling tools like compare_candidates or get_results.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use this as the entry point for candidate evaluation,' which guides when to use it. However, it does not mention when not to use or provide alternatives, though the context of being the first step is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

disable_auto_reloadAInspect

Turn off automatic credit reloading. Use this to revert to manual credit management.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations, so description must disclose behavioral traits. Only states the action; no mention of side effects, permissions, or state changes if already disabled.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no filler. Every word is necessary and direct.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Simple tool with no parameters or output schema. Description is sufficient for basic understanding, though could mention no parameters required.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist, so description adds no parameter info. Schema coverage 100% means no gap. Baseline of 4 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states action ('turn off') and resource ('automatic credit reloading'). Distinguishes from sibling 'enable_auto_reload'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

States use case: 'revert to manual credit management'. Does not provide explicit when-not-to-use or alternatives, but sibling tool offers clear counterpart.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

enable_auto_reloadAInspect

Set up automatic credit purchases when the account balance drops below a specific threshold. Use this to ensure autonomous agents never run out of credits.

ParametersJSON Schema
NameRequiredDescriptionDefault
amountYesAmount to reload each time (min $5.00).
thresholdYesBalance threshold in dollars (e.g., 10.00).
payment_method_idYesSaved Stripe payment method ID.
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses that it sets up automatic purchases but does not detail the exact behavior (e.g., immediate purchase on threshold hit, frequency, whether it replaces existing settings). Without annotations, more behavioral details would be beneficial.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise, consisting of two sentences that efficiently convey the purpose and usage context without unnecessary details.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description adequately covers the tool's function given the 3 required parameters and no output schema. It could be slightly more complete by explaining the behavior after setup, but it is sufficient for typical use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear parameter descriptions. The description adds minimal additional semantic value beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: setting up automatic credit purchases when balance drops below a threshold. It distinguishes itself from siblings like 'disable_auto_reload' and 'purchase_credits' by specifying the automation aspect.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It explicitly advises when to use the tool ('ensure autonomous agents never run out of credits'), providing clear context. However, it lacks explicit guidance on when not to use it or comparisons to alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

estimate_screening_costAInspect

Calculate the exact credit cost for a planned screening job based on candidate count. Use this as a pre-flight check before calling "create_screening".

ParametersJSON Schema
NameRequiredDescriptionDefault
candidate_countYesNumber of resumes you intend to screen.
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so the description carries the burden. It implies a read-only, non-destructive operation (calculating cost) but does not disclose any additional behavioral traits like rate limits, authentication, or side effects. Adequate but not enriched beyond the obvious.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no fluff, front-loaded with the core action. Every sentence is necessary and the structure is efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple single-parameter tool with no output schema, the description adequately covers the purpose, input, and usage context. It implies the return value is a credit cost number. Minor gap: no explicit mention of output format, but not critical.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with parameter description 'Number of resumes you intend to screen.' The description adds 'based on candidate count' which aligns but does not add new meaning beyond the schema. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool calculates exact credit cost for a screening job based on candidate count, with a specific verb and resource. It also distinguishes the tool from its sibling 'create_screening' by calling it a pre-flight check.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises using the tool as a pre-flight check before 'create_screening', which clarifies when to use it. No explicit alternatives or when-not-to-use, but the sibling list provides context and the guidance is sufficient for the intended use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

generate_emailAInspect

Generate a personalized outreach or follow-up email for a specific candidate. Use this after identifying a strong candidate from "get_results" to move them through the pipeline.

ParametersJSON Schema
NameRequiredDescriptionDefault
toneNoThe desired tone of the generated email.professional
typeYesType of email: "invite" for interviews, "rejection" for non-matches, etc.
job_idYesThe job ID.
candidate_idYesThe specific candidate ID from the results.
custom_messageNoAdditional context or instructions to include in the AI-generated email.

Output Schema

ParametersJSON Schema
NameRequiredDescription
bodyNo
subjectNo
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must fully disclose behavior. It states the tool 'generates' an email but does not clarify side effects (e.g., whether it saves or sends the email, or if any data is modified). Lacks details on prerequisites, rate limits, or permissions. For a generation tool, more transparency is needed.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single, well-structured sentence. Front-loaded with the primary action, concise, and no extraneous words. Earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Has an output schema (not shown) so return values are covered elsewhere. The description covers the core purpose and usage context, but lacks detail on the generation process and any constraints. Adequate but not rich.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers all parameters with descriptions, enums, and defaults. The description does not add extra meaning beyond what the schema provides. Baseline of 3 is appropriate since the schema already gives adequate semantics.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Purpose is clearly stated: generate a personalized outreach or follow-up email for a specific candidate. It explicitly distinguishes from sibling 'bulk_generate_emails' by focusing on a single candidate and references 'get_results' for context. The action and resource are specific.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Describes when to use: after identifying a strong candidate from 'get_results' to move them through the pipeline. Provides clear context but does not explicitly exclude other scenarios or contrast with siblings like 'compare_candidates' or 'create_interview'. Still, it offers useful guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_api_schemaAInspect

Fetch the complete OpenAPI 3.0 specification for the HireSquire ecosystem. Use this when you need detailed documentation on REST endpoints or data models.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the burden. It states it fetches a complete specification but doesn't disclose authorization needs, rate limits, or cache behavior. For a read-only tool, this is adequate but could be more detailed.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded, and every word adds value. No unnecessary information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no parameters, no output schema, and no annotations, the description is fairly complete. It could mention that the specification can be used to understand API capabilities, but overall sufficient.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

There are zero parameters, so schema coverage is 100%. The description doesn't need to add parameter information; a baseline of 4 is appropriate as it's clear without it.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool fetches the complete OpenAPI 3.0 specification for the HireSquire ecosystem. It distinguishes from sibling tools, as none are for fetching documentation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use this when you need detailed documentation on REST endpoints or data models.' It provides clear context but doesn't mention when not to use it, which is acceptable given its specific nature.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_available_slotsAInspect

Query a connected calendar for open interview windows on a specific date. Use this to present valid time options to a candidate before scheduling.

ParametersJSON Schema
NameRequiredDescriptionDefault
dateYesTarget date in YYYY-MM-DD format.
durationNoInterview length in minutes.
providerYesThe connected provider to query.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It only states that the tool queries a calendar, but does not disclose the return format (e.g., list of time windows), authentication requirements, rate limits, or any side effects. The description lacks behavioral details beyond the basic query nature.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise with two sentences that front-load the purpose and usage. Every word serves a purpose with no redundancy, making it easy to quickly grasp the tool's function.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite good parameter documentation, the description lacks critical context about the tool's output. Since there is no output schema, the description should at least specify that it returns available time slots. This omission reduces completeness, especially for a query tool where the return format is essential.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the schema already documents all parameters and their descriptions. The tool description adds no additional meaning or context beyond the schema, providing no extra value for parameter understanding.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Query' and resource 'connected calendar' with specific context 'open interview windows on a specific date'. It distinguishes from sibling tools like 'generate_meeting_link' or 'list_calendar_connections' by focusing on slot availability.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage context: 'Use this to present valid time options to a candidate before scheduling.' This clearly indicates when to use the tool, though it does not mention when not to use it or provide alternative tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_credit_balanceAInspect

Check the current credit balance and spending metrics for the account. Use this before starting large jobs to prevent "Insufficient Credits" errors.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses the read-only nature and specifies what is checked (balance and spending metrics). However, it does not detail auth requirements, rate limits, or exact return structure, which would be beneficial.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences. The first states purpose, the second provides usage guidance. No redundancy or wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Without an output schema, the description could specify what 'spending metrics' includes, but it is sufficient for a simple read tool with no parameters. It covers purpose and usage well.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

There are zero parameters, so schema coverage is 100%. The description does not need to add parameter semantics, and the baseline of 4 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Check the current credit balance and spending metrics for the account,' providing a specific verb and resource. It distinguishes from sibling tools like 'get_credit_transactions' and 'purchase_credits' by focusing on the balance and metrics.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises 'Use this before starting large jobs to prevent 'Insufficient Credits' errors,' giving clear context for when to invoke the tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_credit_transactionsAInspect

Fetch a history of credit purchases, spends, and adjustments. Use this for auditing or when a user asks about their recent billing history.

ParametersJSON Schema
NameRequiredDescriptionDefault
limitNoNumber of recent records to return.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided; description only mentions fetching history but lacks details on read-only nature, authentication needs, rate limits, or response structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences that front-load purpose and usage, with no unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Lacks output schema and does not describe the transaction data format (e.g., fields returned), which is needed for a complete understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description covers the only parameter (limit) with a clear description; tool description adds no extra meaning beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the tool fetches history of credit purchases, spends, and adjustments, distinguishing it from siblings like get_credit_balance or purchase_credits.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says to use for auditing or user billing history requests, but does not mention when not to use or suggest alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_resultsAInspect

Retrieve detailed candidate scores, AI reasoning, and interview questions for a completed job. Only call this once "get_screening_status" returns a "completed" status.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe unique ID of the completed screening job.
min_scoreNoOptional: Minimum score threshold (0-100) to filter results.
only_top_nNoOptional: Limit results to the top N candidates. Defaults to 20 to protect context windows. Max 100.

Output Schema

ParametersJSON Schema
NameRequiredDescription
has_moreNoTrue if there are more candidates than returned.
candidatesNo
total_candidatesNo
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It indicates the tool is read-only and conditional on job completion, but doesn't disclose other behavioral traits like error handling, rate limits, or authentication needs. It adds the completion prerequisite but lacks depth.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences: first states purpose, second gives usage condition. No unnecessary words, perfectly front-loaded and efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With an output schema present, the description need not explain return values. It covers the prerequisite condition and summary of results. Could mention edge cases like invalid job_id, but overall sufficient for a retrieval tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema provides 100% coverage with descriptions for all three parameters. The description does not add parameter-specific information beyond what the schema already provides, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Retrieve' and resource 'detailed candidate scores, AI reasoning, and interview questions for a completed job.' It distinguishes from sibling tool 'get_screening_status' by specifying that this tool is for after completion.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to call: 'Only call this once get_screening_status returns a completed status.' This provides clear usage conditions and references a prerequisite sibling tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_screening_statusAInspect

Check the processing status and progress of a screening job. Use this to poll for completion after calling "create_screening" if a webhook was not provided.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe unique ID returned by create_screening.

Output Schema

ParametersJSON Schema
NameRequiredDescription
statusNoCurrent status: processing, completed, or failed.
progressNoPercentage of completion (0-100).
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries the burden. It implies a read-only operation but lacks details on rate limits, authentication, or other behaviors. Adequate but not comprehensive.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences, each earning its place: first states purpose, second gives usage context. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given a simple parameter and output schema, the description fully covers necessary information: what the tool does and when to use it.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with parameter description. Tool description adds context ('after calling create_screening') but does not significantly extend beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the verb 'Check' and the resource 'processing status and progress of a screening job'. Distinguishes from siblings like create_screening and cancel_screening_job.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly instructs to poll for completion after create_screening if no webhook provided. Does not cover when not to use, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_calendar_connectionsAInspect

List all connected interview scheduling tools (Calendly, Cal.com). Use this to check if a provider is ready before attempting to fetch slots or schedule.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided. Description does not mention any behavioral traits like idempotency, side effects, or permissions. The read-only nature is implied but not stated.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences: first states purpose, second gives usage context. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema or annotations. Description provides purpose and usage but lacks details about response format or connection status. Adequate but could be more complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has no parameters (0 params, 100% coverage). Baseline for 0 params is 4. Description adds examples of what connections are listed, which is helpful but not parameter-specific.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the tool lists connected interview scheduling tools with specific examples (Calendly, Cal.com). Distinguishes from sibling tools like create_calendar_connection and get_available_slots.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says to use this to check provider readiness before fetching slots or scheduling. Provides clear context, though no explicit when-not-to-use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_credit_packsAInspect

Browse available credit bundles and their current pricing. Use this to help the user choose the best pack for their "purchase_credits" call.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must cover behavior. It implies read-only operation (browse), which is sufficient, but lacks details like data freshness or whether results are exhaustive.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences with no wasted words. First states purpose, second gives usage directive.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for a simple listing tool with no parameters. Could elaborate on what fields each pack contains (e.g., name, price, credits), but not essential.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Zero parameters in schema, so description needs no parameter info. Baseline of 4 is appropriate; no param details needed.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists credit bundles with pricing and directly connects to the purchase_credits tool, distinguishing it from sibling tools like purchase_credits.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says to use this to help choose a pack for the purchase_credits call, providing clear when-to-use guidance. Could mention when not to use, but siblings distinguish well.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

purchase_creditsAInspect

Instantly buy credits using a pre-saved Stripe payment method. CAUTION: This will charge the user's card immediately. Use only when balance is critical.

ParametersJSON Schema
NameRequiredDescriptionDefault
packNoThe credit pack name (check list_credit_packs for prices).
amountNoCustom dollar amount (min $5).
payment_method_idYesStripe PM ID (e.g., "pm_...").
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description includes a caution: 'This will charge the user's card immediately,' which communicates an important behavioral trait. However, with no annotations, it lacks details on failure handling, idempotency, or prerequisites beyond the caution.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise: two sentences that front-load the purpose and immediately follow with a critical caution. No redundant or unnecessary information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite the tool's financial nature and three parameters, the description omits return value details, error scenarios, and prerequisites like payment method validity. More context would be expected for a transaction tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema already describes all three parameters with 100% coverage. The description adds value by warning about the immediate charge but does not enhance understanding of individual parameters beyond what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('buy credits'), the resource ('credits'), and the method ('pre-saved Stripe payment method'). It effectively distinguishes from sibling tools like create_checkout_session and get_credit_balance.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides a specific usage guideline: 'Use only when balance is critical.' This gives context for when to invoke the tool, though it does not explicitly mention when not to use it or compare with alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

report_hiring_outcomeCInspect

Submit the final hiring decision (hired/rejected/withdrawn) for a candidate. Use this to close the loop and help refine the specialized AI models for future accuracy.

ParametersJSON Schema
NameRequiredDescriptionDefault
job_idYesThe job ID.
outcomeYesThe final outcome of the candidate's journey.
candidate_idYesThe candidate ID.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description bears full responsibility for behavioral disclosure. It describes a write operation ('Submit') and mentions a side effect (model refinement), but does not detail required permissions, reversibility, or response behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise—two sentences—with the action front-loaded. Every word adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the lack of output schema, the description omits important details such as response format, success indicators, or error conditions. It also fails to mention any required authorizations or prerequisites, leaving the agent with critical gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

All three parameters are fully described in the input schema (100% coverage), so the description adds no additional semantic value beyond what is already structured. Baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Submit the final hiring decision') and the specific outcome values, making the tool's purpose immediately understandable. It is distinct from sibling tools which focus on other hiring-related actions like generating emails or creating interviews.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides a vague rationale ('close the loop and help refine AI models') but does not specify when to use this tool versus alternatives, nor does it mention any prerequisites or exclusion criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

test_webhookAInspect

Verify that a webhook endpoint is properly reachable and responds correctly. Use this during setup to ensure the user's server is ready for async notifications.

ParametersJSON Schema
NameRequiredDescriptionDefault
urlYesThe HTTPS webhook URL to test.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must fully disclose behavior. It only mentions verifying reachability and correctness without details on what constitutes a correct response, timeout settings, or potential side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with no wasted words. The action is front-loaded and the purpose is immediately clear.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple single-parameter tool with no output schema, the description is adequate but lacks details on what the test results look like (e.g., success/failure indicators), which would help users interpret outcomes.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The parameter 'url' has a clear schema description, and schema coverage is 100%. The description does not add extra meaning beyond the schema, meeting the baseline.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool verifies webhook endpoint reachability and correctness, which matches the name and distinguishes it from sibling tools that handle other tasks like generating emails or screening jobs.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description advises using this during setup to ensure the server is ready for async notifications. This provides clear context and timing, though it could explicitly state when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

whoamiAInspect

Validate the current API token and retrieve user profile, credit balance, and agent key limits. Use this as a "Health Check" at the start of every session.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden. It clearly discloses the tool validates the token and retrieves three types of data, implying no destructive side effects. No contradictory information.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences: first describes functionality, second gives usage guidance. No wasted words, front-loaded with purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no parameters, no output schema, and low complexity, the description fully covers what the tool does and when to use it. Mentions the data retrieved effectively substitutes for an output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist, and schema coverage is 100%. Description adds meaning by explaining the tool's purpose, meeting the baseline of 4 for zero-parameter tools.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool validates the API token and retrieves specific data: user profile, credit balance, and agent key limits. It also frames it as a 'Health Check' for session start, distinguishing it from more specific sibling tools like get_credit_balance.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly recommends using as a 'Health Check at the start of every session', providing clear context. It does not explicitly state when not to use or list alternatives, but the recommendation is strong and sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Sign in to create a connector for this server.

Resources