Skip to main content
Glama

Do Not Act

Ownership verified

Server Details

Fail-closed guard clause for prediction-market agents before trade().

Status
Healthy
Last Tested
Transport
Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client
Glama
MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.
Tool DescriptionsA

Average 4.2/5 across 7 of 7 tools scored.

Server CoherenceA
Disambiguation5/5

Each tool has a distinct purpose: alerts, dispute diagnostics, CSV export, preflight checks (decision and token), resolution risk, and watchlist management. No two tools overlap in functionality; descriptions clearly differentiate them.

Naming Consistency5/5

All tools follow a consistent 'donotact_<noun>' pattern with snake_case (e.g., donotact_alerts, donotact_dispute_history_csv). No mixing of naming conventions.

Tool Count5/5

With 7 tools, the server is well-scoped for its domain of diagnostics and early warnings. Each tool serves a specific capability without superfluous additions.

Completeness4/5

The tool set covers core diagnostics (alerts, disputes, preflight, resolution risk) and watchlist management, but is missing a tool to list watchlist items, which is a minor gap given its read-only diagnostic focus.

Available Tools

7 tools
donotact_alertsA
Read-onlyIdempotent
Inspect

Read the DoNotAct alert feed for the authenticated API key. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
limitNoMaximum number of recent alerts to read.

Output Schema

ParametersJSON Schema
NameRequiredDescription
alertsNoRecent risk alerts for the authenticated account.
not_adviceNoAlways true; alerts are operational diagnostics, not financial advice.
Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations provide readOnlyHint=true, etc. Description adds valuable behavioral context beyond annotations: 'Early-warning only', the special treatment of specific alert types, and the negative scope statement. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences, no waste. Front-loads purpose, then critical usage guidelines, then negative scope. Each sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Simple tool with one parameter and output schema. Description covers purpose, usage, behavioral context, and limitations comprehensively. Fully adequate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the single parameter 'limit'. The description does not add any additional meaning beyond the schema's description ('Maximum number of recent alerts to read.'). Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Read the DoNotAct alert feed' with a specific verb and resource. It distinguishes from siblings by specifying it is read-only and early-warning, unlike donotact_dispute which likely involves actions.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Early-warning only' and instructs to treat specific alert types as blocking unless overridden. Also states what it does NOT do ('No execution, custody, signing, routing, or advice'), guiding when to use versus alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_disputeA
Read-onlyIdempotent
Inspect

Run DoNotAct dispute lifecycle diagnostics where public lifecycle evidence is covered. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
venueYesPrediction-market venue to inspect. Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported.
market_idYesVenue market identifier to inspect before an agent takes action.

Output Schema

ParametersJSON Schema
NameRequiredDescription
receiptNoSigned response receipt metadata when the endpoint returns one.
verdictNoFail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE.
decisionNoDecision boundary for the requested action. It should match the effective verdict when present.
not_adviceNoAlways true for DoNotAct diagnostics; responses are operational controls, not financial advice.
evidence_setNoEvidence references used by the diagnostic when available.
reason_codesNoShort reason-code identifiers when the endpoint exposes a flat list.
no_trade_reasonsNoMachine-readable reasons that explain non-READY or blocking outcomes.
safe_operating_limitsNoConservative size and operating limits derived from available public evidence.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Description adds early-warning limitation and blocking behavior details beyond annotations (readOnlyHint, idempotentHint, etc.). No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences, front-loaded with purpose, no redundant words. Each sentence provides essential information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the two parameters, annotations, and presence of output schema, the description covers key aspects: purpose, usage, caveats. It does not explain return values, but output schema covers that.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions. Description adds value for the 'venue' parameter by giving usage examples ('Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported'). No additional info for 'market_id'.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it runs dispute lifecycle diagnostics with early-warning only. It implicitly differentiates from action tools by stating 'No execution, custody, signing, routing, or advice,' but does not explicitly name sibling alternatives.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides clear context on when to use ('where public lifecycle evidence is covered') and when not to (blocking conditions unless overridden). Lacks explicit comparison to siblings but offers actionable guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_dispute_history_csvA
Read-onlyIdempotent
Inspect

Fetch the paid dispute-history CSV export for the authenticated API key. The CSV is sourced from the dispute ledger and carries no ROI, alpha, or avoided-loss claim. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
daysNoLookback window in days for the dispute-history export.
limitNoMaximum number of dispute-history rows to return.

Output Schema

ParametersJSON Schema
NameRequiredDescription
csvNoCSV export body returned by the dispute-history endpoint.
row_countNoNumber of CSV data rows when available.
not_adviceNoAlways true; the history feed is evidence, not financial advice.
Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already mark it as read-only, idempotent, and non-destructive. The description adds context beyond annotations: the CSV source ('dispute ledger'), disclaimers about financial claims ('no ROI, alpha, or avoided-loss claim'), and the fail-closed policy. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is very concise: two sentences plus a brief caution. It front-loads the key action and resource, then provides necessary behavioral context. Every sentence adds value and there is no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the presence of a full output schema and 100% parameter descriptions, the description sufficiently covers the tool's purpose, source, disclaimers, and usage constraints. It is complete for the tool's complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% (both parameters have descriptions). The description does not add additional semantic information about the parameters beyond what the schema provides. Baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Fetch'), the resource ('paid dispute-history CSV export'), and the scope ('for the authenticated API key'). It distinguishes from sibling tools like 'donotact_dispute' which likely fetches individual dispute details, not CSV export.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides usage context: 'Diagnostics only' and a critical guideline about the fail-closed behavior ('if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed'). It does not explicitly contrast with sibling tools but conveys appropriate when-to-use boundaries.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_preflightA
Read-onlyIdempotent
Inspect

Default prediction-market decision-boundary check. Pass requested action context when available so DoNotAct can evaluate resolver_type, decision, size-vs-book capacity, evidence_set, and safe operating limits. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
sideNoOutcome side or order direction the agent intends to take.
venueYesPrediction-market venue for the pre-action check.
intentNoIntended action before capital moves, used to choose the correct fail-closed boundary.
policyNoRisk policy used to floor or escalate the verdict.
horizonNoExpected holding or execution horizon for the action.
market_idYesVenue market identifier your agent is about to act on.
notional_eurNoRequested action size in EUR when the agent reasons in euros.
notional_usdNoRequested action size in USD for size-vs-book capacity checks.
max_slippage_bpsNoMaximum tolerated slippage in basis points for the requested action.

Output Schema

ParametersJSON Schema
NameRequiredDescription
receiptNoSigned response receipt metadata when the endpoint returns one.
verdictNoFail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE.
decisionNoDecision boundary for the requested action. It should match the effective verdict when present.
not_adviceNoAlways true for DoNotAct diagnostics; responses are operational controls, not financial advice.
evidence_setNoEvidence references used by the diagnostic when available.
reason_codesNoShort reason-code identifiers when the endpoint exposes a flat list.
no_trade_reasonsNoMachine-readable reasons that explain non-READY or blocking outcomes.
safe_operating_limitsNoConservative size and operating limits derived from available public evidence.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already mark readOnlyHint, idempotentHint, and destructiveHint=false. The description adds context beyond annotations by stating it is 'diagnostics only,' 'operational diagnostics, not financial advice,' and lists specific checks (resolver_type, decision, size-vs-book capacity). It reinforces the fail-closed verdict behavior. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is four sentences, front-loaded with the main purpose, and each sentence adds value: defines the tool, instructs on usage, explains fail-closed behavior, and clarifies it is not financial advice. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has 9 parameters (2 required) and an output schema (not shown), the description explains the overall purpose, fail-closed behavior, and the key checks performed. The output schema likely provides detailed return fields, so the description is sufficient for an agent to decide when and how to invoke the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all parameters. The description groups parameters conceptually (e.g., 'requested action context') but does not add substantive per-parameter meaning beyond what the schema provides. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a 'default prediction-market decision-boundary check' and 'diagnostics only,' specifying the verb 'check' and the resource 'decision-boundary.' It distinguishes itself from sibling tools like donotact_alerts (alerts), donotact_dispute (disputes), and donotact_resolution_risk (resolution risk) by its preflight role.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly instructs to 'Pass requested action context' and explains fail-closed behavior: 'if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed.' It implies use before any action but does not explicitly state when not to use or compare to alternatives, though the tool's distinct purpose is clear given sibling names.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_preflight_tokenA
Read-onlyIdempotent
Inspect

Run DoNotAct token-risk preflight using the GoPlus T1-T8 fail-closed taxonomy. Authenticated MCP calls use DONOTACT_API_KEY; accountless agents can use the x402 token preflight route documented in llms.txt. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
chainYesToken chain to inspect with the fail-closed T1-T8 token-risk taxonomy.
addressYesToken contract address to inspect before an agent buys or routes capital.

Output Schema

ParametersJSON Schema
NameRequiredDescription
receiptNoSigned response receipt metadata when the endpoint returns one.
verdictNoFail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE.
decisionNoDecision boundary for the requested action. It should match the effective verdict when present.
not_adviceNoAlways true for DoNotAct diagnostics; responses are operational controls, not financial advice.
evidence_setNoEvidence references used by the diagnostic when available.
reason_codesNoShort reason-code identifiers when the endpoint exposes a flat list.
no_trade_reasonsNoMachine-readable reasons that explain non-READY or blocking outcomes.
safe_operating_limitsNoConservative size and operating limits derived from available public evidence.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already mark the tool as readOnly, idempotent, non-destructive, and open world. The description adds crucial behavioral context: it is diagnostics only, not financial advice, and is fail-closed with specific verdict meanings. It also describes authentication requirements (API key vs. x402 route). This adds value beyond annotations without contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is 4-5 sentences, each adding distinct value: purpose, authentication, diagnostics disclaimer, fail-closed behavior. It is front-loaded with the core function and avoids fluff. While slightly longer than minimal, the information density justifies its length, earning a 4.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the presence of an output schema and rich annotations, the description covers authentication, fail-closed behavior, and a disclaimer. However, it does not elaborate on the T1-T8 taxonomy or what a typical output looks like (beyond the verdicts mentioned). For a tool that returns risk assessments, an agent might benefit from more guidance on interpreting results. Still, the output schema likely fills this gap, so a 3 is appropriate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the baseline is 3. The main description reiterates the 'preflight' purpose and mentions the T1-T8 taxonomy, adding context that the chain and address parameters are for inspecting token risk within that taxonomy. However, this does not significantly extend the semantic meaning beyond what the schema descriptions already provide (e.g., 'Token contract address to inspect before an agent buys or routes capital'). Hence, score 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool runs a token-risk preflight using the GoPlus T1-T8 fail-closed taxonomy. It specifies a specific verb ('Run'), resource ('token-risk preflight'), and methodology ('GoPlus T1-T8 fail-closed taxonomy'). The title and sibling tools list includes 'donotact_preflight' without '_token', implying this is the token-specific variant, which helps distinguish from the generic preflight tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides some usage context: authenticated calls vs. accountless agents, and that this is diagnostics only. It also states the fail-closed behavior and that verdicts of DO_NOT_ACT or INSUFFICIENT_EVIDENCE should block action. However, it does not explicitly compare this tool to siblings like donotact_alerts or donotact_preflight, nor does it state when not to use it (e.g., for contract-level vs. broader risk). Thus, while helpful, it lacks explicit when-to-use vs. alternatives guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_resolution_riskA
Read-onlyIdempotent
Inspect

Run DoNotAct resolution-governance and ambiguity diagnostics for a venue market. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
venueYesPrediction-market venue to inspect. Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported.
market_idYesVenue market identifier to inspect before an agent takes action.

Output Schema

ParametersJSON Schema
NameRequiredDescription
receiptNoSigned response receipt metadata when the endpoint returns one.
verdictNoFail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE.
decisionNoDecision boundary for the requested action. It should match the effective verdict when present.
not_adviceNoAlways true for DoNotAct diagnostics; responses are operational controls, not financial advice.
evidence_setNoEvidence references used by the diagnostic when available.
reason_codesNoShort reason-code identifiers when the endpoint exposes a flat list.
no_trade_reasonsNoMachine-readable reasons that explain non-READY or blocking outcomes.
safe_operating_limitsNoConservative size and operating limits derived from available public evidence.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint=true and destructiveHint=false, so the description shines by adding the fail-closed context and clarifying it is operational diagnostics, not financial advice. This enriches the agent's understanding beyond the structured annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three short, focused sentences. Each sentence serves a distinct purpose: stating the action, providing critical behavioral guidance, and clarifying the scope. No unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the presence of an output schema and thorough annotations, the description covers the essential context: what the tool does, its operational nature, and how to interpret its output (fail-closed). Minor gap: it doesn't explicitly state that it checks resolution risk before an agent acts, but this is implied.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the schema descriptions for venue and market_id are already clear. The main description does not add new parameter-level details beyond the schema, achieving the baseline expectation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool runs resolution-governance and ambiguity diagnostics for a venue market. It uses specific verbs ('Run') and resources ('diagnostics'), and the phrase 'Diagnostics only' distinguishes it from action-oriented siblings like donotact_dispute or donotact_alerts.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides guidance by explaining the DoNotAct fail-closed behavior ('if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed'), implying when to use the tool (before taking action based on resolution). However, it does not explicitly compare to sibling tools or state when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

donotact_watchlist_addA
Idempotent
Inspect

Add or update a DoNotAct early-warning watchlist item. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.

ParametersJSON Schema
NameRequiredDescriptionDefault
emailNoOptional email address for early-warning delivery.
venueYesVenue for the watchlist item.
market_idYesMarket identifier to monitor for dispute or resolution-risk state changes.
webhook_urlNoOptional HTTPS callback URL for early-warning delivery.

Output Schema

ParametersJSON Schema
NameRequiredDescription
venueNoVenue associated with the watched market.
statusNoOperation status returned by the watchlist API.
market_idNoMarket identifier that was added or updated.
not_adviceNoAlways true; watchlists monitor risk states and do not execute trades.
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate idempotentHint=true and destructiveHint=false. The description adds valuable context about blocking behavior and human override requirements, enhancing transparency beyond structured fields.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Very concise at three sentences. The first sentence immediately states the main action. No unnecessary words; every sentence contributes meaningful information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With output schema present and good annotations, the description is complete. It covers purpose, behavioral notes, and usage constraints, leaving no critical gaps for an agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description does not add meaning beyond what the schema already provides for the parameters (e.g., email, venue, market_id, webhook_url).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Add or update' and the resource 'DoNotAct early-warning watchlist item'. It distinguishes from siblings by specifying it is 'early-warning only' and listing exclusions like 'No execution, custody, signing, routing, or advice'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It provides clear context on when to use (early-warning) and when to treat as blocking unless human override. However, it does not explicitly name alternative sibling tools for comparison, missing a direct when-not-to-use reference.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Sign in to create a connector for this server.

Resources