Do Not Act
Server Details
Fail-closed guard clause for prediction-market agents before trade().
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 4.2/5 across 7 of 7 tools scored.
Each tool has a distinct purpose: alerts, dispute diagnostics, CSV export, preflight checks (decision and token), resolution risk, and watchlist management. No two tools overlap in functionality; descriptions clearly differentiate them.
All tools follow a consistent 'donotact_<noun>' pattern with snake_case (e.g., donotact_alerts, donotact_dispute_history_csv). No mixing of naming conventions.
With 7 tools, the server is well-scoped for its domain of diagnostics and early warnings. Each tool serves a specific capability without superfluous additions.
The tool set covers core diagnostics (alerts, disputes, preflight, resolution risk) and watchlist management, but is missing a tool to list watchlist items, which is a minor gap given its read-only diagnostic focus.
Available Tools
7 toolsdonotact_alertsARead-onlyIdempotentInspect
Read the DoNotAct alert feed for the authenticated API key. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum number of recent alerts to read. |
Output Schema
| Name | Required | Description |
|---|---|---|
| alerts | No | Recent risk alerts for the authenticated account. |
| not_advice | No | Always true; alerts are operational diagnostics, not financial advice. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations provide readOnlyHint=true, etc. Description adds valuable behavioral context beyond annotations: 'Early-warning only', the special treatment of specific alert types, and the negative scope statement. No contradiction.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, no waste. Front-loads purpose, then critical usage guidelines, then negative scope. Each sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Simple tool with one parameter and output schema. Description covers purpose, usage, behavioral context, and limitations comprehensively. Fully adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single parameter 'limit'. The description does not add any additional meaning beyond the schema's description ('Maximum number of recent alerts to read.'). Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Read the DoNotAct alert feed' with a specific verb and resource. It distinguishes from siblings by specifying it is read-only and early-warning, unlike donotact_dispute which likely involves actions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Early-warning only' and instructs to treat specific alert types as blocking unless overridden. Also states what it does NOT do ('No execution, custody, signing, routing, or advice'), guiding when to use versus alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_disputeARead-onlyIdempotentInspect
Run DoNotAct dispute lifecycle diagnostics where public lifecycle evidence is covered. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.
| Name | Required | Description | Default |
|---|---|---|---|
| venue | Yes | Prediction-market venue to inspect. Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported. | |
| market_id | Yes | Venue market identifier to inspect before an agent takes action. |
Output Schema
| Name | Required | Description |
|---|---|---|
| receipt | No | Signed response receipt metadata when the endpoint returns one. |
| verdict | No | Fail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE. |
| decision | No | Decision boundary for the requested action. It should match the effective verdict when present. |
| not_advice | No | Always true for DoNotAct diagnostics; responses are operational controls, not financial advice. |
| evidence_set | No | Evidence references used by the diagnostic when available. |
| reason_codes | No | Short reason-code identifiers when the endpoint exposes a flat list. |
| no_trade_reasons | No | Machine-readable reasons that explain non-READY or blocking outcomes. |
| safe_operating_limits | No | Conservative size and operating limits derived from available public evidence. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description adds early-warning limitation and blocking behavior details beyond annotations (readOnlyHint, idempotentHint, etc.). No contradiction with annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose, no redundant words. Each sentence provides essential information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the two parameters, annotations, and presence of output schema, the description covers key aspects: purpose, usage, caveats. It does not explain return values, but output schema covers that.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions. Description adds value for the 'venue' parameter by giving usage examples ('Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported'). No additional info for 'market_id'.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it runs dispute lifecycle diagnostics with early-warning only. It implicitly differentiates from action tools by stating 'No execution, custody, signing, routing, or advice,' but does not explicitly name sibling alternatives.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear context on when to use ('where public lifecycle evidence is covered') and when not to (blocking conditions unless overridden). Lacks explicit comparison to siblings but offers actionable guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_dispute_history_csvARead-onlyIdempotentInspect
Fetch the paid dispute-history CSV export for the authenticated API key. The CSV is sourced from the dispute ledger and carries no ROI, alpha, or avoided-loss claim. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | Lookback window in days for the dispute-history export. | |
| limit | No | Maximum number of dispute-history rows to return. |
Output Schema
| Name | Required | Description |
|---|---|---|
| csv | No | CSV export body returned by the dispute-history endpoint. |
| row_count | No | Number of CSV data rows when available. |
| not_advice | No | Always true; the history feed is evidence, not financial advice. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already mark it as read-only, idempotent, and non-destructive. The description adds context beyond annotations: the CSV source ('dispute ledger'), disclaimers about financial claims ('no ROI, alpha, or avoided-loss claim'), and the fail-closed policy. No contradiction with annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is very concise: two sentences plus a brief caution. It front-loads the key action and resource, then provides necessary behavioral context. Every sentence adds value and there is no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of a full output schema and 100% parameter descriptions, the description sufficiently covers the tool's purpose, source, disclaimers, and usage constraints. It is complete for the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (both parameters have descriptions). The description does not add additional semantic information about the parameters beyond what the schema provides. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('Fetch'), the resource ('paid dispute-history CSV export'), and the scope ('for the authenticated API key'). It distinguishes from sibling tools like 'donotact_dispute' which likely fetches individual dispute details, not CSV export.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context: 'Diagnostics only' and a critical guideline about the fail-closed behavior ('if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed'). It does not explicitly contrast with sibling tools but conveys appropriate when-to-use boundaries.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_preflightARead-onlyIdempotentInspect
Default prediction-market decision-boundary check. Pass requested action context when available so DoNotAct can evaluate resolver_type, decision, size-vs-book capacity, evidence_set, and safe operating limits. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.
| Name | Required | Description | Default |
|---|---|---|---|
| side | No | Outcome side or order direction the agent intends to take. | |
| venue | Yes | Prediction-market venue for the pre-action check. | |
| intent | No | Intended action before capital moves, used to choose the correct fail-closed boundary. | |
| policy | No | Risk policy used to floor or escalate the verdict. | |
| horizon | No | Expected holding or execution horizon for the action. | |
| market_id | Yes | Venue market identifier your agent is about to act on. | |
| notional_eur | No | Requested action size in EUR when the agent reasons in euros. | |
| notional_usd | No | Requested action size in USD for size-vs-book capacity checks. | |
| max_slippage_bps | No | Maximum tolerated slippage in basis points for the requested action. |
Output Schema
| Name | Required | Description |
|---|---|---|
| receipt | No | Signed response receipt metadata when the endpoint returns one. |
| verdict | No | Fail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE. |
| decision | No | Decision boundary for the requested action. It should match the effective verdict when present. |
| not_advice | No | Always true for DoNotAct diagnostics; responses are operational controls, not financial advice. |
| evidence_set | No | Evidence references used by the diagnostic when available. |
| reason_codes | No | Short reason-code identifiers when the endpoint exposes a flat list. |
| no_trade_reasons | No | Machine-readable reasons that explain non-READY or blocking outcomes. |
| safe_operating_limits | No | Conservative size and operating limits derived from available public evidence. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already mark readOnlyHint, idempotentHint, and destructiveHint=false. The description adds context beyond annotations by stating it is 'diagnostics only,' 'operational diagnostics, not financial advice,' and lists specific checks (resolver_type, decision, size-vs-book capacity). It reinforces the fail-closed verdict behavior. No contradiction.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences, front-loaded with the main purpose, and each sentence adds value: defines the tool, instructs on usage, explains fail-closed behavior, and clarifies it is not financial advice. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 9 parameters (2 required) and an output schema (not shown), the description explains the overall purpose, fail-closed behavior, and the key checks performed. The output schema likely provides detailed return fields, so the description is sufficient for an agent to decide when and how to invoke the tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents all parameters. The description groups parameters conceptually (e.g., 'requested action context') but does not add substantive per-parameter meaning beyond what the schema provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is a 'default prediction-market decision-boundary check' and 'diagnostics only,' specifying the verb 'check' and the resource 'decision-boundary.' It distinguishes itself from sibling tools like donotact_alerts (alerts), donotact_dispute (disputes), and donotact_resolution_risk (resolution risk) by its preflight role.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly instructs to 'Pass requested action context' and explains fail-closed behavior: 'if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed.' It implies use before any action but does not explicitly state when not to use or compare to alternatives, though the tool's distinct purpose is clear given sibling names.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_preflight_tokenARead-onlyIdempotentInspect
Run DoNotAct token-risk preflight using the GoPlus T1-T8 fail-closed taxonomy. Authenticated MCP calls use DONOTACT_API_KEY; accountless agents can use the x402 token preflight route documented in llms.txt. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | Yes | Token chain to inspect with the fail-closed T1-T8 token-risk taxonomy. | |
| address | Yes | Token contract address to inspect before an agent buys or routes capital. |
Output Schema
| Name | Required | Description |
|---|---|---|
| receipt | No | Signed response receipt metadata when the endpoint returns one. |
| verdict | No | Fail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE. |
| decision | No | Decision boundary for the requested action. It should match the effective verdict when present. |
| not_advice | No | Always true for DoNotAct diagnostics; responses are operational controls, not financial advice. |
| evidence_set | No | Evidence references used by the diagnostic when available. |
| reason_codes | No | Short reason-code identifiers when the endpoint exposes a flat list. |
| no_trade_reasons | No | Machine-readable reasons that explain non-READY or blocking outcomes. |
| safe_operating_limits | No | Conservative size and operating limits derived from available public evidence. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already mark the tool as readOnly, idempotent, non-destructive, and open world. The description adds crucial behavioral context: it is diagnostics only, not financial advice, and is fail-closed with specific verdict meanings. It also describes authentication requirements (API key vs. x402 route). This adds value beyond annotations without contradiction.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is 4-5 sentences, each adding distinct value: purpose, authentication, diagnostics disclaimer, fail-closed behavior. It is front-loaded with the core function and avoids fluff. While slightly longer than minimal, the information density justifies its length, earning a 4.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema and rich annotations, the description covers authentication, fail-closed behavior, and a disclaimer. However, it does not elaborate on the T1-T8 taxonomy or what a typical output looks like (beyond the verdicts mentioned). For a tool that returns risk assessments, an agent might benefit from more guidance on interpreting results. Still, the output schema likely fills this gap, so a 3 is appropriate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The main description reiterates the 'preflight' purpose and mentions the T1-T8 taxonomy, adding context that the chain and address parameters are for inspecting token risk within that taxonomy. However, this does not significantly extend the semantic meaning beyond what the schema descriptions already provide (e.g., 'Token contract address to inspect before an agent buys or routes capital'). Hence, score 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool runs a token-risk preflight using the GoPlus T1-T8 fail-closed taxonomy. It specifies a specific verb ('Run'), resource ('token-risk preflight'), and methodology ('GoPlus T1-T8 fail-closed taxonomy'). The title and sibling tools list includes 'donotact_preflight' without '_token', implying this is the token-specific variant, which helps distinguish from the generic preflight tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some usage context: authenticated calls vs. accountless agents, and that this is diagnostics only. It also states the fail-closed behavior and that verdicts of DO_NOT_ACT or INSUFFICIENT_EVIDENCE should block action. However, it does not explicitly compare this tool to siblings like donotact_alerts or donotact_preflight, nor does it state when not to use it (e.g., for contract-level vs. broader risk). Thus, while helpful, it lacks explicit when-to-use vs. alternatives guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_resolution_riskARead-onlyIdempotentInspect
Run DoNotAct resolution-governance and ambiguity diagnostics for a venue market. Diagnostics only. DoNotAct is fail-closed: if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed. This is operational diagnostics, not financial advice.
| Name | Required | Description | Default |
|---|---|---|---|
| venue | Yes | Prediction-market venue to inspect. Use polymarket for UMA/Gamma/CLOB evidence or kalshi where supported. | |
| market_id | Yes | Venue market identifier to inspect before an agent takes action. |
Output Schema
| Name | Required | Description |
|---|---|---|
| receipt | No | Signed response receipt metadata when the endpoint returns one. |
| verdict | No | Fail-closed stop/go verdict such as READY, CAUTION, DO_NOT_ACT, or INSUFFICIENT_EVIDENCE. |
| decision | No | Decision boundary for the requested action. It should match the effective verdict when present. |
| not_advice | No | Always true for DoNotAct diagnostics; responses are operational controls, not financial advice. |
| evidence_set | No | Evidence references used by the diagnostic when available. |
| reason_codes | No | Short reason-code identifiers when the endpoint exposes a flat list. |
| no_trade_reasons | No | Machine-readable reasons that explain non-READY or blocking outcomes. |
| safe_operating_limits | No | Conservative size and operating limits derived from available public evidence. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already indicate readOnlyHint=true and destructiveHint=false, so the description shines by adding the fail-closed context and clarifying it is operational diagnostics, not financial advice. This enriches the agent's understanding beyond the structured annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three short, focused sentences. Each sentence serves a distinct purpose: stating the action, providing critical behavioral guidance, and clarifying the scope. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the presence of an output schema and thorough annotations, the description covers the essential context: what the tool does, its operational nature, and how to interpret its output (fail-closed). Minor gap: it doesn't explicitly state that it checks resolution risk before an agent acts, but this is implied.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the schema descriptions for venue and market_id are already clear. The main description does not add new parameter-level details beyond the schema, achieving the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool runs resolution-governance and ambiguity diagnostics for a venue market. It uses specific verbs ('Run') and resources ('diagnostics'), and the phrase 'Diagnostics only' distinguishes it from action-oriented siblings like donotact_dispute or donotact_alerts.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides guidance by explaining the DoNotAct fail-closed behavior ('if the verdict is DO_NOT_ACT or INSUFFICIENT_EVIDENCE, do not proceed'), implying when to use the tool (before taking action based on resolution). However, it does not explicitly compare to sibling tools or state when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
donotact_watchlist_addAIdempotentInspect
Add or update a DoNotAct early-warning watchlist item. Early-warning only. Treat uma_dispute_detected and uma_dispute_already_active as blocking unless a human operator explicitly overrides outside DoNotAct. No execution, custody, signing, routing, or advice.
| Name | Required | Description | Default |
|---|---|---|---|
| No | Optional email address for early-warning delivery. | ||
| venue | Yes | Venue for the watchlist item. | |
| market_id | Yes | Market identifier to monitor for dispute or resolution-risk state changes. | |
| webhook_url | No | Optional HTTPS callback URL for early-warning delivery. |
Output Schema
| Name | Required | Description |
|---|---|---|
| venue | No | Venue associated with the watched market. |
| status | No | Operation status returned by the watchlist API. |
| market_id | No | Market identifier that was added or updated. |
| not_advice | No | Always true; watchlists monitor risk states and do not execute trades. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already indicate idempotentHint=true and destructiveHint=false. The description adds valuable context about blocking behavior and human override requirements, enhancing transparency beyond structured fields.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Very concise at three sentences. The first sentence immediately states the main action. No unnecessary words; every sentence contributes meaningful information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With output schema present and good annotations, the description is complete. It covers purpose, behavioral notes, and usage constraints, leaving no critical gaps for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description does not add meaning beyond what the schema already provides for the parameters (e.g., email, venue, market_id, webhook_url).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Add or update' and the resource 'DoNotAct early-warning watchlist item'. It distinguishes from siblings by specifying it is 'early-warning only' and listing exclusions like 'No execution, custody, signing, routing, or advice'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides clear context on when to use (early-warning) and when to treat as blocking unless human override. However, it does not explicitly name alternative sibling tools for comparison, missing a direct when-not-to-use reference.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!