The Stall
Server Details
173 pay-per-call tools: US stocks, DeFi, crypto, Polymarket, sanctions — USDC on Base
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 4.1/5 across 172 of 172 tools scored. Lowest: 2.9/5.
Many tools serve overlapping purposes: at least 4 stock price tools, 5+ yield/DeFi tools, 3+ transaction explainers, multiple weather tools, multiple prediction market tools, etc. This makes it very difficult for an agent to select the correct tool.
Naming patterns are highly inconsistent: some use verb_noun (e.g., 'get_...'), some use hyphenated compounds (e.g., 'air-quality'), some are short single words (e.g., 'weather'), others use camelCase or mixed styles. No consistent convention is followed.
With 172 tools, the server is severely over-scoped. No single domain justifies this many tools; it appears to be a collection of many unrelated capabilities without clear curation. A well-scoped server typically has 3-15 tools.
While the server covers many domains, there are obvious gaps and redundancies. For example, there are multiple stock price tools but no tool for fundamental financial statement data. The broad but shallow coverage leads to missing operations in specific areas.
Available Tools
173 toolsaddress-securityAInspect
Wallet/address security and reputation check. Detects phishing, sanctions, cybercrime, money laundering, dark-web activity, and blacklisted wallets using GoPlus Labs + SlowMist + BlockSec data. Returns risk score (0-100) and per-flag breakdown. Supports Ethereum, Base, BSC, Polygon, Arbitrum, Optimism, Avalanche, and more.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Chain name or EIP-155 chain ID. Supported names: ethereum, base, bsc, polygon, arbitrum, optimism, avalanche. Default: ethereum | |
| address | Yes | EVM wallet address (0x...) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description bears full burden. It discloses data sources, detected threats, and output format (risk score 0-100 with per-flag breakdown), but lacks details on computation, rate limits, auth, or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a tight, informative paragraph: purpose, key features, data sources, and supported chains in three sentences. No redundancy or wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, but description partially covers return structure (risk score and per-flag breakdown). Missing details on exact output fields, error states, and chain coverage ('and more'). Adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% with descriptions; description adds no new parameter semantics beyond repeating supported chains. Baseline 3 is appropriate as schema already explains parameters well.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Wallet/address security and reputation check.' It lists specific threat categories (phishing, sanctions, etc.) and data sources, distinguishing it from sibling security tools like sanctions-screening or evm-token-security.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for quick security checks but does not explicitly state when to use this tool versus siblings like block-intel or wallet-screener. No alternatives or exclusions are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
agent-access-checkAInspect
Checks whether a website is accessible and agent-friendly. Fetches robots.txt, .well-known/ai.txt, and sitemap.xml; inspects HTTP headers (CORS, CSP, rate-limit); and returns a readiness verdict. Useful for agents that need to decide whether to scrape, crawl, or interact with a domain before committing to a workflow. Returns allowed/blocked status, disallowed paths, crawl delay, AI-specific rules, and sitemap URL if present.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Domain or URL to check. Can be a bare domain (example.com) or full URL (https://example.com/path). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description fully carries the burden. It transparently describes all actions (fetching specific files and headers) and the return values (allowed/blocked status, disallowed paths, etc.). No hidden behaviors or contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with four sentences, each adding value. The main purpose is front-loaded, followed by specific actions, use case, and return values. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multi-file fetch and header inspection) and the lack of an output schema, the description adequately covers input and output. Minor omissions like error handling do not significantly detract from completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The single parameter 'url' is already well-described in the schema (100% coverage). The description does not add additional meaning beyond what the schema provides, so the baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: checking website accessibility and agent-friendliness, with specific actions listed (fetching robots.txt, .well-known/ai.txt, sitemap.xml, inspecting HTTP headers). This distinguishes it from sibling tools like http-headers or page-intel.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions when to use: when agents need to decide whether to scrape, crawl, or interact with a domain. It does not explicitly state when not to use or name alternatives, but the context from sibling tool names and the clear use case provides sufficient guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
agent-kya-scoreAInspect
Know Your Agent (KYA) trust score for any EVM wallet. Returns a 0–100 score, tier (trusted/verified/unknown), sanctions screening result, per-dimension score breakdown (wallet age, activity, sanctions, ERC-8004 identity, KYB, tx history, trust bond, endpoint quality), on-chain attestations (Coinbase KYC, Gitcoin Passport, USDC balance), wallet stats (balance, tx count, estimated age), and per-transaction spending limits. Use before accepting payment from an agent or before dispatching a sub-agent with funds. Answers: is this agent wallet sanctioned? how established is it? has it passed KYC?
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | EVM wallet address to score (0x-prefixed, 42 chars). Works on any EVM chain — score is chain-agnostic. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully carries the burden. It enumerates all return elements (score, tier, sanctions, breakdown, attestations, stats, spending limits) and states the tool is chain-agnostic. No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is moderately lengthy but each sentence adds value. It is structured with a core statement followed by output details and usage context. Could be slightly tighter but effective.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description thoroughly explains all return values and usage context. It provides a complete picture for an agent to understand and invoke the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (one parameter 'address'). The description adds context by repeating the schema's description and clarifying chain-agnostic nature. However, it does not add meaning beyond the schema, meeting the baseline expectation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Know Your Agent (KYA) trust score for any EVM wallet.' It details the output (score, tier, breakdown) and distinguishes from siblings by focusing on trust assessment for agents. The verb 'score' and resource 'EVM wallet' are specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says when to use: 'Use before accepting payment from an agent or before dispatching a sub-agent with funds.' It provides clear context but does not mention alternatives or when not to use, which would elevate it to a 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
ai-image-genAInspect
Generate an AI image from a text prompt using DALL-E 3. Returns a public URL for the image (hosted for 1 hour) plus the model's revised prompt. Supports vivid or natural style, and three aspect ratios: square (1024×1024), portrait (1024×1792), or landscape (1792×1024). $0.080/image — 20% below closest x402 competitor. Output is base64-encoded PNG or a direct URL depending on response_format.
| Name | Required | Description | Default |
|---|---|---|---|
| size | No | Image dimensions. '1024x1024' is square (default), '1024x1792' is portrait, '1792x1024' is landscape. | |
| style | No | Image style. 'vivid' produces hyper-real, dramatic compositions. 'natural' produces softer, more realistic images. Default: vivid. | |
| prompt | Yes | Text description of the image to generate. Be specific — include subject, style, lighting, mood, and composition for best results. Max 4000 characters. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description carries the burden. It discloses that the image URL is hosted for 1 hour, returns a public URL and revised prompt, and specifies output format depending on response_format. It also mentions pricing. However, it does not address auth requirements or rate limits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with several sentences, front-loading the main purpose. It is well-structured but could be slightly more organized (e.g., grouping pricing info). No unnecessary repetition.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (3 parameters, no output schema), the description explains what is returned (public URL, revised prompt, output format) and provides pricing. It lacks information about whether authentication is needed to access the URL, but overall covers key aspects adequately.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% for all 3 parameters, providing adequate baseline. The description adds extra context by explaining the default style (vivid), exact pixel dimensions for sizes, and prompt tips. It also mentions response_format (which is not in schema), adding some confusion but also additional value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states 'Generate an AI image from a text prompt using DALL-E 3', which is a specific verb+resource. Distinguishes from siblings like meme-generator and image-detect by specifying the model (DALL-E 3), supported styles, sizes, and pricing.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides features but no explicit guidance on when to use this tool vs alternatives. It does not mention when not to use it or suggest any sibling tools for related tasks.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
air-qualityAInspect
Real-time US AQI and pollutant readings for any lat/lon. Returns current AQI with category label (Good/Moderate/Unhealthy/Hazardous), PM2.5, PM10, nitrogen dioxide, ozone, carbon monoxide, dust levels, and UV index. Data from CAMS (Copernicus Atmosphere Monitoring Service), updated hourly. Use for health advisories, outdoor event planning, environment-sensitive routing, or regulatory compliance checks.
| Name | Required | Description | Default |
|---|---|---|---|
| lat | Yes | Latitude in decimal degrees (e.g. 40.71 for New York City). | |
| lon | Yes | Longitude in decimal degrees (e.g. -74.01 for New York City). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided. The description discloses data freshness (updated hourly) and source (CAMS) but does not clarify geographic scope (US-only vs global) or mention any limitations, rate limits, or authentication needs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose, no fluff, and efficiently conveys key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description lists return fields, data source, update frequency, and use cases. However, without an output schema, it could be improved by hinting at the response format or units.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions (lat/lon with examples). The description adds context about what the tool returns but does not add meaning beyond the schema for the parameters themselves.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns real-time US AQI and pollutant readings for any lat/lon, listing specific pollutants and data source. It distinguishes itself from weather-related siblings by focusing on air quality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases (health advisories, outdoor event planning, etc.) but does not mention when not to use the tool or compare it to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
analyst-ratingsAInspect
Wall Street analyst consensus and price targets for any US equity. Returns buy/hold/sell breakdown, mean recommendation score (1=Strong Buy, 5=Strong Sell), analyst count, and price target range (low/mean/median/high) with upside-to-target. Free Yahoo Finance data, no API key. Complements us-stock-price and equity-technicals with the analyst conviction layer.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AAPL, NVDA, MSFT). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries full burden. It discloses data source (Yahoo Finance, free, no API key) and output details. No contradictions; transparency is adequate for a non-destructive tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three well-structured sentences: purpose, output details, and context. Every sentence adds value, front-loaded with main action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but description lists all returns: breakdown, score, count, price target range, upside. Also notes US equity scope and free data source. Complete for the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers ticker with description. Description adds that ticker is case-insensitive, providing extra clarity beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns analyst consensus and price targets for US equities, listing specific outputs. It distinguishes itself from siblings by mentioning the 'analyst conviction layer' complementing us-stock-price and equity-technicals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Context is provided: the tool complements specific siblings and uses free Yahoo Finance data. While it doesn't explicitly state when not to use it, the purpose is clear enough for appropriate selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
base-seasonAInspect
Base chain season snapshot: total chain TVL, top 10 protocols by Base-native TVL, category breakdown, 7d trend, and top Base ecosystem tokens by market cap. No input required — agents use this for pre-trade orientation before DeFi, lending, or liquidity calls on Base.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It implies a read operation ('snapshot') but does not confirm whether it is read-only, mention auth requirements, rate limits, or data freshness. The lack of such details in the description leaves the agent without critical behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first lists contents, second explains usage. No extraneous words, front-loaded information, and efficient for agent consumption.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description covers the main data points. It does not detail return format (e.g., numbers vs percentages), but the listing is sufficient for a snapshot tool. Could be more complete on data structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With zero parameters, schema coverage is 100%. The description adds value by listing the output components (e.g., top protocols, category breakdown) beyond what the empty schema provides. Baseline for 0 params is 4, and the description meets this.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides a Base chain snapshot with specific data points: total TVL, top 10 protocols, category breakdown, 7d trend, and top tokens. It explicitly requires no input, distinguishing it from other crypto tools that may require parameters.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies usage for pre-trade orientation before DeFi, lending, or liquidity calls on Base. It provides clear context but does not explicitly list cases when not to use it or alternatives, though the specificity to Base chain differentiates it from siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
block-intelAInspect
Returns block header data (number, hash, timestamp, gas used/limit, base fee, tx count, validator address) for any block on Base, Ethereum, or Arbitrum. Accepts block tag (latest/safe/finalized) or number. Free-RPC alternative to skills.onesource.io at 33% lower price — $0.002 vs $0.003.
| Name | Required | Description | Default |
|---|---|---|---|
| block | No | Block identifier: 'latest' (default), 'safe', 'finalized', 'earliest', decimal block number, or 0x-prefixed hex number. | |
| network | No | Chain to query: 'base' (default), 'ethereum', or 'arbitrum'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Although no annotations are provided, the description implies a read-only operation by listing the returned data fields. It does not explicitly state the absence of side effects or mention authentication needs, but the context makes it reasonably clear.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, with the first sentence conveying the core purpose and output, and the second providing input details and a pricing comparison. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has no output schema, so the description compensates by listing output fields. It covers input types and defaults. Minor gap: no mention of error handling or rate limits, but sufficient for a simple block data tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 100% schema description coverage, the description adds significant meaning: it specifies valid block identifiers ('latest', 'safe', 'finalized', numbers) and provides default values for both parameters, going beyond the bare schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns block header data and lists specific fields (number, hash, timestamp, etc.). It explicitly names the supported chains (Base, Ethereum, Arbitrum), distinguishing it from chain-specific tools like 'eth-block'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains the input types (block tag or number) and provides default values. It also mentions a pricing alternative, but lacks functional guidance on when to use this over sibling tools like 'eth-block' or 'tx-intel'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
breadcrumb-extractorAInspect
Extracts structured breadcrumb navigation from a URL. Returns domain, ordered path segments with human-readable labels, query parameters as key-value pairs, and a formatted breadcrumb trail string. Identifies numeric IDs vs. named path segments. Pure URL parsing — zero external calls. Useful for agents that process sitemaps, navigation menus, or need to understand page hierarchy.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Full URL to extract breadcrumbs from (e.g. 'https://docs.example.com/api/v2/users/123/profile'). | |
| separator | No | Breadcrumb separator string (default: ' > '). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description fully covers behavioral aspects: 'Pure URL parsing — zero external calls' assures safety and speed. It also discloses internal logic like identifying numeric vs named path segments. No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is two sentences: first sentence states the primary action, second adds detail and use cases. No wasted words, front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 2 parameters and no output schema, the description provides a complete picture: it explains inputs, outputs (even without schema), behavior (zero external calls), and appropriate use cases. Nothing essential is missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but description adds significant value beyond field descriptions: it explains the output structure (domain, path segments, labels, query params, trail string) and the numeric ID identification feature. This helps the agent understand what the tool returns.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool extracts structured breadcrumb navigation from a URL. It lists exact outputs (domain, path segments, labels, query params, trail string) and identifies a distinguishing feature (numeric IDs vs named segments). This differentiates it from sibling tools like page-links or page-intel.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides explicit use cases: agents processing sitemaps, navigation menus, or understanding page hierarchy. It does not explicitly state when not to use or list alternatives, but the context is clear enough for an agent to decide.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
btc-game-theoryAInspect
Bitcoin mining game theory and systems dynamics in one call. Returns: selfish-mining profitability threshold (Eyal-Sirer), 51%-attack electricity cost estimate, fee-vs-subsidy revenue split, difficulty epoch trajectory (expansion / contraction / neutral), Nash-equilibrium state for honest mining, and current epoch progress. Sourced from mempool.space and CoinGecko — no API key required. Use for miner-incentive analysis, network security assessment, and pre-investment regime detection.
| Name | Required | Description | Default |
|---|---|---|---|
| connectivity_gamma | No | Assumed fraction of honest miners who extend the selfish chain when block heights are equal (0–1). Higher γ means better-connected selfish miner. Default 0 (worst-case for honest miners). | |
| efficiency_w_per_th | No | Assumed ASIC power efficiency in W/TH. Default 20 (representative of modern S21/M60 hardware). Lower = more efficient attackers. | |
| electricity_kwh_usd | No | Assumed electricity cost in $/kWh for 51%-attack cost estimate. Default 0.07. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses that no API key is required and sources are from mempool.space and CoinGecko. However, it does not mention rate limits, caching behavior, or whether the tool modifies any state (it appears read-only). This is adequate but leaves gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-organized, starting with purpose, followed by a list of outputs, then sources, and finally use cases. It is reasonably concise but slightly verbose with the output list. Could be tightened by removing minor redundancies.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multiple outputs, 3 optional parameters, no output schema), the description covers outputs, parameters implicitly via schema, sources, and usage. Minor gaps include behavioral details and explicit output schema description, but overall adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with each parameter having its own description that includes defaults and implications. The tool description does not add additional parameter meaning beyond that, so the baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: providing Bitcoin mining game theory and systems dynamics metrics. It lists specific outputs (selfish-mining threshold, 51% attack cost, etc.) and sources. However, it does not explicitly differentiate from siblings like 'btc-miner-econ' or 'btc-systems-theory', which may overlap.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly lists use cases: 'miner-incentive analysis, network security assessment, and pre-investment regime detection.' This provides clear context. However, it does not mention when not to use this tool or suggest alternatives, which could be useful given the presence of related siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
btc-miner-econAInspect
Bitcoin mining economics and fee-market game theory via mempool.space. Returns current fee rates and pressure tier, miner revenue split (subsidy vs fees), next difficulty adjustment (direction, magnitude, blocks remaining), mining pool concentration (top-3 hashrate share), and mempool backlog size. Useful for agents reasoning about BTC transaction timing, miner incentive structures, or on-chain network health.
| Name | Required | Description | Default |
|---|---|---|---|
| include | No | Subset of data to return. 'all' returns every section (default). 'fees' = fee market only. 'difficulty' = next adjustment info only. 'pools' = pool hashrate distribution only. 'mempool' = backlog size only. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes what data is returned without side effects, but lacks details on rate limits, error handling, or performance. For a read-only data tool, it is adequate but not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first sentence states purpose and source, second lists outputs and use cases. Every word earns its place, front-loaded with key info. No redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description enumerates all returned data categories (fees, revenue split, difficulty, pools, mempool) and mentions use cases. It is nearly complete, though it could mention error conditions or data freshness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (the 'include' parameter is described in the schema). The tool description does not repeat or add parameter information, but the schema already provides sufficient meaning. Baseline 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states it returns Bitcoin mining economics data from mempool.space, listing specific data points (fee rates, miner revenue split, difficulty adjustment, pool concentration, mempool backlog). It clearly distinguishes from sibling tools like btc-game-theory by focusing on mining economics.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear use cases: 'USEFUL for agents reasoning about BTC transaction timing, miner incentive structures, or on-chain network health.' It implies context but does not explicitly compare to alternatives or state when not to use, which is acceptable given the specific domain.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
btc-systems-theoryAInspect
Seven-lens systems theory analysis of the Bitcoin network. Returns: (1) difficulty feedback loop state + regulator lag, (2) mempool stock-flow ratio and queue depth, (3) confirmation delay at economy/priority fee tiers, (4) fee nonlinearity index and hash-rate growth curvature, (5) Nakamoto coefficient (min pools for 51% consensus), (6) self-organization score for fee market equilibrium, (7) HHI mining concentration index and decentralization grade A–F. Sourced from mempool.space — no API key required. Pairs with btc-game-theory for full Bitcoin systems and incentive analysis.
| Name | Required | Description | Default |
|---|---|---|---|
| pool_window | No | Rolling window for mining pool distribution. Default: 1w. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions the data source (mempool.space) and that no API key is required, but does not disclose any side effects, authorization requirements, rate limits, or impact of parameter choices beyond the schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise yet detailed, listing all seven output lenses in a structured bullet-like format. It is front-loaded with the core purpose, with no redundant sentences.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given one optional parameter and no output schema, the description thoroughly explains what each of the seven lenses returns. It also mentions the data source and complementary tool, making the context relatively complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents the single parameter. The description does not add additional meaning beyond what is in the schema, meeting the baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it performs a seven-lens systems theory analysis of the Bitcoin network, listing all seven output categories. It distinguishes itself from the sibling tool 'btc-game-theory' by noting they pair for full analysis.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states it pairs with 'btc-game-theory' for comprehensive analysis, providing clear usage context. However, it does not explicitly state when not to use the tool or list alternative tools for specific needs.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
chain-pulseAInspect
Returns an Ethereum block header + current stablecoin depeg status in one call. Collapses the eth-block → stablecoin-watch 2-hop chain. Block fields: number, hash, miner, timestamp, gas_used, tx_count. Stablecoin fields: symbol, price, peg_deviation, depeg_status, composite alert level. Supports Ethereum, Base, Polygon, Arbitrum. Free upstreams (DRPC + DeFiLlama), no API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| block | No | Block number (integer or 0x hex) or tag: latest/pending/earliest/safe/finalized. Default: latest. | |
| network | No | Chain to query for block data. Default: ethereum. | |
| alert_only | No | If true, only return stablecoins depegged (MILD_DEPEG or worse). | |
| stablecoin_symbol | No | Filter stablecoins to a specific symbol (e.g. USDT, USDC, DAI). Omit to return top 10 by market cap. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It adds useful context: data sources (DRPC + DeFiLlama), free usage, no API key required, and supported chains (Ethereum, Base, Polygon, Arbitrum). However, it does not mention rate limits, caching behavior, or what happens on invalid inputs. For a read-only tool, this is adequate but not comprehensive, missing potential warnings about downstream dependencies.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences long, with the main purpose stated first. It efficiently lists key fields, supported chains, and data sources without redundancy. Every sentence adds unique information, and there is no fluff or repetition of schema details. Excellent balance of completeness and brevity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema is provided, but the description lists the return fields for both block and stablecoin data, giving a clear understanding of the output structure. It also covers supported networks and data sources. While it does not explain terms like 'composite alert level' or 'MILD_DEPEG', these are likely domain-specific and acceptable. Overall, it provides sufficient context for an AI agent to effectively invoke and interpret results.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (each parameter has a description), so baseline is 3. The description adds extra value by listing the supported networks (Ethereum, Base, Polygon, Arbitrum) which is not in the schema's network parameter description. It also clarifies the behavior of alert_only ('only return stablecoins depegged (MILD_DEPEG or worse)'). This goes beyond the schema's terse description, justifying a higher score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns a combined result: Ethereum block header and stablecoin depeg status. It specifies the exact fields for both (block fields: number, hash, miner, timestamp, gas_used, tx_count; stablecoin fields: symbol, price, peg_deviation, depeg_status, composite alert level). This directly differentiates it from sibling tools like eth-block and stablecoin-watch, as it collapses a two-hop chain into one call.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies the tool is for when you need both block and stablecoin data, but does not explicitly state when to use it vs. alternatives. It lacks exclusions or conditions like 'use this instead of calling eth-block and stablecoin-watch separately.' The phrase 'collapses the eth-block → stablecoin-watch 2-hop chain' hints at its combined nature, but more direct guidance would improve clarity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
chromatic-dispersionAInspect
Fiber optic chromatic dispersion calculator. Computes dispersion coefficient D(λ) in ps/(nm·km), accumulated dispersion (ps/nm) over a fiber span, and dispersion slope for SMF-28, NZDSF, DSF, LEAF, DCF, and ULLSF fiber types. Optionally sweeps a wavelength range (C-band, L-band, or custom). Pure math — instant, zero API calls. Useful for optical network design, DWDM link budget, and dispersion compensation planning.
| Name | Required | Description | Default |
|---|---|---|---|
| sweep | No | Optional wavelength sweep to compute dispersion across a range. | |
| fiber_type | No | Fiber type. Default: 'smf28' (standard SMF-28, G.652D). | |
| wavelength_nm | No | Operating wavelength in nanometers (default 1550). Range: 1260–1675 nm. | |
| fiber_length_km | No | Fiber span length in km for accumulated dispersion calculation (default 80). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description states 'Pure math — instant, zero API calls,' which is good transparency for a calculator tool. No annotations provided, so the description carries the burden and adequately discloses the behavior (no side effects, no external calls).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences, front-loaded with the core purpose, and every sentence contributes meaning without redundancy. Very concise and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
All 4 parameters are covered, no required parameters, and no output schema needed. The description explains what is computed and common use cases, making it sufficiently complete for a calculator tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all 4 parameters. The description adds value by mentioning optional wavelength sweep (C-band, L-band, or custom) and implying default values for fiber_length_km and wavelength_nm, which enriches the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states it is a 'Fiber optic chromatic dispersion calculator' and specifies exactly what it computes (dispersion coefficient, accumulated dispersion, dispersion slope) and for which fiber types (SMF-28, NZDSF, etc.). This clearly distinguishes it from the unrelated sibling tools (all financial, crypto, etc.).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description says 'Useful for optical network design, DWDM link budget, and dispersion compensation planning,' giving clear use cases. However, it does not explicitly state when not to use it or compare alternatives within the same domain (though none exist among siblings).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
citation-formatterAInspect
Looks up an academic paper by DOI and formats it as BibTeX, APA, MLA, or Chicago citation. Returns full paper metadata: authors, year, journal, volume, pages, publisher. Covers 148M+ works via CrossRef (free registry). Useful for research agents building reference lists, literature review workflows, and knowledge extraction pipelines.
| Name | Required | Description | Default |
|---|---|---|---|
| doi | Yes | Digital Object Identifier (e.g. '10.1038/nature12345' or 'https://doi.org/10.1038/nature12345'). | |
| format | No | Citation format. 'all' returns every format. Default: 'bibtex'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses behavior: it accesses CrossRef (free registry), covers 148M+ works, and returns formatted citations and metadata. It implies read-only, safe operation without contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words. The first sentence states core functionality, the second adds scope and use cases. Front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and simple input, the description covers purpose, scope, and use cases. It lacks details on error handling or rate limits, but is adequate for the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, baseline 3. The description adds value by listing specific format options (BibTeX, APA, MLA, Chicago) beyond the schema's generic description, clarifying the expected input.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it looks up a paper by DOI and formats citations in specific styles (BibTeX, APA, MLA, Chicago). It distinguishes from sibling 'research-paper-search' by focusing on citation formatting rather than general search.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context: useful for reference lists, literature reviews, and knowledge extraction. However, it lacks explicit guidance on when not to use it (e.g., if a DOI is unknown, use research-paper-search).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
city-lookupAInspect
Search airports and cities by keyword, IATA code, or city name. Returns IATA/ICAO codes, coordinates, country, and timezone for each match — useful for travel planning, routing, and geographic enrichment.
| Name | Required | Description | Default |
|---|---|---|---|
| max | No | Max results to return (default 10, max 20) | |
| country | No | Optional ISO country name filter (e.g. 'United Kingdom') | |
| keyword | Yes | City name, airport name, or IATA/ICAO code (e.g. 'London', 'LHR', 'Paris') |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It states it returns data but lacks information on side effects, rate limits, or limitations. The tool is likely read-only, but this is not explicitly stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first states the core action, the second lists outputs and use cases. No wasted words; it is clear and efficiently structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (3 parameters, no output schema), the description adequately covers inputs, outputs, and use cases. It explains what the tool returns without needing additional context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with each parameter described. The description reinforces the keyword parameter with examples (e.g., 'London', 'LHR'), adding marginal value. Baseline is 3 since schema already does most of the work.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches airports and cities by keyword, IATA code, or city name, and specifies what is returned (IATA/ICAO codes, coordinates, country, timezone). It uses specific verbs and resources, distinguishing it from siblings like 'geocode' or 'country-info'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for travel planning and geographic enrichment but does not explicitly state when to use this tool versus alternatives. No exclusion criteria or sibling comparisons are provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
classic-novelsAInspect
Looks up classic and contemporary books by title, author, or ISBN via Open Library (748M+ editions). Returns publication year, subjects/genres, page count, cover images, and links to read online via Project Gutenberg or Internet Archive. Useful for research agents, reading recommendation workflows, literature analysis, and bibliographic data enrichment.
| Name | Required | Description | Default |
|---|---|---|---|
| isbn | No | ISBN-10 or ISBN-13 for exact lookup. | |
| limit | No | Max results (default 5, max 20). | |
| title | No | Book title to search (e.g. 'Pride and Prejudice', 'Moby-Dick'). | |
| author | No | Author name to search (e.g. 'Jane Austen', 'Herman Melville'). | |
| subject | No | Subject/genre filter (e.g. 'science fiction', 'philosophy', 'Victorian literature'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavior fully. It describes the read operation and data source, but does not address rate limits, authentication, data freshness, or potential errors. This is a moderate gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences: purpose, output, use cases. It is well-structured and front-loaded, though the use-case sentence is somewhat generic and could be omitted without loss.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 5 non-required parameters and no output schema, the description covers inputs and outputs well. However, it does not explain default behavior when no parameters are provided, nor error scenarios. Still fairly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already describes all 5 parameters with 100% coverage. The description adds value by explaining what returned data (subjects, cover images, etc.) corresponds to each parameter, going beyond schema definitions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool looks up books by title, author, or ISBN via Open Library, listing specific output fields (year, subjects, page count, etc.). It effectively distinguishes itself from unrelated siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases like research and recommendation workflows, but does not explicitly state when not to use or provide alternatives. Since no similar sibling exists, it's adequate but lacks exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
clinical-trialsAInspect
Search ClinicalTrials.gov for clinical studies by condition, intervention, or keyword. Returns phase, status, enrollment, sponsor, and dates. NIH primary source — no markup. $0.005/call.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum results to return (1-50). Default 10. | |
| phase | No | Trial phase filter (optional). Omit to include all phases. | |
| status | No | Study recruitment status filter. Default: 'recruiting'. | |
| sponsor | No | Filter by lead sponsor name (partial match). E.g. 'Pfizer', 'NIH'. | |
| condition | Yes | Disease or condition to search (e.g. 'diabetes', 'lung cancer', 'Alzheimer'). Also accepts free-text keywords. | |
| intervention | No | Drug, device, or intervention name to filter by (optional). E.g. 'metformin', 'CAR-T'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses source and cost but lacks details on pagination, default sorting, rate limits, or error handling. The basic read-only behavior is implied, but more specificity would improve transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences: first states the action, second lists returns, third adds source and cost. No wasted words, front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (6 defined params, no output schema) and lack of annotations, the description covers purpose, returns, and source adequately. It misses output structure details, but the schema's parameter descriptions and the tool's straightforward nature keep it mostly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% parameter description coverage, so the schema already explains each parameter. The description adds no additional parameter meaning beyond summarizing search fields, meeting the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches ClinicalTrials.gov for clinical studies by condition, intervention, or keyword, and lists the return fields. It distinguishes itself from sibling tools by specifying the authoritative NIH source and a cost per call.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context by stating the source (NIH) and cost, implying authority and cost-consciousness, but does not explicitly mention when not to use this tool or provide alternatives. The unique purpose among siblings partially compensates.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
code-api-surfaceAInspect
Analyzes a code snippet and returns its API surface: HTTP routes (method + path), exported symbols, and middleware. Supports Express, FastAPI, Flask, Django, Spring Boot, ASP.NET, Rails, Gin. Pure static analysis — no code execution. Returns JSON with routes[], exports[], middleware[], lang, framework, and a plain-English summary. $0.10/call.
| Name | Required | Description | Default |
|---|---|---|---|
| code | Yes | Source code to analyze (any language/framework). Max ~50KB recommended. | |
| detail | No | Output scope: 'full' (default) = all fields; 'routes' = HTTP routes only; 'exports' = exported symbols only. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It explicitly states 'pure static analysis — no code execution', mentions cost per call, and indicates recommended max code size in schema. It does not cover error handling or rate limits but is sufficiently transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four sentences with no fluff. Front-loaded with main purpose, then supported frameworks, safety note, and output/cost info. Every sentence adds unique value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given complexity and no output schema, the description covers purpose, inputs, output fields, supported frameworks, and safety. It lacks error handling or performance details, but is complete enough for an agent to decide to call.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% so baseline is 3. The description mentions return structure (routes[], exports[], etc.) which provides output context, but adds no new information about parameters beyond what the schema already describes for 'code' and 'detail'.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it analyzes code snippets to return API surface, listing specific supported frameworks and specifying static analysis. It distinguishes from siblings like code-test-detector which focus on test detection.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context on when to use (code analysis for API surface) and which frameworks are supported, but lacks explicit exclusions or alternatives. It implies usage for any of the listed frameworks.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
code-test-detectorAInspect
Detects testing frameworks and test coverage presence in a code snippet or GitHub repository. For code snippets: identifies test functions, assertions, mocks, fixtures, and frameworks (Jest, pytest, go test, JUnit, RSpec, etc.). For GitHub repos: counts test files vs source files, surfaces config files, and gives a coverage verdict. No code execution — pure static analysis.
| Name | Required | Description | Default |
|---|---|---|---|
| code | No | Code snippet to analyze for test patterns. | |
| filename | No | Optional filename for the code snippet (e.g. 'utils.test.js') — helps confirm test file naming conventions. | |
| github_repo | No | GitHub repository in 'owner/repo' format (e.g. 'facebook/react'). Analyzes the full repo file tree for test coverage. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It clearly states 'No code execution — pure static analysis', which discloses safety and read-only nature. It also details what is identified (test functions, assertions, etc.) for both modes, providing rich behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the main verb, and efficiently covers both modes without redundancy. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 3 optional parameters and no output schema, the description is fairly complete. It covers both input modes and what to expect (e.g., 'gives a coverage verdict'). However, it does not describe the return format, which would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% so baseline is 3. The description adds meaningful context beyond schema: explains the purpose of 'filename' in confirming naming conventions, and describes the 'github_repo' analysis scope. This adds value above the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb ('Detects') and resource ('testing frameworks and test coverage presence') and clearly distinguishes two usage modes: code snippets and GitHub repos. It is not a tautology and differentiates from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states the two primary use cases (code snippet analysis and GitHub repo analysis), providing clear context for when to use. However, it does not explicitly mention when not to use or provide alternatives, which would improve guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
commodity-futuresAInspect
Returns live price and intraday metrics for major commodity futures: crude oil, natural gas, gold, silver, copper, platinum, wheat, corn, soybeans, and coffee. Includes price, day high/low, 52-week range, previous close, exchange, and contract name. Filter by commodity name or category (energy/metals/agricultural). $0.010/call — Yahoo Finance free API, no key required.
| Name | Required | Description | Default |
|---|---|---|---|
| category | No | Filter by category: energy (crude_oil, natural_gas), metals (gold, silver, copper, platinum), agricultural (wheat, corn, soybeans, coffee), or all. Default: all. Ignored if commodities list is provided. | |
| commodities | No | Specific commodities to fetch. One or more of: crude_oil, natural_gas, gold, silver, copper, platinum, wheat, corn, soybeans, coffee. Omit to get all commodities (limited by category if provided). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description implies a read-only operation and mentions the free API and cost, but does not explicitly state non-destructive nature or other behavioral traits. No annotations exist to contradict.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences that front-load purpose, list included data, and mention filtering and cost. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description sufficiently explains what data the tool returns (price, high/low, range, previous close, exchange, contract name) and how to use it, despite lacking an output schema. No gaps are apparent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described. The description rephrases the same filtering options, adding minimal new meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns live price and intraday metrics for major commodity futures, listing specific commodities and metrics. It is distinct from sibling tools which cover stocks, crypto, etc.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains when to use (to fetch commodity futures data) and how to filter, but does not explicitly state when not to use or provide alternatives among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
company-due-diligenceAInspect
AI-agent due diligence on any company. Queries SEC EDGAR for public company data (CIK, ticker, SIC, address, filing history) and optionally analyzes the company website for contact details, social profiles, and legitimacy signals. Returns a structured report with risk flags. Accepts company name plus optional domain.
| Name | Required | Description | Default |
|---|---|---|---|
| domain | No | Optional company website domain or URL (e.g. 'stripe.com'). When provided, adds website-based intelligence to the report. | |
| company | Yes | Company name to look up (e.g. 'Coinbase Global', 'Stripe Inc'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must convey all behavioral traits. It discloses that it queries SEC EDGAR and optionally analyzes websites, but omits details like external API calls, latency, authentication needs, or side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, efficiently conveying purpose, data sources, and optional functionality without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description mentions a 'structured report with risk flags' but lacks details on report fields. For a complex due diligence tool, slightly more detail would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and the description closely mirrors the schema's parameter descriptions. It adds minimal new meaning beyond restating the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool performs due diligence on companies using SEC EDGAR and optionally website analysis, returning a structured report with risk flags. It distinguishes itself from siblings like 'company-intel' and 'web-company-intel' by combining SEC data and website signals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies the tool is for comprehensive company due diligence using SEC data and optional website checks. It does not explicitly contrast with alternatives, but the context of 'AI-agent due diligence' helps guide use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
company-intelAInspect
Returns SEC EDGAR due diligence data for any US public company by ticker symbol: legal name, CIK, SIC industry code and description, state of incorporation, fiscal year end, SEC filer category, primary business location, and 2-year filing history (10-K/10-Q/8-K counts and most-recent dates). Use before any agent task involving US public company identification, regulatory filing assessment, financial analysis, or industry classification. Free upstream: SEC EDGAR public API (US government data, no key required, always current).
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. 'AAPL', 'MSFT', 'TSLA'). Case-insensitive. Standard tickers only — class-suffix tickers like BRK.A may need to be submitted as BRKA. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses the data source (SEC EDGAR public API), cost (free), and currency (always current). However, it doesn't mention rate limits or potential latency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: first lists outputs, second gives usage context, third notes data source. Every sentence adds value, front-loaded, no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a single-parameter tool with no output schema, the description fully covers all returned data fields and usage context. No missing critical information.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline 3. Description adds useful nuance beyond schema, such as case-insensitivity and handling of class-suffix tickers like BRK.A being submitted as BRKA.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns SEC EDGAR due diligence data for US public companies by ticker symbol, listing specific data fields. It distinguishes itself from sibling tools like company-due-diligence and sec-filing-intel by focusing on comprehensive due diligence data including filing history.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states to use before tasks involving US public company identification, regulatory filing assessment, financial analysis, or industry classification. While it doesn't mention alternatives, the guidance is clear and contextual.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
concentration-risk-scoreAInspect
Returns a concentration-risk score for an x402 pay_to wallet: HHI, unique payer count, top-payer share, persistence across scans, and a risk tier (LOW / MEDIUM / HIGH / CRITICAL). An agent uses this to assess whether an endpoint is a single-agent dependency before building a workflow that depends on it.
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | The pay_to wallet address to score (0x-prefixed, 42 hex chars). | |
| window_days | No | Observation window in days (default 7, max 30). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries burden. Describes computation and return of a score, implying read-only behavior, but doesn't explicitly state non-mutation, auth needs, or other constraints. Adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first lists return components, second explains use case. No wasted words, front-loaded, efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Lists all return components (HHI, payer counts, risk tier) and explains purpose. However, lacks definitions of metrics like HHI or persistence, which may be needed for interpretation. No output schema, but description covers main aspects.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema provides 100% coverage with descriptions for both parameters (address format, window_days default/max). Description does not add significant new meaning beyond listing return metrics, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it returns a concentration-risk score for an x402 pay_to wallet, listing specific metrics (HHI, payer count, top-payer share, persistence, risk tier). Describes agent use case for assessing single-agent dependency, distinguishing it from general wallet tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states when an agent should use this tool: to assess whether an endpoint is a single-agent dependency before building a workflow. Doesn't mention alternatives or when not to use, but the use case is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
congressional-tradesAInspect
US Congressional stock trades (STOCK Act disclosures). Two modes: supply a ticker to get all congress member trades in that stock (with excess-return-vs-SPY performance), or omit ticker for recent market-wide congressional activity. Returns representative, party, chamber, transaction type, dollar range, dates, and historical performance vs. SPY. Sourced from Quiver Quant's STOCK Act aggregator — no API key required. $0.022/call.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum trades to return. Default 25, max 100. | |
| ticker | No | Stock ticker to filter by (e.g. AAPL, NVDA). Case-insensitive. Omit for market-wide recent trades. | |
| chamber | No | Filter by congressional chamber. Default: all. | |
| transaction_type | No | Filter by transaction direction. Default: all. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses the data source (Quiver Quant), output fields, and cost. It lacks explicit mention of limitations (e.g., data freshness, depth of history) but provides a good overview. The description does not contradict any annotations (none provided), and it adds useful behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four well-structured sentences: purpose, modes, returns, and source/cost. It is front-loaded and avoids redundancy, with each sentence serving a clear role. No extraneous information, making it ideal for quick agent consumption.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description effectively lists return fields. It explains the two usage modes and provides cost and source. It does not cover error cases or default behaviors beyond the schema, but for a straightforward tool with optional parameters, it is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers all 4 parameters with descriptions (100% coverage). The description adds value by explaining the role of ticker (mode switch) and connecting it to performance data. The other parameters (limit, chamber, transaction_type) are left to the schema, which is sufficient. The description goes beyond the schema for the ticker parameter.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool is for US Congressional stock trades from STOCK Act disclosures and explicitly explains two distinct usage modes (with ticker vs. without ticker). It also lists the fields returned, making the purpose unambiguous and distinguishable from sibling tools like insider-trades.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly outlines two modes: supply a ticker for specific stock trades or omit for market-wide activity. It also mentions cost and no API key required, aiding the agent's decision to use it. While it doesn't explicitly name alternatives, the modes are sufficiently contextualized for the intended use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
consumer-briefAInspect
AI-synthesized US consumer health briefing. Fetches 8 FRED signals (Michigan sentiment, retail sales MoM, real PCE, real disposable income, savings rate, total/revolving consumer credit) and uses GPT-4o-mini to produce consumer posture, spending regime, confidence level, savings stress, credit dependency, 150-word narrative, dominant risk, and agent implication. One call collapses 8 FRED lookups + LLM synthesis for retail sector exposure, recession probability, and consumer credit risk.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the tool fetches 8 FRED signals and uses GPT-4o-mini for LLM synthesis, indicating external calls and processing behavior. Without annotations, it carries the transparency burden well, though it could mention response time or data freshness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single dense paragraph containing all key information. It is concise but could benefit from bullet points or clearer separation of input vs. output. Still, every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately lists the seven output components (consumer posture, spending regime, etc.) and mentions the narrative and risk. It covers the tool's purpose and output comprehensively.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With zero parameters, the description adds meaning by explaining the data sources and output components. Since schema coverage is 100% and no params exist, a baseline of 4 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool generates an AI-synthesized US consumer health briefing, which is a specific verb+resource combination. It distinguishes from sibling tools like labor-brief and housing-brief by focusing on consumer health using FRED signals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description outlines when to use the tool for retail sector exposure, recession probability, and consumer credit risk. It does not explicitly list when not to use it or compare to alternatives, but the specific output context implies appropriate usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
country-infoAInspect
Country information lookup by name, ISO code (alpha-2 or alpha-3), or capital city. Returns: official name, ISO codes, capital, population, area (km²), region, languages, currencies (code + symbol), borders, calling code, timezones, and flag image URL. Useful for international business agents, geographic enrichment, currency identification, and compliance workflows.
| Name | Required | Description | Default |
|---|---|---|---|
| code | No | ISO 3166-1 alpha-2 (e.g. 'DE') or alpha-3 (e.g. 'DEU') code for exact lookup. | |
| name | No | Country name or partial name to search (e.g. 'Germany', 'United States', 'Korea'). | |
| region | No | Return all countries in a region. | |
| capital | No | Capital city name to look up the country by (e.g. 'Berlin', 'Tokyo'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must fully disclose behavior. It lists returned fields but does not explain how multiple parameters interact (e.g., if both name and code provided), error handling, or whether partial name searches return multiple matches. This is insufficient for an agent to reliably invoke the tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, no fluff. First sentence front-loads the action and search methods. Second sentence lists output fields concisely. Third sentence provides context. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple lookup tool with no output schema, the description adequately covers search options and return fields. However, it omits details on behavior when multiple parameters are supplied, error conditions, or case sensitivity. Still, it is mostly complete for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has 100% coverage with descriptions for all 4 parameters. The description adds no significant meaning beyond what the schema already provides, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: country lookup by name, ISO code, or capital. It lists the returned fields and distinguishes itself from siblings like city-lookup and geocode by focusing on country-level data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases (international business, geographic enrichment, etc.) but does not provide guidance on when to use this tool versus alternatives, nor does it specify when not to use it. Lacks explicit exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
credit-spreadsAInspect
Returns current US corporate credit spreads from ICE BofA indices via FRED (free, no API key): High Yield OAS, Investment Grade OAS, and BBB (lowest IG tier) OAS. Includes HY-IG differential and risk regime classification. Pairs with treasury-yields for complete fixed-income discount rate construction.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden of behavioral transparency. It discloses the data source (FRED, free, no API key) and lists returned metrics. However, it lacks information on update frequency, data latency, output format, or error handling, which limits transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, consisting of two sentences. The first sentence clearly enumerates the tool's outputs, and the second provides a usage context by linking to a sibling tool. There is no redundant or irrelevant information, and the most critical details are front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (zero parameters, no output schema), the description covers key aspects: specific metrics returned, data source, and pairing advice. It lacks details on error cases or data freshness, but overall it is sufficiently complete for a straightforward data retrieval tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has zero parameters, so the description does not need to add parameter meaning. According to the guidelines, a baseline of 4 is appropriate for tools with no parameters. The description does not mention any parameters, which is consistent and acceptable.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly identifies the tool's purpose: returning current US corporate credit spreads from ICE BofA indices via FRED. It lists specific metrics (High Yield OAS, Investment Grade OAS, BBB OAS, HY-IG differential, risk regime) and distinguishes itself from the sibling tool 'treasury-yields' by mentioning they pair together for discount rate construction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states that the tool pairs with 'treasury-yields' for complete fixed-income discount rate construction, providing clear context on when to use it. However, it does not include when-not-to-use scenarios or alternative tools, missing an opportunity for stronger guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
crypto-fear-greedAInspect
Crypto Fear & Greed Index — current score (0=extreme fear, 100=extreme greed), 7-day trend, 30-day min/max/avg, and trading regime signal. Free alternative.me data updated daily. $0.005/call.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | History depth for trend and summary (1–30). Default: 7. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses update frequency and cost, but does not discuss rate limits, authentication needs, or output format. For a read-only data retrieval tool, this is adequate but not thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with key outputs, no wasted words. Efficiently conveys purpose, data source, and cost.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one optional param, no output schema), the description covers the main return values and operational details. It could briefly note that the index is for the overall crypto market, but this is implied. Still, it is mostly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% for the single parameter 'days'. The description adds context like 'trend and summary' but does not significantly enhance understanding beyond the schema's description of 'History depth for trend and summary (1–30). Default: 7.' Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides the Crypto Fear & Greed Index with specific outputs: current score, 7-day trend, 30-day min/max/avg, and trading regime signal. It is distinct from sibling tools like 'crypto-pulse' or 'crypto-news-impact' which focus on different aspects.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions the data source ('Free alternative.me data updated daily') and cost, providing context for when to use it. However, it does not explicitly state when not to use it or mention alternatives among the many sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
crypto-fiat-priceAInspect
Cryptocurrency price in any fiat currency — JPY, EUR, CNY, GBP, KRW, INR, AUD, BRL, or 80+ more. Input a coin name or CoinGecko ID (bitcoin, ethereum, solana, btc, eth, sol, etc.) and one or more currency codes. Returns current price, 24h percent change per currency, and last updated timestamp. Free upstream: CoinGecko public API (no key). 85% below specialized fiat oracles. Useful for Asian/European market agents, cross-border DeFi pricing, and multi-currency portfolio valuation.
| Name | Required | Description | Default |
|---|---|---|---|
| coin | Yes | Cryptocurrency ID or common ticker (e.g. bitcoin, ethereum, solana, btc, eth, sol, bnb, xrp). CoinGecko IDs also accepted. | |
| currencies | No | Comma-separated fiat codes to return (e.g. jpy,eur,cny or usd,gbp,krw,inr). Default: usd,jpy,eur. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the full burden and discloses that it returns current price, 24h percent change, and last updated timestamp. It states the upstream is CoinGecko public API (free, no key). It could mention rate limits or data freshness but is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, consisting of four well-structured sentences. Each sentence adds value: purpose, input details, output details, and use cases. No redundant or irrelevant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has only 2 parameters and no output schema, the description adequately covers input, output, and use cases. It explains return values (price, 24h change, timestamp) and provides enough context for an agent to select and invoke the tool appropriately.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents both parameters. The description adds value by providing concrete examples (e.g., bitcoin, btc) and mentioning the default for currencies, which enhances understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns cryptocurrency price in any fiat currency, listing specific coin names and currency codes. It distinguishes from sibling tools like 'forex-rates' by focusing on crypto-to-fiat conversion and mentions use cases for Asian/European markets.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context on when to use this tool: free, no API key, useful for multi-currency portfolio valuation. However, it does not explicitly mention alternatives or when not to use it, leaving room for improvement.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
crypto-news-impactAInspect
Latest cryptocurrency news headlines from CoinDesk with live price correlation for mentioned assets. Returns up to 10 recent articles — each with title, URL, published timestamp, primary category, and a keyword-derived sentiment signal (bullish/bearish/neutral). For each article, identified crypto assets (BTC, ETH, SOL, etc.) are enriched with current USD price and 24h price change. Use before any crypto research, portfolio review, or market sentiment task to understand what news is driving the market right now. Data: CoinDesk RSS (TTL 5 min) + CoinGecko prices.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of headlines to return (1–20). Default: 10. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description fully bears the burden. It discloses data sources (CoinDesk RSS with 5-min TTL, CoinGecko prices), return limits ('up to 10' but schema allows up to 20, a minor inconsistency), and behavior (enriches articles with asset prices). This is thorough for a read-only tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph, well-structured, front-loading the core purpose. It efficiently lists return fields and usage advice without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description thoroughly explains return fields (title, URL, timestamp, category, sentiment, enriched assets with price change). Parameter count is minimal (1), and the description covers all relevant aspects for a news fetching tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single 'limit' parameter. The description adds context ('up to 10 recent articles') but does not significantly extend beyond the schema's own description. The baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides latest cryptocurrency news headlines with live price correlation, explicitly stating 'Returns up to 10 recent articles' with specific fields. It distinguishes from siblings by positioning itself as the go-to tool for understanding market-driving news, contrasting with other crypto tools like crypto-fear-greed or crypto-pulse.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description includes direct usage guidance: 'Use before any crypto research, portfolio review, or market sentiment task'. It does not explicitly state when not to use or name alternatives, but the context is clear and actionable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
crypto-pulseAInspect
Crypto market pulse — latest Ethereum (or Base) block context plus top crypto gainers and losers by 24h change, in a single call. Returns: block_number, timestamp, gas info (base_fee_gwei, gas_used/limit, tx_count), and top movers (symbol, name, price_usd, change_24h, volume_24h, market_cap). Use for crypto portfolio context, on-chain/market correlation, or pre-trade situational awareness. Free upstream: DRPC + CoinGecko. $0.007/call — 30% below the comparable two-endpoint chain.
| Name | Required | Description | Default |
|---|---|---|---|
| network | No | Chain for block context. Default: ethereum. | |
| movers_limit | No | Number of gainers + losers each (1–20). Default 5. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Discloses it's a single call, returns specific data, mentions free upstream sources (DRPC + CoinGecko) and cost ($0.007/call). No destructive implications.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two well-structured sentences that immediately state purpose, then list return fields. Efficient and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, description details all return fields: block_number, timestamp, gas info, top movers with specific fields. Also covers cost and data sources. Complete for a data retrieval tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The description adds context about combining block and market data but doesn't provide additional parameter semantics beyond defaults already in schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool returns 'latest Ethereum (or Base) block context plus top crypto gainers and losers' with detailed field listing. Distinct from siblings like 'eth-block' or 'crypto-top-movers' by combining both aspects.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit use cases: 'crypto portfolio context, on-chain/market correlation, or pre-trade situational awareness.' Does not explicitly state when not to use or alternatives, but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
crypto-top-moversAInspect
Real-time cryptocurrency market snapshot: top 5 gainers and top 5 losers by 24-hour percentage change (among the top 100 coins by market cap), plus the 10 largest coins by market cap with current prices and 24h change. Also returns global market statistics: total market cap (USD), BTC dominance percentage, and 24h trading volume. Stablecoins excluded from movers ranking. Use before any crypto portfolio, trading, or market analysis task to get a current regime read. Data: CoinGecko public API (refreshes every 1–5 minutes).
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses that stablecoins are excluded from movers ranking, data comes from CoinGecko public API, and refreshes every 1-5 minutes. This is adequate for a read-only market snapshot tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and well-structured, front-loading the core output (top movers, largest coins, global stats) in the first sentence. Every sentence adds value, and there is no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description covers all necessary return values: top 5 gainers and losers, top 10 largest coins, global market cap, BTC dominance, volume. It also notes stablecoin exclusion and data source/refresh rate. Complete for a zero-parameter tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has no parameters (100% coverage), so baseline is 4. The description adds meaning by detailing what the tool returns without needing any input, which aligns with zero-parameter tools.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides a real-time cryptocurrency market snapshot including top gainers, losers, largest coins, and global stats. It uses specific language and distinguishes from siblings by indicating it is for a 'current regime read' before any crypto analysis.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use before any crypto portfolio, trading, or market analysis task to get a current regime read.' This provides clear when-to-use guidance. It does not list alternatives or when-not-to-use, but the context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
db-perf-intelAInspect
Database performance intelligence: current versions, EOL status, and benchmark-grounded performance profiles for PostgreSQL, MySQL, MariaDB, MongoDB, Redis, Elasticsearch, SQLite, Cassandra, CockroachDB, and SQL Server. Useful mid-task for infrastructure audits, database selection, and upgrade urgency checks. Live EOL data from endoflife.date; performance profiles from TPC-C, pgbench, sysbench, and YCSB benchmarks.
| Name | Required | Description | Default |
|---|---|---|---|
| include | No | What to include: 'all' (default), 'versions' (EOL/release data only), 'performance' (benchmark profiles only). | |
| database | No | Database to query. Accepts: postgresql, mysql, mariadb, mongodb, redis, elasticsearch, sqlite, cassandra, cockroachdb, mssql. Omit to return all supported databases. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It describes the data sources (live EOL from endoflife.date, benchmarks) but does not disclose any side effects, rate limits, or error handling. It is clearly read-only, so no contradiction, but more detail could improve transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three well-structured sentences: purpose, use cases, and data sources. It is concise without being vague, though it could be slightly more compact. Word choice is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simplicity (no required params, no output schema), the description gives enough context about what is returned (versions, EOL, performance profiles). It is complete for its complexity, though a brief note on return format would push it to 5.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so both parameters are already documented. The description adds context by listing the supported databases and data types, but it does not add meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides database versions, EOL status, and performance profiles for a specific list of databases. The verb 'provides' is implied, and it distinguishes itself from siblings by covering a unique domain (database performance intelligence).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It explicitly states the tool is useful for infrastructure audits, database selection, and upgrade urgency checks, giving clear context. However, it does not mention when NOT to use it or provide alternative tools, which would make it a 5.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
defi-market-pulseAInspect
Combined DeFi yield intelligence and market momentum in one call — 33% cheaper than separate yield-farming-active + market-movers calls ($0.009). Returns top yield pools from DeFiLlama, crypto and equity market movers, and a cross-signal layer that flags 'boosted' pools (high APY + rising token) vs 'at_risk' pools (high APY + falling token). Use for capital allocation decisions, pre-trade DeFi context, and portfolio rebalancing signals.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Filter yield pools by blockchain (e.g. 'Ethereum', 'Base', 'Arbitrum'). Case-insensitive. Omit for all chains. | |
| min_apy | No | Minimum 30-day mean APY percentage. Default 5%. | |
| protocol | No | Filter yield pools by protocol name (e.g. 'aave-v3', 'uniswap-v3'). Substring match. | |
| min_tvl_usd | No | Minimum TVL in USD for yield pools. Default $1M. | |
| yield_limit | No | Number of yield pools to return (1–30, default 15). | |
| market_limit | No | Number of market movers per category (1–20, default 10). | |
| include_equity | No | Include equity market movers (US stocks). Default false — crypto only for speed. | |
| stablecoin_only | No | Return only stablecoin yield pools (no impermanent loss). Default false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses data sources (DeFiLlama, crypto/equity markets), cross-signal logic, and cost. No annotations provided, so description carries full burden. Could mention data freshness or rate limits, but is still transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences plus usage statement, front-loaded with key benefits. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Explains returns (yield pools, market movers, cross-signal layer) and includes usage. No output schema, but description partially compensates by naming 'boosted' and 'at_risk' pools. Could elaborate on return structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with detailed descriptions. The description adds little beyond schema for parameters, so baseline score applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides combined DeFi yield intelligence and market momentum, listing specific outputs (yield pools, market movers, cross-signal layer). It distinguishes from siblings by noting cost savings over separate calls.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states use cases: capital allocation, pre-trade context, portfolio rebalancing. Mentions cost savings vs. alternatives. However, lacks explicit when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
defi-portfolioAInspect
Multi-chain DeFi portfolio scanner. Returns token holdings and USD values for any EVM wallet across Ethereum, Base, Polygon, and Arbitrum mainnet. Covers ETH, major stablecoins (USDC, USDT, DAI), WBTC, ARB, cbETH, and chain-native assets. Collapses the defi.hugen.tokyo/defi/address seam — 41 payers. $0.007/call.
| Name | Required | Description | Default |
|---|---|---|---|
| chains | No | Chains to scan. Defaults to all four. | |
| address | Yes | EVM wallet address (0x + 40 hex chars). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the transparency burden. It mentions collapsing a 'seam' and the cost per call, providing some insight into the underlying service, but does not disclose rate limits, latency, or potential error behaviors, leaving gaps for a read-only scanner.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is succinct and front-loaded with the core purpose, but includes extraneous details like '41 payers' and cost per call that may confuse an AI agent. It earns its place overall but has minor noise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately explains return values (token holdings and USD values) and covered assets/chains. It lacks details on pagination, limits, or error handling, but is sufficiently complete for a straightforward portfolio scanner.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 100% schema description coverage, the baseline is 3. The description adds value by explaining the chains parameter defaults to all four and listing covered assets, which enriches semantic understanding beyond the schema definitions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is a multi-chain DeFi portfolio scanner that returns token holdings and USD values for any EVM wallet, specifying supported chains (Ethereum, Base, Polygon, Arbitrum) and assets. It effectively differentiates from sibling tools like wallet-balance by its multichain scope and portfolio focus.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implicitly indicates when to use this tool: for a comprehensive multi-chain portfolio snapshot. However, it does not explicitly state when not to use it or list alternative tools for single-chain or simpler balance checks, which would improve guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
defi-state-packAInspect
Returns Ethereum block header + stablecoin depeg status + top DeFi yield farming pools in one call. Collapses the 3-hop eth-block → stablecoin-watch → yield-farming chain (PROSPECTOR signals 64812 + 60977, strength 1.0). All three upstreams fetched in parallel. Supports Ethereum, Base, Polygon, Arbitrum. Filter yield pools by chain, protocol, min TVL, min APY. Free upstreams — DRPC + DeFiLlama — no API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| block | No | Block number (integer or 0x hex) or tag: latest/pending/earliest/safe/finalized. Default: latest. | |
| min_apy | No | Minimum 30-day mean APY (%) for yield pool inclusion. Default 0. | |
| network | No | Chain to query for block data. Default: ethereum. | |
| top_pools | No | Max number of yield pools to return, sorted by 30-day mean APY desc. Default 10, max 25. | |
| alert_only | No | If true, only return stablecoins that are depegged (MILD_DEPEG or worse). | |
| min_tvl_usd | No | Minimum TVL in USD for yield pool inclusion. Default 1000000 ($1M). | |
| yield_chain | No | Filter yield pools by blockchain (e.g. 'Ethereum', 'Base', 'Arbitrum'). Case-insensitive. Omit for all chains. | |
| yield_protocol | No | Filter yield pools by protocol name (e.g. 'aave-v3', 'uniswap-v3'). Case-insensitive substring. | |
| stablecoin_symbol | No | Filter stablecoins to a specific symbol (e.g. USDT, USDC, DAI). Omit to return top 10 by market cap. | |
| stablecoin_pools_only | No | If true, only return stablecoin-only yield pools. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description carries full burden. It transparently states parallel fetching, free upstreams, no API key, supported chains, and filtering. However, it does not mention response structure, error handling, or potential upstream failures.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with the main purpose, no redundant information. Efficiently conveys key points: combination, parallel, chains, filters, free access.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite 10 optional parameters and no output schema, the description lacks details on response structure, error handling, and what happens if an upstream fails. For a composite tool, this is a significant gap.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with individual parameter descriptions. The description adds summary context (e.g., filters) but doesn't significantly enhance individual parameter meaning beyond the schema. Slight improvement from baseline 3 due to clarifying usage of string booleans.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool combines three resources (Ethereum block header, stablecoin depeg status, top DeFi yield pools) in one call, distinguishing it from siblings like eth-block, stablecoin-watch, and yield-farming-active which each handle only one.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use when all three data types are needed and mentions parallel fetching, but lacks explicit guidance on when not to use it or alternatives. It does not state to use specific sibling tools for individual needs.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
defi-yieldsAInspect
Returns top DeFi yield pools ranked by APY from DeFiLlama. Covers 16,000+ pools across 400+ protocols and 50+ chains (Ethereum, Base, Solana, Arbitrum, Polygon, etc.). Filter by chain, minimum APY, minimum TVL, or stablecoin-only. Each pool includes APY breakdown (base + reward), TVL, 7-day APY change, and a direct DeFiLlama link. Sourced from DeFiLlama public API — no key required, updated continuously.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Filter to a specific chain (e.g. Ethereum, Base, Solana, Arbitrum, Polygon). Case-insensitive. Omit for all chains. | |
| limit | No | Max results to return (default 20, max 50). | |
| min_apy | No | Minimum APY percentage (e.g. 5 for 5%). Default: 0. | |
| min_tvl_usd | No | Minimum TVL in USD (e.g. 1000000 for $1M). Default: 100000. | |
| stablecoin_only | No | If true, return only stablecoin-denominated pools. Default: false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Discloses it is a read operation from a public API with no key needed and continuous updates. Describes response content (APY breakdown, TVL, 7-day change, link). Does not mention rate limits or errors, but is transparent enough for a safe read tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Concise paragraph of four sentences, front-loaded with purpose, then coverage, filters, and response. No redundant or verbose content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers purpose, data source, filters, and response fields. Lacks mention of sorting behavior (implied by 'ranked by APY') and any error handling, but is largely complete for a simple data retrieval tool without an output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, providing full parameter descriptions. The description reiterates these filters without adding significant new semantics, staying at baseline.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool returns top DeFi yield pools ranked by APY from DeFiLlama, specifying the exact resource and action. It distinguishes from siblings by focusing on yield pool rankings rather than broader DeFi data or yield farming positions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Describes when to use: to get top yield pools with optional filters. No explicit when-not-to-use or alternatives, but the context of ranking and filters is clear enough for an agent to decide.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dex-pair-searchAInspect
Search DEX trading pairs for any token (by symbol, name, or contract address) across 50+ chains including Ethereum, Solana, Base, BSC, Arbitrum, Polygon, and Avalanche. Returns top pairs by liquidity with real-time price, 24h volume, buy/sell transaction counts, price change %, and buy pressure metric. Free via DexScreener. Ideal for agents tracking on-chain trade flow, entry/exit signals, or multi-chain token prices without maintaining DEX integrations.
| Name | Required | Description | Default |
|---|---|---|---|
| q | Yes | Token symbol (e.g. SOL, ETH, USDC), name, or contract address to search. | |
| chain | No | Optional: filter to a specific chain ID (solana, base, eth, bsc, arbitrum, polygon, etc.). | |
| limit | No | Max pairs to return (1–30). Default 10. | |
| min_liquidity_usd | No | Filter pairs with less than this USD liquidity. Default 10000. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the tool's read-only nature (searching and returning data) and mentions it is free. It does not specify rate limits, auth requirements, or error handling, but the core behavior is transparent enough for an agent to understand it is a query operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a compact paragraph of 4 sentences, each adding value: purpose, scope, return details, and use cases. It is front-loaded with the primary action and avoids redundancy with the schema. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 4 parameters, no output schema, and no annotations, the description covers the tool's purpose, supported tokens, return data, and use cases. It mentions the underlying source (DexScreener). It could be slightly more explicit about optional parameter effects, but the schema handles that. Overall, it is nearly complete for a search tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds context by listing example chains and the return fields (price, volume, etc.), but the parameter meanings are already fully explained in the schema. The description does not add significant additional semantic detail beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches DEX trading pairs by token symbol, name, or contract address across 50+ chains. It specifies the output includes top pairs by liquidity with real-time data. While siblings like dex-trending-pools exist, the description implicitly distinguishes by focusing on search rather than trending.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context: ideal for tracking on-chain trade flow, entry/exit signals, and multi-chain token prices without DEX integrations. It mentions the tool is free via DexScreener. However, it does not explicitly state when not to use it or provide alternatives, but the use-case guidance is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dex-swap-quoteAInspect
Best-route DEX swap quote across 20+ chains via Li.Fi aggregator. Returns expected output, exchange rate, gas cost, price impact, and route (DEX/bridge names). Works for same-chain and cross-chain swaps. Use ETH/USDC/WBTC symbols or ERC-20 addresses. Chains: ethereum, base, polygon, arbitrum, optimism, bsc, avalanche, and more. $0.012/call — free upstream.
| Name | Required | Description | Default |
|---|---|---|---|
| slippage | No | Max slippage tolerance as a decimal (0.005 = 0.5%). Default 0.03 (3%). | |
| to_chain | Yes | Destination chain. Same as from_chain for same-chain swaps. Use chain name or ID. | |
| to_token | Yes | Destination token — symbol or ERC-20 contract address. | |
| from_chain | Yes | Source chain — name (ethereum, base, polygon, arbitrum, optimism, bsc, avalanche) or chain ID integer. | |
| from_token | Yes | Source token — symbol (ETH, USDC, WBTC, USDT, MATIC, LINK) or ERC-20 contract address. | |
| from_amount | Yes | Amount of source token to swap, in human-readable units (e.g. '0.1' for 0.1 ETH, '100' for 100 USDC). For unknown tokens, provide wei/smallest-unit string with '!' prefix (e.g. '!1000000'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description must disclose behavior. It reveals it uses Li.Fi aggregator, returns specific data points, and costs $0.012/call. It implies read-only (quote). No destructive behavior stated; could be more explicit about read-only nature, but sufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is compact: 4 sentences. First sentence states core function, second lists outputs, third gives usage examples, fourth lists chains and cost. No wasted words, well front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, description explains return fields (expected output, exchange rate, gas cost, price impact, route). Covers chains and cost. Missing error handling or rate limits, but adequate for a quote tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, baseline 3. Description adds value by explaining token symbol/address usage and the '!' prefix for wei strings, but schema already describes parameters well. No major additional semantics beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it returns a best-route DEX swap quote across 20+ chains via Li.Fi aggregator. It lists specific outputs (expected output, exchange rate, gas cost, etc.) and distinguishes from sibling tools like dex-pair-search (pair search) and dex-trending-pools (trending pools). The verb 'quote' and resource 'DEX swap' are specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description mentions it works for same-chain and cross-chain swaps, provides example tokens and chains. It implicitly advises using this for quotes before execution, though it doesn't explicitly state when not to use it or mention alternatives like dex-pair-search for finding token addresses.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dex-trending-poolsAInspect
Trending DEX liquidity pools with buy/sell pressure data across multiple timeframes (5m, 1h, 6h, 24h). Sourced from GeckoTerminal (free, no key). Supports 100+ networks: eth, base, bsc, arbitrum, polygon_pos, solana, etc. Returns pool name, price, price change %, volume, buy vs sell transaction counts, and a buy_pressure_24h_pct metric (% of transactions that are buys — above 50% = net accumulation, below 50% = net distribution). Useful for spotting early momentum, identifying which on-chain tokens agents are accumulating, and validating signals from price feeds with raw flow data.
| Name | Required | Description | Default |
|---|---|---|---|
| page | No | Page of trending pools (20 per page). Default 1, max 10. | |
| network | No | Network ID to query. Common: eth, base, bsc, arbitrum, polygon_pos, solana, avax, optimism. Default: base. | |
| buy_pressure_min | No | Only return pools where buy_pressure_24h_pct >= this value (e.g. 60 = at least 60% buys). Omit for no filter. | |
| min_volume_usd_24h | No | Filter pools with less than this 24h volume. Default 10000. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full transparency burden. It discloses the tool is free, requires no API key, supports 100+ networks, and returns specific metrics (price, volume, buy/sell counts, buy_pressure_24h_pct). It does not mention rate limits or pagination behavior beyond max page, but overall provides a good understanding of what the tool does and its limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph of about 6 sentences, front-loaded with the core purpose. It includes all necessary details (timeframes, source, networks, return fields, use cases) without unnecessary repetition. It is efficient and clear, though could benefit from slight restructuring for readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description effectively explains the return fields (pool name, price, price change %, volume, buy vs sell counts, buy_pressure_24h_pct). It covers parameters, use cases, and data source. It is complete enough for an agent to understand how and when to use the tool, though more detail on edge cases (e.g., empty results) would be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by explaining the buy_pressure_min parameter in context and defining the buy_pressure_24h_pct metric in detail. This goes beyond the schema's parameter descriptions, providing a richer understanding of how to use the filters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns trending DEX pools with buy/sell pressure data across multiple timeframes, sourced from GeckoTerminal. It specifies the verb ('provides' implied), the resource (liquidity pools), and the distinct data (buy/sell counts, pressure metric). It distinguishes itself from sibling tools like dex-pair-search by focusing on trending pools and flow data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains when to use the tool: 'spotting early momentum', 'identifying which on-chain tokens agents are accumulating', and 'validating signals from price feeds with raw flow data'. It does not explicitly state when not to use it or list alternative tools, but the use cases are clearly defined.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dividend-calendarAInspect
Upcoming dividend ex-dates from NASDAQ — all stocks going ex-dividend on a given date (default: today) or in the next 1–7 days. Returns symbol, company name, ex-date, record date, payment date, dividend amount, and annual indicated dividend. $0.008/call.
| Name | Required | Description | Default |
|---|---|---|---|
| date | No | Target date in YYYY-MM-DD format. Default: today (UTC). | |
| days_ahead | No | Fetch dividends for this many calendar days starting from `date`. Default: 1 (single day). Max: 7. | |
| min_dividend | No | Filter: minimum dividend amount per share (USD). Default: no filter. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses that the tool returns specific fields and importantly mentions the cost ($0.008/call). This adds value beyond basic read-only behavior. However, it lacks details on rate limits, idempotency, or potential side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence that concisely states the purpose, scope, return fields, and cost. No extraneous information. Front-loaded with the most critical information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately explains the return values by listing the fields. It covers the essential context for a date-range query tool. Could be improved by noting ordering or default limit, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage for all three parameters. The description adds no extra context beyond what the schema already provides (e.g., date format, default values, max days). Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly specifies the tool's purpose: retrieving upcoming dividend ex-dates from NASDAQ for a given date or date range. It identifies the resource (dividends), source (NASDAQ), and scope (all stocks). The list of returned fields distinguishes it from sibling tools like 'dividend-intel'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for fetching upcoming dividend ex-dates with a default of today and a range of 1-7 days. However, it does not explicitly state when to use this tool versus alternatives like 'dividend-intel' or provide when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dividend-intelAInspect
Full dividend intelligence for any US equity: trailing 12-month yield, forward annual rate, payout frequency (monthly/quarterly/semi-annual/annual), 5-year dividend CAGR, consecutive years paid, consecutive years of growth, and complete 5-year dividend history with dates and amounts. Single call — no API key. Ideal for income screening, dividend safety analysis, and yield comparison across a portfolio.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker (e.g. AAPL, JNJ, KO, T, SCHD). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description covers single-call nature and no API key requirement. However, it omits details on data freshness, rate limits, or what happens with invalid tickers. For a simple data-retrieval tool, this is adequate but not exemplary.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences with no fluff. Front-loaded with 'Full dividend intelligence' and immediately lists contents. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given single parameter, 100% schema coverage, no output schema, the description fully explains what the tool does and what data it returns. Covers all necessary context for an AI agent to use it appropriately.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description for 'ticker' already covers case-insensitivity and examples. Description adds 'any US equity' but doesn't add significant new meaning beyond schema. Schema coverage is 100%, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it provides 'Full dividend intelligence for any US equity' and lists specific data points (yield, frequency, CAGR, history). Distinguishes itself from sibling 'dividend-calendar' by offering comprehensive historical data rather than just upcoming dates.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly mentions ideal use cases: 'income screening, dividend safety analysis, and yield comparison across a portfolio'. Does not explicitly mention when not to use or alternatives, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
dns-lookupAInspect
DNS record lookup for any domain via Cloudflare DoH. Supports A, AAAA, MX, TXT, CNAME, NS, SOA, CAA, or ALL record types. TXT lookups include SPF/DMARC/DKIM email security signal extraction. Useful for domain audits, email configuration verification, CDN setup checks, and infrastructure reconnaissance.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | DNS record type to query. 'ALL' queries A, MX, TXT, NS, and CNAME in parallel and returns all results. Default: 'A'. | |
| domain | Yes | Domain name to look up (e.g. 'github.com', 'mail.google.com'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description partially compensates by noting Cloudflare DoH backend and TXT extraction details, but omits rate limits, caching policy, or response size constraints.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with main action, no fluff, each sentence adds distinct value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 parameters, no output schema), the description is nearly complete: covers purpose, supported records, use cases, and backend. Minor gap on behavioral details.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (both parameters described). Description adds value beyond schema by explaining TXT lookups include email security signals and listing all supported record types.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool performs DNS lookups via Cloudflare DoH, lists supported record types (A, AAAA, etc.), and mentions TXT extraction SPF/DMARC/DKIM. This differentiates it from sibling tools like domain-whois or email-verify.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description enumerates use cases (domain audits, email configuration, CDN checks, infrastructure recon) but does not explicitly state when not to use or name alternatives, leaving slight ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
document-qa-prepAInspect
Prepares a document for question-answering and RAG pipelines. Chunks the input text at paragraph/sentence boundaries, assigns deterministic chunk IDs, estimates token counts, and extracts document metadata (word count, type, headings). Returns ready-to-embed chunks with overlap support. No LLM or external API — pure text processing. Use mid-task when you've fetched a document and need it split before querying a vector store.
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Document text to prepare (plain text, Markdown, or lightly-structured prose). Max 500,000 chars. | |
| metadata | No | Optional key-value metadata to attach to every chunk (e.g. source URL, document ID). | |
| overlap_tokens | No | Token overlap between consecutive chunks for context continuity (default 50, max 512). | |
| chunk_size_tokens | No | Target chunk size in tokens (default 512, max 4096). Uses 4-char-per-token estimate. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full weight. It discloses that processing is local with no LLM calls, chunking at paragraph/sentence boundaries, deterministic IDs, token estimation, and overlap support. This is adequate for understanding behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with no fluff: first sentence states core function, second lists key actions, third gives usage placement. Front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Though no output schema, the description explains returns 'ready-to-embed chunks' and lists specific metadata extracted (word count, type, headings). Sufficient for an agent to understand output without excessive detail.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers all parameters with descriptions (100% coverage). The tool description adds extra context on chunking boundaries and token estimation method, enhancing understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool prepares documents for QA and RAG pipelines, with specific actions like chunking, ID assignment, token estimation, and metadata extraction. It distinguishes itself from a large set of unrelated sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly advises to use 'mid-task' after fetching a document and before querying a vector store. Mentions that it is pure text processing without LLM/API, guiding safe usage. Lacks explicit when-not-to-use scenarios but overall clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
domain-whoisAInspect
Domain WHOIS/RDAP lookup. Returns registration date, expiration date, last updated, registrar, nameservers, status flags, and registrant/admin contact info (where available). Uses RDAP — the structured JSON replacement for WHOIS. Useful for domain due diligence, expiry monitoring, and identifying registrar/owner changes.
| Name | Required | Description | Default |
|---|---|---|---|
| domain | Yes | Domain name to look up (e.g. 'example.com', 'github.io', 'bbc.co.uk'). Strip 'www.' prefix. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, and the description does not disclose behavioral traits such as rate limits, authentication needs, or error handling. It is a read-only lookup, but this is implied rather than stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three short sentences provide all essential information without redundancy. The purpose is front-loaded and every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a one-parameter tool with no output schema, the description covers basic use and data returned. However, it omits details about potential failures, output format, or limitations, which would be helpful for agents.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The sole parameter 'domain' has a clear description with examples and instruction to strip 'www.' prefix. Schema coverage is 100%, and the description adds valuable context beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Domain WHOIS/RDAP lookup' and lists specific data fields returned. It distinguishes from sibling tools like dns-lookup by focusing on registration details.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases: 'domain due diligence, expiry monitoring, and identifying registrar/owner changes.' However, it does not explicitly contrast with similar tools like dns-lookup or ssl-cert.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
drug-intelAInspect
FDA drug intelligence: labeling (warnings, dosage, drug interactions, contraindications, indications), adverse event report summary (top reactions + total count), and recent recall history. Accepts brand or generic name. Data from openFDA — no API key. Useful for pharmaceutical research, clinical AI, drug safety due diligence.
| Name | Required | Description | Default |
|---|---|---|---|
| drug_name | Yes | Drug name — brand or generic (e.g. 'ibuprofen', 'Tylenol', 'metformin', 'Lipitor'). | |
| query_type | No | Which FDA data to retrieve. 'all' returns label + adverse event summary + recent recalls. Default: 'label'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so description carries full burden. It mentions data source ('openFDA — no API key') and input type, but lacks explicit statements about read-only nature, error handling (e.g., unknown drug), or rate limits. Acceptable but not rich.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four short sentences, front-loaded with core functionality. Each sentence adds unique information (data types, input, source, use cases). No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Describes return data (adverse event summary with top reactions + total count) without output schema. Could mention return format (JSON) or structure, but overall covers key outputs sufficiently for a 2-param tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers both parameters with descriptions (100% coverage). Description adds value by clarifying that drug_name accepts brand or generic, and that query_type defaults to 'label'. This enhances understanding beyond raw schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly specifies 'FDA drug intelligence' with enumerated data types (labeling, adverse events, recalls) and input format (brand or generic name). It distinguishes from sibling tools like fda-recall-watch and clinical-trials by covering multiple FDA datasets.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description states it is 'useful for pharmaceutical research, clinical AI, drug safety due diligence,' providing clear context for when to use. It does not explicitly list when not to use or name alternatives, but the use cases adequately guide tool selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
earnings-calendarAInspect
Upcoming US stock earnings — report date, EPS estimate, pre/post-market timing. Filter by ticker or look N days ahead (1–90). Data: Alpha Vantage 3-month calendar, cached 2 hr.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max results (default 20, max 100). Ignored when symbol is provided (returns all matches). | |
| symbol | No | Optional. Ticker to filter by (e.g. NVDA, AAPL). Omit to get all earnings in the window. | |
| days_ahead | No | Calendar days ahead to include (1–90, default 7). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses data source (Alpha Vantage 3-month calendar) and cache duration (2 hr), which is useful context beyond the tool's basic function. No annotations to contradict.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with no redundant text. Every phrase adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Describes return fields and filtering, but could be slightly stronger on ordering or what happens without a symbol. Still fairly complete given tool simplicity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters are already well-described in the input schema (100% coverage). The description adds no new param info beyond confirming the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool provides upcoming US stock earnings info (report date, EPS estimate, timing) and filtering by ticker or days ahead, distinguishing it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies usage (filter by ticker or days ahead) but does not explicitly state when to use versus alternatives like earnings-surprises or dividend-calendar.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
earnings-surprisesAInspect
Historical EPS beat/miss data for any US equity: actual EPS, consensus estimate, surprise %, beat rate, estimate revisions (30-day EPS drift), and next earnings date. Free Yahoo Finance data, no API key. Pairs with analyst-ratings and equity-fundamentals for a complete earnings intelligence stack.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AAPL, NVDA, MSFT). Case-insensitive. | |
| quarters | No | Number of past quarters to return (1-8, default 4). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions the data source (Yahoo Finance) and that no API key is needed, but does not disclose rate limits, data freshness, or the impact of the quarters parameter on output. It lists return fields but could provide more behavioral details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, dense sentence that covers purpose, data points, source, and pairing. It is front-loaded and concise, though could be slightly restructured for readability without losing information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately lists the seven key data items returned. It also provides context on data source and complementary tools. However, it does not explain the relationship to the quarters parameter or how results are structured.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, with both parameters well-described (ticker with examples, quarters with range and default). The description does not add new semantic meaning beyond the schema; it merely lists output fields which are not parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides historical EPS beat/miss data for US equities, listing specific data points (actual EPS, consensus estimate, surprise %, beat rate, estimate revisions, next earnings date). It differentiates from sibling tools by mentioning pairing with analyst-ratings and equity-fundamentals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for historical earnings analysis and suggests pairing with analyst-ratings and equity-fundamentals for a complete stack, but lacks explicit when-to-use or when-not-to-use conditions versus alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
earthquake-intelAInspect
Real-time earthquake intelligence from USGS. Fetch recent global significant quakes (M5.0+ last 7 days) or quakes near a lat/lon (M3.0+ within 500 km). Returns magnitude, depth, location, tsunami flag, and shake intensity. Free USGS FDSN API — no key required.
| Name | Required | Description | Default |
|---|---|---|---|
| lat | No | Latitude in decimal degrees. Required for location mode. | |
| lon | No | Longitude in decimal degrees. Required for location mode. | |
| days | No | Look-back window in days (1–30). Default: 7. | |
| mode | No | recent = global events sorted by time/magnitude; location = near a lat/lon coordinate. Default: recent. | |
| limit | No | Maximum results to return (5–50). Default: 20. | |
| radius_km | No | Search radius in km for location mode (10–1000). Default: 500. | |
| min_magnitude | No | Minimum Richter magnitude to include. Default: 5.0 (recent) or 3.0 (location). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the API source (USGS FDSN), real-time nature, magnitude thresholds, and that no API key is required. It does not cover rate limits or error handling, but for a read-only tool, the essential behavioral traits are transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences: first introduces the tool and source, second details modes, thresholds, and returned data. Every word adds value, no redundancy or filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 7 parameters and no output schema, the description is fairly complete. It explains the two modes, default values, and lists returned data fields (magnitude, depth, location, tsunami flag, shake intensity). It does not describe pagination or exact response format but covers the essentials for a data retrieval tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds significant value by explaining the two modes, default magnitude thresholds per mode, radius, and days context. It clarifies which parameters are relevant for each mode, going beyond the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states it fetches earthquake data from USGS with two distinct modes: global significant quakes (M5.0+, last 7 days) and location-based quakes (M3.0+, within 500 km). It lists the returned data fields, clearly distinguishing it from potential siblings like 'usgs-earthquake' by specifying thresholds and functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear guidance on when to use each mode (recent vs location) and default thresholds. However, it does not explicitly mention when not to use this tool or compare it to alternatives like 'usgs-earthquake', which is a sibling tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
economic-calendarAInspect
Upcoming US macro data release schedule: CPI, NFP, FOMC, GDP, PCE, PPI, JOLTS, Retail Sales, Housing Starts, and 20+ more releases with exact dates, times (ET), and market-impact priority. BLS live calendar + Fed/BEA/Census static 2026 schedule. Essential for agents timing trades or building macro-regime signals.
| Name | Required | Description | Default |
|---|---|---|---|
| days_ahead | No | How many calendar days ahead to include (default: 30, max: 90). | |
| category_filter | No | Filter by economic category. Default: 'all'. | |
| priority_filter | No | Filter by priority level. 'high' returns only market-moving releases (CPI, NFP, FOMC, GDP, PCE, PPI, JOLTS). 'high_medium' adds Retail Sales, Housing Starts, Durable Goods. 'all' returns every release. Default: 'all'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses source types (BLS live calendar, Fed/BEA/Census static 2026 schedule), time format (ET returns exact dates/times), and market-impact priority. It does not contradict any annotations (none exist). Additional behavioral details like output format or pagination are not covered, but the core behavior is transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences: first sentence densely packs the tool's purpose and scope; second sentence provides use-case context. No waste, front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given zero required parameters, three optional params, and no output schema, the description adequately covers what the tool does and what it returns (schedule with dates, times, priority). It implies output structure but does not explicitly describe the return format (e.g., list of events). Slight gap but still complete enough for most agents.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds context about US macro releases and explains market-impact priority categories (high includes CPI, NFP, etc.), which reinforces schema descriptions but does not add significant new meaning beyond the parameter descriptions already present.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it provides an upcoming US macro data release schedule with specific indicators (CPI, NFP, FOMC, GDP, PCE, PPI, JOLTS, Retail Sales, Housing Starts, and 20+ more). It uses a specific verb (schedule) and resource (US macro releases), distinguishing it from sibling tools like earnings-calendar or dividend-calendar.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states it is essential for agents timing trades or building macro-regime signals, providing clear context for use. However, it does not mention when not to use this tool or explicitly name alternatives among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
email-verifyAInspect
Email address validation and quality scoring. Checks RFC-5322 syntax, detects disposable/throwaway domains (via Kickbox free API), and verifies DNS MX record presence (via Cloudflare DoH). Returns a quality score (0-100) and verdict: DELIVERABLE | RISKY | UNDELIVERABLE | INVALID. Useful for lead qualification, form validation, and filtering bot signups.
| Name | Required | Description | Default |
|---|---|---|---|
| Yes | Email address to validate (e.g. 'user@example.com'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses use of external APIs (Kickbox free API, Cloudflare DoH) and explains the verification steps. It also states return values (score and verdict). This is transparent and adds value beyond basic function.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is concise, with three sentences that front-load the primary purpose and then provide technical details. No redundant or unnecessary information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With one parameter, no output schema, and no annotations, the description is nearly complete: it explains the validation process, external dependencies, and output format. Minor improvement could clarify score interpretation, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single parameter 'email' with a good description. The tool description does not add additional parameter-level details beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it validates email addresses, checks syntax, detects disposable domains, and verifies DNS MX records. It specifies output: quality score and verdict types. This differentiates it from sibling tools which have different functions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides explicit use cases (lead qualification, form validation, filter bot signups) but does not mention when to avoid using this tool or suggest alternative tools. Implied usage context is present but no exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
energy-briefAInspect
AI-synthesized US energy market situation briefing. Gathers 7 real-time signals from FRED (free, no auth): WTI crude price, regular gasoline price, Henry Hub natural gas, CPI Energy, PPI Oil & Gas, utilities output, and electric power production. Uses GPT-4o-mini to synthesize: energy regime label (energy_shock/elevated/normal/energy_glut/uncertain), dominant risk, agent implication, and a 200-word narrative. Extends the macro intelligence suite to energy — critical for inflation analysis, commodity exposure, and sector rotation decisions.
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | Output length. 'standard' = 200-word narrative (default). 'concise' = 100-word summary. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses the use of GPT-4o-mini for synthesis, lists all signals and outputs, and notes the FRED data is free and no-auth. It does not detail limitations or latency, but the behavior is adequately described.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with purpose, and every sentence adds value without redundancy. It is highly concise yet informative.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description lists all return elements (regime label, risk, implication, narrative length). The tool is simple with one optional parameter, and the description provides sufficient context for an AI agent to understand and invoke it correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single parameter 'style', and the schema description fully explains its values and defaults. The description does not add further meaning beyond the schema, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is an AI-synthesized US energy market briefing, lists specific signals and outputs, and distinguishes it from siblings like macro-brief and consumer-brief by emphasizing energy-specific data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states the tool is critical for inflation analysis, commodity exposure, and sector rotation, providing clear context for when to use it. However, it does not mention when not to use it or compare to specific alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
ens-lookupAInspect
ENS name ↔ Ethereum address resolution. Forward: pass a .eth name to get the address, avatar, and social profile records. Reverse: pass a 0x address to get its primary ENS name and profile. Returns address, ens_primary, avatar_url, description, twitter, github, discord, telegram, url, and content_hash.
| Name | Required | Description | Default |
|---|---|---|---|
| name | Yes | ENS name (e.g. 'vitalik.eth') for forward lookup, or 0x Ethereum address for reverse lookup. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the full disclosure burden. It lists the returned fields (address, ens_primary, avatar_url, etc.) and describes both modes, but does not mention any behavioral aspects such as rate limits, authentication requirements, or potential errors. Adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first sentence captures the core purpose, second enumerates return fields. No superfluous words. Front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter tool with no output schema, the description fully explains the two use cases and lists all return fields. Covers both forward and reverse resolution completely.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds the same information as the schema's property description (forward/reverse mode). It does not add significant new meaning beyond what the schema already provides. Baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Explicitly states the tool resolves ENS names to Ethereum addresses and vice versa, with clear distinction between forward and reverse lookups. This verb+resource description is specific and differentiates from sibling tools like 'dns-lookup' which handles DNS, not ENS.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Describes when to use forward lookup (pass .eth name) and when to use reverse (pass address), but lacks explicit guidance on alternatives or when not to use this tool. Usage context is clear, but no exclusions are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
equity-briefAInspect
AI-synthesized equity situation brief for any US stock. Gathers five signal layers in parallel — current price/52w range, RSI-14 + SMA20/50/200 trend regime, insider buy/sell activity (SEC EDGAR Form 4, last 60 days), options IV30 + put/call ratio (CBOE), and next earnings date + EPS estimate (Yahoo Finance) — then uses GPT-4o-mini to produce a structured brief: regime label, bull/bear case, dominant risk, agent implication, and a 160-word narrative. Replaces a 4-call agent chain (equity-technicals + us-stock-price + insider-trades + options-snapshot) at $0.350. Free upstreams (no API keys required for data gathering).
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | 'standard' = 160-word narrative (default). 'concise' = 90-word summary. | |
| ticker | Yes | US stock ticker symbol (e.g. AAPL, MSFT, NVDA, TSLA). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It transparently details the five parallel data sources, the use of GPT-4o-mini for synthesis, the output structure, and cost savings.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single dense paragraph that front-loads the purpose. It efficiently lists data layers and output, but could be slightly more structured for readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description explains the return value (structured brief with regime label, bull/bear case, etc.) and narrative word counts. It mentions data recency (last 60 days for insider trades), making it complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%. The description adds context by specifying ticker as US stock symbol, default style as 'standard' (160-word narrative), and 'concise' (90-word summary), supplementing the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it produces an AI-synthesized equity situation brief for any US stock, listing the five data layers and output components. It distinguishes from siblings by explicitly replacing a multi-call chain with a single call.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates it replaces a 4-call agent chain, implying efficiency, and notes no API keys required. However, it does not explicitly state when not to use it (e.g., if raw data is preferred).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
equity-fundamentalsAInspect
Fundamental valuation metrics for any US public company — P/E TTM, forward P/E, PEG, P/B, EV/EBITDA, margins, ROE, ROA, revenue TTM, earnings/revenue growth, free cash flow, market cap, beta. Raw data for valuation screening and DCF inputs. No API key. $0.015/call.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AAPL, NVDA, MSFT). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full disclosure burden. It mentions cost ($0.015/call) and 'No API key' but omits critical behavioral traits like rate limits, data freshness, error handling for invalid tickers, or whether data is real-time. This lack of safety or mutation information limits agent understanding.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences. The first lists the metrics, the second adds context (raw data for screening/DCF, no API key, cost). Efficient and front-loaded. Every word adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, no output schema), the description covers the output contents thoroughly. However, it could be more complete by mentioning the response format (likely JSON), data latency, or behavior on missing data. Slightly lacking for full autonomous use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
There is only one parameter (ticker) with 100% schema coverage. The description adds meaning by specifying 'US public company' and listing the full set of returned metrics (P/E, margins, etc.), which is not present in the schema property description. This enriches the agent's understanding of what the parameter affects.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description specifies the verb 'provides fundamental valuation metrics' and explicitly lists over a dozen metrics (P/E TTM, forward P/E, etc.), distinguishing it from sibling tools like stock-brief or equity-brief which offer summaries or other data. The phrase 'Any US public company' sets clear scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for valuation screening and DCF inputs but does not explicitly state when to use this tool versus alternatives (e.g., equity-brief, company-intel). No exclusions or when-not-to-use guidance is provided, leaving the agent to infer from context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
equity-sentimentAInspect
Equity market Fear & Greed composite. Four signals: VIX vs 90-day percentile, SPY vs 200-day moving average, US high-yield credit spread vs 90-day range (FRED BAMLH0A0HYM2), SPY RSI-14. Returns composite score 0–100 (0=extreme greed, 100=extreme fear) with regime label and per-signal breakdown. Distinct from market-sentiment (crypto). Use before sizing positions, adjusting portfolio risk, or routing capital. Free sources, no API keys.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses free sources, no API keys, and describes output format (composite score, regime label, per-signal breakdown). No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, each informative: purpose, signal details, usage guidance. No redundant words, well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description fully explains inputs (none), signals, output structure, and use cases. Complete for a parameterless tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
No parameters exist; baseline is 4. Description adds meaning by explaining the output and signals beyond the empty schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides an equity market Fear & Greed composite with four specific signals, and distinguishes itself from the sibling tool market-sentiment (crypto).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states when to use: before sizing positions, adjusting portfolio risk, or routing capital. Also notes it is distinct from the crypto sentiment tool, providing clear guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
equity-technicalsAInspect
Returns a complete technical analysis package for any US stock: RSI(14) with oversold/overbought signal, MACD(12/26/9) with histogram, Bollinger Bands(20,2σ) with price position, SMA 20/50/200 with crossover state, 20-day volume ratio, and a consensus signal (BULLISH/BEARISH/NEUTRAL) based on indicator agreement. Sourced from 1-year Yahoo Finance daily OHLCV — no API key, live data. Richer than a simple price endpoint: agents use this for entry/exit signal confirmation, momentum screening, and pre-trade context without managing their own TA library.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AMD, AAPL, NVDA, STRC). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must disclose behavior. It mentions data sourced from 1-year Yahoo Finance daily OHLCV, live data, no API key. This adds context beyond a simple return. However, it does not explain error handling (e.g., invalid ticker), rate limits, or whether the consensus signal is always provided. Lacks full transparency for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is a single dense paragraph listing indicators in a run-on sentence. While informative, it could be more structured (e.g., bullet points) for easier parsing. It front-loads purpose but the list format reduces clarity. Not overly verbose but not optimally concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, description does a good job explaining what the output contains (list of indicators, data source, use cases). It covers input, output components, and data origin. However, it omits the exact structure of the output (e.g., nested objects) and potential caveats (e.g., delisted stocks). Still, it is largely complete for an agent to decide if the tool meets its needs.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Only one parameter (ticker) with schema description already covering US stock ticker. The tool description repeats 'US stock' but adds examples and case-insensitivity note, which are already in schema. Schema coverage is 100%, so description adds minimal extra value. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it returns a complete technical analysis package for any US stock, listing specific indicators (RSI, MACD, Bollinger, SMAs, volume ratio, consensus signal). It explicitly distinguishes from simpler price endpoints, making the tool's purpose highly specific and actionable.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides explicit use cases: entry/exit signal confirmation, momentum screening, pre-trade context. It contrasts with simple price endpoints, helping agent decide when to use this tool. However, it does not explicitly exclude other technical analysis tools or mention when not to use, leaving some ambiguity among sibling tools like equity-fundamentals or stock-brief.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
erc20-snapshotAInspect
Complete ERC20 token state in one call: name, symbol, decimals, total supply (raw + formatted), wallet balance, and allowance. Collapses four onesource chain calls (total-supply + erc20-balance + allowance + contract — 155–229 payers each) into one $0.007 payment — 36% below the $0.011 combined chain. Supports Ethereum (default), Base, Polygon, Arbitrum. No API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| wallet | No | Wallet address to check token balance for (optional). | |
| network | No | EVM chain. Default: ethereum. | |
| spender | No | Spender address to check allowance against wallet (optional; requires wallet). | |
| contract | Yes | ERC20 token contract address (0x-prefixed, 42 chars). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It explains the tool returns name, symbol, decimals, total supply (raw + formatted), wallet balance, and allowance. It also notes that no API key is required and supports multiple networks. It does not detail parameter dependencies (e.g., spender requires wallet) but that is covered in the schema. Overall behavior is well communicated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no superfluous information. The first sentence states the purpose and outputs, the second provides efficiency context and supported networks. It is front-loaded and every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having no output schema, the description enumerates all returned fields (name, symbol, decimals, total supply, wallet balance, allowance) which is sufficient for understanding the tool's behavior. It also covers networks and cost details, making it complete for a tool of this complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by explaining what each parameter corresponds to in the output: 'wallet balance' for wallet, 'allowance' for spender, and supported networks for network. This goes beyond the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states clearly that the tool provides a complete ERC20 token state including name, symbol, decimals, total supply, wallet balance, and allowance. It distinguishes itself from sibling tools that only offer a subset of these values, such as 'wallet-balance' or 'token-top-holders'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains when to use this tool: when a comprehensive token snapshot is needed, and it highlights the cost and efficiency gain over making four separate calls. It mentions supported networks but does not explicitly list cases where a user should use an alternative tool (e.g., if only balance is needed). However, the context is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
etf-holdingsBInspect
Top holdings, sector weights, and asset allocation for any US ETF (SPY, VOO, QQQ, AGG, XLK, etc.). Returns up to 25 positions with weights, sector breakdown, and equity/bond/cash split. No API key. $0.018/call.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | ETF ticker symbol (e.g. SPY, VOO, QQQ, AGG, XLK, VTI). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It only mentions cost and a 25-position limit, omitting rate limits, error handling, or data freshness. Minimal behavioral disclosure beyond core functionality.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with main function, no unnecessary words. Every sentence serves a purpose: output description, data scope, and pricing details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple single-parameter tool without output schema, the description adequately explains return data (holdings, weights, sector/cash split). Missing error handling or freshness details, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (ticker described). The description adds value by noting case-insensitivity, listing example tickers, and summarizing the response structure (positions, weights, breakdown), which enhances understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns top holdings, sector weights, and asset allocation for US ETFs, with specific examples. It distinguishes itself from siblings like hedge-fund-holdings by focusing on ETFs, but does not explicitly differentiate.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides cost info but no guidance on when to use this tool versus alternatives, nor conditions or exclusions. Usage is implied as 'for any US ETF', but lacks explicit context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
eth-blockAInspect
Returns an Ethereum block header and transaction hashes by block number, hex string, or tag (latest/pending/earliest/safe/finalized). Fields: block_number, hash, parent_hash, miner, timestamp_iso, gas_used, gas_limit, base_fee_gwei (EIP-1559), tx_count, transaction_hashes. Supports Ethereum (default), Base, Polygon, and Arbitrum. $0.002/call — 33% below comparable market rate.
| Name | Required | Description | Default |
|---|---|---|---|
| number | No | Block number as an integer, 0x-prefixed hex, or tag: latest/pending/earliest/safe/finalized. Defaults to latest. | |
| network | No | Chain to query. Default: ethereum. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses return fields, pricing, and supported networks, but lacks details on rate limits, authentication, error handling, or any destructive behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single well-structured sentence that efficiently conveys purpose, fields, networks, and pricing. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the main purpose, all parameters, return fields, and pricing. However, it could mention potential errors or limitations (e.g., max block number, tag restrictions).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage, but the description adds value by explaining parameter defaults (number defaults to latest), enumerating valid tags, and listing supported networks for the network parameter.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Returns an Ethereum block header and transaction hashes by block number, hex string, or tag'. It lists specific fields and supported networks, making the purpose unambiguous and distinguishing it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context on parameters (number types, tags) and supported networks, but does not explicitly mention when to use this tool versus alternatives or any exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
evm-log-eventsAInspect
Query EVM contract event logs via eth_getLogs. Filter by contract address, event topic (signature hash), and block range. Returns up to 50 decoded log entries with topics, data, tx hash, block number. Supports Ethereum/Base/Polygon/Arbitrum via free DRPC. $0.004/call — 20% below comparable market rate.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Chain to query. Default: ethereum. | |
| limit | No | Max number of log entries to return (1–50). Default: 20. | |
| topic0 | No | Event signature hash (topic[0]) to filter by. Common values: Transfer = '0xddf252ad1be2c89b69c2b068fc378daa952ba7f163c4a11628f55a4df523b3ef', Approval = '0x8c5be1e5ebec7d5bd14f71427d1e84f3dd0314c0f7b2291e5b200ac8c7c3b925', Swap (Uniswap V2) = '0xd78ad95fa46c994b6551d0da85fc275fe613ce37657fb8d5e3d130840159d822'. | |
| address | No | Contract address to query logs from (e.g. '0xC02aaA39b223FE8D0A0e5C4F27eAD9083C756Cc2' for WETH). | |
| to_block | No | End block — integer, hex string, or tag. Default: latest. | |
| from_block | No | Start block — integer, hex string, or tag (latest/earliest). Default: 100 blocks before latest. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions a limit of 50 log entries and a per-call cost, but does not discuss rate limits, authentication, or what happens on broad queries (e.g., large block ranges). More detail on behavior beyond the basic limits would improve transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three concise sentences that front-load the core purpose and key details (filters, limits, chains, pricing). Every sentence provides useful information with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the return format, supported chains, and pricing. However, it lacks details on error handling (e.g., invalid address/block range) and does not mention pagination beyond the limit. For a tool with no output schema, it provides adequate but not exhaustive context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All 6 parameters have schema descriptions (100% coverage). The description adds value by providing example topic0 hashes (Transfer, Approval, Uniswap Swap) for common contract events, which is helpful beyond the schema's generic descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool queries EVM contract event logs via eth_getLogs. It specifies filtering by address, topic, and block range, and lists return fields (decoded logs with topics, data, tx hash, block number). It also names supported chains, differentiating it from unrelated sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context by listing supported chains and pricing, but does not explicitly state when to use this tool versus alternatives, or note any limitations/conditions for use. No 'when not to use' guidance is given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
evm-nonceAInspect
Returns the current nonce (confirmed transaction count) and pending nonce for any EVM wallet address. Supports Ethereum, Base, Polygon, Arbitrum, and Optimism. Use the pending nonce when building new transactions. $0.002/call — 33% below comparable market rate.
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | EVM wallet address (0x-prefixed, 42 characters). | |
| network | No | Chain to query. Default: base. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description carries full burden. It discloses the output (confirmed and pending nonce) and cost, but does not explicitly state it is a read-only operation or cover error cases. Adequate but missing explicit safety or idempotency info.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with no redundancy: first states output, second lists networks, third gives usage tip and pricing. Essential information only.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple query tool with 2 parameters and no output schema, the description covers the main purpose, supported chains, and a usage hint. It could mention the response format (e.g., object with confirmed and pending), but the name and context suffice.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds value by listing the specific supported networks (Ethereum, Base, Polygon, etc.) beyond the schema's general description. It reinforces the address format already in schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns current and pending nonce for EVM wallet addresses. It lists supported chains (Ethereum, Base, etc.), distinguishing it from siblings like 'tx-explainer' or 'wallet-balance'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says to use the pending nonce when building new transactions, providing clear guidance. It supports multiple chains but does not mention when not to use this tool or suggest alternatives, though context makes it obvious.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
evm-token-securityAInspect
Honeypot, rug-pull, and scam detection for any EVM token. Returns a 0–100 risk score with labeled flags: honeypot status, hidden ownership, mint authority, self-destruct, buy/sell tax rates, creator wallet concentration, and open-source status. Covers 40+ chains (Ethereum, Base, BSC, Arbitrum, Polygon, Solana, etc.) via GoPlusLabs. Useful pre-trade before buying unknown tokens, before routing payments through new contracts, or when validating DeFi protocol addresses. Pairs with solana-token-risk (Solana-native rug detection) and market-intelligence (endpoint verification).
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Chain name or numeric chain ID. Common: ethereum, base, bsc, arbitrum, polygon, solana, optimism, avalanche. Default: base. | |
| address | Yes | Token contract address to screen (e.g. '0x833589fCD6eDb6E08f4c7C32D4f71b54bdA02913' for USDC on Base). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It implies a read-only query (returns risk score) but does not explicitly state no side effects, authentication needs, or rate limits. More explicit transparency would improve this dimension.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is concise, well-structured, and front-loaded. Every sentence adds information: purpose, output, coverage, use cases, and sibling references. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description thoroughly explains the risk score range, flags, and coverage across 40+ chains. Use cases and sibling references provide complete context for tool invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and description adds value by providing common chain names, a default (base), and an example address. This goes beyond the schema alone, justifying a 4 (above baseline of 3).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description explicitly states the tool detects honeypots, rug-pulls, and scams for EVM tokens, and returns a risk score with specific flags. It also distinguishes from siblings by mentioning pairing with solana-token-risk and market-intelligence.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear use cases: pre-trade before buying unknown tokens, before routing payments through new contracts, or when validating DeFi protocol addresses. Also mentions alternatives (siblings) for Solana-native detection and endpoint verification.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
fact-checkAInspect
AI-powered claim verification. Searches DuckDuckGo, Wikipedia, Hacker News, and arXiv in parallel, then uses GPT-4o-mini to assess the claim and return a structured verdict: confirmed / contradicted / uncertain, with confidence score (0–1), supporting and contradicting evidence excerpts with source URLs, key entities, and step-by-step reasoning. Use before an agent acts on a factual assertion it received from another agent or user. $0.500/call.
| Name | Required | Description | Default |
|---|---|---|---|
| claim | Yes | The factual assertion to verify (e.g. 'Bitcoin was created in 2008' or 'TypeScript is a superset of JavaScript'). Be specific — vague claims return uncertain verdicts. | |
| context | No | Optional background context that helps interpret the claim (e.g. domain, time period, known related facts). Narrows the verification scope. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description bears full burden. It comprehensively discloses the search sources, the AI model used (GPT-4o-mini), the verdict types (confirmed/contradicted/uncertain), confidence score, evidence excerpts with URLs, key entities, step-by-step reasoning, and cost ($0.500/call). No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with a cost note. The key action is stated first, followed by details. Every sentence adds value; no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multi-source search, AI reasoning), the description covers all necessary behavioral and output details despite lacking an output schema. It explains what the agent will receive and how it works, making it complete for agent invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds valuable nuance: 'Be specific — vague claims return uncertain verdicts' for the 'claim' parameter and 'Narrows the verification scope' for 'context'. This enhances understanding beyond schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states 'AI-powered claim verification' and details the sources searched (DuckDuckGo, Wikipedia, Hacker News, arXiv) and the structured verdict output. It clearly distinguishes itself from sibling tools, none of which appear to perform claim verification.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states when to use: 'Use before an agent acts on a factual assertion it received from another agent or user.' This provides clear guidance and implies when not to use (e.g., for non-factual queries).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
fda-recall-watchAInspect
FDA recall and enforcement search across drugs, food/cosmetics, and medical devices (85,000+ actions). Returns classification (Class I/II/III), recall reason, product description, status, and distribution pattern. Seam cap: fills the product-safety layer missing from drug-intel + company-due-diligence chains. No API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum recalls to return per category (1–10). Default: 5. | |
| query | Yes | Search term: company name, product name, ingredient, NDC, or recall reason keyword (e.g. 'Pfizer', 'acetaminophen', 'Listeria', 'pacemaker battery'). | |
| class_filter | No | Filter by recall classification. 'Class I' = most serious. Default: 'any'. | |
| product_type | No | Product category to search. 'all' queries drugs + food + devices in parallel. Default: 'all'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses that no API key is required and mentions the data scope (85,000+ actions) and return fields. But lacks details on rate limits, pagination, or error handling. Without annotations, description carries full burden, but it is moderately transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences front-load the purpose, outputs, and contextual usage. Every sentence adds value with no fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the 4-parameter schema with full coverage and no output schema, the description is largely complete. It explains the tool's purpose, outputs, and fit with sibling tools. Minor gap: no mention of pagination or performance characteristics.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, meaning the schema already documents all parameters well. The description does not add significant new meaning beyond the schema's parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool searches FDA recalls across drugs, food/cosmetics, and medical devices, lists specific outputs (classification, reason, description, status, distribution), and distinguishes itself from siblings by filling a product-safety gap.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly frames the tool as filling a missing layer in drug-intel and company-due-diligence chains, indicating when to use. However, does not explicitly state when not to use or provide alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
fec-donor-intelAInspect
FEC campaign finance lookup — search all US federal political donations by individual or organization name. Returns recent contributions (sorted newest-first) with committee names, donation amounts, election cycles, employer/occupation, and aggregate totals (total donated, number of contributions, committees supported). Official FEC Open Data, updated daily. Use for executive due diligence, political affiliation screening, ESG analysis, or investigative research.
| Name | Required | Description | Default |
|---|---|---|---|
| name | Yes | Donor name to search. For individuals use 'LAST, FIRST' format (e.g. 'MUSK, ELON') for best results. Organization names work too (e.g. 'Google LLC'). | |
| cycle | No | Election cycle year to filter (e.g. 2024 for the 2023-2024 cycle). Omit for all cycles. | |
| limit | No | Max results to return (1–50, default 20). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses sort order (newest-first), data source (Official FEC Open Data), and update frequency (daily). No annotations exist, so description carries burden; it covers key behavioral traits for a read-only lookup tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose, no redundant information. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Describes return fields (committee names, amounts, cycles, employer/occupation, aggregate totals) despite no output schema. Covers update frequency and use cases, making it self-contained.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Adds significant value beyond schema: explains name format ('LAST, FIRST' for individuals), clarifies cycle as election cycle year, and default limit of 20. All three parameters are well-described.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool's function: search US federal political donations by individual or organization name. Specifies return fields and distinguishes from sibling tools like congressional-trades.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Lists use cases (due diligence, screening, ESG, research) but does not explicitly state when not to use or provide alternatives. However, the context is clear and helpful.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
federal-contract-intelAInspect
US federal contract and grant intelligence for any company via USASpending.gov. Returns total obligated amount, award count, top awards (award ID, amount, agency, description, start date), and agency breakdown — covering $10T+ in federal spending. Useful for procurement research, competitive intelligence, vendor due diligence, and government contractor analysis. No API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| top_n | No | Number of top awards to return. Default: 5. | |
| award_type | No | Award type to query. 'contracts' = procurement contracts; 'grants' = financial assistance grants; 'all' = both. Default: 'contracts'. | |
| years_back | No | How many fiscal years back to search (1 = current FY only, 2 = 2 most recent FYs). Default: 2. | |
| company_name | Yes | Company or organization name (e.g. 'Boeing', 'Lockheed Martin', 'SpaceX', 'Johns Hopkins University'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses that no API key is required and that the data source is USASpending.gov, covering $10T+ in spending. For a tool without annotations, this provides useful behavioral insight, though it could mention rate limits or data freshness.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences efficiently convey purpose, return data, and key benefit. Front-loaded with the core function, but could be slightly more structured with bullet points for clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description thoroughly explains return values (total obligated amount, award count, top awards with fields, agency breakdown) and the data source. Combined with complete parameter descriptions in the schema, the tool is well-defined for an agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All 4 parameters are fully described in the input schema (100% coverage), so the description does not add additional semantics. The baseline is 3, and no extra value is provided beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it provides US federal contract and grant intelligence for any company via USASpending.gov, listing specific return fields. Distinguishes from siblings like company-intel by focusing on government spending data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly mentions use cases such as procurement research, competitive intelligence, vendor due diligence, and government contractor analysis, providing clear context for when to use. However, lacks explicit exclusions or comparisons to alternative tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
flight-trackerAInspect
Recent departures or arrivals at any major airport via OpenSky Network (free, crowd-sourced ADS-B). Accepts 3-letter IATA (JFK) or 4-letter ICAO (KJFK) codes. Returns callsign, origin/destination airports, estimated times, and aircraft identifiers. Useful for logistics workflows, travel agents, itinerary builders, and supply-chain tracking tasks. Default: departures from airport in last 4 hours.
| Name | Required | Description | Default |
|---|---|---|---|
| hours | No | Look-back window in hours (1–24). Defaults to 4. Larger windows return more flights. | |
| airport | Yes | Airport code in IATA (3-letter, e.g. 'JFK') or ICAO (4-letter, e.g. 'KJFK') format. | |
| direction | No | Whether to return departing or arriving flights. Defaults to 'departures'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses the data source (OpenSky Network, free, crowd-sourced) and defaults. However, it does not cover limitations like data coverage ('major airports' only), potential delays, rate limits, or error handling. Adequate but not exhaustive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences plus a default statement, no fluff. Every sentence adds value (purpose, accepted codes, return fields, use cases, defaults). Front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, but the description lists return fields explicitly (callsign, origin/destination, estimated times, aircraft identifiers). It also states defaults and accepted input formats. For a simple list tool, this is complete. No missing critical information.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for each parameter. The description adds value by explaining default behavior (4-hour window, departures) and hinting at usage (logistics workflows), but these are minor additions. Per guidelines, baseline 3 is appropriate when schema does heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves recent departures or arrivals at major airports via OpenSky Network, specifies accepted codes (IATA/ICAO), and lists return fields (callsign, origin/destination, times, aircraft identifiers). It also mentions use cases. This is specific and distinguishes from unrelated sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context: useful for logistics, travel agents, etc., and notes the default behavior (departures, 4-hour window). It does not explicitly state when not to use it or compare with alternatives, but given the diverse sibling tools, this is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
fomc-trackerAInspect
US Federal Funds Rate, next FOMC meeting date + countdown, rate trend (hiking/holding/cutting), and full 2026 schedule. FRED public CSV + static calendar. Pairs with treasury-yields and credit-spreads.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses data sources (FRED public CSV, static calendar) and return content (rate, meeting, trend, schedule). No annotations exist, so the description carries the burden. It does not mention update frequency, caching, or any limitations, but it is accurate and non-misleading.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: the first lists the tool's outputs concisely, the second covers data sources and sibling pairing. Every sentence adds value with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the main outputs and data sources. No output schema exists, so the description fulfills the explanation of return values. It could add details on format or update frequency, but for a simple info-retrieval tool, it is largely complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has no parameters (100% coverage by schema). With zero parameters, the baseline is 4. The description adds no parameter information because none are needed.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool provides US Federal Funds Rate, next FOMC meeting date with countdown, rate trend (hiking/holding/cutting), and full 2026 schedule. It names specific data sources (FRED public CSV + static calendar) and differentiates from siblings by advising pairing with treasury-yields and credit-spreads.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description clearly indicates the tool's domain (FOMC rates, meetings, trends) and suggests pairing with specific sibling tools (treasury-yields, credit-spreads). However, it does not provide explicit when-not-to-use or alternative tools beyond the pairing suggestion.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
forex-ratesAInspect
Real-time fiat foreign exchange rates. Base currency defaults to USD; returns rates for all 166 supported currencies, or a filtered subset. Sourced from open.er-api.com (free, no key, daily updates ~00:00 UTC). Supports any ISO 4217 base: EUR, GBP, JPY, KRW, CNY, AUD, etc. Use for: USD→KRW conversion when reading Korean exchange data, cross-border payment amounts, international price normalization, or any multi-currency DeFi workflow.
| Name | Required | Description | Default |
|---|---|---|---|
| base | No | Base currency ISO 4217 code (e.g. USD, EUR, GBP, JPY). Default: USD. | |
| convert | No | Optional: convert an amount. E.g. {amount: 1000, from: 'USD', to: 'KRW'}. | |
| symbols | No | Specific currency codes to return (e.g. ['KRW','EUR','GBP']). If omitted, returns all 166+ currencies. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the data source, update schedule (daily UTC), free access, and supported currencies. It implies read-only behavior but could mention rate limits or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (5 sentences) and well-structured: purpose first, then details, then use cases. No redundant or unnecessary information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
While the description covers inputs and use cases well, it lacks information about the response format (e.g., structure of rates object). Since there is no output schema, this omission reduces completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds meaningful context: default base currency, all vs. filtered subsets, and a concrete example for the 'convert' parameter. This enhances understanding beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides real-time fiat forex rates, specifies default base currency and supported currencies, and lists concrete use cases. This distinguishes it from sibling tools like crypto-fiat-price, which deal with crypto assets.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases (e.g., USD→KRW conversion, cross-border payments) and contextual information (source, update frequency, default base). While it does not exclude alternatives, the specificity guides appropriate usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
funding-ratesAInspect
Returns current perpetual funding rates for 200+ assets on Hyperliquid DEX, sorted by absolute funding magnitude. Includes 8-hour and annualized rates, open interest, mark price, and directional signal (longs-pay or shorts-pay). Use before entering a perpetual position to factor funding cost/income into strategy ROI, or to scan for high-rate short-bias opportunities in delta-neutral strategies.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max results to return (default 25, max 100). | |
| symbol | No | Filter to a specific asset symbol (e.g. BTC, ETH, SOL). Case-insensitive. Omit for all assets. | |
| sort_by | No | Sort order. 'abs_funding' (default) = highest absolute rate first. 'funding' = most positive first. 'open_interest' = largest OI first. | |
| direction | No | Filter by funding direction. 'positive' = longs pay shorts (bullish funding). 'negative' = shorts pay longs (bearish funding). Default: 'all'. | |
| min_oi_usd | No | Minimum open interest in USD to filter out illiquid markets (e.g. 500000 for $500K). Default: 0. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It correctly implies a read-only operation ('returns') and lists the data fields. However, it omits details on data freshness, rate limits, or authentication requirements, which would be valuable for a live market data tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with the tool's purpose, then listing contents, and finally providing usage guidance. Every sentence adds value without redundancy or fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With no output schema, the description names the returned fields but does not specify format or example data. For a tool returning multiple numeric and categorical fields, this may be insufficient for an agent to parse the output. The usage scenarios are well-covered, but structural details of the response are lacking.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with adequate descriptions for all 5 parameters. The description adds minimal additional meaning beyond the schema, such as implying that 'sort_by' relates to absolute funding magnitude. Baseline score of 3 is appropriate as the schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns perpetual funding rates for a specific resource (Hyperliquid DEX), with a specific verb ('returns'), and lists the included data fields. It distinguishes itself from 150+ sibling tools by focusing on a niche DeFi metric.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly advises using the tool before entering perpetual positions to assess funding costs and for scanning high-rate opportunities in delta-neutral strategies. While it does not enumerate when not to use it or list alternatives, the provided context is sufficient for typical use cases.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
gas-estimateBInspect
Multi-chain gas price oracle: fast/standard/slow Gwei + USD cost for a transfer. Chains: ethereum, base, polygon, arbitrum, bsc. Seam: x402node/myceliasignal gas feeds.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Chain to query. Options: ethereum, base, polygon, arbitrum, bsc. Omit for all 5. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavior. It indicates this is a read-only oracle returning gas prices but lacks details on caching, update frequency, or precision. The behavior is somewhat transparent but incomplete.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is relatively concise (two sentences) but includes the unclear 'Seam:' clause which detracts from clarity. The core purpose is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, so the description should indicate return format. It mentions fast/standard/slow Gwei and USD cost, but lacks structural details. The behavior for omitting chain (returns all) is implied by schema but not reiterated.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and already documents the single parameter 'chain' with options. The description adds no new parameter information beyond the schema, achieving baseline adequacy.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is a multi-chain gas price oracle returning fast/standard/slow Gwei and USD cost for a transfer, listing specific chains. However, the mention of 'Seam: x402node/myceliasignal gas feeds' is cryptic and reduces clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives like 'gas-prices'. The description only lists supported chains without context on selection criteria or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
gas-pricesAInspect
Current gas prices and EIP-1559 fee recommendations across 6 major EVM chains: Ethereum, Base, Arbitrum, Optimism, Polygon, BNB Chain. Returns base fee, priority fee percentiles (slow/standard/fast), and estimated ETH/MATIC/BNB cost for a standard 21k-gas transfer. All sourced from free public RPC endpoints — no API key needed. Use before sending on-chain transactions, estimating agent operating costs, or comparing chain fees for routing decisions.
| Name | Required | Description | Default |
|---|---|---|---|
| networks | No | Which networks to query. Default: all 6. Specify a subset for faster response. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states data is sourced from free public RPC endpoints with no API key needed, implying it is non-destructive and always available. It does not mention rate limits or errors but covers essential behavioral traits for a read-only tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no redundancy. It front-loads the purpose and efficiently covers key details: chains, output fields, source, and use cases.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description explains the return structure (base fee, priority fee percentiles, estimated cost by chain). For a one-parameter tool, this is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Only one parameter 'networks' is present, and schema coverage is 100%. The description adds value beyond the schema by noting 'Default: all 6. Specify a subset for faster response.' This clarifies effect and usage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns current gas prices and EIP-1559 fee recommendations across 6 major EVM chains. It specifies the data returned (base fee, priority fee percentiles, estimated cost) and the source. This distinguishes it from siblings like 'gas-estimate' or 'crypto-fiat-price'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicit usage scenarios are given: 'Use before sending on-chain transactions, estimating agent operating costs, or comparing chain fees for routing decisions.' While it lacks explicit when-not-to-use, the context is clear and provides practical guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
generate-memeAInspect
Generates a meme image from 211 built-in templates. Returns a direct PNG URL and embed-ready markdown. Input: template ID (default: drake), top line, optional bottom line, optional middle line (for 3-line templates like panik-kalm-panik). Popular templates: drake, buzz (X Everywhere), fry, fine (This Is Fine), success, panik-kalm-panik, boat (I Should Buy a Boat), aag (Ancient Aliens), blb (Bad Luck Brian). $0.005/call — free upstream, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| top | Yes | Top / first line text. Required. | |
| width | No | Output image width in pixels (100–800). Default 600. | |
| bottom | No | Bottom / second line text. Optional. | |
| middle | No | Middle / third line text. Only used if the template has 3+ lines (e.g. panik-kalm-panik). Ignored otherwise. | |
| template | No | Meme template ID. Popular options: drake, buzz, fry, fine, success, panik-kalm-panik, boat, aag, bad, blb, yuno. Call /cap/list-templates (free) for the full 211. Default: drake. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Explains output format (PNG URL + markdown), pricing ($0.005/call), and no API key required. Since no annotations exist, the description covers the key behavioral aspects. Missing potential rate limits or error handling, but adequate for the tool's simplicity.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three succinct sentences: purpose/output, input structure, and additional details (popular templates, pricing). No wasted words; all information is relevant and front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately explains the return value. Covers all parameters and provides usage context. Missing error scenarios or behavior on invalid templates, but still fairly complete for a simple meme generation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds value by listing popular templates, explaining the default, and clarifying when the 'middle' parameter is used. This goes beyond the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it generates memes from 211 templates and returns PNG URL and markdown. Specific verb+resource. However, it does not explicitly differentiate from the sibling 'meme-generator', which may have overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear input structure and mentions pricing, but lacks explicit guidance on when not to use this tool or which alternatives (e.g., 'meme-generator') might be more appropriate for certain cases.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
geocodeAInspect
Forward and reverse geocoding via OpenStreetMap Nominatim. Forward: convert an address or place name to latitude/longitude, bounding box, and OSM metadata. Reverse: convert lat/lon to a structured street address. Supports any location worldwide. Useful for location enrichment, address validation, coordinate lookup, and building location-aware agent flows.
| Name | Required | Description | Default |
|---|---|---|---|
| lat | No | Latitude for reverse geocoding. Requires 'lon' also. | |
| lon | No | Longitude for reverse geocoding. Requires 'lat' also. | |
| limit | No | Max results for forward geocode (default 3, max 10). Ignored for reverse. | |
| query | No | Address or place name to geocode (forward lookup). Example: '1600 Pennsylvania Ave Washington DC' or 'Eiffel Tower Paris'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It explains the two modes and mentions worldwide support and OSM metadata. However, it omits any discussion of rate limits, attribution requirements, or request restrictions, which are important for an API-driven tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded, with no redundant sentences. Every sentence adds value, clearly explaining the tool's purpose, modes, and use cases.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given there is no output schema, the description adequately explains what each mode returns (coordinates, bounding box, address). It mentions both modes and parameter usage. A minor gap is not emphasizing that reverse geocoding requires both lat and lon, though the schema implies it.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents parameters adequately. The description adds high-level context but does not provide additional semantics beyond summarizing forward vs. reverse modes. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it performs forward and reverse geocoding via OpenStreetMap Nominatim, converting addresses to coordinates and vice versa. It distinguishes itself from sibling tools like 'city-lookup' and 'place-details' by specifying its use of OSM and its dual-mode capability.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists useful applications such as location enrichment and address validation, providing context for when to use the tool. However, it does not explicitly mention when not to use it or suggest alternatives, missing an opportunity to guide the agent toward the best tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
github-repo-intelAInspect
GitHub repository intelligence: stars, forks, open issues, language, license, last push date, latest release version and date, topics, and whether the repo is actively maintained. Input any GitHub repo as 'owner/repo' or a full GitHub URL. Use before wiring a new library as a dependency, when evaluating a project for acquisition or integration, or when you need to assess community health (stars/forks ratio, issue velocity, maintainer recency). Free upstream: GitHub public API (no key needed, 60 req/hr unauthenticated).
| Name | Required | Description | Default |
|---|---|---|---|
| repo | Yes | GitHub repo in 'owner/repo' format, or a full GitHub URL (e.g. 'torvalds/linux' or 'https://github.com/vercel/next.js'). | |
| include_release | No | If true, fetches the latest release tag and date (extra API call). Default true. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses that it uses the free GitHub public API, mentions the 60 req/hr unauthenticated rate limit, and lists the data points returned. It does not discuss error behavior or response format, but for a read-only data retrieval tool, this is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, each serving a purpose: output fields, input format, use cases + rate limit. Front-loaded and no unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description lists all key return fields and adds rate limit context. It could mention the response format (JSON) but remains fairly complete for a simple data retrieval tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds context for the 'repo' parameter (supports owner/repo or URL) and implies the 'include_release' parameter by mentioning release info, but does not significantly extend beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves GitHub repository intelligence including specific data points like stars, forks, open issues, language, license, last push date, release info, and maintenance status. This distinguishes it from sibling tools which are broader or unrelated.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases: before adding a library dependency, evaluating acquisition/integration, or assessing community health. It also mentions the rate limit (60 req/hr) for frequency guidance. It does not explicitly state when not to use, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
global-equity-indicesAInspect
Global equity snapshot: 9 major indices (Nikkei 225, Hang Seng, ASX 200, Nifty 50, Shanghai, FTSE 100, DAX, CAC 40, Euro Stoxx 50) plus DXY. Returns current level, daily % change, 52-week range context, and region posture (bullish/mixed/bearish). Free Yahoo Finance data. No API key. Overnight context for global macro agents and forex positioning. $0.010/call.
| Name | Required | Description | Default |
|---|---|---|---|
| regions | No | Limit to specific regions. Omit for all regions. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses data source ('Free Yahoo Finance data'), access requirements ('No API key'), and cost ('$0.010/call'). It also explains what the tool returns, providing complete transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three concise sentences. The first lists the indices, the second details return fields, the third adds usage context and cost. Every sentence adds value, and key information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers purpose, return fields, data source, cost, and usage context. It lacks examples of valid region values for the parameter, but this is a minor gap for a simple tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 100% coverage for the single optional 'regions' parameter, so the description adds no additional meaning beyond the schema's description. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it's a 'Global equity snapshot' listing 9 specific major indices plus DXY, and details the return fields (current level, daily % change, 52-week range, region posture). This distinguishes it from sibling tools like 'equity-brief' or 'market-overview'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives clear context: 'Overnight context for global macro agents and forex positioning.' It implies when to use (for macro/forex analysis) but does not explicitly state when not to use or mention alternative tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
gov-votesAInspect
US Congressional vote records from official government XML sources (senate.gov + clerk.house.gov). Search by chamber (house/senate), category (passage, amendment, cloture, nomination, procedural, etc.), or limit. Returns vote question, result (Passed/Failed), yea/nay totals, bill number, and description. 119th Congress (2025–2026). No API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max results (default 10, max 25). | |
| chamber | No | Chamber to search. Default: 'senate'. | |
| category | No | Vote category filter (inferred from vote question). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool is a search/query operation with no API key required, implying it is read-only. However, it does not disclose potential rate limits, data freshness, or any behavioral constraints beyond what is implied.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a concise paragraph of three sentences that front-loads the main purpose. It is efficient and avoids unnecessary detail, though it could be slightly more structured with bullet points for clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the tool's scope (119th Congress), return values (vote question, result, totals, bill number, description), and required authentication (none). While no output schema exists, the description adequately explains the output. It is complete for a straightforward search tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage, but the description adds value by explaining the chamber parameter options (house/senate) and listing example categories (passage, amendment, etc.). This clarifies the intended use beyond the schema's brief descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool retrieves US Congressional vote records from official government XML sources, specifying it covers the 119th Congress. It distinguishes itself from sibling tools by focusing on a specific domain (government votes) that is unique among the listed siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear guidance on when to use the tool: to search for congressional vote records by chamber, category, or limit. It does not explicitly state when not to use it or mention alternatives, but the context is sufficient for an AI agent to understand appropriate usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
hedge-fund-holdingsAInspect
Returns top stock holdings from any institution's latest SEC 13F filing. Input: institution name (e.g. 'Renaissance Technologies', 'Bridgewater Associates'). Output: top 25 positions by market value with shares, value, and portfolio weight. No API key. SEC EDGAR source. $0.025 vs $25-50/mo Quiver Quant subscription.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max number of top holdings to return (default: 25, max: 100). | |
| institution | Yes | Institutional investor name as it appears in SEC filings (e.g. 'Renaissance Technologies', 'Berkshire Hathaway', 'Two Sigma'). | |
| include_options | No | If true, includes PUT and CALL option positions alongside equity holdings. Default: false (equity SH positions only). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries the burden. It discloses cost, source (SEC EDGAR), and output format, but does not mention error handling, rate limits, or behavior when institution is not found.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is concise and front-loaded with the core purpose. The cost comparison is slightly extraneous but not wasteful. No unnecessary repetition.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and few parameters, the description adequately explains the tool's function. It could mention edge cases, but is sufficient for an agent to use correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers all 3 parameters with descriptions (100% coverage). The description only repeats 'institution' and output fields, adding no new meaning for 'limit' or 'include_options'. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns top stock holdings from an institution's latest SEC 13F filing, specifying input (institution name) and output (top 25 with shares, value, weight). This differentiates it from sibling financial tools like etf-holdings or insider-trades.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context: input institution name, output top holdings, and mentions no API key compared to alternatives like Quiver Quant. However, it lacks explicit 'when not to use' or comparison to sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
hf-model-searchAInspect
Search HuggingFace Hub for ML models. Specify a keyword (e.g. 'bert', 'llama', 'stable diffusion') and optional task filter (e.g. 'text-classification', 'text-generation', 'image-classification'). Returns top results sorted by downloads or likes, including model ID, author, pipeline task, framework library, download count, likes count, and tags.
| Name | Required | Description | Default |
|---|---|---|---|
| sort | No | Sort order. Default: downloads. | |
| task | No | Optional pipeline task filter (e.g. 'text-classification', 'text-generation', 'image-classification'). Omit to search all tasks. | |
| limit | No | Number of results to return (1–20). Default: 10. | |
| query | Yes | Search query — model name, architecture, or keyword (e.g. 'bert', 'llama', 'sentiment'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description bears full burden. It discloses the output fields (model ID, author, pipeline task, etc.) and sorting behavior implicitly, but does not mention rate limits, auth requirements, or other behavioral traits. The disclosure is adequate but not highly detailed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences with no fluff. The first sentence states purpose, the second lists output. It is front-loaded and every word contributes meaning.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple search tool with full schema coverage and no output schema, the description covers the action, inputs with examples, and output fields. It is complete for an agent to understand what the tool does and what it returns.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by providing concrete examples for 'query' and 'task' parameters (e.g., 'bert', 'text-classification'), which are not in the schema descriptions. It enhances understanding beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches HuggingFace Hub for ML models, uses specific verbs ('Search'), and provides examples of keywords and task filters. It is distinct from sibling tools which cover domains like research papers (research-paper-search) or news (hn-search).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives clear guidance on input (keyword required, optional task filter) with examples. It implies the tool is for finding ML models but does not explicitly exclude other uses or mention alternatives, so it lacks explicit when-not-to-use instructions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
hn-searchAInspect
Hacker News story and comment search via Algolia. Returns titles, scores, comment counts, authors, and URLs for posts matching the query. Filter by type (story/comment) and date range (day/week/month/year/all). Sorted by relevance by default; use sort=date for newest-first. Useful for tech news, community sentiment, or discovering discussion threads about a topic.
| Name | Required | Description | Default |
|---|---|---|---|
| q | Yes | Search query (required). Use quotes for exact phrases. | |
| sort | No | Sort order — relevance (default) or date (newest first). | |
| type | No | Content type to search. Default: story. | |
| limit | No | Number of results (1–20). Default: 10. | |
| date_range | No | Filter results to a time window. Default: all. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must disclose behavioral traits. It describes return fields and filtering but does not explicitly state that the tool is read-only or idempotent. It fails to notify the agent about any side effects or safety characteristics, which is a significant gap for a tool with zero annotation coverage.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (5 sentences) and front-loaded with the core purpose. Each sentence adds unique information: purpose, return fields, filters, sorting, and use cases. No redundant or unnecessary content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's moderate complexity (5 params, no output schema), the description provides adequate contextual information: it lists returned fields, explains sorting and filtering, and gives use cases. However, it does not mention pagination or result count beyond the limit parameter, and there is no output schema to fill that gap. This is a minor omission.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by providing usage examples (e.g., 'Use quotes for exact phrases' for q, 'Sorted by relevance by default; use sort=date for newest-first' for sort) and clarifying defaults and allowed values (date_range options). This enriches the parameter understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: 'Hacker News story and comment search via Algolia'. It specifies the resource (Hacker News) and actions (search, filter, sort), and lists returned fields. This distinguishes it from sibling tools like 'research-paper-search' or 'reddit-intel' which target different sources.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context: 'Useful for tech news, community sentiment, or discovering discussion threads about a topic.' This implies when to use it but lacks explicit guidance on when not to use or comparisons to sibling tools. No alternatives or exclusions are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
housing-briefAInspect
AI-synthesized US housing market briefing. Fetches 8 FRED signals (starts, permits, existing/new sales, months supply, 30Y mortgage rate, Case-Shiller HPI, median price) and uses gpt-4o-mini to produce market phase, direction, supply posture, affordability regime, 150-word narrative, dominant risk, and agent implication. One call collapses 8 FRED lookups + LLM synthesis for REIT analysis, mortgage exposure, consumer wealth, and macro housing drag.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full behavioral burden. It discloses that it fetches 8 FRED signals and uses gpt-4o-mini for synthesis, and enumerates outputs. It does not cover potential edge cases, rate limits, or failure modes, but for a zero-parameter tool, transparency is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences that efficiently convey purpose, inputs, outputs, and use cases. It is front-loaded with the core action ('AI-synthesized US housing market briefing') and has no redundant or filler content.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Without an output schema, the description enumerates all key outputs: market phase, direction, supply posture, affordability regime, narrative, dominant risk, and agent implication. It also lists the 8 FRED signals and mentions the LLM model. For a zero-parameter tool, this is comprehensive and gives the agent a clear expectation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The tool has zero parameters, and schema description coverage is 100%. The description adds no parameter-level detail because none is needed. A baseline of 4 is appropriate for a no-parameter tool.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it produces an AI-synthesized US housing market briefing using 8 specific FRED signals and an LLM. It lists outputs (market phase, direction, etc.) and use cases (REIT analysis, mortgage exposure), distinguishing it from sibling briefs like energy-brief or macro-brief.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implicitly tells when to use: for a comprehensive housing market summary without multiple API calls. It lists specific applications (REIT analysis, mortgage exposure, consumer wealth, macro housing drag). It does not explicitly state when not to use or name alternatives, but sibling tool names provide context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
http-headersAInspect
HTTP response headers inspector and security grader. Fetches headers from any public URL and evaluates OWASP-recommended security headers: HSTS, CSP, X-Frame-Options, X-Content-Type-Options, Referrer-Policy, Permissions-Policy. Returns raw headers, per-header security findings, overall grade (A–F), and actionable recommendations. Useful for web app security audits, CDN configuration verification, and compliance checks.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Public HTTP/HTTPS URL to inspect. Redirects are followed. | |
| include_all_headers | No | If true, return all response headers (not just security-relevant ones). Default: false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses that it fetches headers from public URLs, follows redirects, and returns raw headers, per-header findings, grade, and recommendations. It does not mention rate limits, authentication, or timeouts, but the core behavior is well covered.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, each serving a purpose: stating the core function, listing security headers, and summarizing output and use cases. No wasted words; front-loaded with the verb and resource.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with two parameters and no output schema, the description adequately explains input and output. It does not cover all edge cases (e.g., non-public URLs), but given the tool's simplicity, it is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for both parameters (url and include_all_headers). The description adds value by listing the security headers evaluated and explaining the output format, which goes beyond the schema's parameter descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool inspects HTTP response headers and grades security headers according to OWASP recommendations. It specifies the verb 'fetches' and the resource 'headers from any public URL', and lists specific security headers evaluated, distinguishing it from siblings like ssl-cert or dns-lookup.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit contexts for use: web app security audits, CDN configuration verification, and compliance checks. However, it does not explicitly state when not to use the tool or mention alternatives among siblings, leaving some ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
image-detectAInspect
Detects the true image format of any URL via magic byte inspection — works even when the file extension or Content-Type header lies (common with proxied or CDN-hosted images). Returns: format (png/jpeg/gif/webp/avif/bmp/tiff/svg/ico/unknown), detected MIME type, whether Content-Type header matches, file size (bytes), and pixel dimensions for PNG and JPEG. No API key required. $0.040/call — 20% below x402node.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL of the image to inspect. Must be publicly accessible. HTTP or HTTPS. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It details the output (format, MIME type, size, dimensions) and states that no API key is required and the cost. It does not cover error handling or rate limits, but for a simple detection tool, this is fairly transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (3 sentences) and front-loads the key value proposition. Every sentence adds essential information: detection method, output details, and pricing. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool is simple with one parameter and no output schema. The description adequately explains the return fields and use case. However, it lacks information on error behavior (e.g., invalid URL) and potential limitations, which would add completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The single 'url' parameter is described in the schema with basic constraints. The description adds value by explaining why the tool is useful (handles lying headers) and what it returns, which goes beyond the schema's minimal description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: detecting the true image format via magic byte inspection, even when the extension or Content-Type lies. It also lists the output fields, making it distinct from any sibling tools that might deal with images (e.g., ai-image-gen, meme-generator).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for verifying image format when headers are unreliable, but it does not explicitly state when to use this tool versus alternatives, nor does it provide when-not-to-use guidance. The mention of pricing is useful but not a replacement for usage guidelines.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
imf-country-outlookAInspect
IMF World Economic Outlook forecasts — current year + 3-year horizon for 180+ countries. Returns GDP growth, CPI inflation, unemployment rate, current account balance (% GDP), government gross debt (% GDP), and fiscal balance (% GDP). Sourced from IMF DataMapper API (no key required). Distinct from World Bank data — these are IMF forward projections updated Apr/Oct. Use for sovereign risk, EM allocation, currency thesis, fiscal sustainability analysis.
| Name | Required | Description | Default |
|---|---|---|---|
| country | Yes | ISO 3-letter country code (e.g. 'USA', 'CHN', 'DEU'). Accepts ISO2 for common countries (e.g. 'US'). Comma-separate up to 5 countries for comparison (e.g. 'USA,CHN,DEU'). | |
| horizon | No | Forecast years ahead to include (1–5). Default: 3. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses data source (IMF DataMapper API, no key required), update frequency (Apr/Oct), horizon (current year + 3), and return fields. This provides good behavioral context beyond the input schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences with no wasted words. It front-loads the core purpose and then efficiently lists the return metrics and source details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple data retrieval tool with 2 parameters and no output schema, the description lists return fields, source, and update frequency, making it complete enough for an agent to understand the output. However, it could mention limits on number of countries or any pagination.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds value by explaining that country accepts ISO3 and ISO2 codes, supports comma-separation up to 5 countries, and horizon defaults to 3. This goes beyond the schema's basic description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns IMF World Economic Outlook forecasts for 180+ countries with specific metrics (GDP growth, CPI inflation, etc.). It distinguishes from sibling tools like world-bank-data by noting these are IMF forward projections updated Apr/Oct.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases (sovereign risk, EM allocation, etc.) and notes the data source (IMF vs World Bank). However, it does not explicitly state when not to use this tool or directly compare to alternatives like world-bank-data, which is a sibling.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
insider-tradesAInspect
Recent SEC Form 4 insider trading activity for any US public company. Returns who bought or sold (director, officer, 10%+ owner), transaction date, shares, price per share, total value, and position after trade. Sourced from SEC EDGAR public API — authoritative, free, no API key. Use for investment analysis, governance checks, or flagging unusual insider selling before an event.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | Lookback window in days (default 30, max 180). | |
| ticker | Yes | US stock ticker (e.g. AAPL, NVDA, MSFT). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must cover behavior. It mentions source (SEC EDGAR), authority, and free access, but does not disclose rate limits, latency, error handling, or that the lookback window is controlled by the 'days' parameter (max 180). Basic transparency but missing some key operational details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with front-loaded purpose, succinct listing of returned data, and clear source/use-case context. No wasted words; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Good coverage of returned fields and source. Lacks explicit mention of lookback window (max 180 days) and differentiation from the sibling 'sec-insider-trades'. No output schema, but description lists returned data sufficiently for most use cases.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with adequate descriptions for both parameters. The tool description adds no additional meaning beyond the schema, and does not mention the 'days' parameter at all. Baseline score of 3 is appropriate since schema already explains parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it provides recent SEC Form 4 insider trading activity for US public companies, including details on who bought/sold and transaction specifics. However, it does not differentiate from the sibling tool 'sec-insider-trades', which may cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides use cases like investment analysis and governance checks, but lacks guidance on when to prefer this over alternatives (e.g., congressional-trades or sec-insider-trades). No explicit when-not-to-use or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
intel-packAInspect
Three-source intelligence pack in one x402 call: equity market snapshot (SPY/QQQ/IWM/VIX/risk signal) + top DeFi yield pools by APY + top prediction markets by volume. Replaces three separate calls. $0.175 purchased individually; $0.15 as a pack. No inputs required.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses cost comparison and that no inputs are required, but does not describe output format or any potential side effects. Adequate for a simple no-input tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is three sentences: contents, replacement benefit, pricing/input requirement. No wasted words, clearly structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and no annotations, description adequately covers what the tool does and its value proposition. Could mention output format but not necessary for a bundled data pack.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
No parameters exist, so description does not need to add param details. Baseline 4 for zero-param tools.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool provides three intelligence sources (equity snapshot, DeFi yields, prediction markets) in one call, specifying symbols and that it replaces three separate calls. This distinguishes it from siblings like equity-brief, defi-yields, and prediction-markets.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It implies the pack should be used instead of three individual calls, but does not explicitly state when not to use it (e.g., if only one component is needed). No alternatives are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
ip-intelAInspect
Geolocation and network intelligence for IP addresses or domain names. Returns country, region, city, coordinates, ISP, organization, ASN, reverse DNS, and proxy/VPN/mobile/hosting flags. Accepts IPv4, IPv6, or domain names (auto-resolved to IP). Useful for infrastructure audits, fraud detection, blockchain node geography, and origin enrichment.
| Name | Required | Description | Default |
|---|---|---|---|
| target | No | IPv4 address, IPv6 address, or domain name to look up (e.g. '8.8.8.8', '2001:4860:4860::8888', 'github.com'). | |
| targets | No | Batch lookup: up to 10 IP addresses or domain names. If provided, 'target' is ignored. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It successfully discloses the tool is a read-only lookup, accepts multiple input types, auto-resolves domains, and has a batch mode. However, it omits potential limitations like rate limits, data freshness, or error handling behaviors.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is efficiently packed into four sentences: first sentence defines main purpose, second lists outputs, third specifies inputs, fourth gives use cases. No unnecessary words, and the most critical information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers input types, output fields, and use cases well for a lookup tool. Without an output schema, it compensates by listing return values. However, it lacks details on error states, performance expectations, or any prerequisites beyond the inputs.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear parameter descriptions. The tool description adds value by explaining that domain names are auto-resolved to IP and listing the extensive output fields, which goes beyond the schema but does not introduce new parameter constraints.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides geolocation and network intelligence for IPs and domains, listing specific data fields. It distinguishes from siblings by detailing the exact type of intelligence (location, ISP, proxy flags) not covered by related tools like dns-lookup or domain-whois.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives explicit use cases (infrastructure audits, fraud detection, etc.), helping an agent decide when to use it. However, it does not provide exclusionary guidance or direct comparisons to sibling tools, leaving some ambiguity about when alternative tools might be preferred.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
ipo-calendarAInspect
Returns live IPO calendar from Nasdaq — upcoming deals with expected pricing dates, recently priced offerings, new S-1 filings, and withdrawn deals. Covers all major US exchanges. Includes company name, ticker, exchange, price range, share count, and total offer amount. Structured extraction WITH summary analytics AND deal counts — richer than raw scraped calendars. Use section='upcoming' for near-term deal flow, section='priced' for post-IPO watchlist setup, section='all' for complete week view.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum results per section. | |
| section | No | Calendar section to return. 'upcoming' = deals pricing this week; 'priced' = recently priced IPOs; 'filed' = new S-1 registrations; 'withdrawn' = cancelled deals; 'all' = all sections. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full burden. It describes the tool as returning live data with summary analytics, but does not disclose rate limits, data freshness, authentication requirements, error behavior, or other operational traits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with purpose, followed by data fields and usage tips. It is well-structured but slightly verbose with phrases like 'Structured extraction WITH summary analytics AND deal counts — richer than raw scraped calendars.'
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has 2 parameters and no output schema. The description covers input parameter usage well but lacks details on output format structure (e.g., array of objects) and potential edge cases like pagination or limit interaction with sections.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% (both parameters have descriptions). The description adds value by explaining the section parameter's usage scenarios, but does not elaborate on the limit parameter beyond the schema's 'Maximum results per section.' The baseline is 3 due to high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns live IPO calendar from Nasdaq, listing specific categories (upcoming, priced, filed, withdrawn) and data fields. It distinguishes itself from siblings like dividend-calendar or earnings-calendar by focusing on IPOs and notes it is richer than raw scraped calendars.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly advises when to use each section: 'section=''upcoming'' for near-term deal flow, section=''priced'' for post-IPO watchlist setup, section=''all'' for complete week view.' This provides clear usage context, though it does not mention alternatives or when not to use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
job-searchAInspect
Search remote and hybrid job listings by keyword and location. Returns up to 10 postings with title, company, industry, employment type, description summary, apply URL, and posting date. Uses jobicy.com — covers tech, finance, healthcare, marketing, and more. $1.50/call — 10% below closest x402 competitor.
| Name | Required | Description | Default |
|---|---|---|---|
| geo | No | Geographic filter — country name or 'Anywhere' for worldwide (e.g. 'USA', 'UK', 'Canada', 'Anywhere'). Optional. | |
| tag | Yes | Search keyword — role title, skill, or technology (e.g. 'software engineer', 'react', 'data analyst'). Required. | |
| count | No | Number of results to return (1–10, default 5). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations present. Description discloses source (jobicy.com), industries covered, and pricing. Missing details on error handling, rate limits, or read-only nature, but adds value beyond bare minimum.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences that efficiently convey action, output, source, and pricing. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but the description lists return fields. Covers scope (remote/hybrid), source, limitations (up to 10 results), and pricing. Complete for a search tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers all parameters with descriptions. Description adds context about the source and coverage, reinforcing the meaning of the parameters beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool searches remote and hybrid job listings by keyword and location, and lists the return fields. Distinguishes from siblings as there is no other job search tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides context on coverage (tech, finance, etc.) and pricing, but does not explicitly mention when not to use or alternatives. However, no direct sibling competitor exists.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
json-extractAInspect
Extracts and parses JSON from mixed-content text. Handles LLM output with JSON embedded in prose, code fences (```json), trailing commas, single-quoted strings, JS-style comments, and bare object keys (JSON5-style). Returns the parsed data, a cleaned JSON string, extraction method used, and any repair applied. Pure text processing — zero external API calls.
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text containing JSON, possibly mixed with prose or code fences. Max 200,000 chars. | |
| schema_check | No | Optional: JSON Schema (draft-07 subset) to validate the extracted data against. If provided, returns a validation result. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description fully carries the burden. It discloses that it is pure text processing with zero external API calls, details edge cases handled (trailing commas, comments, etc.), and mentions output components. However, it does not describe error behavior when no JSON is found, which is a minor gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no wasted words. First sentence states purpose, second lists key features, third summarizes output and constraints. It is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description explains the output components but lacks information about error handling or return format details (e.g., what happens on failure). Given the absence of an output schema, the description should cover these gaps, making it only moderately complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, with both parameters well-described. The main description reinforces but does not add significant meaning beyond the schema. Per guidelines, a baseline of 3 is appropriate when schema covers all parameters adequately.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool extracts and parses JSON from mixed-content text, with specific verb-resource combination. It distinguishes itself from sibling tools which are primarily data retrieval or analysis tools, none of which perform JSON extraction from free text.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage scenarios (LLM output, mixed content) but does not explicitly state when not to use it or mention alternatives. The context is clear enough given sibling tools are unrelated, but a note on exclusion (e.g., for well-formed JSON) would improve it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
korean-crypto-moversCInspect
Top movers and volume leaders on Korean exchanges (Upbit, 263 KRW markets). Returns biggest 24h price movers (% change from prev close), highest-volume tokens by KRW trade value, and optional raw snapshot. 20% cheaper than printmoneylab's market-movers endpoint at the same Upbit/Bithumb data layer.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of tokens to return per section (1–50). Default: 20. | |
| section | No | Data section to return. 'movers'=top price movers, 'volume'=top by 24h KRW volume, 'all'=both. Default: all. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It describes what is returned but does not mention data freshness, rate limits, or whether the operation is safe (read-only). The mention of cost comparison is irrelevant to behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is only two sentences and front-loads the core purpose. However, the second sentence contains a marketing comparison ('20% cheaper') that is not essential for tool usage, slightly reducing conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has no output schema, so the description should fully explain return values. It mentions price movers and volume leaders but does not specify the output format (e.g., JSON structure, field names). The addition of 'optional raw snapshot' is vague.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description restates the default values for 'limit' and 'section' but adds no new semantic information beyond what the schema already provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns top movers and volume leaders on Korean exchanges, specifying Upbit and 263 KRW markets. However, it does not differentiate from the similarly named sibling 'korean-market-movers', missing an opportunity to clarify distinctions.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives like 'korean-market-movers' or 'crypto-top-movers'. There is no mention of prerequisites, limitations, or exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
korean-market-moversCInspect
Real-time movers and volume-spike leaders across all KRW-denominated markets on Upbit (South Korea's largest crypto exchange). Returns top risers, top fallers, and top volume leaders ranked by 24h change or accumulated trade value. Korean exchange data is a leading indicator frequently observed by institutional agents — the 'kimchi premium' and local retail sentiment often precede global price moves. Free upstream source (Upbit public API), covers 260+ KRW markets.
| Name | Required | Description | Default |
|---|---|---|---|
| top_n | No | Number of results per category (risers, fallers, volume leaders). Default 10, max 30. | |
| symbol | No | Return data for a specific token symbol only (e.g. 'BTC', 'ETH'). Case-insensitive. If provided, top_n and min_volume_usd are ignored. | |
| min_volume_usd | No | Minimum 24h volume in USD to include in results (filters illiquid micro-caps). Default 50000. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description should disclose behavioral traits like idempotency, rate limits, or authentication. It mentions the source is free but does not clarify if calls are read-only or if there are usage restrictions. The agent is left guessing about safety and performance implications.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loaded with the core purpose. The institutional agent context adds value but could be more concise. Overall, it is well-structured and avoids redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description explains the tool's functionality and data source but omits the output structure (e.g., fields returned). Since there is no output schema, the agent needs to know whether it gets prices, percentages, or volume. This is a notable gap for a simple data retrieval tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers all parameters with descriptions (100% coverage), so the description adds no additional semantics. It does not clarify defaults, relationships between parameters, or how they affect output beyond what the schema states.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns 'top risers, top fallers, and top volume leaders' from Upbit's KRW markets. It specifies the exchange and currency, differentiating it from general crypto movers. However, it does not explicitly distinguish itself from the sibling 'korean-crypto-movers', which may cover similar data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The description hints at its use as a leading indicator but does not mention when not to use it or provide comparisons with similar tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
labor-briefBInspect
AI-synthesized US labor market briefing. Fetches 7 FRED signals (initial claims, continued claims, JOLTS openings, nonfarm payrolls MoM, unemployment rate, wage growth YoY, labor force participation) and uses GPT-4o-mini to produce labor regime, wage pressure, claims trend, Fed posture signal, 150-word narrative, dominant risk, and agent implication. One call collapses 7 FRED lookups + LLM synthesis for wage inflation models, recession probability, and Fed policy forecasting.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description discloses the FRED+GPT synthesis process and outputs, but omits side effects, rate limits, or read-only status. Adequate but leaves gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with key information. Reasonably concise though first sentence is dense with terms.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Lists all outputs (regime, wage pressure, etc.) and mentions use cases. Lacks explicit return format but sufficient given no output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
No parameters; baseline 4 as schema covers all. Description adds no parameter-specific meaning beyond what schema already shows.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it produces an AI-synthesized US labor market briefing using FRED data and GPT, but does not differentiate from sibling tool 'labor-market'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool vs siblings like 'labor-market' or any context about alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
labor-marketAInspect
Returns US labor market leading indicators from FRED (free, no API key): initial jobless claims (weekly), continued claims, JOLTS job openings, nonfarm payrolls, labor force participation rate, average hourly earnings with YoY wage growth, and the openings-per-unemployed Beveridge curve ratio. Pairs with macro-indicators for the complete employment picture.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description fully bears the burden of behavioral disclosure. It states the data is free, requires no API key, and lists specific indicators. While it lacks details about rate limits or update frequency, it adequately conveys that the tool is read-only and non-destructive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence that front-loads the tool's purpose. It is concise but slightly dense due to listing many indicators. Breaking into two sentences could improve readability, but overall it is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and no parameters, the description adequately explains what the tool returns by listing specific data points. It mentions pairing with macro-indicators, which adds context. However, it does not specify the output format (e.g., JSON structure), which could be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has zero parameters, so the description does not need to explain parameter usage. Per the guidelines, a baseline of 4 is appropriate when there are no parameters, and the description correctly avoids unnecessary parameter information.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses specific verbs ('Returns') and identifies the resource ('US labor market leading indicators from FRED'), listing concrete data points (claims, JOLTS, payrolls, etc.). It distinguishes itself from sibling tools by explicitly mentioning pairing with 'macro-indicators' for a complete employment picture.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests when to use the tool (to get labor market indicators) and advises pairing with macro-indicators. However, it does not explicitly state when not to use it or provide alternative tools beyond macro-indicators.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
lbo-modelAInspect
Full LBO model: sources & uses, year-by-year operating model, debt schedule with cash sweep, IRR and MOIC, plus 3×3 entry/exit sensitivity tables. Pure computation — no API dependency. Priced $4.50, 10% below the closest x402 competitor at $5.00, with richer output including full operating model and dual sensitivity tables.
| Name | Required | Description | Default |
|---|---|---|---|
| da_pct | No | D&A as % of revenue. Default 0.04. | |
| tax_rate | No | Effective tax rate. Default 0.25. | |
| capex_pct | No | CapEx as % of revenue. Default 0.03. | |
| hold_years | Yes | Investment hold period in years (1–10). | |
| cash_on_hand | No | Target cash acquired at close (USD). Reduces equity check. Default 0. | |
| entry_ebitda | Yes | LTM EBITDA at acquisition (USD). | |
| debt_multiple | Yes | Initial leverage: Term Loan = entry_ebitda × debt_multiple. | |
| entry_revenue | Yes | LTM revenue at acquisition (USD). | |
| existing_debt | No | Target existing debt refinanced at close (USD). Default 0. | |
| exit_multiple | Yes | Exit EV/EBITDA multiple at end of hold period. | |
| interest_rate | No | Annual interest rate on term loan (e.g. 0.08). Default 0.08. | |
| cash_sweep_pct | No | Fraction of FCF applied to optional debt repayment (0–1). Default 1.0. | |
| ebitda_margins | Yes | EBITDA margin for each hold year (e.g. [0.25,0.26,0.27,0.27,0.27]). | |
| entry_multiple | Yes | Entry EV/EBITDA multiple. | |
| transaction_fee_pct | No | Total transaction fees as % of purchase price (e.g. 0.04 = 4%). Default 0.04. | |
| revenue_growth_rates | Yes | Annual revenue growth rate for each hold year (e.g. [0.08,0.07,0.06,0.05,0.05]). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description states 'Pure computation — no API dependency', which discloses the tool's self-contained, non-dependent nature. However, it does not elaborate on potential side effects, idempotency, or input validation behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph with two sentences, front-loading the core purpose. The inclusion of pricing and competitor comparison adds some minor clutter, but overall it is efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a pure computation tool with full schema coverage and no output schema, the description adequately explains what the tool produces and that it has no API dependency. It could mention input constraints like the hold period range, but overall it is sufficient for decision-making.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already defines each parameter. The description adds high-level model structure but does not enhance parameter meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states it is a 'Full LBO model' and lists specific output components (sources & uses, operating model, debt schedule, IRR, MOIC, sensitivity tables), clearly distinguishing it from sibling tools which are not LBO-specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when an LBO model is needed but does not explicitly contrast with alternatives or provide exclusion criteria. It mentions pricing and a competitor but offers no guidance on when to choose this over siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
legal-searchAInspect
Searches 5M+ US court opinions (SCOTUS, federal circuits, district courts, state courts) via CourtListener. Returns case name, court, date, citation, docket number, and a text snippet. Filter by court level, date range, or judge name. Useful for legal research agents, contract risk analysis, precedent lookup, and regulatory compliance workflows.
| Name | Required | Description | Default |
|---|---|---|---|
| court | No | Filter by court level: 'scotus', 'appeals', 'district', or a specific court ID (e.g. 'ca9', 'nysd'). Leave blank for all courts. | |
| limit | No | Max results (default 5, max 20). | |
| query | Yes | Search query — can be a legal concept, case name, statute, or citation (e.g. 'miranda rights', 'negligence per se', 'Title VII', '410 U.S. 113'). | |
| date_after | No | Filter to opinions filed after this date (YYYY-MM-DD). | |
| date_before | No | Filter to opinions filed before this date (YYYY-MM-DD). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It mentions the data source (CourtListener) and return fields, but lacks details on rate limits, pagination behavior, data freshness, or error handling. Adequate but not rich.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences: one for functionality, one for filters and use cases. No wasted words, front-loaded with core action.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description explains return values. It covers source, filters, and use cases. Lacks mention of US-only scope or result limit, but otherwise complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but the description claims a 'judge name' filter, which does not exist in the input schema. This is misleading. The additional context about query types (legal concept, citation) is helpful, but the discrepancy lowers the score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches US court opinions via CourtListener, specifying the corpus size and return fields. It distinguishes itself from siblings as no other tool in the list covers legal search.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists use cases (legal research, contract risk analysis, etc.) providing clear context for when to use. However, it does not explicitly exclude alternative tools or mention when not to use, but given no sibling overlap, this is minor.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
limitless-marketsCInspect
Returns active prediction markets from Limitless Exchange with current Yes/No probabilities. Covers short-duration crypto markets (5-min, 15-min BTC/ETH direction), plus longer-term markets. Returns title, implied yes/no prices, expiration, volume, categories, and slug. $0.006/call — free upstream, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| page | No | Page number for pagination (default 1). | |
| slug | No | If provided, fetch a specific market by its Limitless slug (e.g. 'btc-up-or-down-5-min-1780703750051'). Overrides other filters. | |
| limit | No | Number of markets to return (1–20, default 10). | |
| query | No | Optional keyword filter applied to market titles (case-insensitive). E.g. 'btc', 'eth', 'sol'. | |
| trade_type | No | Filter by trade mechanism: 'clob' (order book) or 'amm' (automated market maker). Default returns all. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must compensate. It mentions cost and that slug overrides filters, but lacks disclosure on rate limits, error handling, data freshness, or other behavioral traits beyond basic read operation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four sentences, front-loaded with purpose and coverage. Efficient and no redundancy, though could be slightly more succinct.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, so description must describe returns. It lists fields but not types or format. Adequate for a data retrieval tool but missing details for full agent understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter descriptions. The description adds little beyond summarizing output fields, so baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns active prediction markets from Limitless Exchange with Yes/No probabilities, covering short-duration crypto and longer-term markets. It lists output fields, giving a clear purpose. However, it does not differentiate from sibling tools like 'prediction-markets'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool vs alternatives. The description mentions market types and cost, but does not specify prerequisites, exclusions, or context for choosing this over similar tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
macro-briefAInspect
AI-synthesized US macroeconomic situation briefing. Gathers 7 real-time signals — HY/IG credit spreads, yield curve (10Y-3M), initial jobless claims, JOLTS openings, core PCE, and Fed Funds rate — from FRED (free, no auth) then uses GPT-4o-mini to synthesize a structured briefing: regime label (expansion/contraction/late-cycle/recovery/uncertain), dominant risk, agent implication, and a 200-word narrative. Replaces a 5+ step data assembly + LLM chain. Priced below Bloomberg macro summaries.
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | Output length. 'standard' = 200-word narrative (default). 'concise' = 100-word summary. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the data sources (FRED, no auth) and the use of GPT-4o-mini for synthesis, along with output components. With no annotations, it adequately conveys the tool's operation and outputs, though it could mention potential inaccuracy or data lag due to AI synthesis.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with four sentences, each serving a purpose: stating the tool's function, listing signals, describing output, and providing value proposition. It is front-loaded and efficient, with no redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multiple signals, AI synthesis) and lack of output schema, the description adequately explains the output structure (regime label, dominant risk, narrative) and data sources. It could be more complete by mentioning data freshness or limitations, but overall it covers the essential context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The sole parameter 'style' is fully described in the input schema (standard vs concise, with defaults). The description does not add extra meaning beyond the schema, but schema coverage is 100%, so a baseline of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'AI-synthesized US macroeconomic situation briefing' that gathers 7 real-time signals and uses GPT-4o-mini to produce a structured output. It names specific components (regime label, dominant risk, etc.) and distinguishes itself from siblings by highlighting its synthesis step and all-in-one nature.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use the tool: it 'replaces a 5+ step data assembly + LLM chain' and is 'priced below Bloomberg macro summaries,' suggesting it is a quick, cost-effective alternative. However, it does not explicitly state when not to use it or compare to specific sibling tools like 'macro-indicators', so some guidance is missing.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
macro-indicatorsAInspect
Returns current US macroeconomic indicators: Fed Funds Rate, CPI (with year-over-year inflation %), Unemployment Rate, and Real GDP. Sourced from FRED (St. Louis Federal Reserve), updated monthly. One call establishes the policy and economic backdrop for any risk-pricing or multi-asset workflow. Pair with market-overview for the full macro picture.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description carries full burden. Discloses data source (FRED) and update frequency (monthly), giving moderate transparency. Does not mention rate limits, availability, or any side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, each adding value: what it returns, source/update, and use case. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and no annotations, the description adequately covers purpose, data, and usage context. Could be improved by noting any limitations or real-time aspects.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
No parameters, so baseline is 4. Description adds meaning by explaining what the tool returns, which is more than the empty schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it returns US macroeconomic indicators and lists specific ones (Fed Funds Rate, CPI, Unemployment, Real GDP). Mentions source and update frequency. Does not explicitly differentiate from other macro tools like macro-brief among siblings, but suggests pairing with market-overview.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides context on when to use (establish policy/economic backdrop for risk-pricing workflows) and suggests pairing with market-overview. However, lacks explicit when-not-to-use guidance or alternatives among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
manufacturing-briefAInspect
AI-synthesized US manufacturing & industrial sector briefing. Gathers 7 FRED signals (free, no auth): Industrial Production, Capacity Utilization, Durable Goods Orders, Manufacturing Output, Manufacturing Employment, Inventory/Sales Ratio, and PPI All Commodities. Uses GPT-4o-mini to synthesize: manufacturing regime (expanding/growing/stagnant/contracting), dominant risk, agent implication, and 200-word narrative. Completes the macro intelligence suite alongside energy-brief, labor-brief, and consumer-brief.
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | Output length. 'standard' = 200-word narrative (default). 'concise' = 100-word summary. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Despite no annotations, the description discloses data sources (7 FRED signals, free, no auth) and the synthesis method (GPT-4o-mini). It does not cover rate limits or data freshness, but the transparency is good for a briefing tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no waste: first states purpose, second details data and output. Front-loaded with key information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers data sources, synthesis method, and output components (regime, risk, etc.), providing sufficient context for a briefing tool with no output schema. Minor omission: whether output is structured or plain text.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and the parameter description is clear. The tool description adds value by detailing the output structure (regime, risk, implication, narrative) beyond the schema, improving understanding of the single parameter's effect.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it synthesizes a US manufacturing briefing from 7 specific FRED signals, and distinguishes itself from sibling tools like energy-brief and labor-brief by explicitly naming them as completing the macro intelligence suite.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for manufacturing sector analysis and mentions siblings as alternatives, but lacks explicit guidance on when to use this tool vs others or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
market-intelligenceAInspect
Returns settlement-verified x402 endpoint intelligence. Shows which endpoints have genuine organic payer breadth and live pricing — sourced from on-chain USDC settlements, not just catalog listings. Filter by category, price range, and minimum unique payers. Useful before wiring a workflow to an external x402 endpoint: confirms it is active with multiple independent payers, not private infrastructure or a dead listing.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max results (default 20, max 50). | |
| sort_by | No | Sort order. Default: payers (highest organic breadth first). | |
| category | No | Filter by endpoint category (e.g., 'ai', 'data', 'finance', 'search'). Omit for all. | |
| min_payers | No | Minimum unique payer count (default 3). Higher = more proven organic demand. | |
| max_price_usd | No | Maximum endpoint price in USD. Omit for no ceiling. | |
| min_price_usd | No | Minimum endpoint price in USD (default 0.01). Use 0.50 to see PRIMARY_RANGE only. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, but description explains data source (on-chain USDC settlements), filtering, and purpose. Could mention read-only nature and latency, but it is fairly transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Paragraph of 5 sentences, well-structured with purpose first, then filters, then use case. Slightly verbose but informative.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, yet description does not detail the return format. Agent may not know what fields are returned (e.g., endpoint names, payers, prices). Could benefit from a brief sentence on output.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with good descriptions. Description adds value by explaining min_payers as 'higher = more proven organic demand' and min_price_usd use for PRIMARY_RANGE.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns settlement-verified x402 endpoint intelligence, distinguishing from catalog listings. It specifies filtering and the use case of confirming active endpoints with multiple independent payers.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly mentions when to use: 'before wiring a workflow to an external x402 endpoint'. Does not discuss alternatives or when not to use, but the context is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
market-moversAInspect
Today's top market movers — equity gainers, losers, most-active, and crypto gainers/losers by 24h change. Sourced from Yahoo Finance screener and CoinGecko (free, no API key). Returns symbol, name, price, % change, volume, and market cap. Filter by asset class (equities, crypto, or both) and mover type (gainers, losers, active). Use for pre-trade context, on-chain event correlation, and market-regime detection.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of results per category (1–20, default 10). | |
| mover_type | No | Which mover category. 'active' returns US equities by volume (no crypto equivalent). Default 'all'. | |
| asset_class | No | Which asset class to return. Default 'both'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Discloses data sources (Yahoo Finance, CoinGecko), free access, no API key required. Lists returned fields. Lacks info on data freshness or limitations, but adequately transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single paragraph with clear structure: purpose, sources, output fields, filters, use cases. Every sentence adds value. No redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, so description must explain return data. Does so adequately (lists fields). For a simple listing tool with good parameter descriptions, this is sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with descriptions. Description adds value by explaining each filter in context (e.g., 'active' returns US equities by volume) and default values. Enhances understanding beyond raw schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool returns top market movers including equity gainers/losers/most-active and crypto gainers/losers. Identifies specific resources and sources. Does not explicitly differentiate from sibling tools like 'crypto-top-movers' but implies broader scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit use cases: pre-trade context, on-chain event correlation, market-regime detection. Mentions data sources and filter options. Does not include when-not-to-use or alternatives, but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
market-overviewAInspect
Single-call market snapshot: SPY, QQQ, IWM, and DIA price + intraday % change, VIX fear gauge, 10-year Treasury yield (^TNX), and a derived risk-posture signal (RISK_ON / NEUTRAL / RISK_OFF / RISK_OFF_ELEVATED). Replaces 5–6 individual price calls with one structured payload useful for position-sizing, regime detection, or pre-trade context. Sourced from Yahoo Finance public data — live during market hours.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the data source (Yahoo Finance public data) and availability (live during market hours). It does not mention rate limits or error handling, but for a read-only snapshot tool, this is sufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences long, front-loaded with the key payload, and uses efficient phrasing. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema or annotations, the description covers the core fields, data source, and use cases. It could elaborate on the derivation of the risk signal, but overall it is sufficiently complete for the tool's simplicity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has zero parameters, so the description does not need to add parameter meaning. The description correctly identifies the tool as a single-call snapshot with no required inputs. Baseline 4 for zero parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns a market snapshot with specific assets (SPY, QQQ, IWM, DIA), VIX, Treasury yield, and a risk signal. It explicitly mentions it replaces 5-6 individual calls, distinguishing it from sibling tools like stock-brief or stock-price-multi.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains it is useful for position-sizing, regime detection, or pre-trade context, and that it replaces multiple individual calls. However, it does not explicitly state when not to use it or name alternative tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
market-sentimentAInspect
Combined crypto market sentiment signal: Crypto Fear & Greed Index (alternative.me, 0–100, 30-day trend) plus BTC and ETH Implied Volatility from Deribit options market (annualized %). Free upstream sources, no keys. Returns current FGI value, classification (Extreme Fear to Extreme Greed), 7-day FGI trend, IV trend (RISING/FALLING/FLAT), and a composite regime label (CAPITULATION_ZONE, OVERHEATED, COMPLACENT_GREED, CALM_MARKET, etc.). Use before making large DeFi positions, calibrating yield farming risk, or routing agent spend in volatile conditions.
| Name | Required | Description | Default |
|---|---|---|---|
| include_iv | No | Whether to fetch Deribit BTC/ETH implied volatility (adds ~500ms). Default true. | |
| history_days | No | Number of past daily FGI readings to include in response (1–30). Default 7. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries the burden. It discloses free upstream sources, no keys, latency impact of include_iv, and output components. This is sufficient for a non-destructive tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph but packs essential information without fluff. It is front-loaded with the main purpose and includes usage guidance efficiently.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but description enumerates return fields and example regime labels. For a non-nested tool with two optional params, this provides enough context for an agent to use it correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%. The description adds value beyond schema: notes that include_iv 'adds ~500ms' and history_days accepts '1–30'. This helps the agent understand parameter impact.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides a combined crypto sentiment signal (Fear & Greed Index + implied volatility) and lists specific output fields. However, it does not differentiate from siblings like 'crypto-fear-greed' or 'defi-market-pulse', which could cause ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides concrete usage scenarios: 'before making large DeFi positions, calibrating yield farming risk, or routing agent spend in volatile conditions.' This is helpful, though it does not explicitly state when not to use or list alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
meme-generatorBInspect
Generate a meme image URL from text. Provide text_top and text_bottom (the two lines of the meme), and optionally a template name. If no template is given, supply a topic keyword and a fitting template will be chosen automatically. Returns a direct image URL you can embed in messages, posts, or web pages. 211 templates available including drake, doge, distracted, buzz, fry, success, gru, oprah, and more. Free upstream: memegen.link (no API key, unlimited requests). Common agent use cases: social media content generation, reaction images, marketing copy, presentation humor.
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | Optional style variant (e.g. 'animated', 'dark', 'no'). Only supported by some templates. Defaults to template default. | |
| topic | No | Topic keyword used to auto-select a template when 'template' is omitted. Examples: 'choice', 'crypto', 'success', 'fire', 'ai'. Ignored if template is specified. | |
| width | No | Output image width in pixels. Defaults to 600. | |
| height | No | Output image height in pixels. Defaults to 450. | |
| template | No | Meme template ID. Examples: drake, doge, distracted, buzz, fry, gru, success, oprah, astronaut, brain, fine, picard. Omit to auto-select from topic. | |
| text_top | No | Top line of the meme (the setup or rejected option). Max 120 chars. | |
| text_bottom | No | Bottom line of the meme (the punchline or preferred option). Max 120 chars. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Without annotations, the description carries the full burden. It mentions the source (memegen.link) and that it's free with no API key, but does not disclose error handling, rate limits, or behavior on invalid inputs. It lacks explicit statements about read-only or destructive nature.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph of four sentences, each adding value: core action, inputs, output format, template availability, source, and use cases. It is front-loaded with the main purpose and efficiently conveys key information without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 7 parameters and no output schema, the description covers the main aspects: how to use, template selection, auto-selection via topic, output type, and source. It mentions use cases and max character limits. While it lacks details on error responses, it is fairly complete for a simple image generation tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds value by explaining the roles of text_top (setup) and text_bottom (punchline), mentioning max lengths (120 chars), and providing examples for template and topic. It clarifies the auto-selection logic when template is omitted.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool generates a meme image URL from text, specifying required inputs (text_top, text_bottom) and optional parameters (template, topic). It covers the output and use cases. However, it does not explicitly differentiate from the sibling tool 'generate-meme'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains how to use the tool: provide top/bottom text, optionally a template, or supply a topic for auto-selection. It lists common use cases. However, it does not provide guidance on when not to use it or alternatives to consider.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
news-sentimentAInspect
Returns global news coverage and sentiment for any company, stock ticker, or topic. Primary source: GDELT Project v2 (250M+ articles, ML tone scoring). Fallback: Google News RSS with keyword heuristic. Returns article count, avg sentiment tone (−100 negative → +100 positive), top positive/negative headlines, and top news domains. Lookback: 1–30 days (default 3). Results cached 10 min.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | Lookback window in days (1–30). Default: 3. | |
| query | Yes | Company name, stock ticker, or news topic (e.g. 'NVIDIA', 'Apple earnings', 'Bitcoin regulation', 'Fed interest rates'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Despite no annotations, the description discloses key behaviors: primary and fallback sources, caching (10 min), lookback range, and return fields. However, it does not mention rate limits or potential delays. Good, but not perfect.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four dense sentences, each serving a purpose: purpose, sources, outputs, configuration. Front-loaded with the core action. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but description adequately explains return data: article count, sentiment tone, top headlines, top domains. Caching and lookback are covered. One might want more detail on headline selection criteria, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with descriptions. The description adds value by providing examples of query values and clarifying the default for 'days' (3). This enriches understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool returns global news coverage and sentiment for companies, tickers, or topics. Verb 'Returns' is specific, and the resource (news sentiment) is well-defined. Among many news-related siblings, this description distinguishes via unique sources (GDELT, Google News RSS) and output metrics.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance. The description implies usage for news sentiment analysis but does not differentiate from sibling tools like 'equity-sentiment' or 'market-sentiment'. A brief comparison would improve this.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nft-metadataAInspect
Fetch NFT metadata, traits, image URL, and collection floor price for any ERC-721 or ERC-1155 token. Returns name, description, all attributes/traits, cached image CDN URL, collection name, OpenSea floor price, token type, and mint block. Supports Ethereum, Polygon, Base, Arbitrum mainnet. Useful for NFT valuation research, portfolio analysis, content generation, and collection intelligence. Free upstream: Alchemy NFT API.
| Name | Required | Description | Default |
|---|---|---|---|
| network | No | Blockchain network. Options: ethereum, polygon, base, arbitrum, optimism. Default: ethereum. | |
| contract | Yes | NFT contract address (0x hex, 42 chars). Example: 0xBC4CA0EdA7647A8aB7C2061c2E118A18a936f13D (BAYC). | |
| token_id | Yes | Token ID as a string or integer. Example: '42' or '1000'. ERC-1155 token IDs also accepted. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the data source (Alchemy NFT API) and that it is free, but does not mention rate limits, auth requirements, or caching behavior. The description adds some value but could be more transparent about potential limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, front-loaded with the main action, and efficiently lists supported networks, return fields, and use cases. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description comprehensively lists return fields (name, description, attributes, image URL, floor price, etc.) and supported networks. Also mentions use cases, making it complete for an agent to understand what the tool does and returns.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds marginal value by providing an example contract address and clarifying that ERC-1155 token IDs are accepted, but mostly repeats schema info. No additional parameter semantics beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description starts with a clear verb 'Fetch' and specifies the resource 'NFT metadata, traits, image URL, and collection floor price for any ERC-721 or ERC-1155 token'. It lists return fields and supported networks, distinguishing it from sibling tools like 'erc20-snapshot' which focus on different token standards.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases ('NFT valuation research, portfolio analysis, content generation, and collection intelligence'), providing context for when to use it. However, it does not explicitly state when not to use it or compare with alternatives, but given no obvious sibling for NFT metadata, this is acceptable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
npi-lookupAInspect
US NPI registry lookup — find any licensed US healthcare provider or organization by NPI number, name, state, or specialty. Returns NPI, entity type, name, credentials, specialty/taxonomy, license states, practice address, and active status. 7M+ records from CMS. Use for provider credentialing, healthcare due diligence, billing verification, or AML/KYB screening on medical payments.
| Name | Required | Description | Default |
|---|---|---|---|
| npi | No | 10-digit NPI number for direct lookup (fastest, most precise). | |
| limit | No | Max results to return (1–50, default 10). | |
| state | No | 2-letter US state code to filter by practice location (e.g. 'CA', 'NY'). | |
| last_name | No | Provider last name (individual providers). Supports wildcard with '*' suffix (e.g. 'Smi*'). | |
| specialty | No | Taxonomy/specialty description to filter (e.g. 'Internal Medicine', 'Cardiology', 'Psychiatry'). | |
| first_name | No | Provider first name (individual providers only). | |
| postal_code | No | 5-digit ZIP code to filter by practice location. | |
| organization | No | Organization name for NPI-2 (group practices, hospitals, labs, etc.). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description carries the full burden. It describes the return fields and data source (7M+ records from CMS) but does not disclose any behavioral traits like rate limits, authentication needs, or side effects. The description is adequate but not beyond the obvious.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences. The first sentence front-loads the core purpose and search capabilities; the second adds return fields and use cases. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 8 parameters and no output schema, the description covers key return fields, data source, and use cases. It lacks mention of pagination or error handling, but overall it is complete for a lookup tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, and the description adds value by noting that 'npi' is fastest/most precise and that 'last_name' supports wildcard with '*' suffix. This enriches understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it performs a US NPI registry lookup, listing specific data fields and use cases, which distinguishes it from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly lists four use cases (provider credentialing, healthcare due diligence, billing verification, AML/KYB screening). It provides clear context for when to use but does not mention when not to use or compare to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
npm-lookupAInspect
Node.js / JavaScript package metadata from the npm registry. Returns latest version, description, license, keywords, direct dependencies, weekly download count, publish date, GitHub repository, and all recent versions. Also supports looking up a specific version. Use before adding an npm package as a dependency: verify it's maintained, check its license, assess how many dependencies it pulls in. Free upstream: npm registry API (no key, public).
| Name | Required | Description | Default |
|---|---|---|---|
| package | Yes | npm package name (e.g. 'express', 'react', '@anthropic-ai/sdk'). Scoped packages like '@scope/name' are supported. | |
| version | No | Specific version to look up (e.g. '4.18.2'). Defaults to latest stable. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description adequately explains the tool's behavior: it fetches metadata from the npm registry, returns multiple fields, and supports version lookups. It also notes 'Free upstream: npm registry API (no key, public)', implying no rate limits or authentication needed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: three sentences clearly stating purpose, list of returned data, usage advice, and upstream info. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description covers the return values well. It lacks details on error handling or rate limits, but for a simple lookup tool, it is sufficient. Mention of the free API adds useful context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptive parameter names and descriptions. The description adds context about what the tool returns (e.g., all recent versions) but does not significantly enhance parameter meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns npm package metadata and lists specific fields (version, description, license, etc.). It distinguishes from siblings like pypi-lookup by explicitly mentioning npm registry.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit guidance: 'Use before adding an npm package as a dependency' and mentions verification, license checking, dependency assessment. It does not explicitly state when not to use or compare with other tools, but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
options-chainAInspect
CBOE delayed options chain for any US equity or index — returns stock price, per-contract IV, greeks (delta/gamma/theta/vega), OI, volume, and bid/ask. Filterable by expiration date and call/put. Free CBOE data, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| exp | No | Filter to a specific expiration date YYYY-MM-DD. Omit to return the nearest max_expirations dates. | |
| type | No | Filter by option type. Omit to return both calls and puts. | |
| ticker | Yes | US equity or index ticker. Examples: AAPL, SPY, QQQ, TSLA, NVDA. | |
| near_atm_only | No | If true, only return strikes within 20% of the current underlying price. Default false. | |
| max_expirations | No | Maximum number of expiration dates to include when no specific exp is set. Default 4, max 12. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so the description must cover behavioral traits. It mentions 'delayed' data and 'free, no API key', but fails to disclose rate limits, pagination, error handling for invalid tickers, or any destructive actions. The lack of annotations makes this an incomplete disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the most critical information (what it does and returns). Every word adds value without redundancy or filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 5 parameters, no output schema, and no annotations, the description covers the core functionality but omits details about output structure, data latency (how delayed?), maximum strikes, and error conditions. It is adequate but not fully comprehensive.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters described. The description adds context beyond the schema by explaining default behaviors (e.g., omitting exp returns nearest max_expirations dates, omitting type returns both). This extra guidance slightly elevates the score above the baseline 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns a CBOE delayed options chain for any US equity or index, specifying exact data points (stock price, IV, greeks, OI, volume, bid/ask). It distinguishes from siblings like 'options-snapshot' by detailing the comprehensive output.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains the data source and cost (free, no API key), implying general use for options chain retrieval. However, it lacks explicit guidance on when to use this tool over alternatives like 'options-snapshot' or 'credit-spreads', which are present in sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
options-snapshotAInspect
Options intelligence snapshot for any US equity — IV30, put/call volume ratio, top calls and puts by trading volume, and unusual-volume flags. Free CBOE delayed data (15-min delay during trading hours), no API key required. Complements us-stock-price and equity-technicals with the options-layer sentiment layer agents need for complete trade context. $0.015/call.
| Name | Required | Description | Default |
|---|---|---|---|
| top_n | No | Number of top calls and top puts to return (by volume). Default: 5. | |
| ticker | Yes | US equity ticker symbol (e.g. AAPL, TSLA, NVDA, SPY). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description fully discloses behavioral traits: data source (CBOE delayed), delay (15-min during trading hours), cost ($0.015/call), and no API key required. This provides all necessary transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with purpose and key data, then data source/cost, then complementary role. Every sentence adds value with no redundancy or fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Comprehensive for a simple 2-parameter tool with no output schema: describes all output fields (IV30, ratio, top calls/puts, unusual volume), data source, delay, cost, and complementary tools. No gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described. Description adds minor value via examples (AAPL, TSLA) and clarifies case-insensitivity and default for top_n, but does not significantly extend schema meaning.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Options intelligence snapshot for any US equity' with specific data points (IV30, put/call ratio, top trades, unusual volume). Distinguishes from sibling tools by mentioning it's the 'options-layer sentiment' complement to us-stock-price and equity-technicals.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states it complements us-stock-price and equity-technicals for complete trade context, implying when to use. Mentions free delayed data and no API key, but does not explicitly list when not to use (e.g., real-time needs) or directly reference the sibling options-chain.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
page-intelAInspect
Extracts structured content from any public URL: page title, meta description, H1-H3 headings, all links (with text and internal/external flag), and a 500-character text preview. Useful for research agents following link chains from on-chain data, auditing page structure, or seeding downstream text-generation calls.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Public HTTP/HTTPS URL to fetch. Redirects are followed. Max 256 KB of response read. | |
| link_limit | No | Maximum number of links to return (default 50, max 200). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description must carry the transparency burden. It does not disclose behavioral traits such as authentication needs, rate limits, error handling, or caching. The only behavioral detail (redirect following) appears in the schema, not the description. The description is minimally transparent for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the primary action and output, then usage context. Every word adds value; no redundancy or filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple extraction tool with 2 parameters and no output schema, the description adequately lists all returned fields. However, it does not specify the exact JSON structure or ordering, which might require the agent to infer. Still, the information is sufficient for most agent scenarios.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description does not add any parameter-specific details beyond the schema; the schema already clearly explains 'url' and 'link_limit'. The description adds no extra semantic value.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb ('extracts') and defines the resource ('structured content from any public URL') with a detailed list of extracted fields (title, meta description, headings, links, text preview). It clearly distinguishes the tool from siblings like 'page-links' and 'readable-content' by specifying its structured output.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases: research agents following link chains, auditing page structure, seeding text-generation. However, it does not mention when not to use this tool versus alternatives like 'page-links' or 'web-scrape-links', leaving some ambiguity for the agent.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
page-linksAInspect
Extracts all hyperlinks from a webpage. Fetches the target URL, resolves relative links to absolute URLs, and classifies each as internal (same domain) or external. Filter by all/external/internal, cap results with limit. Returns page title, total link count before filtering, and a structured array of {href, text, is_external, domain}. Priced at $0.004 — 20% below orbisapi web-scrape-links ($0.005/call). Upstream: direct HTTP fetch, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Webpage URL to extract links from (http or https). | |
| limit | No | Max links to return (1–200). Default: 100. | |
| filter | No | Which links to return: all (default), external-only (different domain), or internal-only (same domain). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses behavior: fetches URL, resolves relative links, classifies links, returns page title and count, and provides pricing and upstream details. No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (5 sentences) with no superfluous information. It covers purpose, process, options, output, pricing, and upstream in a structured manner.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description explains the output format (page title, total count, array with fields) and filter options, making it complete for an agent to understand usage.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds meaningful context beyond the schema (e.g., resolving relative links, classification logic, output structure).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly defines the tool's purpose: extracting all hyperlinks from a webpage, with specific actions (fetch, resolve, classify). It distinguishes from the sibling 'web-scrape-links' via pricing and downstream capabilities.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context (filter by internal/external, cap results) but does not explicitly state when to use this tool versus alternatives like 'web-scrape-links'. No clear when-not or exclusion criteria.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
pingAInspect
Liveness + echo probe. Pays back a timestamp and echoes msg. Use to verify the x402 rail and Bazaar listing end to end.
| Name | Required | Description | Default |
|---|---|---|---|
| msg | No | optional string to echo back |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It discloses that it returns a timestamp and echoes msg, which implies no side effects, but it does not explicitly state idempotency or safety. For a ping tool, this is adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with key terms 'Liveness + echo probe.' No wasted words. Perfectly concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with one optional parameter and no output schema, the description fully covers what the tool does and when to use it. No additional details are needed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (the msg parameter has a description). The description adds the word 'echoes' but does not provide additional semantic nuance beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is a liveness probe and echo service that returns a timestamp and echoes the msg parameter. It distinguishes itself from siblings (all other tools are domain-specific) as a general end-to-end verification tool.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states when to use it: 'to verify the x402 rail and Bazaar listing end to end.' It does not provide alternative tools or exclusions, but given its unique role as a liveness probe, the guidance is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
place-detailsAInspect
Enriched place and business details by name (OSM Nominatim). Returns website, phone, email, opening hours, operator, brand, cuisine, social media links, full address, and coordinates. Use when you need the business metadata behind a location — not just where it is, but who runs it and how to reach it. Cheaper than Google Maps place details.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max results (default 1, max 5). Use >1 when the query may match multiple locations. | |
| query | Yes | Business or place name, optionally with city/region for disambiguation. Examples: 'Starbucks Times Square New York', 'Eiffel Tower Paris', 'Golden Gate Bridge San Francisco'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It lists return fields and the data source. Lacks details on error handling or limitations, but overall transparent for a read-only search tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three succinct sentences: purpose, returns, usage guidance. No fluff, every sentence adds value. Well-structured and easy to parse.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Adequate context given no output schema the tool returns field names implying a structured response. Could mention response format or error handling, but sufficient for a simple lookup tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description does not add significant meaning beyond the schema's existing descriptions and examples. The mention of disambiguation is already in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool returns enriched business details (website, phone, email, etc.) via OSM Nominatim. It distinguishes itself from sibling tools like geocode by focusing on business metadata beyond coordinates.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear guidance: 'Use when you need the business metadata behind a location — not just where it is, but who runs it and how to reach it.' Also mentions it's cheaper than Google Maps, implying an alternative.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
policy-impact-mapperAInspect
Analyzes regulatory and policy text to map its impact across industry sectors. Extracts compliance requirements, change types (new mandates, prohibitions, exemptions), effective dates, affected entities, and assigns sector-level impact scores (HIGH/MEDIUM/LOW). Pure pattern analysis — no LLM, instant results. Useful for compliance agents, regulatory monitoring workflows, and policy change digests.
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Policy, regulation, or legislative text to analyze. Can be a full document, a section, or a press release summarizing a policy change. Max 100,000 chars. | |
| title | No | Optional title or name of the policy/regulation (e.g. 'EU AI Act Article 13'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses that analysis is pattern-based, not LLM-based, and produces results instantly. It does not explicitly mention read-only nature, side effects, or authorization needs. Adequate but not comprehensive for a tool with no annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the main purpose, followed by details and characteristics. Every sentence adds value; no fluff or redundancy. Highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The tool has no output schema, so the description compensates by listing extracted items (compliance requirements, change types, etc.) and impact scores. It mentions 'instant results' but does not specify output format or error handling. Sufficient for a mapper tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described. The description reinforces the purpose but adds minimal new parameter-specific meaning beyond the schema (e.g., 'Max 100,000 chars' is already in schema). Baseline 3 is appropriate as schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly defines the tool's purpose: analyzing regulatory/policy text to map impact across sectors, listing specific outputs (compliance requirements, change types, etc.). It distinguishes itself from siblings by emphasizing 'pure pattern analysis — no LLM, instant results,' which differentiates it from other tools that may use AI.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides usage context, stating it is useful for compliance agents, regulatory monitoring, and policy change digests. It does not explicitly state when not to use it or point to alternatives, but the context is sufficient for an agent to decide. Lacks explicit exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
polymarket-accuracy-scoreAInspect
Historical Polymarket crowd accuracy score: % of markets where the final crowd majority correctly predicted the outcome, plus Brier score (calibration quality). Breakdowns by category — crypto, politics, sports, macro, equities, ai. Filter by category and lookback days. $0.004/call — 20% below closest x402 competitor. Source: Polymarket public API (no key required).
| Name | Required | Description | Default |
|---|---|---|---|
| category | No | Restrict analysis to one category. Omit for all categories. | |
| days_back | No | Lookback window in days for resolved markets (1–90, default 30). | |
| min_volume | No | Minimum market trading volume in USDC to include (default 0). Use 1000 to focus on liquid markets. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry the burden. It mentions no authentication required and a cost, but does not explicitly state it is read-only or disclose rate limits or data freshness. Adequate but could be improved.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with the key purpose, uses four focused sentences with no wasted words. Each sentence adds value, including pricing and source information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema is present, but the description adequately explains the return values (percentage correct, Brier score, breakdowns by category). It could mention output format (e.g., JSON) but is sufficient for an agent to understand what to expect.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, providing a baseline of 3. The description adds value by listing example categories (crypto, politics, sports, etc.) and mentioning 'lookback days' and default 30, which enriches the schema's descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool computes historical Polymarket crowd accuracy, including percentage correct and Brier score, with breakdowns by specific categories. It distinguishes itself from siblings like 'polymarket-category-performance' by focusing on crowd accuracy metrics.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides context (cost, source, no key required) but does not explicitly state when to use this tool versus alternatives like 'polymarket-category-performance' or 'polymarket-sentiment-shift'. However, the purpose is clear enough that the agent can infer usage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
polymarket-category-performanceAInspect
Polymarket category activity breakdown: volume, liquidity, market count, and top market per category (crypto, politics, sports, ai, macro, equities). Shows where trading activity is concentrated. Optionally filter to one category. $0.004/call — 20% below closest x402 competitor. Source: Polymarket public API (no key required).
| Name | Required | Description | Default |
|---|---|---|---|
| category | No | Filter to a single category. Omit to return all categories ranked by volume. | |
| min_liquidity | No | Only include markets with at least this much liquidity (USD). Default: 1000. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so description carries full burden. It discloses pricing ($0.004/call, 20% cheaper than competitor) and source (public API, no key required). However, it omits details on rate limits, data freshness, or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with three sentences that front-load the purpose and include pricing and source. No extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists; description lists output fields (volume, liquidity, market count, top market) but lacks structure details. It adequately specifies what the tool returns but doesn't fully compensate for missing output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds marginal value by listing example categories and mentioning the default for min_liquidity, but does not provide new information beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides a breakdown of Polymarket categories with specific metrics (volume, liquidity, market count, top market). It distinguishes from sibling tools like polymarket-accuracy-score by focusing on category activity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for high-level category concentration analysis but does not explicitly state when to use vs. alternatives or when not to use. No exclusions are given.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
polymarket-intelAInspect
Top active Polymarket prediction markets ranked by trading volume — question, Yes probability, 24h volume, liquidity, 1d/1wk price change, resolution date. No API keys. $0.003/call.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of markets to return (1–25). Default: 10. | |
| period | No | Volume period used for ranking (default: 24h). | |
| min_liquidity | No | Filter out markets with liquidity below this USD threshold (default: 1000). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description mentions no API keys and cost per call but does not explicitly state it is a read-only fetch. It implies read behavior but lacks depth on data freshness or limitations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence that efficiently conveys the tool's purpose, output fields, and key facts (no API keys, cost). No redundant words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description outlines the return fields (question, Yes probability, etc.). It covers the output structure adequately but omits details like response format (JSON) and pagination.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description does not add extra meaning beyond the schema. Parameters 'limit', 'period', and 'min_liquidity' are self-explanatory from the schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns top active Polymarket prediction markets ranked by trading volume, listing specific output fields. It distinguishes from sibling tools like 'prediction-markets' by focusing on Polymarket and ranking by volume.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide guidance on when to use this tool vs alternatives like 'polymarket-accuracy-score' or 'polymarket-category-performance'. It only states what it does, leaving the agent to infer usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
polymarket-sentiment-shiftAInspect
Returns Polymarket prediction markets with the biggest recent probability shifts — useful for detecting sudden consensus changes on elections, crypto prices, and macro outcomes. Sorted by absolute 1-week change by default. Each result includes current probability, price change, volume, and resolution date. $0.008/call — free upstream.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max markets to return (1–20, default 10). | |
| window | No | Time window for price change ranking: '1d' (24h), '1w' (7-day, default), '1m' (30-day). | |
| direction | No | Filter by direction of shift: 'up' (probability rising), 'down' (falling), or 'all' (default). | |
| min_volume | No | Minimum total USDC trading volume. Default 10000. Higher = more liquid markets. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It discloses cost ($0.008/call) and mentions output fields, but lacks details on destructive behavior, authentication needs, rate limits, or safety. The description adds some behavioral context (sorting, output fields) but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences pack purpose, use case, content, and cost. No wasted words; front-loaded with action and resource.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description competently lists output fields (probability, price change, volume, resolution date). It covers sorting and filtering but lacks error handling or rate limit info. For a data retrieval tool, this is mostly adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds value by explaining the default sorting (absolute 1-week change) and listing output fields. It also mentions minimum volume default (10000 USDC) implicitly. This provides meaning beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns prediction markets with the biggest probability shifts, sorted by change, and lists included fields (probability, price change, volume, resolution date). It distinguishes itself by focusing on sentiment shifts rather than broad market data, though not explicitly differentiating from siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for detecting consensus changes on elections, crypto prices, and macro outcomes, but does not explicitly state when to use this tool over siblings like prediction-markets or polymarket-accuracy-score. No when-not or alternatives provided.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
portfolio-rebalanceAInspect
Pure-math portfolio rebalancing calculator. Given current holdings (asset names and their current USD values) and target allocations (percentages summing to 100), returns the exact buy/sell dollar amounts needed to reach target weights. Handles partial rebalancing with a drift threshold. No external API — deterministic and instant. Useful for DeFi agents, robo-advisory flows, and automated rebalancing triggers.
| Name | Required | Description | Default |
|---|---|---|---|
| targets | Yes | Target allocation percentages. Must sum to 100. | |
| holdings | Yes | Current portfolio positions. | |
| cash_injection | No | Optional additional cash (USD) to deploy during rebalancing (positive = buying, negative = withdrawing). | |
| drift_threshold_pct | No | Minimum drift percentage before an asset is flagged as needing rebalance (default 1.0). Positions within threshold are marked 'in_range'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden of disclosure. It states the tool is deterministic, instant, and handles partial rebalancing with a drift threshold, which conveys key behavioral traits. However, error conditions (e.g., targets not summing to 100) are not addressed.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is four sentences long, front-loaded with the core purpose, and free of redundant or filler content. Every sentence contributes meaningful information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description provides a reasonable overview of output behavior (e.g., 'exact buy/sell dollar amounts' and 'in_range' flag). It covers key functionality, use cases, and the drift threshold feature, but could be more precise about the output structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for all four parameters, so baseline is 3. The description adds conceptual context (e.g., 'drift threshold' for partial rebalancing) but does not elaborate on parameter syntax or format beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly identifies the tool as a 'Pure-math portfolio rebalancing calculator' that calculates buy/sell amounts for given holdings and targets. It distinguishes itself from siblings by emphasizing its deterministic, no-API nature, but does not explicitly name alternative tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions use cases like 'DeFi agents, robo-advisory flows, and automated rebalancing triggers,' providing context for when to use it. However, it lacks explicit guidance on when not to use it or comparisons to sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
prediction-marketsAInspect
Returns top active Polymarket prediction markets sorted by trading volume. Includes crowd-sourced outcome probabilities (0–1), USDC volume, liquidity, and resolution date. Filter by keyword. Use this to gauge market consensus on events before making decisions. $0.05/call — free upstream, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max markets to return (1–25, default 10). | |
| query | No | Optional keyword filter. Returns only markets whose question contains this string (case-insensitive). E.g. 'bitcoin', 'election', 'fed rate'. | |
| min_volume | No | Minimum USDC trading volume to include (default 1000). Higher = more liquid and reliable signal. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It mentions cost ($0.05/call) and no API key requirement, but does not state whether the tool is read-only or non-destructive, nor does it address caching or data freshness. The description is moderately transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (four sentences) and front-loaded with the core action. Each sentence adds value: what the tool returns, included data items, filtering, usage context, and cost. No extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the moderate complexity (3 optional parameters, no output schema), the description adequately covers what is returned, how to filter, and pricing. It lacks mention of pagination or explicit sorting order details, but these are minor omissions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with detailed parameter descriptions (e.g., limit range, query case-insensitivity, min_volume default). The description only reiterates 'Filter by keyword' without adding new semantic context beyond the schema, meeting the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Returns top active Polymarket prediction markets sorted by trading volume.' It specifies the verb ('Returns'), the resource ('Polymarket prediction markets'), and the sorting criterion, differentiating it from sibling tools like polymarket-accuracy-score.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides a clear usage guideline: 'Use this to gauge market consensus on events before making decisions.' While it does not explicitly list when not to use it or mention alternatives, the stated use case is straightforward and actionable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
prediction-stock-pulseAInspect
One call returns prediction market sentiment (Limitless Exchange) + live equity price for a specified ticker. Collapses the prediction-market → stock-price agent chain into a single x402 payment. $0.016 vs $0.024 bought individually. Inputs: ticker (required), query (optional keyword filter for prediction markets).
| Name | Required | Description | Default |
|---|---|---|---|
| query | No | Optional keyword to filter prediction markets (e.g. 'btc', 'eth', 'rate cut'). If omitted returns top 5 markets by volume. | |
| ticker | Yes | US stock ticker (e.g. AMD, NVDA, SPY). Case-insensitive. | |
| market_limit | No | Number of prediction markets to return (1–10, default 5). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions returning sentiment and price and a specific exchange, but does not disclose limitations like market_limit behavior, which is in the schema but not in the description. More detail on what 'sentiment' means would be helpful.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-structured with three sentences: core action, value proposition, and input list. It is concise and front-loaded with the main purpose, though slightly repetitive with the schema.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 3 parameters and no output schema, the description should describe the output format more thoroughly. It mentions 'returns prediction market sentiment' but not its structure, and omits the 'market_limit' parameter entirely, limiting completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description repeats parameter info from the schema without adding new meaning. It fails to mention the 'market_limit' parameter, so it does not improve understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool returns prediction market sentiment and live equity price for a specified ticker, differentiating it from siblings that only do one. It also mentions collapsing a two-step agent chain into one call, which strongly conveys its unique purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
While the description explains that the tool combines two functions and saves cost, it does not explicitly state when to use alternatives like 'us-stock-price' or 'prediction-markets' if only one is needed. The guidance is present but incomplete.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
protocol-revenue-leadersAInspect
Returns DeFi protocols ranked by daily fees (revenue generated). Covers 1000+ protocols across chains — DEXes, lending markets, derivatives, stablecoins. Includes 1d/7d/30d trend context, category breakdown, and chain presence. Use to identify which protocols are capturing the most economic activity, screen for fundamental DeFi strength, or compare protocol revenue trajectories.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of protocols to return. Default 20, max 100. | |
| sort_by | No | Ranking metric. 'daily_fees' = 24h fees (default). '7d_fees' = 7-day total. '30d_fees' = 30-day total. '1d_change' = biggest 1-day fee growth (%). | |
| category | No | Filter by protocol category (e.g. 'Dexes', 'Lending', 'Derivatives', 'CDP', 'Liquid Staking', 'Bridge'). Case-insensitive partial match. | |
| min_daily_fees | No | Minimum 24h fees in USD to include a protocol (e.g. 10000 for $10K+). Filters out noise. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description mentions outputs (trend context, category breakdown, chain presence) and coverage (1000+ protocols) but lacks details like data update frequency, pagination, or read-only nature. No annotations exist to supplement.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences long, front-loads the core function, and every sentence adds value. No superfluous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 4 optional parameters, no output schema, and no annotations, the description provides sufficient context about functionality and use cases. Minor gap: does not specify output format explicitly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema descriptions cover all 4 parameters (limit, sort_by, category, min_daily_fees) with clear explanations. The tool description adds no additional parameter-level context, so baseline 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns DeFi protocols ranked by daily fees, covering 1000+ protocols across multiple categories and chains. It distinguishes itself from sibling tools like 'defi-market-pulse' or 'defi-portfolio' by focusing specifically on revenue rankings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides concrete use cases: identifying protocols capturing economic activity, screening for DeFi strength, comparing revenue trajectories. However, it does not explicitly state when not to use this tool or suggest alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
pypi-lookupAInspect
Python package metadata from PyPI. Returns latest version, summary, author, license, Python version requirement, install dependencies, release date, and download URLs. Also supports fetching a specific version. Use before integrating a Python library: check if it's actively maintained, what license it uses, and whether it's compatible with your Python version. Free upstream: PyPI JSON API (no key, no rate limit for normal use).
| Name | Required | Description | Default |
|---|---|---|---|
| package | Yes | PyPI package name (e.g. 'requests', 'numpy', 'anthropic', 'langchain'). Case-insensitive. | |
| version | No | Specific version to look up (e.g. '2.31.0'). If omitted, returns the latest stable release. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description notes that the upstream PyPI JSON API is free with no key and no rate limit, which is helpful. However, it does not mention potential errors (e.g., package not found) or any other behavioral details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (two sentences plus a note) and front-loaded with the tool's purpose. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description thoroughly lists the return fields (latest version, summary, author, license, etc.) and covers version support, making it complete for a lookup tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%. The description adds that package names are case-insensitive and provides examples. It explains that omitting version returns the latest stable release, adding value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Python package metadata from PyPI' and lists the specific data returned. It distinguishes itself from sibling tools like 'npm-lookup' by being Python-specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly advises using the tool before integrating a Python library to check maintenance, license, and compatibility. It provides clear context but does not mention alternatives or when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
readable-contentAInspect
Fetches any public URL and returns the full readable article text as clean Markdown, stripped of navigation, ads, and boilerplate. Returns title, published date (if available), and the complete body ready for LLM summarization, analysis, or RAG ingestion. A $0.004 alternative to exa.ai/contents ($0.007) and web-read ($0.016) for the same full-text extraction primitive.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Public HTTP/HTTPS URL to extract readable content from. Redirects are followed. | |
| no_cache | No | If true, forces a fresh fetch bypassing Jina's cache (default false). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description takes on the burden of disclosure. It clearly states behavior: fetches public URLs, strips boilerplate, returns Markdown with title/date/body, follows redirects, and supports optional cache bypass. It lacks details on failure modes or rate limits but covers the core behavior adequately.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, front-loaded with the action and result. Every sentence adds value: what it does, what it returns, and cost comparison. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the return format (title, date, body as Markdown) and mentions the output is ready for LLM tasks. For a simple fetch tool without output schema, this is sufficient. It also provides pricing context. Minor missing details on error handling, but acceptable.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% and both parameters have clear descriptions matching the description. The tool description adds no extra semantics beyond what the schema already provides. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description uses a specific verb 'Fetches' and clearly identifies the resource (public URL, full readable article text as Markdown). It distinguishes from siblings by naming alternatives (exa.ai/contents, web-read) and highlighting cost advantage, making the tool's unique value proposition explicit.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description states when to use the tool (for full-text extraction) and explicitly compares it to cheaper alternatives, implying it's the preferred choice for that use case. However, it does not specify when not to use it (e.g., if raw HTML or only metadata is needed).
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
reddit-intelAInspect
Searches Reddit posts and/or comments by keyword. Returns top results with score, comment count, author, subreddit, URL, and timestamp. Filter by subreddit, sort by score or date. Supports separate post and comment search. Sourced from PullPush public API — no auth required. Ideal for competitive sentiment analysis, trend detection, and community research.
| Name | Required | Description | Default |
|---|---|---|---|
| q | Yes | Search query. Supports quoted phrases (e.g. "AI agents") and boolean operators. | |
| mode | No | What to search: 'posts' (default), 'comments', or 'both'. | |
| sort | No | Sort field. Default: score. | |
| limit | No | Max results per mode (1–25). Default: 10. | |
| subreddit | No | Restrict search to this subreddit (omit 'r/' prefix). Omit for all subreddits. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses that no authentication is required and that data is sourced from PullPush API. However, it lacks details on rate limits, pagination, error handling, and data freshness. With no annotations, this is adequate but incomplete.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded with the primary purpose. It uses multiple sentences but each serves a purpose. Could be slightly more streamlined, but it is clear and well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite lacking an output schema, the description lists the returned fields (score, comment count, author, etc.) and explains filtering and sorting. It covers the essential context for using the tool, though pagination and result ordering details are missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All 5 parameters have schema descriptions (100% coverage). The description adds value beyond the schema by explaining quoted phrases and boolean operators for 'q', clarifying defaults for 'sort' and 'limit', and detailing mode options. This enhances the schema's information.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool searches Reddit posts and/or comments by keyword, listing returned fields and filtering options. It is specific to Reddit, but does not explicitly differentiate from similar search tools like hn-search or social-intel.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions ideal use cases (sentiment analysis, trend detection, community research) but provides no explicit guidance on when not to use the tool or how it compares to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
regex-testerAInspect
Safe regex testing and extraction. Validates a pattern, finds all matches (with capture groups), replaces text, and explains what the pattern matches. Zero external API calls — instant, deterministic. Useful for agents generating or debugging regex patterns mid-task, validating extracted data, or transforming text with precision.
| Name | Required | Description | Default |
|---|---|---|---|
| flags | No | Regex flags: g (global), i (case-insensitive), m (multiline), s (dotAll), u (unicode). Default: 'g'. | |
| input | Yes | Text to test the pattern against. Max 50,000 chars. | |
| pattern | Yes | Regular expression pattern (without delimiters). Example: '(\\d{3})-?(\\d{4})' | |
| replace_with | No | Optional replacement string. Uses $1, $2, etc. for capture groups. When provided, returns replaced output. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the burden. It states zero external API calls, instant deterministic execution, and lists operations. It implies return of validation, matches, explanation, and replacement but does not explicitly describe the return structure or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph of about 80 words, efficiently front-loading the purpose and following with use cases. No redundant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema or annotations, the description covers purpose, behavior, and use cases well. It lacks explicit listing of return values and error scenarios, but the implied functionality is sufficient for most agents.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds minimal value. It repeats pattern without delimiters and default flags from schema but does not deepen understanding beyond what schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly identifies the tool as a regex tester and extractor, listing specific operations (validate, find matches, replace, explain). It distinguishes itself from the diverse sibling tools by focusing on regex operations, which are unique among them.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases for agents generating or debugging regex patterns, validating data, or transforming text. However, it does not mention when not to use the tool or compare it to alternatives, though siblings are unrelated.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
research-paper-searchAInspect
Academic paper search across 250M+ works via OpenAlex (free, no key). Returns top papers with title, authors, year, DOI, citation count, open-access status, and primary research topic. Covers all disciplines: AI/ML, medicine, physics, economics, law, biology, and more. Supports relevance, citation-count, and recency sorting; open-access filtering; and year-range constraints. Use for literature review, prior-art search, citation building, or finding the seminal papers in any field.
| Name | Required | Description | Default |
|---|---|---|---|
| sort | No | Sort order. relevant = OpenAlex relevance score; cited = most-cited first; recent = newest first. Default: relevant. | |
| limit | No | Number of papers to return (1–10). Default: 5. | |
| query | Yes | Search query. Natural-language or keyword (e.g. 'transformer attention mechanism', 'CRISPR gene editing cancer', 'bitcoin game theory'). | |
| min_year | No | Only return papers published this year or later (e.g. 2020). Optional. | |
| open_access_only | No | If true, only return open-access papers with freely available PDFs. Default: false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses read-only nature (search, free, no key), sorting/filtering options, and return fields. Does not mention rate limits or errors, but acceptable for a search tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four sentences, well-structured: purpose first, then return fields, then coverage, then use cases. No redundant wording, efficient front-loading.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but description enumerates return fields (title, authors, year, DOI, citation count, open-access status, topic). Mentions free API and coverage. Lacks pagination details or error handling, but sufficient for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions. The tool description paraphrases schema info (sort, filtering, year range) without adding new meaning. No extra detail beyond what schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is an academic paper search tool across 250M+ works via OpenAlex, listing return fields and disciplines. It distinguishes from siblings like legal-search or hn-search by specifying 'academic paper search' and 'all disciplines'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly suggests use cases: literature review, prior-art search, citation building, finding seminal papers. Does not provide when-not-to-use or alternatives, but the use cases are clear and adequate.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
research-synthesisAInspect
AI-synthesized intelligence report for any query — aggregates Hacker News, OpenAlex academic papers, Reddit, arXiv preprints, and DuckDuckGo in parallel, then distills into a structured report: executive summary, key findings, market sentiment, emerging trends, and recommendations. Useful for market research, competitive intelligence, technology landscape analysis, and due diligence. $0.200/call — 20% below nearest competitor. Up to 5 source categories queried per call.
| Name | Required | Description | Default |
|---|---|---|---|
| focus | No | Optional focus direction for synthesis (e.g. 'technical implementation details', 'market adoption', 'risks and challenges'). Narrows the analytical lens. | |
| query | Yes | Research query or topic. Be specific for best results (e.g. 'AI agent payment protocols 2025' rather than 'AI'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, but description compensates by detailing AI-synthesized, parallel aggregation, structured output, cost per call, and source category limit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four sentences that are concise, front-loaded with purpose, and include key details without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, description fully explains output structure, source count, pricing, and competitor comparison, meeting completeness needs for a 2-param tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with descriptions, but description adds value by advising specificity for query and explaining focus parameter's role in narrowing analysis.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it aggregates multiple sources (Hacker News, OpenAlex, etc.) and distills into a structured report with specific sections (executive summary, key findings, etc.), distinguishing it from sibling tools like research-paper-search.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides explicit use cases (market research, competitive intelligence, etc.) and pricing context, but lacks explicit when-not-to-use or alternative tool suggestions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
roastBInspect
Witty AI roast of any target — person, company, product, code snippet, or concept. Returns 3-5 sentences of sharp, clever humor. Style: dry (default), savage, sarcastic, or gentle. 75% below anchor-x402.com/v1/roast.
| Name | Required | Description | Default |
|---|---|---|---|
| style | No | Roast style. Default: dry. | |
| target | Yes | What to roast: a name, company, product, code snippet, or brief description (max 500 chars). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description must disclose behavioral traits. It states output length and style options, but omits critical details like authentication requirements, rate limits, error handling, or whether the tool is idempotent. The cryptic '75% below anchor-x402.com/v1/roast' adds confusion without clear explanation.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is efficient, with three sentences covering purpose, output, and style. However, the final sentence about '75% below...' is unclear and somewhat distracting, slightly reducing conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity and lack of output schema, the description adequately explains target and output format. However, it does not mention error conditions, input validation, or exact return structure, leaving moderate gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema covers both parameters with descriptions. The description goes further by listing the available styles ('dry (default), savage, sarcastic, or gentle'), which is not in the schema's enum. This adds concrete value for parameter selection.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: generating a witty AI roast for any target. It specifies the nature of the output (3-5 sentences of humor) and mentions style options, distinguishing it from the diverse sibling tools which are mainly data or analysis oriented.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives, nor does it mention any prerequisites or restrictions (e.g., appropriate contexts, limitations of use). The agent is left to infer usage from the description alone.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
rss-readerAInspect
Fetches and parses any public RSS 2.0 or Atom 1.0 feed. Returns feed metadata (title, description, language, last updated) and structured items (title, link, 400-char summary, author, published date, GUID). Useful for monitoring news, blog posts, GitHub release notes, Reddit RSS, arXiv category feeds, or security advisories.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Public RSS or Atom feed URL to fetch. | |
| limit | No | Maximum number of items to return (default 20, max 100). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, so the description must cover behavioral traits. It explains read-only nature and return format with 400-char summaries. However, it omits potential issues like rate limits, error handling on invalid feeds, or authentication requirements.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with action and value. No redundant words: each sentence adds specific information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Without an output schema, the description thoroughly explains returned feed metadata and item structure. All required context for a simple fetch tool is present, given low complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described. The description adds no parameter-specific meaning beyond the schema, but overall context is provided. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Fetches and parses any public RSS 2.0 or Atom 1.0 feed.' with specific verbs and resource. It distinguishes from siblings like news-sentiment or reddit-intel by focusing on raw feed fetching.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists diverse use cases (news, blog posts, GitHub release notes, etc.), showing when to use it. It does not explicitly exclude scenarios or mention alternatives, but context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sanctions-screeningAInspect
OFAC SDN sanctions screening — checks whether a person, company, vessel, or aircraft appears on the US Treasury Specially Designated Nationals list. Returns match score, sanctions program(s), and entity type. Covers ~19,000 entries including RUSSIA-EO14024, SDGT, IRAN, DPRK, TCO, and 30+ programs. Use for payment compliance, KYB/KYC, AML checks, and counterparty due diligence.
| Name | Required | Description | Default |
|---|---|---|---|
| name | Yes | Entity name to screen (person, company, vessel, or aircraft). Full name preferred; partial name also works. | |
| type | No | Optional type filter. Omit to search all types. | |
| limit | No | Maximum number of hits to return. Default 10, max 50. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses return values (match score, program, entity type) and mentions coverage, which is helpful. However, with no annotations, it does not state whether the tool is read-only, rate limits, or authentication requirements, leaving some behavioral traits unspecified.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences, each concise and informative. No redundancy; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers purpose, input parameters, output, and use cases. Without an output schema, it should detail return format more fully; it mentions match score, program, and entity type but not whether aliases or IDs are returned. Slightly incomplete but adequate for a simple screening tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All three parameters are described in the input schema with full coverage. The description adds only minimal extra context for the 'name' parameter ('full name preferred; partial also works'). Baseline 3 is appropriate since the schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'checks' and the specific resource 'OFAC SDN sanctions list', detailing what entities are screened (person, company, vessel, aircraft). It also specifies the output (match score, program, entity type) and use cases, making the purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly lists use cases: 'payment compliance, KYB/KYC, AML checks, and counterparty due diligence.' It does not provide when not to use or alternatives, but given the tool's specificity and the absence of a similar sibling, this is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sec-filing-intelAInspect
Real-time SEC EDGAR filing lookup by ticker or CIK. Returns company profile plus recent filings (form type, date, description, EDGAR URL). Supports 8-K (material events), 10-K/10-Q (earnings), Form 4 (insider trades), DEF14A (proxy), and all other EDGAR form types. Authoritative US government data, no API key, updated within minutes of filing.
| Name | Required | Description | Default |
|---|---|---|---|
| cik | No | SEC Central Index Key (CIK). Provide as numeric string or zero-padded 10-digit form. Use instead of ticker if ticker is unknown. | |
| limit | No | Max number of filings to return. Default 10, max 25. | |
| ticker | No | US public company ticker symbol (e.g. AAPL, MSFT, TSLA). Case-insensitive. Provide either ticker or cik. | |
| form_type | No | Filter results to a specific form type (e.g. 8-K, 10-K, 10-Q, 4, DEF14A, S-1). Case-insensitive. If omitted, returns all recent filings. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must bear the burden. It states real-time updates, authoritative source, no API key, but lacks detail on rate limits, error handling, or behavior when ticker/CIK is invalid. A 3 is appropriate given the missing behavioral disclosures beyond what is stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single paragraph, front-loaded with key information (real-time, by ticker/CIK), no fluff. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 4 parameters, no output schema, and no annotations, description adequately covers input but does not explain output structure or error cases. Could mention pagination or response format for better completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 4 parameters with 100% coverage. Description adds practical context: cik should be used when ticker unknown, limit defaults to 10 max 25, form_type is case-insensitive and omission returns all. This goes beyond schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly identifies the tool as an SEC EDGAR filing lookup by ticker or CIK, specifies the returned data (company profile, recent filings with details), and lists supported form types, distinguishing it from sibling tool 'sec-insider-trades' which is more specific.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for SEC filings retrieval and mentions flexibility in input (ticker or CIK) and filtering by form_type, but does not explicitly state when not to use this tool or suggest alternatives like sec-insider-trades for insider-trade-specific queries.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sec-insider-tradesCInspect
SEC EDGAR Form 4 insider trading data for any US public company — shows recent insider buys, sells, awards, and exercises with shares, price, and post-transaction ownership. No API key. EDGAR primary source.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of recent Form 4 filings to retrieve and parse. Default 10, max 25. | |
| ticker | Yes | US stock ticker symbol. Examples: AAPL, NVDA, TSLA, MSFT. | |
| include_derivatives | No | Include derivative transactions (option exercises, conversions). Default true. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description bears full burden. It mentions 'No API key' and 'EDGAR primary source' but lacks details on rate limits, data freshness, pagination, or any side effects. The operation is read-only, but this is not explicitly stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, efficient and front-loaded with key purpose and source. However, it lacks structured formatting (e.g., bullet points) that could improve scanability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, so the description must explain return values. It lists fields (shares, price, ownership) and transaction types, providing a basic understanding. However, it does not detail exact output structure or field names, leaving some ambiguity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% description coverage for all three parameters. The description adds minimal value beyond schema by mentioning 'recent' transactions and giving examples of derivatives (awards, exercises), but does not directly explain how 'limit' controls recency or how 'include_derivatives' affects results.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves SEC EDGAR Form 4 insider trading data for US public companies, listing specific transaction types (buys, sells, awards, exercises) and data points (shares, price, ownership). However, it does not differentiate from the sibling tool 'insider-trades', which may cause ambiguity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool vs alternatives. The description implies it is for recent insider trades of a US public company, but does not specify limitations or compare to similar tools like 'insider-trades'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sector-rotationAInspect
S&P 500 sector rotation: relative performance of all 11 GICS sectors (XLK XLF XLE XLV XLI XLY XLP XLB XLRE XLU XLC) vs SPY benchmark. Returns 1D, 5D, 1M, and 3M absolute and relative returns, a rotation signal per sector (LEADING, CATCHING_UP, FALLING_BEHIND, LAGGING), and 1M leadership ranking. Parameterizable: sort by any timeframe. Free Yahoo Finance source, no API keys. Use for sector allocation, macro-regime interpretation, or screening rotation momentum.
| Name | Required | Description | Default |
|---|---|---|---|
| rank_by | No | Timeframe to rank sectors by relative performance. Default: 1m. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full responsibility. It discloses: free Yahoo Finance source, no API keys needed, and the specific return metrics (1D, 5D, 1M, 3M returns, rotation signals, ranking). It does not mention rate limits, data freshness, or potential limitations, but for a non-destructive data retrieval tool, the disclosure is adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: two sentences with no fluff. The first sentence packs key details (sectors, benchmark, returns, signals, ranking), and the second adds parameter and usage context. Every word earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description fully explains what the tool returns (1D, 5D, 1M, 3M absolute and relative returns, rotation signal values like LEADING, and leadership ranking). It also mentions the parameter and source. For a tool with one optional parameter, this is complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% (one parameter 'rank_by' with description 'Timeframe to rank sectors by relative performance. Default: 1m.'). The tool description adds 'Parameterizable: sort by any timeframe,' which reinforces but does not add new meaning beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool's function: analyzing S&P 500 sector rotation by comparing relative performance of all 11 GICS sectors against the SPY benchmark. It specifies the verb ('sector rotation'), resource ('S&P 500 sectors'), and outputs (returns, rotation signals, ranking). This clearly distinguishes it from sibling tools like 'equity-brief' or 'market-movers'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear usage context: 'Use for sector allocation, macro-regime interpretation, or screening rotation momentum.' This informs when to use the tool, but it does not mention alternatives or when not to use it. Given the long list of sibling tools, explicit exclusions would be helpful, but the guidance is still strong.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
short-volume-intelAInspect
Daily FINRA consolidated short-sale volume for any US equity ticker: short volume, total volume, and short ratio (short/total) for the last N trading days. Useful for detecting crowded short positions and short-squeeze setups. Free FINRA CDN upstream, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| days | No | Number of recent trading days to fetch (1–10, default 5). | |
| ticker | Yes | US stock ticker symbol (e.g. AMD, GME, NVDA). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It indicates the data is from FINRA and free, but does not mention rate limits, error handling, or update frequency. Adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loaded with the core purpose and output, followed by use case and cost info. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description fully explains what the tool returns (short volume, total volume, short ratio) and the time range. For a simple data retrieval tool, this is complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters well-described. The description adds little beyond the schema (e.g., 'last N trading days' and 'any US equity ticker'), so it meets the baseline for high coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns daily FINRA short-sale volume for any US equity ticker, specifying the data (short volume, total volume, short ratio) and time window (last N days). It distinguishes itself from sibling tools by focusing on short volume analysis.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions usefulness for detecting crowded short positions and short-squeeze setups, providing clear context. It also notes the free data source, but does not exclude scenarios or compare to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
solana-token-riskAInspect
Rug-pull and risk scanner for Solana SPL tokens. Input a mint address; returns mint/freeze authority status, top-10 holder concentration, liquidity depth, token age, and a composite safety score (0–100, higher = safer) with risk level (safe/moderate/risky/danger) and green/warning flags. Uses Solana public RPC and DexScreener — no API keys required. Useful for agents vetting a memecoin or new token before trading, swapping, or recommending.
| Name | Required | Description | Default |
|---|---|---|---|
| mint | Yes | Solana SPL token mint address (base58, 32–44 chars). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It discloses data sources (Solana public RPC, DexScreener), no API keys required, and lists specific checks (mint/freeze authority, holder concentration, etc.). However, it does not mention rate limits, error handling, or behavior for invalid inputs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with no waste. First sentence states purpose and output, second lists metrics, third adds data source and use case. Front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but description details return values (mint/freeze authority, holder concentration, liquidity depth, token age, composite score, risk level, flags). Missing error cases or edge conditions, but adequate for typical use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with one parameter 'mint' described as 'Solana SPL token mint address (base58, 32–44 chars)'. The description repeats 'Input a mint address' without adding substantial new meaning, so it meets baseline but does not exceed.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose as a 'Rug-pull and risk scanner for Solana SPL tokens'. It specifies the input (mint address) and output (risk metrics, composite score, flags), making it distinct from sibling tools like token-top-holders or evm-token-security.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions it's 'useful for agents vetting a memecoin or new token before trading, swapping, or recommending', implying when to use. However, it does not explicitly state when not to use or provide alternatives, leaving usage context somewhat implicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
solana-tx-explainerAInspect
Given a Solana transaction signature, returns a decoded breakdown: fee payer, programs invoked (Jupiter, Raydium, Pump.fun, SPL Token, etc.), SPL token balance changes with deltas, transaction fee in SOL and USD, block time, and a one-sentence human-readable summary. Uses public Solana mainnet RPC — no API key required. PROSPECTOR-class: 6 distinct wallets observed paying for this capability in organic x402 traffic. $0.07/call.
| Name | Required | Description | Default |
|---|---|---|---|
| signature | Yes | Solana transaction signature (base58, 87–88 characters). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description discloses behavioral traits: uses public Solana mainnet RPC, no API key required, cost $0.07/call, and traffic observation. With no annotations provided, it carries full burden and does so thoroughly without contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is concise (three sentences) and front-loaded with key details. Every sentence adds value: what it does, output specifics, usage context, and cost. No fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has only one parameter, full schema coverage, and no output schema, the description fully compensates by detailing the return fields (fee payer, programs, token changes, fee, block time, summary). It is complete and self-sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema has full coverage (100%) with one parameter 'signature' described as base58 87-88 characters. Description adds context beyond schema by specifying the exact format and length, which aids correct invocation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it decodes Solana transactions given a signature, listing specific outputs (fee payer, programs, token changes, fee, block time, summary). Differentiates from siblings like 'tx-explainer' and 'tx-intel' by specifying Solana and the detailed breakdown.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implicitly states when to use: when you need a decoded Solana transaction breakdown. It does not explicitly provide when-not-to-use or alternatives, but the context is clear enough for an agent.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
solar-intelAInspect
Solar irradiance analysis and 7-day forecast for any location. Returns GHI (global horizontal irradiance), DNI (direct normal), DHI (diffuse), peak sun hours, cloud cover, sunrise/sunset, and panel yield estimate for a 1 kW system. Useful for rooftop solar feasibility, agricultural planning, EV charging optimization, and energy market analysis. Free upstream: Open-Meteo (no API key, no rate limits). Undercuts stableenrich.dev/api/google-maps/solar by 31%.
| Name | Required | Description | Default |
|---|---|---|---|
| latitude | No | Latitude in decimal degrees (-90 to 90). Use with longitude instead of location name. | |
| location | No | City, region, or address (e.g. 'Phoenix AZ', 'London', 'Sydney Australia'). Use this OR latitude+longitude. | |
| longitude | No | Longitude in decimal degrees (-180 to 180). Use with latitude instead of location name. | |
| forecast_days | No | Number of forecast days (1–16). Default 7. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses that the tool is free and has no rate limits, adding some transparency. However, it does not describe the data source (Open-Meteo), potential error behavior, or confirm it is read-only.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise: three sentences covering purpose, outputs, use cases, and a competitor mention. Information is front-loaded, and every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description explains what the tool returns (list of variables) and covers use cases. Without an output schema, it adequately sets expectations, though details on forecast granularity (hourly/daily) are missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with clear parameter descriptions. The tool description adds value by explaining what the tool returns and use cases, but does not provide additional parameter-level guidance beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: solar irradiance analysis and 7-day forecast. It lists specific outputs (GHI, DNI, DHI, etc.) and distinguishes itself from a competitor API, providing clear differentiation among sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists multiple use cases (rooftop solar feasibility, agricultural planning, etc.) and highlights that it is free with no API key or rate limits. However, it does not explicitly mention when to avoid using this tool or name sibling alternatives like 'weather'.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sports-predictionAInspect
Returns today's (or a given date's) sports games with team win-loss records, venue, scheduled time, and live score. Supports MLB, NBA, NFL, NHL, NCAAF, NCAAB. Sourced from ESPN public API — no key required. $0.005/call. Use before prediction-markets or sports-content tasks to get accurate team context.
| Name | Required | Description | Default |
|---|---|---|---|
| date | No | Date in YYYY-MM-DD format (default: today UTC). Use for historical or upcoming schedules. | |
| team | No | Optional filter: team name or abbreviation (case-insensitive substring match). E.g. 'Cubs', 'CHC', 'Lakers'. | |
| sport | Yes | League code: mlb | nba | nfl | nhl | ncaaf | ncaab |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses the data source (ESPN public API), no key requirement, and cost per call. However, it omits details on rate limits, pagination, or error handling, which are important for agents.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is brief yet informative, with two sentences covering core functionality and a note on source/cost. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description fails to explain the return format or potential errors. It provides a list of fields but lacks details on ordering, pagination, or response structure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 100% coverage, and the description adds useful context such as date format default, case-insensitive team filtering, and league codes. This enhances understanding beyond the schema alone.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns sports games with team records, venue, time, and score for a given date, supporting multiple leagues. However, it does not explicitly differentiate from the sibling tool 'sports-scores', which may cause confusion.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description recommends using this tool before prediction-markets or sports-content tasks, providing clear context. It also mentions the cost per call, but does not specify when to avoid using it or alternative tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
sports-scoresAInspect
Live and recent sports scores for NBA, NFL, MLB, NHL, MLS, EPL, La Liga, Bundesliga, Serie A, Champions League, and more. Returns game status, current score, venue, period/clock, and broadcast info. Uses ESPN's free public scoreboard API. Optional date filter (YYYYMMDD) for historical or upcoming schedule.
| Name | Required | Description | Default |
|---|---|---|---|
| date | No | Date in YYYYMMDD format (e.g. '20240115'). Defaults to today. | |
| limit | No | Max games to return (default 10, max 30). | |
| league | No | League code. Default: 'nba'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses use of ESPN's free public API and enumerates return fields, but lacks information on rate limits, authentication, caching, or error behavior. No annotations available.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words, front-loading key information about leagues and return data.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Provides sufficient context for a scores retrieval tool given no output schema and no annotations. Could mention error handling or pagination but adequate overall.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with descriptions. Description adds no additional parameter semantics beyond restating the date filter; baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it returns live and recent sports scores for multiple major leagues, listing specific leagues and return fields. Distinguishes from sibling tool 'sports-prediction'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Mentions optional date filter for historical/upcoming schedules and the underlying API source. However, no explicit when-not-to-use or mention of sibling alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
ssl-certAInspect
Inspects the TLS/SSL certificate of any HTTPS host. Returns validity window (not-before, not-after, days remaining), issuer (CA name, organization), subject (CN), Subject Alternative Names, SHA-256 fingerprint, serial number, and an expiry status (HEALTHY/OK/WARNING/CRITICAL/EXPIRED). Useful for monitoring certificate expiry, auditing TLS configuration, and verifying SAN coverage.
| Name | Required | Description | Default |
|---|---|---|---|
| host | Yes | Hostname or IP address to inspect (e.g. 'github.com', 'api.example.com'). | |
| port | No | TLS port number (default: 443). | |
| servername | No | SNI server name override (defaults to 'host'). Useful when connecting to an IP that hosts multiple domains. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It correctly implies a read-only inspection but does not disclose potential side effects, rate limits, timeout behavior, or any prerequisites (e.g., network connectivity). The description is straightforward but lacks depth beyond basic functionality.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is two sentences: first defines core functionality, second enumerates return fields and use cases. It is front-loaded and efficient, though slightly dense. Could benefit from bullet points or clearer separation.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately lists return fields (validity window, issuer, subject, SAN, fingerprint, serial, expiry status). It covers key use cases. However, it omits details about error handling or what happens on connection failure.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for all three parameters (host, port, servername). The description adds no additional parameter meaning beyond what the schema already provides, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states the tool inspects TLS/SSL certificates and lists all returned fields. It distinguishes itself from networking siblings like dns-lookup, ping, http-headers by focusing specifically on certificate inspection.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description provides explicit use cases: monitoring certificate expiry, auditing TLS configuration, and verifying SAN coverage. It implies when to use but does not mention when not to use or alternative tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
stablecoin-watchAInspect
Real-time depeg monitor for top USD stablecoins (USDT, USDC, DAI, USDS, and others ranked by market cap). Returns current price, peg deviation %, depeg status (PARITY / MILD_DEPEG / MODERATE_DEPEG / SEVERE_DEPEG), supply trend, and a composite alert level (GREEN / YELLOW / ORANGE / RED). Sourced from DeFiLlama public API — no key required, updates every call. Useful for DeFi risk management, collateral health checks, and pre-trade regime detection.
| Name | Required | Description | Default |
|---|---|---|---|
| top_n | No | Number of top stablecoins to return, ranked by circulating supply. Default 20, max 50. Ignored if symbol is specified. | |
| symbol | No | Filter to a specific stablecoin symbol (e.g. USDT, USDC, DAI). Case-insensitive. If omitted, returns top coins by market cap. | |
| alert_only | No | If true, only return coins with MILD_DEPEG or worse. Useful for alert-driven workflows. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description discloses key behavioral traits: data source (DeFiLlama public API), no API key required, updates every call, and output components (price, peg deviation %, depeg status, supply trend, alert level). It does not mention rate limits or failure modes, but the disclosed details are sufficient for basic usage.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, front-loading purpose and output in the first sentence, and adding data source and use cases in the second. No wasted words, every sentence serves a purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers input parameters, output fields, data source, and suggested use cases. No output schema exists, but the description lists return fields clearly. It lacks details on error handling or edge cases, but for a simple monitoring tool, it is sufficiently complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with adequate parameter descriptions. The tool description does not add new information beyond the schema; it only restates that omitting symbol returns top coins. Since the schema already includes this, no extra value is added, matching the baseline of 3.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose as a real-time depeg monitor for top USD stablecoins, specifying the action (monitor), resource (stablecoins), and output fields. It distinguishes itself from sibling tools like crypto-pulse or defi-market-pulse by focusing specifically on stablecoin depeg detection.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides use cases: 'DeFi risk management, collateral health checks, and pre-trade regime detection.' This gives context but does not explicitly exclude alternatives or compare with sibling tools. However, the specificity of the tool makes its usage clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
stock-briefAInspect
US equity snapshot + Limitless prediction market sentiment in one call. Returns current price, intraday change, volume, 52-week range for any NYSE/NASDAQ ticker, plus any active Limitless prediction markets matching the ticker keyword (direction bets, price level markets). Single-call alternative to separate limitless-markets + us-stock-price calls. Free upstream — no API key required. $0.015/call.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AMD, AAPL, NVDA, TSLA). Case-insensitive. | |
| market_limit | No | Max Limitless prediction markets to return (1–10, default 5). Markets are filtered by ticker keyword. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries the full burden. It discloses the return data, cost ($0.015/call), and that no API key is required. It does not cover rate limits, pagination, or error behavior, but for a read-only snapshot tool, this is minimally adequate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, dense sentence that front-loads the purpose and immediately follows with the returned data and key differentiator. It includes cost and API key requirement with no wasted words. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the moderate complexity (2 params, no output schema, no annotations), the description covers outputs, cost, and differentiation. It lacks details on data freshness and market matching logic, but is otherwise fairly complete for a snapshot tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already fully documents both parameters. The description adds marginal value by mentioning ticker keyword matching and the types of prediction markets (direction bets, price level markets), but does not significantly extend beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns a US equity snapshot and Limitless prediction market sentiment in one call. It lists specific data (price, change, volume, 52-week range) and differentiates itself as a single-call alternative to the separate limitless-markets and us-stock-price sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context for when to use this tool (when both equity snapshot and prediction markets are needed) and explicitly frames it as an alternative to making separate calls. It does not include specific when-not scenarios, but the context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
stock-ohlcvAInspect
Returns historical OHLCV (open/high/low/close/volume) candlestick data for a stock, ETF, or index. Supports intervals from 1-minute to monthly and ranges from 1 day to max history. Use for chart analysis, trend detection, and quantitative backtesting. $0.010/call — free upstream, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Max candles to return (1–500, default 60). Applied from most recent. | |
| range | No | Lookback period. Default: '1mo'. Note: intraday intervals cap at 60d max. | |
| ticker | Yes | Stock ticker symbol (e.g. 'AAPL', 'SPY', 'BTC-USD', '^VIX'). Case-insensitive. | |
| interval | No | Candlestick interval. Intraday ('1m'–'1h') limited to last 60 days. Default: '1d'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions pricing but does not disclose behavioral traits like read-only nature, error handling, or rate limits. For a data retrieval tool, this is insufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: defines function, lists features, mentions use cases and cost. No wasted words, front-loaded, and easy to scan.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, so description should explain return format (e.g., array with date, O, H, L, C, V). It only states 'returns... data' without structure. Incomplete for a 4-param tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so parameters are well-documented in the schema. The description adds context about interval and range limits (e.g., intraday cap at 60 days) but largely repeats schema info. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns historical OHLCV candlestick data for stocks, ETFs, or indices, which is a specific verb and resource. It distinguishes from sibling tools like stock-brief or stock-price-multi by focusing on historical candlestick data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists use cases such as chart analysis, trend detection, and quantitative backtesting, providing clear context. However, it does not explicitly compare with alternatives or state when not to use it.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
stock-price-multiAInspect
Returns current US equity prices for up to 5 tickers in one call — STRC, AMD, MSTR, SLV, USO, or any NYSE/NASDAQ symbol. Each call returns price, change %, volume, day range, and 52-week range per ticker. Sourced from Yahoo Finance public data, no API key. A single $0.018 call replaces 3-5 separate blockrun.ai queries that total $0.066–$0.220.
| Name | Required | Description | Default |
|---|---|---|---|
| tickers | Yes | Up to 5 US stock ticker symbols (e.g. ["STRC","AMD","MSTR"]). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description covers key behavioral aspects: data source (Yahoo Finance), no API key required, case-insensitive tickers, and returned fields. It omits details about error handling for invalid tickers, rate limits, or data latency, but for a simple price query it is reasonably transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with four sentences, each adding value: purpose, return fields, data source, and cost comparison. It is front-loaded with the core functionality. No redundant or vague statements.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description explains the return data (price, change %, volume, day range, 52-week range). It covers constraints (max 5 tickers, case insensitivity), source, and missing API key. It is sufficiently complete for an agent to decide and invoke the tool, though it lacks details on response format or error scenarios.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% as the parameter 'tickers' is described in both schema and description. The description adds examples and clarifies the 5-ticker limit and case insensitivity, which are also in the schema. Thus it adds minimal extra meaning beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns current US equity prices for up to 5 tickers in one call, with specific example tickers and data fields (price, change %, volume, day range, 52-week range). This distinguishes it from sibling tools like us-stock-price (likely single ticker) by emphasizing the batch capability.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description indicates when to use this tool: for querying up to 5 US stock prices simultaneously. It highlights cost savings compared to separate queries. However, it does not explicitly state when not to use it or name alternative sibling tools, leaving some ambiguity if only one ticker is needed.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
strategy-signalBInspect
Technical analysis signal for any US equity, ETF, or crypto. Returns RSI(14), MACD(12/26/9), Bollinger Bands(20), volume trend, directional posture (STRONG_BUY/BUY/NEUTRAL/SELL/STRONG_SELL), and key price levels. Richer output than comparable services at $0.090. Free upstream: Yahoo Finance (equities), CoinGecko (crypto).
| Name | Required | Description | Default |
|---|---|---|---|
| symbol | Yes | Ticker or crypto symbol. Examples: AAPL, SPY, QQQ, BTC, ETH, SOL, NVDA. | |
| include_bars | No | If true, include last 5 OHLCV bars in response. Default false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must convey behavioral traits. It discloses the output indicators, pricing ($0.090 per use), and upstream data sources (free). However, it omits important details such as rate limits, authentication requirements, or any destructive side effects. The cost and source info add value but do not fully cover the expected behavioral profile.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise with three sentences that progress logically: purpose, outputs, and additional context (pricing, sources). It front-loads the key information and avoids unnecessary verbosity, though the pricing detail could be seen as tangential.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's moderate complexity and lack of output schema, the description provides a reasonable overview of inputs and outputs but lacks details on response format, interpretation of directional posture, or threshold conditions. It is sufficient for a basic understanding but not fully self-contained.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the input schema already describes parameters adequately. The description adds nothing new to parameter semantics beyond listing output indicators; it does not clarify parameter usage further. Thus, it meets the baseline expectation for a well-covered schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides technical analysis signals for US equities, ETFs, and crypto, listing specific indicators (RSI, MACD, Bollinger Bands, etc.). This makes the tool's purpose unambiguous. However, it does not differentiate itself from sibling tools like 'equity-technicals', which may offer similar functionality, so it lacks explicit sibling differentiation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use when technical analysis is needed but provides no explicit guidance on when to use this tool versus alternatives. It mentions pricing and data sources but does not state when not to use or compare with similar tools like equity-technicals. Usage context is vague.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
timezoneAInspect
Timezone intelligence using the IANA database (418 zones) built into Node.js. Returns current local time, UTC offset, DST status, and the long timezone name for any IANA timezone. Can also convert an ISO timestamp to one or more target timezones. Useful for scheduling agents, global operations, and time-aware data enrichment. Zero external API calls — instant response.
| Name | Required | Description | Default |
|---|---|---|---|
| search | No | Return a list of timezone names matching this substring (e.g. 'America', 'Paris'). Use to discover valid timezone identifiers. | |
| timezone | No | IANA timezone name to look up (e.g. 'America/Chicago', 'Europe/London', 'Asia/Tokyo'). Omit to return UTC. | |
| timezones | No | Batch: list of IANA timezone names (max 20). If provided, 'timezone' is ignored. | |
| convert_from_iso | No | ISO 8601 timestamp to convert (e.g. '2026-06-06T15:00:00Z'). If omitted, uses current UTC time. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses behavior: uses Node.js built-in IANA database, zero external API calls, instant response, and specific output fields. It also explains the two modes (lookup and conversion).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Four sentences, each providing essential information: purpose, outputs, conversion capability, and performance characteristic. No filler, well-organized.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description sufficiently explains return values and covers all functions (single lookup, batch, search, conversion). Users can understand what to expect.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds meaningful context: 'search' for discovery, 'timezone' vs 'timezones' for batch, and 'convert_from_iso' for conversion. This enhances understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides timezone intelligence using the IANA database, listing specific outputs (current local time, UTC offset, DST status, long name). It also mentions conversion of ISO timestamps. No sibling tool covers timezone, so differentiation is implicit.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases: 'scheduling agents, global operations, and time-aware data enrichment.' It does not explicitly state when not to use, but given the tool's uniqueness and clear scope, guidance is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
token-top-holdersAInspect
Returns top holders for any Ethereum ERC-20 token (by contract address), with concentration metrics. Includes each holder's share%, human-readable balance (with decimal conversion), and whale analysis. Reports: top-10 concentration, whale count (>1% holders), Herfindahl-Hirschman Index for concentration risk. Use to assess token distribution health, identify institutional-grade holders, or screen for rug-pull risk before entering a position.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of top holders to return. Default 50, max 100. | |
| token_address | Yes | Ethereum ERC-20 contract address (0x-prefixed). Example: '0xA0b86991c6218b36c1d19D4a2e9Eb0cE3606eB48' for USDC. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden for behavioral disclosure. It describes outputs but does not mention rate limits, data freshness, or potential side effects. The behavioral traits beyond output specification are missing.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loading the main purpose, detailing outputs, and concluding with use cases. No redundant information; every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite lacking an output schema, the description covers key return elements: share%, balance, whale analysis, concentration metrics. It could benefit from mentioning default limit and pagination, but overall is sufficient for a data retrieval tool with two parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds value by explaining output features like human-readable balance with decimal conversion, but does not significantly enhance parameter meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns top holders for Ethereum ERC-20 tokens with concentration metrics, using specific verbs and resources. It differentiates from siblings by focusing on token distribution health and rug-pull screening.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit use cases such as assessing token distribution health, identifying institutional holders, and screening for rug-pull risk. However, it does not mention when not to use or list direct alternatives among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
treasury-auction-calendarAInspect
Returns upcoming US Treasury auction schedule (Bills, Notes, Bonds, TIPS, FRNs) from TreasuryDirect.gov. Filter by security type and look-ahead window. Shows auction date, issue date, term, offering amount, coupon rate, and reopening status. Official government data, no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | Security type filter: 'all', 'bill', 'note', 'bond', 'tips', 'frn', 'cmb'. Default: 'all'. | |
| limit | No | Max number of auctions to return (default: 20, max: 100). | |
| days_ahead | No | How many calendar days ahead to include auctions for (1–365). Default: 30. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It states official government data, no API key, and lists return fields. It is a read-only tool, no destructive actions. However, it does not disclose potential rate limits, data freshness, or error behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with key purpose and source, then details filters and output, then confirms official data. No unnecessary words, efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple data retrieval tool with 3 optional parameters and no output schema, the description covers purpose, source, filters, output fields, and authentication. It is mostly complete, though it could mention data freshness or refresh interval. The schema covers parameter limits.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100% with each parameter having a description (type, limit, days_ahead). The tool description only adds 'Filter by security type and look-ahead window,' which reinforces the schema but does not provide additional semantic value beyond what is already in schema definitions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns upcoming US Treasury auction schedule from TreasuryDirect.gov, listing specific security types (Bills, Notes, etc.) and output fields. It distinguishes itself from siblings like treasury-yields by focusing on auction schedule rather than yields.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for retrieving auction schedules by security type and date range, but does not explicitly mention when not to use it or compare with alternatives like treasury-yields or economic-calendar. The context is clear but lacks exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
treasury-yieldsAInspect
Returns current US Treasury yield curve at 3M, 5Y, 10Y, and 30Y nodes from CBOE interest-rate indices (free, no API key). Includes 10Y-3M spread and curve shape classification. Essential for DCF discount rates, bond pricing, and recession signal monitoring.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations exist, and the description mentions it's free and no API key, but lacks details on data freshness, rate limits, or limitations. It covers basic traits but is not fully transparent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences: what it returns, additional outputs, and use cases. No fluff, front-loaded information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no parameters and no output schema, the description covers the essential aspects (inputs, source, outputs, use cases), though more detail on output format would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The tool has no parameters, so schema coverage is 100% by default. Description adds no parameter info, but baseline for 0 parameters is 4.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns US Treasury yield curve at specific nodes from CBOE indices, and distinguishes from other financial data tools by specifying the exact resource and additional outputs like spread and classification.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear use cases (DCF, bond pricing, recession monitoring) implying when to use it, but does not explicitly mention when not to use or compare with alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
tx-explainerAInspect
Given a transaction hash and chain, returns a decoded breakdown: sender, recipient, ETH value transferred, gas used, transaction fee, decoded method name (transfer/approve/swap/deposit/etc.), ERC-20 token transfer details if applicable, block number, block timestamp, and a one-sentence agent-readable summary. Supports Ethereum (default), Base, Polygon, and Arbitrum mainnet. Uses free public JSON-RPC nodes — no API key required, results in ~1-2s.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Chain to query. Default: ethereum. | |
| tx_hash | Yes | Transaction hash — 0x-prefixed, 66 characters total. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description discloses use of free public JSON-RPC nodes, no API key required, and expected response time (~1-2s). However, no annotations exist, so it carries the full burden. It does not address potential limitations (e.g., rate limits, transaction confirmation status, error handling).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, well-structured paragraph that front-loads the primary purpose, lists returned fields, and adds technical details. Every sentence adds value, with no redundant or vague language.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's moderate complexity and lack of output schema, the description is fairly complete: it lists all returned fields, supported chains, and technical constraints. It could mention error behavior for invalid inputs or unsupported chains, but overall it covers essential aspects.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters (tx_hash and chain). The description adds value by listing supported chains beyond the schema's default note, but does not substantially enhance parameter understanding beyond what the schema provides.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool decodes a transaction given a hash and chain, listing all returned fields (sender, recipient, value, gas, fee, method, token transfers, block info, summary). It distinguishes itself from siblings like 'tx-intel' and 'solana-tx-explainer' by specifying supported EVM chains and detailed breakdown.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description specifies supported chains (Ethereum, Base, Polygon, Arbitrum) and default Ethereum, along with tech details (free public nodes, no API key, ~1-2s). This provides clear context for when to use, but lacks explicit when-not-to-use or alternative tool comparisons.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
tx-intelAInspect
Decode and explain any EVM transaction — in one x402 payment. Returns: transaction status, type (ETH transfer / ERC20 transfer / swap / approval / contract call), human-readable summary, token transfers parsed from logs, gas cost, block context, and explorer URL. Collapses the observed tx-explainer + onesource/block agent seam (18 wallets, 8-day persistence). Supports Base (default), Ethereum, Arbitrum, Optimism, Polygon, Avalanche, BSC. Free upstream: DRPC + public RPC nodes. $0.006/call — 40% below the x402.ottoai.services tx-explainer.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | EVM chain. Default: base. | |
| tx_hash | Yes | Transaction hash (0x-prefixed, 66 chars). | |
| include_block_context | No | Include block-level context (base_fee, tx_count, miner). Collapses the onesource/chain/block seam agents use after tx-explainer. Default: true. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full behavioral burden. It explains what the tool does and what it returns, covers pricing, and notes free upstream nodes. It does not disclose read-only nature or rate limits, but the explanation is fairly transparent for a decoding tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with purpose and outputs, then covers chain support and pricing. Every sentence adds value, though mentions like '18 wallets, 8-day persistence' add marginal detail. Could be slightly more concise.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 3 parameters, no output schema, and no annotations, the description fully covers what the tool returns, how it works, supported chains, and cost. An agent has sufficient context to select and invoke correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the description adds minimal value beyond existing parameter descriptions. It does mention supported chains for the 'chain' parameter and context about 'include_block_context' collapsing seams, but overall does not significantly enhance understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool decodes and explains any EVM transaction, listing specific outputs (status, type, summary, token transfers, gas cost, etc.). It differentiates from siblings by noting it collapses the tx-explainer and onesource/block agent seam.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists supported chains and mentions collapsing multiple agents, but does not explicitly state when to use this tool versus alternatives like 'tx-explainer' or 'block-intel'. Only implicit via the seam collapse comment.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
unit-converterAInspect
Converts between 100+ units across 12 categories: length, weight, temperature, volume, speed, area, energy, pressure, data, time, angle, frequency. Handles mixed-case inputs (km, KM, Km all work). Returns the converted value plus all common units in the same category. Zero external calls — pure math.
| Name | Required | Description | Default |
|---|---|---|---|
| to | No | Target unit. If omitted, returns all units in the same category. | |
| from | Yes | Source unit (e.g. 'kg', 'mi', 'f', 'kwh', 'mph', 'gb', 'psi'). Case-insensitive. | |
| value | Yes | Numeric value to convert. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the burden. It states 'zero external calls' and case-insensitivity, but does not disclose potential limitations like value range or precision. Lacks completeness for safety.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with no redundancy. Key information is front-loaded: purpose, scope, input tolerance, return behavior, and performance guarantee.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema exists, but description explains return format (converted value plus common units). Could mention error handling or precision, but sufficiently covers usage intent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but description adds: 'Handles mixed-case inputs' and behavior when 'to' is omitted (returns all units). This enriches understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool converts between 100+ units across 12 categories, listing them. It clearly distinguishes from siblings (no other unit converters) and uses specific verbs 'converts' and 'returns'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies when to use (conversion needed) and mentions that omitting 'to' returns all units. However, it does not explicitly contrast with alternatives, though none exist among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
usgs-earthquakeBInspect
Real-time global earthquake events from USGS. Filter by magnitude, depth, location radius, or time window. Returns location, magnitude, depth, PAGER alert, and USGS event URL. Free primary source — no API key.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Maximum number of events to return (1–100). Default 20. | |
| latitude | No | Center latitude for radius search (decimal degrees). Requires longitude and radius_km. | |
| days_back | No | How many days of events to fetch (1–30). Default 7. | |
| longitude | No | Center longitude for radius search (decimal degrees). Requires latitude and radius_km. | |
| radius_km | No | Search radius in km around lat/lon. Max 20001. Default 500 when lat/lon provided. | |
| min_magnitude | No | Minimum Richter magnitude to include. Default 4.5. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It mentions 'Real-time' and 'Free primary source — no API key,' but lacks details such as rate limits, data update frequency, or any constraints beyond what is in the schema. The description does not disclose behavioral traits like pagination or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three concise sentences with no fluff. It front-loads the main purpose and quickly covers filters, return fields, and source authenticity. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations and no output schema, the description should provide more context. It lists return fields but not their structure or format. It does not mention error handling, rate limits, or realistic usage scenarios for the 6 parameters. The description feels incomplete for a tool with moderate complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the baseline is 3. The description adds minimal meaning beyond the schema by listing filter types (magnitude, depth, etc.) but does not elaborate on parameter behavior or provide examples.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides real-time global earthquake events from USGS, mentions filtering capabilities and return fields. However, it does not differentiate from sibling tool 'earthquake-intel', which may have a similar purpose.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage by listing filter options ('Filter by magnitude, depth, location radius, or time window') but does not explicitly state when to use this tool versus alternatives, nor does it provide any exclusions or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
us-stock-priceBInspect
Returns current US equity price and intraday metrics (change %, volume, day high/low, 52-week range) for any NYSE/NASDAQ ticker. Sourced from Yahoo Finance public data — no API key, live during market hours. Priced at $0.018, 22% below blockrun.ai's $0.023/call for the same data.
| Name | Required | Description | Default |
|---|---|---|---|
| ticker | Yes | US stock ticker symbol (e.g. AMD, AAPL, NVDA). Case-insensitive. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses that data is from Yahoo Finance public data and live during market hours, but lacks details on rate limits, error handling, or behavior for invalid tickers. Description adds moderate transparency but omits important behavioral traits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise: two sentences with no wasted words. The first sentence immediately states the core functionality, and the second adds source and pricing. Perfectly front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the simple interface (one parameter, no output schema, no annotations), the description is somewhat incomplete. It does not specify the return format, list all metrics precisely, or clarify behavior outside market hours. Pricing info is included but may be irrelevant for AI agent decisions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with a clear description for the 'ticker' parameter (case-insensitive, examples). Baseline is 3; the description does not add additional parameter information beyond the schema, so no extra credit.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns current US equity price and intraday metrics for NYSE/NASDAQ tickers, using a specific verb ('Returns') and resource ('US equity price'). It specifies data source and pricing, but does not explicitly differentiate from sibling tools like 'stock-brief' or 'equity-brief' that may offer similar data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. Mentions 'live during market hours' and pricing, but does not advise on when not to use it or recommend sibling tools for different needs.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
wallet-balanceAInspect
Returns the native token balance (ETH, POL, BNB) for any EVM wallet address. Supports Ethereum, Base, Polygon, Arbitrum, Optimism, and BSC. Includes optional USD value. $0.002/call — 60% below comparable market rate. Free upstreams: DRPC + CoinGecko, no API key required.
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | EVM wallet address (0x-prefixed, 42 characters). | |
| network | No | Chain to query. Default: ethereum. | |
| with_usd | No | If true, fetches live USD price and returns USD value. Default: false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description discloses cost, upstream sources, and no API key requirement, but lacks information on rate limits or error handling.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three efficient sentences, front-loaded with primary purpose. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Mostly complete for a simple balance tool; missing return format details but sufficient for expected use. No output schema to supplement.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%; description repeats some parameter info (e.g., networks, USD toggle) but adds minimal beyond schema definitions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool returns native token balance for EVM wallets, listing specific chains and optional USD value. Distinct from siblings like ens-lookup or eth-block.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implicitly guides use by listing supported chains and optional features. Does not explicitly mention when not to use or compare to alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
wallet-credit-scoreAInspect
Composite credit score (0–100) for any EVM wallet. Aggregates Ethereum and Base transaction count, account balance, USDC holdings, and multi-chain footprint into a single trustworthiness score with tier label (PRIME / ESTABLISHED / ACTIVE / SPARSE / DORMANT). Use before sending payments, routing agent transactions, or assessing counterparty risk in automated workflows. Priced 46% below x402node.dev equivalent.
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | EVM wallet address (0x…, 40 hex chars). Checksummed or lowercase accepted. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Discloses the aggregation of specific metrics and output tier labels, but omits error handling, data freshness, or behavior for inactive wallets.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Concise with two functional sentences; the pricing note is marginally useful but could be considered extraneous for tool selection.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Adequately covers input, use cases, and output format despite lacking output schema or annotations; still leaves minor gaps on error states.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The single parameter 'address' is well-described in the schema (100% coverage); the description adds no further semantic value beyond reaffirming EVM compatibility.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it computes a composite credit score (0–100) for EVM wallets using multiple data sources, distinguishing it from sibling tools like wallet-balance or wallet-screener.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly recommends use for payments, routing, and counterparty risk assessment, but lacks explicit exclusion for alternative scenarios.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
wallet-screenerAInspect
Risk screening for EVM wallet addresses. Returns a 0–100 risk score and individual flags: sanctions (OFAC/other), phishing activity, cybercrime, money laundering, darkweb transactions, mixer usage, stolen funds, fake KYC, and 12 more categories. Sourced from GoPlusLabs (free, no key, chain_id optional — defaults to checking cross-chain). Use before sending funds to an unknown address, before accepting a payment, or when validating a counterparty wallet in a DeFi workflow. Distinct from evm-token-security (which screens TOKEN contracts — this screens WALLETS).
| Name | Required | Description | Default |
|---|---|---|---|
| address | Yes | Wallet address to screen (EVM hex address, e.g. '0x1234...'). Does not need to be a contract — any wallet address. | |
| chain_id | No | Chain ID for context (optional, numeric or chain name). If omitted, GoPlus checks cross-chain records. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden. It discloses the data source (GoPlusLabs, free, no key), output format (score and flags), and optional chain_id behavior. Missing some details like error handling on invalid addresses, but substantial transparency is achieved.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and well-structured, front-loading the purpose. Every sentence adds useful information without redundancy. It fits within a few lines, making it easy for the agent to parse quickly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the absence of an output schema, the description thoroughly explains return values (risk score and individual flags) and lists many flag categories. It also covers usage context and data source, making the tool self-contained for an AI agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with descriptions for both parameters. The description adds value by clarifying that address does not need to be a contract, and explaining the default cross-chain behavior when chain_id is omitted. It contextualizes the parameters beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool screens EVM wallet addresses for risk, returning a score and detailed flags. It explicitly distinguishes from the sibling tool evm-token-security, which screens token contracts. The verb and resource are specific and unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit usage scenarios: before sending funds, accepting payments, or validating a counterparty wallet in DeFi. It also contrasts with evm-token-security, guiding the agent on when to use this tool versus alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
weatherAInspect
Current weather conditions and 7-day daily forecast for any location worldwide. Input a city name, coordinates, or address. Returns temperature (°C), humidity, wind speed, precipitation, weather code, and a forecast with high/low temps and precipitation totals. Free upstream: Open-Meteo (no API key, no rate limits). Useful for DeFi agents tracking energy markets (cold → heating demand → gas prices), agricultural commodity prediction markets, or natural disaster risk assessment for on-chain insurance.
| Name | Required | Description | Default |
|---|---|---|---|
| latitude | No | Latitude in decimal degrees (-90 to 90). Use with longitude instead of location name. | |
| location | No | City name, region, or address (e.g. 'New York', 'London UK', 'Tokyo'). Use this OR latitude/longitude. | |
| longitude | No | Longitude in decimal degrees (-180 to 180). Use with latitude instead of location name. | |
| forecast_days | No | Number of forecast days (1–16). Default 7. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description fully discloses behavioral traits: it mentions the free upstream service Open-Meteo ('no API key, no rate limits'), which is critical for usage. It also describes the output fields (temperature, humidity, etc.). No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no wasted words. It front-loads the core function in the first sentence, lists returned data in the second, and adds use cases and upstream info in the third. Every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, the description adequately covers what is returned (temperature, humidity, wind speed, etc.) and the forecast details. It also explains the forecast_days parameter default. The use cases provide context for agent decision-making. No obvious gaps.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema already covers 100% of parameters with descriptions, so baseline is 3. The description adds value by clarifying that input can be 'city name, coordinates, or address' and mentions the default forecast_days=7, which goes beyond the schema's individual descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool provides 'current weather conditions and 7-day daily forecast for any location worldwide', which is a specific verb+resource combination. It clearly distinguishes from the sibling 'weather-alerts' tool by focusing on current conditions and daily forecast rather than alerts.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides concrete use cases for DeFi agents, agricultural markets, and on-chain insurance, which helps agents decide when to use it. However, it does not explicitly state when not to use it or mention the sibling 'weather-alerts' as an alternative for alert-specific queries.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
weather-alertsAInspect
Active NOAA weather alerts for any US state — tornado warnings, flash flood watches, hurricane warnings, blizzard advisories, heat alerts, and 80+ other NWS event types. Real-time data from api.weather.gov; no API key. Filterable by state, event type, or severity. Returns event, severity, certainty, urgency, affected areas, onset/expiry, headline, and protective action instructions. Complements weather.js (forecasts) — this cap covers active emergencies.
| Name | Required | Description | Default |
|---|---|---|---|
| event | No | Filter by event type (e.g. 'Tornado Warning', 'Flash Flood Watch', 'Hurricane Warning'). Partial matches not supported — use exact NWS event names. | |
| limit | No | Max alerts to return (1–100, default 25). Sorted by severity descending. | |
| state | No | 2-letter US state code to filter by (e.g. 'TX', 'FL'). Omit to get all active US alerts. | |
| severity | No | Minimum severity level to return: Extreme, Severe, Moderate, Minor. Default: no filter (all severities). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It mentions real-time data from api.weather.gov and no API key required, which is helpful. However, it does not disclose rate limits, data freshness guarantees, or error handling behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, each serving a distinct purpose: stating the resource, listing features, and clarifying differentiation. It is front-loaded with the core purpose and contains no extraneous words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the absence of an output schema, the description compensates by enumerating the returned fields (event, severity, certainty, etc.). It also explains the data source, filter options, and the tool's scope relative to forecasts. This provides a complete picture for a retrieval tool with 4 parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already describes each parameter fully. The description adds context by listing example event types and the default limit, but does not significantly enhance understanding beyond the schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides active NOAA weather alerts for any US state, listing specific event types. It explicitly differentiates from the sibling 'weather.js' tool by noting it covers active emergencies versus forecasts.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description includes 'Complements weather.js (forecasts) — this cap covers active emergencies,' which gives clear context on when to use this tool versus an alternative. However, it does not explicitly list when not to use it or other potential alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
web-change-monitorAInspect
Returns content-change signals for any public URL: ETag, Last-Modified, Content-Length, and Content-Type via HTTP HEAD. Falls back to GET + SHA-256 of the first 32 KB when the server returns no cache markers. Store the snapshot and re-call periodically to detect changes. Useful for monitoring competitor pricing, news, regulatory filings, or any public page.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | HTTPS or HTTP URL to poll. Must be a publicly accessible endpoint. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It explains the HEAD-first approach and fallback to GET+SHA-256 of the first 32KB, and advises storing snapshots. However, it omits potential rate limits, authorization needs, or behavior for non-HTTP URLs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise (3 sentences), front-loaded with the main functionality, and every sentence adds value. No fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (one parameter, no output schema), the description covers the algorithm, usage pattern, and example use cases completely.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% (one parameter with description 'HTTPS or HTTP URL to poll. Must be a publicly accessible endpoint'). The description adds 'public URL' but adds little beyond schema. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns content-change signals (ETag, Last-Modified, Content-Length, Content-Type) via HTTP HEAD, with a fallback to GET+SHA-256. It specifies the resource as any public URL and distinguishes it from siblings like http-headers (headers only) and web-scrape-links (scraping content).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states the tool is useful for monitoring competitor pricing, news, regulatory filings, or any public page, and advises storing snapshots and re-calling periodically. While it doesn't explicitly list when not to use or alternative tools, the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
web-company-intelAInspect
Extract structured company intelligence from any public website. Returns company name, description, logo, emails, phones, address, founded date, social links (Twitter/LinkedIn/GitHub/etc.), and raw OpenGraph + schema.org/Organization data. Pure HTML extraction — no external APIs. $0.003 hedge against orbisapi web-scrape-company at $0.005.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL of the company website to analyze (e.g. 'https://stripe.com'). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses the extraction method ('Pure HTML extraction') and cost, but with no annotations, it fails to mention limitations such as JavaScript rendering, rate limits, or accuracy caveats. This partial disclosure earns a mid-range score.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with no redundant information. The first sentence captures purpose and output, the second adds method and cost context. All content is earned.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple extraction tool with one parameter and no output schema, the description covers the return fields and method. It lacks mention of error handling or behavior on non-company URLs, but is mostly complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The single parameter 'url' is fully described in the schema (100% coverage). The description adds no additional semantic value beyond restating the purpose, so the baseline score of 3 applies.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Extract' and the resource 'structured company intelligence from any public website'. It lists specific return fields and distinguishes from alternative tools via a cost comparison, making the purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus siblings like 'company-intel' or 'company-due-diligence'. The cost comparison implies a use case but lacks direct when-to-use or when-not-to-use instructions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
web-scrape-linksAInspect
Extracts all hyperlinks from any public webpage. Returns href URLs normalized to absolute URLs with visible link text. Filters out javascript:, mailto:, data: schemes. Optionally restrict to same-domain links, deduplicate, or include #anchor links. Useful for crawlers, sitemap builders, link graph analysis, and content audits.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL of the webpage to extract links from (http or https). | |
| limit | No | Maximum links to return (default 200, max 500). | |
| deduplicate | No | Pass "false" to allow duplicate hrefs. Default: true (each unique URL returned once). | |
| include_anchors | No | Pass "true" to include anchor-only links (#section). Default: false. | |
| same_domain_only | No | Pass "true" to return only links pointing to the same domain as the input URL. Default: false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description effectively communicates key behaviors: normalization, filtering of non-HTTP schemes, default deduplication, and optional inclusion of anchors or same-domain links. It lacks details on rate limits, error handling, or robots.txt compliance, but covers the main behavioral aspects beyond the schema.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences: the first states the core action, the second adds detail, and the third lists use cases. No unnecessary words, front-loaded with the most important information. Excellent conciseness.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the return format (normalized URLs with link text) and optional parameters adequately. However, it lacks information on error handling, response structure (array vs object), and important details for web scraping like rate limits or robots.txt adherence. Given the absence of an output schema, more detail on the return format would be beneficial.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 100% coverage, providing baseline parameter descriptions. The description adds value by clarifying defaults (limit 200 max 500, deduplicate true, etc.) and explaining the filtering behavior, going beyond the schema to aid understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that the tool extracts hyperlinks from a public webpage, normalizes URLs to absolute, filters out specific schemes, and provides optional parameters. It lists use cases but does not explicitly distinguish from sibling tools like 'page-links', though the detail implies a more comprehensive extraction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists common use cases (crawlers, sitemap builders, etc.) giving implicit guidance on when to use it. However, it does not explicitly state when not to use it or mention alternative tools, leaving room for ambiguity with similar siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
whale-radarAInspect
Polymarket whale intelligence for a given proxy wallet address. Returns recent prediction-market trades (market title, outcome, side, size in USDC, price, timestamp) and current open positions (title, size, avg price, current value, unrealized PnL). Infers whale tier (whale/shark/dolphin/minnow) from trade sizes. Use for copy-trading signal generation, market-sentiment cross-reference, or on-chain agent behavior profiling. Free upstream: Polymarket public data API.
| Name | Required | Description | Default |
|---|---|---|---|
| wallet | Yes | Polymarket proxy wallet address (0x, 42 hex chars). Obtain from recent Polymarket trades or on-chain Polygon activity. Example: 0x7a9dc87be2c72791fd86fad5f67be7c5dc89ba5d | |
| activity_limit | No | Max recent trades to return (1–50, default 10). | |
| include_closed | No | If true, include zero-value (closed/redeemable) positions. Default false. | |
| positions_limit | No | Max open positions to return (1–50, default 10). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, and the description does not disclose side effects, rate limits, authentication requirements, or whether the tool is read-only. It only mentions it uses free public data, leaving behavioral traits unclear.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, with two clear paragraphs that front-load the main purpose. Every sentence adds information, and there is no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers input requirements, output structure, and use cases well. However, it lacks an example or details on output format, which would improve completeness for a tool without an output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With 100% schema coverage, the baseline is 3. The description adds value by explaining the wallet address format and how to obtain it, and summarizing the output structure, which goes beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it provides Polymarket whale intelligence for a given proxy wallet, listing specific outputs (recent trades, open positions, whale tier) and use cases. It distinguishes itself from sibling tools by its focus on Polymarket whale activity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases (copy-trading, sentiment analysis, behavior profiling) and mentions free upstream API. However, it does not explicitly state when not to use it or compare with alternative tools like wallet-balance or polymarket-sentiment-shift.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
world-bank-dataAInspect
World Bank open data — 1600+ development indicators for 200+ countries. Returns most-recent values and 5-year trend for any indicator by country. Covers GDP, population, inflation, unemployment, FDI, debt, exports, CO₂, life expectancy, Gini, internet penetration, ease of doing business, and more. Accepts ticker-style aliases (gdp, inflation, unemployment) or full WB indicator codes. Sourced from api.worldbank.org — free, no key required. Use for country risk, macro comparisons, policy analysis, and development economics.
| Name | Required | Description | Default |
|---|---|---|---|
| years | No | Number of most-recent years to return (1–10). Default: 5. | |
| country | Yes | ISO 2-letter country code(s), semicolon-separated for multiple (e.g. 'US', 'US;CN;DE'). Use 'WLD' for world average. | |
| indicator | Yes | Indicator alias or WB code. Aliases: gdp, gdp_growth, gdp_per_capita, population, inflation, unemployment, fdi, debt_gdp, exports_gdp, imports_gdp, co2_per_capita, internet_users, life_expectancy, gini, literacy, ease_of_biz, current_account, market_cap_gdp, forex_reserves. Or any WB indicator code like 'NY.GDP.MKTP.CD'. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It describes the return format (recent values and trend), data source, and that no API key is required. However, it does not disclose rate limits, error handling, data freshness, or any side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is reasonably concise, front-loading the main purpose and key details. It conveys essential information without verbosity, though it could be slightly more structured with bullet points for clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a data retrieval tool with 3 parameters and no output schema, the description covers the input semantics, return structure, and use cases adequately. It lacks examples or error handling details but is sufficient for an agent to understand and invoke the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds significant value by listing accepted aliases for the 'indicator' parameter, explaining the semicolon-separated format for country codes, and clarifying the default and range for 'years'. This aids correct usage beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves World Bank development indicators for countries, specifying the data source, coverage (1600+ indicators, 200+ countries), and output (most-recent values and 5-year trend). It differentiates itself from sibling tools by focusing on this specific dataset, though it does not explicitly name siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases ('country risk, macro comparisons, policy analysis, development economics') but does not explicitly tell when to use this tool versus alternatives like 'macro-brief' or 'country-info'. No when-not-to-use or alternative tools are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
x402-endpoint-intelAInspect
Market intelligence for any x402 endpoint or operator wallet. Returns settlement volume, unique payer count, price range, reputation tier, activity window, and endpoint description — drawn from 4.1M+ Base mainnet settlements in the Stall's PROSPECTOR archive. Use before routing agent spend, vetting a counterparty operator, or benchmarking competitor pricing. No external API. Covers 80,000+ operators across 10+ days of live traffic.
| Name | Required | Description | Default |
|---|---|---|---|
| target | Yes | Resource URL (https://…) or operator wallet address (0x…40 hex chars). Type is auto-detected. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It states the tool uses an internal archive with no external API and covers 80,000+ operators, implying a read-only operation. However, it does not address authorization, rate limits, or behavior on invalid targets.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is compact and well-structured: four sentences front-load core functionality, then add usage guidance and data scope. Every sentence adds value without redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a single-parameter tool, the description adequately covers inputs, outputs, and usage context. It lists return fields and data size, but lacks details on result formatting or edge cases. No output schema exists, but the description compensates sufficiently.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The sole parameter 'target' is described in the schema with details on format (URL or wallet address) and auto-detection. The description does not add meaning beyond the schema, and schema coverage is 100%, meeting the baseline for this score.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it returns market intelligence for x402 endpoints/operator wallets, listing specific data fields (settlement volume, payer count, etc.) and the data source. This differentiates it from sibling tools like address-security or wallet-balance, making its purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly recommends using the tool before routing agent spend, vetting counterparty operators, or benchmarking competitor pricing. While it does not list alternatives or when not to use, the use cases are specific and actionable.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
yield-farming-activeBInspect
Returns active DeFi yield farming pools sorted by 30-day average APY. Sourced from DeFiLlama (free, no key). Each pool includes protocol, chain, symbol, TVL, current APY, 30-day mean APY, impermanent-loss risk, and stablecoin flag. Filter by chain, protocol, minimum TVL, minimum APY, or stablecoin-only pools. Use for portfolio yield research, pre-trade DeFi reconnaissance, or capital allocation decisions.
| Name | Required | Description | Default |
|---|---|---|---|
| chain | No | Filter by blockchain (e.g. 'Ethereum', 'Base', 'Polygon', 'Arbitrum', 'Solana'). Case-insensitive. Omit for all chains. | |
| limit | No | Number of pools to return (1–50, default 20). | |
| min_apy | No | Minimum APY percentage to include (based on 30-day mean). Default 0. E.g. 5 returns pools yielding ≥5% annualized. | |
| sort_by | No | Sort order: 'apy_30d' (30-day average APY, default and most stable), 'apy_current' (live APY, more volatile), 'tvl' (largest pools first). | |
| protocol | No | Filter by protocol name (e.g. 'aave-v3', 'uniswap-v3', 'curve', 'lido'). Case-insensitive substring match. | |
| min_tvl_usd | No | Minimum Total Value Locked in USD. Default 1000000 ($1M). Lower values include smaller pools with potentially higher (but riskier) yields. | |
| stablecoin_only | No | If true, returns only stablecoin pools (no impermanent loss from token price volatility). Default false. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It mentions data source (DeFiLlama, free, no key) and includes risk notes (impermanent-loss risk, stablecoin flag). But it lacks details on rate limits, data freshness, pagination, or what happens with no results. Significant gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph that covers purpose, source, fields, filters, and use cases. It is not excessively long but could be better structured for readability (e.g., bullet points or separate sentences for different aspects).
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the parameter richness (7 params, 100% schema coverage) and lack of output schema/annotations, the description adequately covers filters and intended use cases. However, it omits behavioral aspects (pagination, rate limits, data update frequency) and does not describe the output format beyond listing fields.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All 7 parameters have descriptions in the schema, yielding 100% coverage. The description adds useful context beyond the schema, such as case-insensitive substring matching for protocol, default values, and risk implications of min_tvl_usd. This helps agents set appropriate values.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns active DeFi yield farming pools sorted by 30-day average APY, sourced from DeFiLlama. It specifies included fields (protocol, chain, APY, TVL, etc.) and filtering options. However, it does not explicitly differentiate from sibling tools like defi-yields or defi-market-pulse.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description suggests use cases (portfolio yield research, pre-trade DeFi reconnaissance, capital allocation decisions) and lists filter options. However, it does not specify when not to use this tool or mention alternative tools for similar purposes.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!
social-intelAInspectReturns public profile data for any social platform account. Pass a profile URL (platform auto-detected) or platform + username. Supports GitHub, Reddit, HackerNews, Twitter/X, npm, and Open Graph fallback for any URL. Returns name, bio, follower/karma counts, creation date, and platform-specific metrics. Priced at $0.004 — 20% below comparable endpoints.
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Description discloses return fields (name, bio, follower/karma counts, creation date, platform-specific metrics), pricing, and supported platforms. No annotations provided, so description carries burden; it adequately covers the read-only nature and fallback behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences: first states core function and input methods, second lists platforms, return fields, and pricing. Front-loaded and efficient with no redundant words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Sufficient for a simple profile fetch tool with 3 optional parameters. Lists return fields and platforms. Could mention error handling or data freshness, but overall adequate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Adds value beyond schema by explaining URL auto-detection, username without @ prefix, and condition when platform is required. Schema coverage is 100%, so description enriches semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states it returns public profile data for any social platform account, with specific verb and resource. Lists supported platforms (GitHub, Reddit, etc.), distinguishing it from platform-specific sibling tools like 'reddit-intel' or 'github-repo-intel'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Provides clear guidance: pass profile URL (auto-detect) or platform+username. Does not explicitly mention when not to use or alternatives, but context is sufficient for most use cases.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.