Agent Einstein — Autonomous Crypto Intelligence

by io.emc2ai

Server Details

40 AI crypto tools: whale tracking, security scans, DeFi analytics, quantum security, and more.

Status: Healthy
Last Tested: 2026-07-05 05:33
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

C2.7/5.0

Tool DescriptionsC

Average 3.3/5 across 73 of 73 tools scored. Lowest: 1.8/5.

Server CoherenceC

Disambiguation3/5

Many tools target similar domains (arbitrage, trading signals, yield analysis) with overlapping purposes. While descriptions provide some differentiation, the sheer number of tools creates ambiguity, e.g., multiple arbitrage scanners and smart money trackers. Agents may struggle to select the correct tool without careful reading.

Naming Consistency3/5

All tool names use snake_case, which is consistent, but the naming pattern varies (noun_verb vs. verb_noun) and some names are excessively long or domain-specific (e.g., mindyield_content_judge, polymarket_longshot_fader). Overall, the pattern is readable but not uniform.

Tool Count1/5

With 73 tools, the server far exceeds the typical scope for an MCP server. This extreme number suggests poor scoping and likely includes many niche or redundant tools that could be consolidated. It overwhelms the agent's tool selection space.

Completeness4/5

The tool set covers an impressively broad range of crypto domains: market analysis, trading, arbitrage, yield, security, prediction markets, quantum, and more. There are no major obvious gaps for an 'autonomous crypto intelligence' server, though some basic price feeds are implicit rather than explicit.

Available Tools

74 tools

agent_contextBInspect

Einstein Agent Context Snapshot: Returns Einstein's full synthesised market view: BTC cycle phase (post-halvening / expansion / distribution / bear) with days-since-halvening, multi-factor regime classification (bull/bear/ranging/volatile + features), macro feature vector (Fed stance, VIX, yield curve, ETF flows, risk-appetite), news classification (geopolitical risk + AI release signal), and per-category strategy rotator verdicts. The most complete 'what does Einstein think about the market right now' payload; ideal for grounding other agents' decisions in a coherent structured view of crypto + macro.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so the description should disclose side effects, idempotency, or access costs. It only describes return content, not behavioral traits like read-only nature or latency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single dense paragraph. While informative, it could be more concise or structured (e.g., bullet points) for easier parsing. Every sentence adds value but length is borderline.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description lists return components but does not clarify how input parameters (chain, limit, timeperiod) affect the output. Without an output schema, more detail on parameter impact would improve completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage for all three parameters. The description adds context about the overall output but does not enhance parameter understanding beyond schema defaults.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool returns a comprehensive market snapshot with specific components like BTC cycle phase, regime classification, macro features, news, and strategy rotator. It differentiates itself as 'the most complete' payload for grounding other agents.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use when agents need a coherent structured view, but does not explicitly state when to use or not use this tool versus alternatives like regime_classification. No exclusions are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

altcoin_season_indexAInspect

Altcoin Season Index: Real-time blockchaincenter-style Altcoin Season Index, computed in-house from CoinGecko top-50 minus stables and BTC/ETH derivatives. Returns: 90d index 0–100, 30d index, rotation acceleration delta, regime classification (BTC season / BTC-leaning / altseason-leaning / altseason / extreme), top 5 outperformers, worst 5 underperformers, BTC anchor return, plus the contrarian sizing rule the self-trade engine applies based on the regime (alt × 0.4–1.3, BTC × 0.7–1.0). Use to time alt rotation, time BTC accumulation, or as a contrarian de-risk gate at index ≥75.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully details the return structure and derivation, offering transparency. It could be improved by explicitly stating that the operation is read-only and noting any rate limits or dependencies.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise and well-structured: it starts with the tool's identity, then lists key return fields, and ends with specific use cases. Every sentence contributes meaningfully.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the core purpose, return fields, and usage. It lacks a brief explanation of the contrarian sizing rule or regime classifications, but given the detail provided, it is largely complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema covers 100% of parameters with descriptions. The tool description does not add any additional context or semantics for the parameters beyond what the schema provides, so it adds no extra value here.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as an Altcoin Season Index, specifies its computation method, and details the outputs. It distinguishes itself from sibling tools like market_movers or regime_classification by focusing on a specific index for altcoin rotation timing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit use cases for timing alt rotation, BTC accumulation, and as a de-risk gate. However, it does not mention when to avoid using the tool or suggest alternative tools for broader market analysis, which would enhance completeness.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

approval_security_scanAInspect

Approval Security Scan: Find risky token approvals for a wallet — ERC-20 allowances, Permit2 granular allowances (held inside Uniswap Permit2), and ERC-721/1155 setApprovalForAll operator grants — with risk grading and USD value-at-risk, across 9 EVM chains. Body: { params: { address, chain? } }.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses risk grading and USD value-at-risk outputs, but does not mention limitations, rate limits, or failure behavior. Since no annotations exist, the description carries full burden and provides useful behavioral info.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is relatively short but includes a misleading 'Body: {...}' section that adds no value and conflicts with schema. Could be more precise without this redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Lacks details on output format, chain list mismatch (claims 9 EVM chains but schema shows 7 including non-EVM Solana), and no output schema. Missing critical context for a complex tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description mentions 'address' while the schema uses 'tokenAddress', causing confusion. It omits 'limit' and 'timeperiod' parameters, and the 'Body: {...}' code block is redundant. Schema coverage is 100% but description does not add value and introduces inconsistency.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scans for risky token approvals across multiple standards (ERC-20, Permit2, ERC-721/1155) with risk grading and USD value-at-risk on 9 EVM chains. It differentiates from siblings like 'security_scan' by specifying the approval focus.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use when checking wallet approvals but provides no explicit comparison to alternatives like 'security_scan' nor when NOT to use this tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

arbitrage_scannerCInspect

Arbitrage Scanner: Cross-chain and cross-DEX price discrepancies for arbitrage opportunities.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It fails to indicate whether the tool executes trades or merely scans, what the output format contains, data latency/freshness, or financial risk considerations. It also does not disclose whether the operation is read-only (implied by 'scanner' but not confirmed) or requires specific permissions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that front-loads the core concept. The 'Arbitrage Scanner:' prefix is slightly redundant with the tool name but does not significantly detract from clarity. No extraneous information is included.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given this is a financial analysis tool with no output schema and no annotations, the description is incomplete. It fails to describe the return value (e.g., opportunity details, profit calculations, risk metrics), does not mention execution versus read-only behavior, and omits safety considerations typical for arbitrage operations.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description does not explicitly mention any parameters, but the input schema has 100% description coverage with clear enum values for 'chain' and 'timeperiod'. Since the schema fully documents all three parameters, the baseline score applies; the description adds no additional semantic context beyond what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's function: scanning for price discrepancies across chains and DEXes to identify arbitrage opportunities. It uses specific verbs and resources ('scan', 'price discrepancies', 'arbitrage opportunities'). However, it fails to distinguish itself from the sibling tool 'cross_chain_arbitrage', which creates potential selection ambiguity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives, particularly the similarly-named 'cross_chain_arbitrage' sibling. It does not mention prerequisites (e.g., requiring a specific token address), nor does it indicate whether this tool is for analysis only or execution.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

backtest_custom_strategyBInspect

Custom Strategy Backtest: Describe a rule-based breakout strategy in plain language (entry on previous-day high/low break, %-distance stop-loss & take-profit, move-to-breakeven threshold, trailing-stop distance, optional session-open timing & leverage) and get an event-driven, intrabar backtest over years of historical data: total return, ending equity from a chosen starting capital, win rate, Sharpe/Sortino, max drawdown and an equity curve — plus the sensible defaults the agent chose for unspecified parameters. Read-only research; no funds are moved.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations, so description carries burden. It discloses read-only nature, event-driven intrabar execution, and return of sensible defaults, but lacks limitations like API rate limits or data coverage.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single paragraph with good front-loading, but includes redundant phrases and could be more succinct. Not overly long but not tight.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema and input schema mismatch leaves gaps. The description of outputs is good, but the main input (strategy description) is not reflected in the schema, causing confusion.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for 3 parameters (chain, limit, timeperiod) but the description describes a different input (strategy in plain language) not present in schema. This mismatch reduces clarity.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool backtests custom breakout strategies and lists inputs and outputs. It distinguishes from sibling tools only by implying it's custom strategy vs pre-built? But not explicit.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Read-only research; no funds are moved,' which indicates when to use. However, no comparison to alternative backtest tools like backtest_strategy.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

backtest_strategyBInspect

Walk-Forward Backtest as a Service: Run a candle-replay backtest with walk-forward cross-validation against any of Einstein's 4 production-grade strategy adapters (sma-crossover, rsi-mean-reversion, momentum-breakout, buy-and-hold). Returns Sortino, max-drawdown, total return, profit-factor and per-fold breakdown so the caller can vet a strategy on real out-of-sample data before risking capital. Caller specifies adapter id + symbol + timeframe + fold config; results come back in seconds.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must carry the full burden. It discloses the operation type (backtest, non-destructive) and mentions results come quickly, but does not explain potential side effects, permissions, or data limitations. It adds some value but leaves gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise with two well-structured sentences. It front-loads the key purpose and then details returns and usage. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of walk-forward backtesting and the lack of output schema, the description should explain adapter IDs, symbol support, and fold configuration. However, the fatal schema mismatch makes the description effectively incomplete – the user cannot actually invoke the tool with the provided schema. A comprehensive description would need to align with the actual parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema has 100% coverage with parameters (chain, limit, timeperiod) that are completely unrelated to the description's stated required inputs (adapter id, symbol, timeframe, fold config). The description adds no meaning to the actual schema and instead misleads by describing a different tool. This is a critical disconnect.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool runs a walk-forward backtest with candle-replay using specified adapters and returns specific metrics. It is specific about the verb and resource, but does not explicitly differentiate from the sibling 'strategy_backtest' or 'optimize_strategy_params', so a 4 is appropriate.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description gives context on when to use (to vet a strategy before risking capital) and what to specify, but lacks explicit guidance on when not to use or alternatives. The schema mismatch further complicates actual usage, so a 3 is warranted.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

bitcoin_bridge_comparisonBInspect

Bitcoin Bridge Comparison: Compare Bitcoin bridges by volume, track wrapped BTC peg deviations (wBTC, cbBTC, tBTC), and view THORChain BTC pool stats.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses the data domains covered (volume metrics, peg deviations, pool stats) but lacks operational details such as data freshness, rate limiting, error handling, or whether this triggers any on-chain transactions. It implies read-only behavior but doesn't state it explicitly.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single efficient sentence that front-loads the subject matter. The colon-separated structure ('Bitcoin Bridge Comparison: Compare...') is slightly redundant with the tool name, but every clause adds specific detail about data types (volume, peg deviations, pool stats) without waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given that all three parameters are optional and no output schema exists, the description adequately identifies the subject matter but lacks critical operational context. It should specify default behavior when called without parameters (since required: []) and ideally hint at the return structure (e.g., 'returns comparative metrics including...').

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description mentions 'volume' and time-based tracking which loosely maps to the 'timeperiod' parameter, and specific chains map to 'chain', but it doesn't explicitly connect these concepts or explain parameter interactions (e.g., that 'chain' filters the bridge comparison to that specific network).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the verb (compare, track, view) and resources (Bitcoin bridges, wBTC/cbBTC/tBTC pegs, THORChain pools). It provides concrete asset examples that distinguish it from generic analytics tools. However, it doesn't explicitly contrast usage with siblings like 'cross_chain_arbitrage' or 'bitcoin_onchain_analytics'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to use this tool versus alternatives, prerequisites for use, or when to avoid it. The description states what data is available but not the analytical scenarios where this tool is appropriate (e.g., 'use this to assess bridge risk' or 'use arbitrage_scanner instead for pricing discrepancies').

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

bitcoin_mempool_intelligenceCInspect

Bitcoin Mempool & Fee Intelligence: Real-time Bitcoin mempool analytics, fee estimation, and transaction tracking. Includes congestion level, recommended fee tiers, and confirmation ETA.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries the full burden. It partially compensates by disclosing what data is returned (congestion levels, fee tiers, confirmation ETA), which is helpful given the lack of an output schema. However, it omits operational details like rate limits, authentication requirements, or caching behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately concise with two front-loaded sentences. The first establishes the domain and the second lists specific capabilities. No sentences are wasted, though the first sentence functions somewhat as a title重复 since the tool name is already descriptive.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the lack of output schema, the description adequately explains what data is returned (congestion, fees, ETA). However, the Bitcoin-specific claim creates confusion given the multi-chain schema, and the description omits whether the data is real-time vs historical despite mentioning 'real-time' in the text.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

While the schema has 100% coverage (baseline 3), the description detracts value by implying Bitcoin-only usage when the 'chain' parameter explicitly supports Ethereum, Base, Solana, and others. It adds no clarifying context for the 'limit' or 'timeperiod' parameters beyond what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool provides Bitcoin mempool analytics, fee estimation, and transaction tracking with specific outputs (congestion level, fee tiers, ETA), which distinguishes it from siblings like bitcoin_onchain_analytics. However, it claims Bitcoin-specific functionality while the input schema's 'chain' parameter accepts Ethereum, Solana, and other non-Bitcoin networks, creating a significant scope ambiguity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like bitcoin_onchain_analytics or arbitrage_scanner. It does not mention prerequisites, such as whether this requires a specific Bitcoin node connection, or when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

bitcoin_onchain_analyticsCInspect

Bitcoin Network & Mining Analytics: Mining pool distribution, hashrate trends, difficulty adjustment tracking, block analytics, and Bitcoin address lookup.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.4/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It lists data categories returned but omits critical behavioral details: whether data is real-time or historical, latency expectations, what happens when no parameters are provided (all are optional), or whether the tool is read-only.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence is appropriately concise and front-loaded with the core topic, but includes unsupported claims (address lookup) that reduce the value of the limited space used.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Incomplete for a tool with 3 parameters and rich siblings. Fails to resolve the Bitcoin/multi-chain schema mismatch, explain the missing address parameter for claimed lookup functionality, or describe output format (no output schema provided).

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

While schema coverage is 100% (baseline 3), the description claims 'address lookup' functionality but no address parameter exists in the schema. It also fails to explain why a 'bitcoin' tool accepts non-Bitcoin chains (base, ethereum, solana) in the chain parameter, creating semantic confusion.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly lists Bitcoin-specific capabilities (mining pools, hashrate, difficulty tracking), but fails to address the contradiction between the Bitcoin-specific name/description and the multi-chain schema (which includes Ethereum, Solana, etc. in the chain parameter enum). It also doesn't distinguish from sibling tools like bitcoin_mempool_intelligence or bitcoin_staking_analysis.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to select this tool versus sibling alternatives (e.g., bitcoin_mempool_intelligence for mempool data, general_analysis for broader crypto). No prerequisites mentioned despite all parameters being optional.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

bitcoin_staking_analysisBInspect

Babylon Bitcoin Staking Analytics: Babylon staking analytics including global TVL stats, finality provider leaderboards, staker delegation positions, and individual delegation tracking.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses what data is returned (global TVL, leaderboards, positions), but omits operational details like authentication requirements, rate limits, cache behavior, or whether the analysis is real-time versus historical.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence efficiently lists the four data categories provided. Minor redundancy in repeating 'Babylon ... Analytics' and 'Babylon ... analytics' with the colon structure, but otherwise no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for an analytics tool with no output schema—compensates by enumerating the return data types (TVL, leaderboards, etc.). However, given the complex domain (Bitcoin staking with Babylon protocol) and lack of annotations, it would benefit from explicit usage context or behavioral constraints.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage for all three parameters (chain, limit, timeperiod). The description adds no additional parameter context, but since the schema is self-documenting, this meets the baseline expectation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly identifies the tool provides Babylon Bitcoin staking analytics (TVL stats, finality provider leaderboards, delegation tracking), distinguishing it from siblings like bitcoin_onchain_analytics and brc20_analytics. However, it uses a noun phrase structure ('Analytics: ... analytics') rather than a specific action verb, slightly weakening clarity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides no guidance on when to use this tool versus the numerous sibling alternatives (bitcoin_onchain_analytics, runes_analytics, etc.) or what prerequisites might exist. No mention of when not to use it or complementary tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

brc20_analyticsCInspect

BRC-20 Token Analytics: BRC-20 token overview, mint progress, holder distribution, and individual token details via UniSat API.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions 'via UniSat API' indicating the data source, but fails to disclose whether the tool is read-only or destructive, rate limits, authentication requirements, or error behaviors. For an analytics tool with no safety annotations, this is insufficient behavioral coverage.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single efficient sentence with no redundant words. The key resource (BRC-20) is front-loaded. However, it could benefit from structured formatting (e.g., separating capability list from API source) to improve scannability for agents parsing the text.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of an output schema, the description compensates partially by listing the types of data returned (overview, mint progress, holder distribution). However, with three optional parameters and no annotations, it should clarify default behaviors when parameters are omitted or explain the UniSat API constraints. It provides the minimum necessary context but leaves operational gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline score applies. The description does not mention any parameters or provide context for why one might filter by 'chain' (particularly notable since BRC-20 is Bitcoin-native yet the schema allows Ethereum/Solana chains), 'limit', or 'timeperiod'. However, since the schema fully documents all three parameters, the description meets the minimum viable threshold.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's function as BRC-20 token analytics and lists specific data points provided (overview, mint progress, holder distribution, individual token details). It distinguishes from sibling tools like 'runes_analytics' by explicitly naming the BRC-20 standard and UniSat API data source. However, it lacks explicit comparative guidance against the similar Runes analytics sibling.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives (e.g., 'runes_analytics' for the Runes token standard), nor does it mention prerequisites, required conditions, or when not to use it. Agents must infer applicability solely from the resource name.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chart_forecastBInspect

AI Chart Forecast: Dual-model ML price forecast combining Google TimesFM and Kronos (12B+ K-lines from 45 exchanges) on full OHLC chart data. Returns multi-horizon drift %, direction (bullish/bearish/neutral), Kronos quantile bands (q10/q50/q90), CONSENSUS/DIVERGENCE classifier between the two models, and Einstein structured confidence assessment with supporting/contradicting factors. Accepts EVM contracts or canonical symbols (BTC, ETH, ADA, BNB, DOT, ...). Fails loud if the forecast service is unavailable — no charge.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries the burden. It mentions that the tool 'fails loud if the forecast service is unavailable — no charge,' but does not disclose read-only status, authentication needs, rate limits, or side effects. Moderate disclosure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is relatively concise given the number of outputs listed. It front-loads key information but could eliminate the 'fails loud' detail without losing clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema exists, so the description compensates by listing outputs. However, it omits the symbol parameter entirely and does not explain how parameters (chain, limit, timeperiod) relate to the forecast. Adequate but with gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage, so baseline is 3, but the description mentions 'Accepts EVM contracts or canonical symbols' which is not reflected in the schema (no symbol parameter). This creates confusion and misaligns with the actual parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is an AI Chart Forecast using two ML models (TimesFM and Kronos) and lists specific outputs (drift %, direction, bands, classifier, confidence). It uses a specific verb-resource combination and is unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool versus alternatives like technical_analysis or market_movers. No explanation of prerequisites or context for usage, leaving the agent without selection criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

claimable_airdropsAInspect

Claimable Airdrops Checker: Check a wallet for unclaimed airdrops and reward campaigns it is eligible for (via Merkl), with token, amount, USD value and claim link. Scans all supported chains unless one is specified. Body: { params: { address, chain? } }.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description adequately conveys it is a non-destructive read operation that scans wallets. It mentions the scope (all chains or specified) and what information is returned, though no explicit statement about safety is needed.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is moderately concise but includes a redundant title-like prefix and a pseudo-code body. It is front-loaded but could be trimmed without losing meaning.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite describing return fields, the parameter inconsistency undermines completeness. No output schema is provided, and the missing 'address' param in the schema suggests the description may be inaccurate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for chain, limit, and timeperiod, but the description mentions an 'address' parameter not present in the input schema, creating confusion. This mismatch reduces the value added beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it checks a wallet for unclaimed airdrops and rewards via Merkl, returning token, amount, USD value, and claim link. This is distinct from sibling tools, which focus on other analyses.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for checking airdrops across all chains unless a specific chain is provided. It does not explicitly name alternatives, but no sibling tool directly overlaps. The context is clear enough for an AI to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

copy_trading_intelCInspect

Copy Trading Intelligence: Analyze top-performing wallets for copy trading signals with risk-adjusted position sizing.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It mentions 'risk-adjusted position sizing' hinting at analytical complexity, but fails to clarify whether this is a read-only operation (likely), what specific data fields are returned, how risk is calculated, or any rate limiting concerns. This leaves significant behavioral gaps for a financial analysis tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that avoids verbosity. The 'Copy Trading Intelligence:' prefix functions as a conceptual header that slightly weakens the front-loading, but the core content immediately follows. No redundant or filler text is present.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of annotations and output schema, the description should explain what the analysis returns (e.g., wallet rankings, performance metrics, position size recommendations). It also fails to clarify the relationship with 'copy_trading_leaderboard'. For a complex financial tool with zero structured metadata, the description is insufficiently complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description does not explicitly mention any input parameters (chain, limit, timeperiod). However, the input schema has 100% description coverage with clear enums and types, establishing a baseline of 3. Since the schema fully documents the parameters, the description does not need to compensate, though it adds no additional semantic context about parameter interactions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool analyzes top-performing wallets to generate copy trading signals with risk-adjusted position sizing. It specifies the verb (analyze), resource (wallets), and output type (signals with sizing). However, it does not explicitly differentiate from the sibling tool 'copy_trading_leaderboard', leaving ambiguity about when to choose this over the alternative.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives, prerequisites for use, or conditions where it should be avoided. Given the presence of 'copy_trading_leaderboard' and 'smart_money' siblings, the lack of explicit when-to-use guidance forces the agent to guess based on naming alone.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

copy_trading_leaderboardBInspect

Copy Trading Leaderboard: Top wallets worth copying with performance data, win rates, and P&L metrics.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full disclosure burden. It compensates partially by listing return data types (performance metrics, win rates, P&L) since no output schema exists. However, it lacks operational details such as ranking methodology, data freshness, rate limits, or whether results are cached.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence with zero wasted words. The colon structure front-loads the concept (Leaderboard) and follows with specific data attributes, making it appropriately sized for the tool's complexity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of an output schema, the description adequately compensates by specifying the categories of returned metrics. However, with zero required parameters and three optional filters, it could better explain the default behavior when no parameters are provided or how the ranking algorithm weighs different performance factors.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline score is 3. The description does not add semantic context to the parameters (e.g., explaining how `timeperiod` affects metric calculations or that `chain` filters the universe of wallets), but it does not need to given the comprehensive schema documentation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the resource (Copy Trading Leaderboard) and specific data returned (performance data, win rates, P&L metrics). However, it does not differentiate from the sibling tool `copy_trading_intel` or clarify when to prefer this leaderboard view over other trading analysis tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like `copy_trading_intel`, `smart_money`, or `wallet_trust_score`. It does not mention prerequisites, filtering strategies, or limitations of the leaderboard approach.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

cross_chain_arbitrageAInspect

Cross-Chain Arbitrage Scanner: Detect price discrepancies across chains and DEXes with net profit calculations after bridge fees.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.6/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It successfully communicates that the tool performs calculations (net profit after bridge fees) rather than just raw price comparison, and 'detect' implies a read-only operation. However, it omits operational details like data freshness, rate limits, or authentication requirements that would typically appear in annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficiently structured sentence. The prefix 'Cross-Chain Arbitrage Scanner:' acts as an immediate identifier, followed by the specific capability. Every clause adds value, with zero redundancy or filler content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a 3-parameter tool with no required fields and simple types, the description adequately covers the functional scope. It compensates for the lack of output schema by indicating what calculations are returned (net profit). Minor gaps remain regarding operational constraints, but the description is sufficient for tool selection given the moderate complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage with clear enum values and types. The description does not explicitly discuss individual parameters, but with complete schema documentation, no additional parameter semantics are necessary. Baseline score applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action (detect price discrepancies), the scope (across chains and DEXes), and the unique value proposition (net profit calculations after bridge fees). This effectively distinguishes it from the generic 'arbitrage_scanner' sibling by emphasizing cross-chain analysis and fee-aware calculations.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance is provided on when to select this tool versus the 'arbitrage_scanner' sibling or other alternatives. While 'cross-chain' and 'bridge fees' imply usage for multi-chain opportunities, the description lacks explicit when-to-use/when-not-to-use criteria or references to alternative tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

crqc_proximity_benchmarkBInspect

CRQC Proximity Benchmark: Run standardized benchmark circuits on real IBM quantum hardware to measure current error rates and estimate how far we are from Cryptographically Relevant Quantum Computers (CRQC) that could break ECDSA.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full disclosure burden. It adds valuable context that the tool uses 'real IBM quantum hardware' (implying potential queue times and costs), but fails to specify execution time, cost implications, whether results are cached, or what the return format contains (error rates? qubit counts? time estimates?).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single efficient sentence that front-loads the tool name and immediately explains its function. The colon structure ('CRQC Proximity Benchmark: Run...') is slightly redundant with the tool name but does not significantly impact clarity. Every word conveys specific meaning about the hardware platform, measurement type, and cryptographic goal.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given high complexity (quantum computing benchmarks, cryptographic relevance), absence of output schema, and lack of annotations, the description should explain return values, execution model (async vs sync), and cost structure. It provides none of these operational details, leaving significant gaps for an agent attempting to invoke the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage (chain, limit, timeperiod all documented). The description adds no explicit parameter guidance, so it earns the baseline score of 3. Notably, the description does not bridge the conceptual gap between running IBM quantum hardware benchmarks and the blockchain-query parameters present in the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool runs standardized benchmark circuits on IBM quantum hardware to measure error rates and estimate proximity to CRQC (Cryptographically Relevant Quantum Computers) capable of breaking ECDSA. It distinguishes from siblings like 'pqc_readiness_assessment' (which assesses migration readiness) by focusing on raw hardware benchmarking and cryptographic breaking estimates.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus sibling quantum tools like 'pqc_readiness_assessment', 'pqc_wallet_scan', or 'quantum_hardware_qrng'. It does not specify prerequisites (e.g., IBM Quantum account requirements) or when alternative analysis methods might be preferable.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

defi_yield_analysisBInspect

DeFi Yield Opportunity Finder: Scan DeFi protocols for highest risk-adjusted yield opportunities with gas cost analysis.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Since no annotations are provided, the description carries the full burden. It adds valuable behavioral context not in the schema: 'risk-adjusted' methodology and 'gas cost analysis'. However, it fails to disclose critical operational details: whether this is read-only (implied by 'scan' but not explicit), data sources, rate limits, or output format expectations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Excellent structure—single sentence, front-loaded with functional identifier ('DeFi Yield Opportunity Finder'), followed by specific action verb and value propositions. Zero wasted words; every phrase (risk-adjusted, gas cost analysis) conveys distinct functionality.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given three simple optional parameters and no output schema, the description adequately covers the input side but remains incomplete regarding what the tool returns (e.g., yield percentages, risk metrics, protocol names, APY calculations). Without an output schema, the description should have sketched the return structure or analysis methodology.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline is 3. The description mentions scanning 'DeFi protocols' which aligns with the 'chain' parameter, but adds no additional semantic context about how 'timeperiod' affects yield calculations (APY window vs. historical performance) or why one would adjust the 'limit' beyond the schema's default/max constraints.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scans DeFi protocols to find yield opportunities, with specific differentiators (risk-adjusted yields, gas cost analysis). However, it lacks explicit differentiation from siblings like 'pendle_yield_trading' or 'flashloan_opportunities' that also deal with yield generation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus alternatives like 'arbitrage_scanner' or 'pendle_yield_trading'. No mention of prerequisites (e.g., specific chain selection requirements) or when this analysis is most valuable versus other investment research tools in the sibling list.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

einstein_meta_strategyCInspect

Einstein AI Meta-Strategy: AI-driven autonomous meta-strategy that decides what to trade across all asset classes (crypto swaps, Hyperliquid perps/stocks, Polymarket prediction markets, DeFi yield) on each execution cycle.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure but fails to specify whether the tool executes actual trades (destructive) or returns recommendations (read-only), what the return format is, or required authentication. For a financial trading tool, this is a critical safety gap.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence structure efficiently packs the conceptual scope. The 'Einstein AI Meta-Strategy:' prefix is slightly redundant with the tool name, but the sentence is front-loaded with the core value proposition and contains no extraneous filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the high complexity (autonomous AI trading across multiple asset classes), absence of output schema, and zero annotations, the description is insufficient. It omits execution semantics, return value structure, error handling, and whether human confirmation is required for the 'decisions' it makes.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, documenting 'chain', 'limit', and 'timeperiod'. The description lists asset classes covered by the strategy but does not explain how parameters filter or constrain the meta-strategy (e.g., whether 'chain' applies to all asset classes or only crypto swaps). Baseline score appropriate since schema carries the load.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states this is an 'AI-driven autonomous meta-strategy that decides what to trade' and enumerates specific asset classes (crypto swaps, Hyperliquid perps/stocks, Polymarket, DeFi yield). It distinguishes from siblings by positioning itself as a comprehensive 'meta' layer across all asset classes rather than a specialized scanner.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance is provided on when to invoke this meta-strategy versus using specific sibling tools like 'arbitrage_scanner' or 'defi_yield_analysis' directly. The phrase 'on each execution cycle' implies periodic use but doesn't clarify prerequisites or selection criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

einstein_trade_signalsAInspect

Einstein Live Trade Signals: Stream Einstein's recent autonomous trade signals — every executed trade with full deterministic context: candidate-preranker rank/alpha-score, momentum band (RSI/MACD/MTF/StochRSI composite), regime, news classification (regulator/hack/exchange-failure/depeg/ETF), macro risk-appetite, BTC cycle phase. Subscribers can mirror trades with their own smart wallet using the action/asset/category/size%/SL%/TP%/leverage payload. Same data the public /api/self-trade/signals endpoints serve, but bundled as a paid A2A skill for agent-to-agent automation pipelines that prefer the unified A2A protocol.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.8/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the burden. It details the content of signals (candidate-preranker rank, momentum band, regime, etc.) and notes it is streaming recent trades. However, it omits whether the operation is read-only or any constraints like rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is informative but verbose. It front-loads the main purpose but then adds many details about signal composition and subscription, which could be streamlined without losing essential information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the lack of annotations and output schema, the description provides substantial context about the signals' content and source. It adequately covers the tool's output but could benefit from explaining parameter usage more explicitly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline 3 applies. The description does not add significant meaning beyond the schema's parameter descriptions; it only recontextualizes them as part of the signal data.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the tool's purpose: streaming Einstein's recent autonomous trade signals with full deterministic context. It distinguishes itself from sibling tools by emphasizing its A2A protocol and bundling of data from public endpoints.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description mentions the tool is for agent-to-agent automation pipelines and that subscribers can mirror trades, but does not explicitly state when to use it over alternatives or provide exclusions. The context is somewhat implied but lacks clear guidelines.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

flashloan_opportunitiesBInspect

Flashloan Opportunity Scanner: Real-time scanner for MEV, liquidation, DEX arbitrage, and cross-chain flashloan opportunities with profit estimation.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full disclosure burden. It adds 'real-time' and 'profit estimation' as behavioral traits, but fails to clarify critical safety aspects: whether the tool executes transactions or is read-only, authentication requirements, rate limits, or data freshness guarantees. The term 'scanner' implies observation, but explicit safety context is missing for a high-risk financial domain.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, information-dense sentence with no filler content. It front-loads the tool identity and immediately lists specific capabilities, making efficient use of limited space.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of DeFi flashloans and the absence of both annotations and output schema, the description adequately covers the scanning scope but leaves gaps regarding return value structure, execution risks, and actionable next steps. It mentions profit estimation but does not characterize the opportunity data format.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage with clear enum values and type definitions. The description does not add parameter-specific guidance (e.g., default behaviors, valid combinations) beyond what the schema already provides, warranting the baseline score of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as a scanner for flashloan opportunities, specifying four distinct opportunity types (MEV, liquidation, DEX arbitrage, cross-chain) and profit estimation. While it implicitly distinguishes from siblings like 'arbitrage_scanner' through specificity to flashloans, it does not explicitly contrast with similar tools in the ecosystem.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance is provided on when to use this tool versus similar siblings like 'arbitrage_scanner', 'cross_chain_arbitrage', or 'liquidation_monitor'. There are no prerequisites, exclusions, or workflow positioning hints, despite the crowded toolset with overlapping functionality.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

forgotten_assets_scanAInspect

Forgotten Assets Scanner: Find forgotten money for a wallet: dust/forgotten ERC-20 balances plus native ETH locked in legacy contracts (e.g. EtherDelta), priced via DefiLlama with recovery hints. Body: { params: { address, chain? } }.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full burden. It discloses the types of assets scanned (dust ERC-20, locked native ETH), the pricing source (DefiLlama), and that it provides recovery hints. This goes beyond a generic scan but could be improved by explicitly stating read-only behavior or potential side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences but includes a redundant 'Body: { params: ... }' format detail. It could be more concise by omitting the body notation. The information is front-loaded but could be tighter.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (multi-chain, forgotten funds), the description lacks output details and has parameter mismatches. Without an output schema, the agent does not know what response to expect. The description does not cover limitations or result format, making it incomplete for effective invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description mentions parameters 'address' and 'chain?', but the input schema lists 'chain', 'limit', and 'timeperiod' with a required field 'tokenAddress' that is not in properties. This mismatch confuses the actual parameters. Despite 100% schema coverage, the description fails to add meaningful semantics and introduces contradiction.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's function: 'Find forgotten money for a wallet: dust/forgotten ERC-20 balances plus native ETH locked in legacy contracts...' It specifies the verb (find) and resource (forgotten assets), and the use of DefiLlama for pricing and recovery hints distinguishes it from sibling tools like approval_security_scan or wallet_portfolio.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description gives clear context for when to use: to recover forgotten or locked funds. It implies usage for wallet recovery but does not explicitly state when not to use or compare with alternatives. Without stated exclusions, it still provides sufficient guidance for an AI agent to select it appropriately.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

general_analysisAInspect

General AI Analysis: Send any prompt to Agent Einstein who autonomously selects tools, analyzes data, and provides intelligent responses. Use for complex questions requiring reasoning or multi-step analysis.

ParametersJSON Schema

Name	Required	Description	Default
`prompt`	Yes	The question or analysis request to send to Einstein AI

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It successfully discloses the crucial behavioral trait that this tool 'autonomously selects tools,' indicating it is an orchestrator/agent rather than a direct data source. It could improve by mentioning whether responses are streamed, cached, or have side effects, but the agentic nature is well-covered.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences with zero waste. The first sentence defines the mechanism and capability; the second provides usage guidance. Information is front-loaded and appropriately sized for a single-parameter tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the high complexity of an agent-orchestration tool among many specific function tools, the description adequately explains its role as the general interface for Einstein. It compensates for the lack of output schema by explaining the response type ('intelligent responses'). It appropriately omits redundant parameter detail given the 100% schema coverage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage for the single 'prompt' parameter ('The question or analysis request to send to Einstein AI'). The description text ('Send any prompt') aligns with but does not substantially extend the schema's semantics. With full schema coverage, the baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool sends prompts to 'Agent Einstein' who autonomously selects tools and analyzes data. It distinguishes itself from the 40+ specific siblings (e.g., arbitrage_scanner, whale_tracking) by emphasizing this is a general-purpose, autonomous agent for complex queries rather than a single-function data fetch.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides clear positive guidance ('Use for complex questions requiring reasoning or multi-step analysis'), effectively positioning it as the fallback for queries that don't map to specific single-purpose siblings. Lacks explicit 'when not to use' exclusions or named alternatives, but the contrast with specific sibling names makes the boundary reasonably clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

generate_videoBInspect

AI Video Generation: Generate a short branded vertical (9:16) AI video from a text prompt. The pipeline writes a voiceover script, generates cinematic stills (DALL·E) animated with ken-burns motion, adds a TTS voiceover and captions, and renders an mp4 — returned as a downloadable URL. Made with AI; a "Made with AI" disclosure is embedded. Typical render 40–90s. Send { params: { prompt } } with your brief (e.g. "a hype 15s promo for a Base memecoin"). Output: { status, response: { mediaUrl, jobId }, payment }.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior1/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It discloses AI disclosure and render time, but contradicts the input schema: description says 'Send { params: { prompt } }' while schema has chain, limit, timeperiod. This is a serious inconsistency, making it misleading.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is moderately concise and structured with process, example, and output. However, it contains an error about input parameters, which harms clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description gives a good overview of video generation but omits constraints (e.g., rate limits, costs, content policies). The mismatch with schema reduces completeness, and no output schema is provided but output is described.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so each parameter has a description. However, the description introduces a 'prompt' parameter not in the schema and fails to explain the actual schema parameters (chain, limit, timeperiod). This confusion reduces the value added beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it generates short branded vertical AI videos from a text prompt, specifying format (9:16), process (voiceover, stills, animation, captions), and output (mp4 URL). It distinguishes from all sibling tools, which are blockchain/analysis tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides an example prompt and typical render time, implying use for video generation. However, no explicit when-not-to-use or alternatives are given, though siblings are all unrelated.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

holder_concentrationCInspect

Holder Concentration: Token holder distribution analysis with concentration risk metrics.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis
`tokenAddress`	No	Token contract address to analyze

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It mentions 'concentration risk metrics' hinting at the output domain, but fails to disclose operational characteristics such as whether the analysis is real-time or cached, rate limits, error behaviors for invalid addresses, or what specific metrics (e.g., Gini coefficient, HHI) are returned.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence, but wastes space by prefixing with 'Holder Concentration:' which restates the tool name. The remaining content 'Token holder distribution analysis with concentration risk metrics' is efficient, but the tautological opening reduces the value density.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a 4-parameter analytical tool with no output schema, the description provides the minimum viable context by identifying the analysis domain. However, it lacks detail on the return format or specific concentration metrics calculated, leaving significant gaps given the absence of structured output documentation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage, establishing a baseline of 3. The description mentions 'concentration risk metrics' which contextually implies the 'tokenAddress' is the primary subject, but adds no specific guidance on parameter usage (e.g., syntax for addresses, how 'limit' affects the depth of holder analysis, or default behaviors when optional parameters are omitted).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'Token holder distribution analysis with concentration risk metrics,' specifying the resource (token holders) and output type (risk metrics). It implicitly distinguishes from sibling tools like 'whale_tracking' (individual movements) and 'wallet_portfolio' (single wallet views) through its focus on aggregate distribution and concentration risk, though it doesn't explicitly name alternatives.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives. It doesn't indicate prerequisites (e.g., needing a specific token address format), when to prefer this over 'whale_tracking' for large holder analysis, or exclusions for certain token types.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

hyperliquid_smart_moneyBInspect

Hyperliquid Smart Money Intelligence: Access top trader leaderboard, whale positions, and smart money consensus signals from Hyperliquid. Shows what the best traders are collectively bullish or bearish on.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It describes the semantic content of returned data (bullish/bearish signals, leaderboards) but omits operational characteristics such as real-time vs cached data, rate limits, authentication requirements, or safety properties (read-only vs destructive).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description consists of two efficient sentences with no redundancy. The first sentence front-loads the core capability and data sources, while the second clarifies the value proposition (collective bullish/bearish sentiment), making every word serve a distinct purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given three optional parameters with full schema documentation and no output schema, the description adequately covers the business purpose. However, without annotations or an output schema, the description should ideally characterize the return structure (e.g., 'returns a list of traders with PnL rankings') to be complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description provides conceptual context about what data is retrieved (implying the 'limit' applies to traders or signals) but does not explicitly map parameters to functionality, provide usage examples, or explain parameter interactions (e.g., chain filtering on Hyperliquid).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's function as accessing Hyperliquid-specific smart money data including leaderboards, whale positions, and consensus signals. It distinguishes the scope to Hyperliquid (differentiating from the generic 'smart_money' sibling) by explicitly mentioning the platform twice, though it doesn't explicitly contrast usage against the sibling tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance is provided for when to select this tool versus the generic 'smart_money' sibling or other trading intelligence tools like 'copy_trading_intel'. The description implies Hyperliquid-specific use cases through naming but lacks explicit when-to-use or when-not-to-use instructions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

liquidation_monitorAInspect

DeFi Liquidation Monitor: Monitor Aave/Compound positions approaching liquidation with health factors and bonus estimates.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.6/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden of behavioral disclosure. It successfully indicates the tool returns 'health factors and bonus estimates' (domain-specific outputs), and implies read-only monitoring behavior. However, it lacks explicit safety confirmation (read-only vs transactional), error behavior, or real-time vs cached data disclosure that would be critical for financial DeFi operations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence format with category prefix 'DeFi Liquidation Monitor:' provides immediate categorization. Every clause delivers distinct value: protocol scope (Aave/Compound), function (monitoring liquidation proximity), and return data types (health factors, bonus estimates). Zero redundancy or filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 100% schema coverage and no output schema, the description adequately compensates by specifying what data is returned (health factors, bonus estimates). However, it misses opportunity to note that all parameters are optional or describe default monitoring behavior when no parameters are provided.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing baseline 3. The description mentions Aave/Compound contexts but does not add parameter-specific semantics beyond the schema (e.g., default behaviors, valid combinations, or that all parameters are optional). No clarification on whether 'chain' filters protocols or just networks.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description uses specific verb 'Monitor' with explicit resources 'Aave/Compound positions' and clarifies scope with 'approaching liquidation'. The mention of 'health factors and bonus estimates' distinguishes it from sibling tools like defi_yield_analysis or flashloan_opportunities, clearly positioning it as a liquidation risk assessment tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Description provides no explicit guidance on when to use this tool versus alternatives (e.g., when to use flashloan_opportunities vs this monitor). No prerequisites, filtering guidance, or 'when-not-to-use' conditions are specified despite the tool having specific DeFi protocol limitations.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

market_moversCInspect

Market Movers: Detect significant price action and momentum signals across DEXes.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. While 'Detect' implies a read-only operation, the description doesn't confirm safety, disclose rate limits, define what constitutes 'significant' price action, or indicate data freshness/latency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence efficiently delivers the core purpose without redundancy. The 'Market Movers:' prefix substitutes for the null title effectively. No filler content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex financial analysis tool with no output schema and no annotations, the description is insufficient. It fails to explain the return format, what specific momentum indicators are used, or how results are ranked, leaving the agent unprepared to interpret results.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage with clear enums for chain and timeperiod. The description adds minimal semantic value beyond the schema, though the mention of 'across DEXes' implicitly contextualizes the chain parameter. Baseline 3 is appropriate given schema completeness.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool detects 'significant price action and momentum signals across DEXes' with specific verbs and domain. However, it lacks explicit differentiation from siblings like 'top_gainers' or 'technical_analysis' which operate in similar domains.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus the 40+ sibling analysis tools (e.g., 'top_gainers', 'smart_money', 'whale_tracking'). No prerequisites, filters, or exclusion criteria are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mempool_watchBInspect

EVM Mempool Watch: Live mempool subscription for a short window (30–120s). Returns pending-tx insights: top priority-fee transactions (frontrun candidates), repeat-sender bot detection, gas-priority distribution. Supports Ethereum, Base, Arbitrum, Optimism, Polygon, BSC. Optional address filter restricts to tx targeting a specific router or token contract.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must fully disclose behavior. It describes a live subscription but is ambiguous whether it's a one-time query or persistent stream. It does not mention rate limits, costs, or auth requirements, but accurately states it returns pending-tx insights without destructive side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise (two sentences) and well-structured. However, its brevity contributes to the inaccuracies regarding parameters, making it less effective than a longer, more precise description.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of a mempool watch tool, the description lacks critical details: the subscription mechanism (e.g., WebSocket vs polling), expected output format, and resolution of parameter contradictions. It does not compensate for the missing output schema or the conflicting timeperiod values.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description claims an optional address filter, but the input schema does not include an 'address' parameter. Additionally, the schema's 'timeperiod' enum values (1h,4h,7d etc.) contradict the described short window (30-120s). These mismatches severely impair the agent's ability to use the tool correctly.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool does live mempool subscription with a short window (30-120s), returns specific insights (frontrun candidates, bot detection, gas distribution), and lists supported chains. This distinguishes it from siblings like mev_detection or token_sniping.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description mentions an optional address filter and the short window for real-time monitoring, implying when to use it. However, it lacks explicit guidance on when not to use it or alternatives like mev_detection for more detailed MEV analysis.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mev_detectionCInspect

MEV Detection: Sandwich attacks, front-running, and toxic flow analysis.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. While it lists attack types analyzed, it fails to disclose whether the operation is read-only, what data format is returned, rate limits, or whether results are real-time vs historical. Insufficient for a complex blockchain analytics tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely terse (7 words) and front-loaded, but undersized for the tool's complexity. While no words are wasted, critical information about return values and sibling differentiation is missing, making it inadequately concise rather than efficiently minimal.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of MEV analysis and absence of an output schema, the description should explain what data structure is returned (transactions, statistics, scores) and analysis methodology. With 3 optional parameters and no return documentation, the description is incomplete for safe invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description mentions specific MEV attack types which provides domain context, but adds no specific guidance on parameter interactions (e.g., default behavior when all optional parameters are omitted) or valid value formats beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's function using specific verbs ('Detection') and resources ('Sandwich attacks, front-running, toxic flow'), which helps distinguish it as an analysis tool. However, it does not explicitly differentiate from the sibling 'mev_protection' tool, which could cause selection confusion.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus alternatives like 'mev_protection' or 'arbitrage_scanner'. No mention of prerequisites, data freshness, or whether this requires specific blockchain node access.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mev_protectionCInspect

MEV Protection Transaction Routing: Analyze transactions for MEV vulnerability and recommend private mempool routing strategies.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden but remains ambiguous on critical behavioral traits: it is unclear whether this tool actually performs transaction routing or merely recommends strategies, what the analysis criteria are, rate limits, or required permissions for mempool access.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence with subtitle structure. The prefix 'MEV Protection Transaction Routing:' slightly wastes space by restating the concept implicit in the tool name, but the remainder efficiently communicates core functionality without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With 3 optional parameters and no output schema, the description should clarify what default behavior occurs when called without arguments. It omits explanation of return values, analysis methodology, or how the 'recommendations' are formatted, leaving significant gaps for an agent attempting to use the tool effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100% (chain, limit, timeperiod all documented). The description adds no additional parameter context, but meets the baseline since the schema fully documents acceptable values (enums) and defaults.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses specific verbs ('analyze', 'recommend') and identifies the domain (MEV vulnerability, private mempool routing). However, it fails to distinguish from sibling tool 'mev_detection', leaving ambiguity about whether this tool detects threats, provides protection analysis, or executes routing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this versus 'mev_detection' or other analysis tools. No mention of prerequisites (e.g., transaction data requirements) or when private mempool routing is actually necessary versus standard broadcasting.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mindyield_content_judgeAInspect

MindYield Submission Judge (AI moderation): Score a MindYield contest submission for quality, originality, and contest-relevance. Returns score (0-10), rationale, and an approve/reject/flag recommendation. Designed for AI_JUDGED meme, RATE, RANK, and PAIRWISE contests on the MindYield no-loss platform.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses the return values ('score (0-10), rationale, and approve/reject/flag recommendation'), which is helpful. However, it does not mention any behavioral aspects such as whether the tool modifies state, requires authorization, has rate limits, or has any side effects. Given no annotations, this is a moderate level of transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences long, with no filler. The first sentence clearly states the purpose and output; the second adds context about the contest types. Every word contributes meaning, and it is front-loaded with the most important information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (AI moderation) and lack of output schema, the description covers the output fields but fails to explain the input parameters, which are critical for correct invocation. The agent might not understand why chain, limit, and timeperiod are required when judging a submission, leading to potential misuse. The description is incomplete in bridging the purpose to the parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has three parameters (chain, limit, timeperiod) with 100% coverage, but the description does not mention or explain any of them. Moreover, these parameters appear unrelated to the stated purpose of 'judging a contest submission' – they are more typical for blockchain analysis. The description fails to clarify how these parameters are used in the context of content judging, leaving the agent confused about what to provide.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Score a MindYield contest submission for quality, originality, and contest-relevance.' It specifies the contest types (AI_JUDGED meme, RATE, RANK, PAIRWISE) and mentions the no-loss platform, making it distinct from any sibling tools. No other sibling tool has a similar purpose.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly mentions the contest types for which this tool is designed ('AI_JUDGED meme, RATE, RANK, and PAIRWISE contests'). While it does not explicitly list when not to use it or alternatives, the context of sibling tools (e.g., mindyield_vaults_deposit, mindyield_leaderboard) makes it clear that this tool is for judging specific contests, so usage context is well implied.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mindyield_integrity_deep_scanAInspect

MindYield Contest Integrity Deep-Scan: Off-chain integrity layer over MindYield contests. Combines LLM content analysis with whale / sybil signals to flag AI-generated submissions, sybil clusters, and vote-buying patterns. Returns an integrity score and a finalize / delay / cancel recommendation. Augments MindYield’s on-chain fraud engine.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It explains the tool's behavior (combines LLM analysis with signals) and output (score and recommendation). However, it does not confirm read-only status, authentication needs, or rate limits, which are relevant for a scanning tool. The description adds moderate value beyond the schema.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is exceptionally concise: three sentences that front-load the purpose, detail the mechanism, and state the output. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description explains that it returns an integrity score and a recommendation (finalize/delay/cancel). This is helpful, though it could be more specific about score range or recommendation logic. The tool's complexity is moderate, and the description covers key aspects.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage for its three parameters (chain, limit, timeperiod), so the baseline is 3. The description does not add any additional meaning or context about these parameters, relying solely on the schema definitions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: an off-chain integrity layer that uses LLM analysis and whale/sybil signals to flag AI submissions, sybil clusters, and vote-buying, returning an integrity score and recommendation. It distinguishes itself from siblings like mindyield_content_judge by specifying its scope as a 'deep-scan' augmenting MindYield's on-chain fraud engine.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies it is used for integrity checks on MindYield contests, but it does not explicitly state when to use this tool over alternatives (e.g., mindyield_content_judge) or provide exclusions. The mention of 'augments' gives context but no direct guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mindyield_leaderboardBInspect

MindYield Platform Stats: Read-only platform stats from MindYield: TVL, total yield distributed, active product counts. Free.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the burden. It declares 'Read-only' and 'Free,' indicating no destructive actions and no cost. However, it does not disclose potential rate limits, authentication needs, or effects of parameters like chain or timeperiod.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, front-loaded sentence that efficiently communicates the tool's purpose and scope. It avoids unnecessary words while covering key points.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description mentions specific metrics (TVL, yield distributed, active product counts), which is helpful. However, it does not explain the output format or how results are structured, and with no output schema, the agent lacks full context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%; all parameters (chain, limit, timeperiod) have descriptions in the schema. The tool description adds no extra semantic value beyond what the schema already provides, so a baseline of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description states 'Read-only platform stats from MindYield: TVL, total yield distributed, active product counts.' This provides a specific verb ('query' implied), resource ('MindYield platform stats'), and lists key metrics, making the purpose clear. It does not explicitly differentiate from siblings, but the name and content set it apart.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is given on when to use this tool versus alternatives like other MindYield tools (e.g., mindyield_vaults_list). The description lacks context for usage scenarios or exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mindyield_vaults_listAInspect

MindYield Yield Prediction Markets: List active MindYield yield prediction markets (ProbabilityVaultV2). Returns each vault with TVL, YES/NO ratio (live implied probability), time-to-resolution, and the underlying Polymarket conditionId. No-loss: only the generated yield is at stake; principal is always returned. Note: yield prediction markets are the canonical product — Mirror Contests are deprecated and not listed.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It clearly explains the no-loss feature (principal always returned) and that only active vaults are listed. This provides sufficient transparency for a read-only list tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences that front-load key information. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, the description explains return fields and product nuances. It covers the main functional aspects needed for a list tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with good parameter descriptions. The description does not add significant meaning beyond the schema, so baseline 3 applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists active MindYield yield prediction markets and specifies the returned fields (TVL, YES/NO ratio, time-to-resolution, conditionId). It distinguishes from sibling tools by noting mirror contests are deprecated.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains the tool's purpose but does not explicitly state when to use it vs alternatives. It mentions that mirror contests are deprecated, implying this is the canonical tool, but lacks explicit when-not guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mindyield_yield_vault_depositBInspect

MindYield Auto-Yield Vault Deposit Plan: Build an unsigned tx plan to deposit USDC into MindYield AutoYieldVault on Base. ERC4626 vault, auto-rebalanced across Aave V3 / Compound V3 / Morpho via Chainlink Automation. 2% fee on yield only — no fee on principal. Caller signs and broadcasts.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses that the tool builds an unsigned plan, not directly executing, and explains the vault type (ERC4626), rebalancing sources, and fee structure (2% on yield only). Since no annotations exist, the description carries the full burden and does so adequately, though it could mention authorization needs or output format.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise with three sentences, front-loading the main purpose. It is well-structured but could benefit from bullet points or clearer separation of details.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex DeFi deposit tool, the description omits crucial context: the format of the unsigned plan, required parameters like amount and vault address, and how the given schema parameters relate to the operation. No output schema exists to fill gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Despite 100% schema coverage, the parameters (chain, limit, timeperiod) are not explained in the description and appear unrelated to depositing USDC. The description adds no meaning to these parameters, creating confusion about their role.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool builds an unsigned tx plan to deposit USDC into MindYield AutoYieldVault on Base, specifying verb and resource. However, it does not differentiate from sibling 'yield_deposit', and the purpose is slightly undermined by the schema mismatch (parameters unrelated to deposit).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool versus alternatives. There is no mention of prerequisites, exclusions, or when not to use it. Among siblings, 'yield_deposit' exists, but no differentiation is provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

optimize_strategy_paramsBInspect

Walk-Forward Parameter Optimization: Run grid + random + Bayesian-lite parameter search on any registered strategy adapter, gated by walk-forward cross-validation so the returned params haven't overfit. Caller supplies adapter id + parameter search space + symbol/timeframe; service returns the best out-of-sample Sortino params, stability score (0..1, drift across folds), and worst-fold drawdown. Backed by Einstein's StrategyParamAdvisor's same validation gates (≥0.7 stability, ≥0.8 OOS Sortino, ≥16 trials). Trustworthy enough to plug back into a production trading loop.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses that it uses walk-forward cross-validation, returns stability score, worst-fold drawdown, and validation gates. No annotations provided, so description carries full burden; it adequately covers safety and output behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single paragraph with valuable details but is somewhat verbose. It front-loads the main purpose but could be more concise and structured for easier parsing.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the mismatch between described inputs and actual input schema, the description fails to provide complete context for an agent to invoke the tool correctly. The output behavior is explained, but the required parameters are missing from the schema, causing a critical gap.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema parameters (chain, limit, timeperiod) are documented with descriptions, but the tool's description references completely different parameters (adapter id, search space, symbol/timeframe). The description adds no meaningful context to the actual schema parameters and misaligns with them.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Walk-Forward Parameter Optimization' and explains the search methods and validation gates. However, the input schema does not contain the required parameters (adapter id, search space, symbol/timeframe) listed in the description, introducing confusion.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for parameter optimization to avoid overfitting and mentions validation gates, but does not explicitly contrast with sibling tools like backtest_strategy or strategy_gate. No when-not-to-use guidance provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pendle_yield_tradingBInspect

Pendle Yield Tokenization Analysis: Analyze Pendle yield markets, compare fixed vs variable yields, find best PT/YT opportunities across chains, and view yield term structures.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, placing full burden on the description. It discloses analytical behaviors ('compare fixed vs variable yields', 'view yield term structures') but omits operational details such as whether the tool is read-only, rate limits, data freshness, or authentication requirements.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single compound sentence that efficiently lists multiple capabilities. It is appropriately sized and front-loaded with the domain identifier ('Pendle Yield Tokenization Analysis'), though the colon-separated label format slightly reduces readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description adequately covers the tool's domain-specific purpose (Pendle yield tokenization) and analytical capabilities. However, it lacks description of return values (no output schema exists) and operational constraints, leaving gaps for a tool with financial analysis capabilities.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, providing clear descriptions for 'chain', 'limit', and 'timeperiod'. The description mentions 'across chains' which aligns with the chain parameter, but does not add semantic meaning or usage guidance beyond what the schema already provides. Baseline 3 is appropriate for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses specific action verbs ('Analyze', 'compare', 'find', 'view') and identifies the specific resource ('Pendle yield markets', 'PT/YT opportunities'). It distinguishes from siblings like 'defi_yield_analysis' by explicitly referencing Pendle-specific concepts (Principal/Yield Tokens, fixed vs variable yields).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no explicit guidance on when to use this tool versus the sibling 'defi_yield_analysis' or other yield-related tools. While the Pendle specificity implies usage for Pendle markets, there is no explicit 'when to use' or 'when not to use' guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_arbitrageAInspect

Polymarket Cross-Platform Arbitrage: Find price discrepancies between Polymarket and Kalshi prediction markets for arbitrage opportunities with confidence scoring.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.7/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full disclosure burden. It adds valuable behavioral context by mentioning 'confidence scoring' as a feature, but fails to clarify critical operational traits: whether the tool executes trades or merely reports opportunities, authentication requirements, or the confidence score range/format.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence with zero redundancy. It front-loads the domain ('Polymarket Cross-Platform Arbitrage') and immediately follows with the mechanism and distinctive feature. Every clause serves a specific purpose in defining the tool's unique value proposition.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the 100% schema coverage and absence of an output schema, the description adequately covers the tool's purpose but leaves gaps regarding return value structure (e.g., whether it returns opportunities or execution results). For a financial analysis tool with three well-documented parameters, it is minimally viable but could benefit from describing the output format.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage with clear enums and default values documented. The description does not add semantic meaning beyond the schema (e.g., it doesn't explain why one would choose a specific chain or timeperiod), but with complete schema documentation, it meets the baseline expectation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action (find price discrepancies), the exact platforms compared (Polymarket and Kalshi), and the output characteristic (confidence scoring). It effectively distinguishes from siblings like polymarket_spread_analysis (internal spreads) and cross_chain_arbitrage (crypto chains) by explicitly naming the Kalshi cross-platform comparison.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

While the description implies usage scenarios through the specificity of 'Cross-Platform' and naming Kalshi, it lacks explicit guidance on when to prefer this tool over arbitrage_scanner or polymarket_conviction. No 'when-not-to-use' or prerequisite conditions are mentioned, leaving agents to infer applicability.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_convictionAInspect

Polymarket High Conviction Scanner: Identify high-conviction betting opportunities by analyzing model-price divergence, cross-platform comparisons, and order flow signals.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. While it mentions the analytical approach (model-price divergence, order flow), it omits operational traits: whether results are real-time or cached, rate limiting, authentication requirements, or whether the operation is read-only. This leaves significant behavioral gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is optimally concise with a clear header ('Polymarket High Conviction Scanner:') followed immediately by the action verb and methodology. Zero redundant words; every clause serves to differentiate the tool's specific value proposition.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has no output schema and no annotations, the description adequately covers the functional purpose but lacks completeness regarding return values or result structure. For a 3-parameter analysis tool, it meets minimum viability but should disclose what constitutes a 'high-conviction opportunity' in the output.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, with all three parameters (chain, limit, timeperiod) fully documented in the schema. The description does not add semantic value beyond the schema (e.g., no examples of valid combinations or clarification on default behaviors), warranting the baseline score.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool identifies 'high-conviction betting opportunities' using specific methodologies (model-price divergence, cross-platform comparisons, order flow signals). It effectively distinguishes itself from sibling tools like polymarket_arbitrage and polymarket_insider_scan through its unique focus on conviction signals versus pure arbitrage or insider detection.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context (when seeking high-conviction signals with model-price divergence), but provides no explicit when-to-use guidance or comparisons to alternatives. It does not clarify why one would choose this over polymarket_arbitrage or polymarket_insider_scan despite having distinct functionality.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_insider_scanAInspect

Polymarket Insider Detection: Scan Polymarket for insider trading signals using 18 detection heuristics including wallet clustering, Sybil detection, and on-chain whale transfer analysis.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full disclosure burden. It successfully describes the analytical behavior (18 heuristics, specific detection methods) but fails to disclose operational traits like read-only status, rate limits, authentication requirements, or output format. It confirms what analysis is performed but not safety profile or side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single dense sentence with zero wasted words. Front-loaded with the core function ('Polymarket Insider Detection: Scan...'), immediately communicating value. Every phrase ('18 detection heuristics', 'wallet clustering', 'Sybil detection') adds specific methodological context without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for purpose identification but incomplete given the tool's complexity and lack of output schema. For an analysis tool running 18 heuristics, the description should ideally characterize the output format (alerts? scores? clustered wallets?) or signal types returned. It mentions 'signals' but leaves significant gaps regarding the return structure.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description adds no parameter-specific guidance (e.g., no mention of tokenAddress, timeperiod granularity implications, or chain selection criteria), relying entirely on the schema for parameter documentation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool scans Polymarket for insider trading signals, specifying the exact resource (Polymarket) and action (scan/detection). The mention of '18 detection heuristics' and specific methodologies (wallet clustering, Sybil detection, whale transfer analysis) strongly distinguishes it from sibling tools like polymarket_arbitrage or polymarket_markets.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

While it lacks explicit 'when-not-to-use' language, the description provides clear contextual boundaries through the specific detection heuristics listed. An agent can easily infer this tool is for insider/suspicious activity detection rather than general market analysis or arbitrage, effectively differentiating it from the 9+ sibling Polymarket tools without explicit enumeration.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_longshot_faderBInspect

Polymarket Longshot Fader (Nothing-Ever-Happens): Scan Polymarket for NO-position opportunities where dramatic events (geopolitical crisis, AI risk, political disaster, macro collapse) are statistically overpriced due to availability bias and the longshot effect. Returns calibration-adjusted edge estimates across categories.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.4/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions the tool 'returns calibration-adjusted edge estimates,' which gives some insight into output behavior, but fails to disclose critical traits such as whether it's read-only or mutative, rate limits, authentication needs, or error handling. For a tool with no annotations, this leaves significant gaps in understanding how the tool behaves operationally.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded, starting with the core purpose and key details in a single sentence. It avoids unnecessary fluff, but could be slightly more structured by separating the 'what' from the 'how' or output details. Every sentence earns its place, though minor improvements in clarity are possible.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of the tool (scanning for statistical opportunities) and the lack of annotations and output schema, the description is moderately complete. It explains the purpose and output type ('returns calibration-adjusted edge estimates'), but does not cover behavioral aspects or provide full context on how results are generated or interpreted. It's adequate but has clear gaps in operational transparency.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all parameters (chain, limit, timeperiod) with descriptions and enums. The description does not add any additional meaning or context about these parameters beyond what the schema provides, such as how they affect the scan results. Thus, it meets the baseline of 3 where the schema does the heavy lifting without extra value from the description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('Scan Polymarket for NO-position opportunities') and resources ('Polymarket'), and distinguishes it from siblings by focusing on 'longshot fader' opportunities rather than arbitrage, conviction, or other market analyses. It explicitly mentions the target phenomena ('dramatic events') and the analytical approach ('statistically overpriced due to availability bias and the longshot effect').

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context by specifying 'NO-position opportunities' and 'dramatic events,' but does not explicitly state when to use this tool versus alternatives like 'polymarket_arbitrage' or 'polymarket_conviction.' It provides some guidance on the target scenarios but lacks clear exclusions or named alternatives, leaving usage somewhat inferred rather than explicitly defined.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_marketsCInspect

Polymarket Market Discovery: Search and discover active prediction markets on Polymarket with AI-powered semantic search, order book data, and analytics.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.7/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. While it mentions return data types (order book, analytics), it misleadingly implies semantic search functionality that the parameters cannot support. It lacks disclosure on authentication requirements, rate limits, read-only status, or pagination behavior beyond the limit parameter.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence with no obvious waste. The prefix 'Polymarket Market Discovery:' is slightly redundant with the tool name but serves as a subject header. The content is front-loaded with the key resource and action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 3 optional parameters and no output schema, the description is minimally adequate. It compensates for the missing output schema by mentioning return data types (order book, analytics). However, the unsupported 'semantic search' claim represents a significant gap in accuracy, and the lack of sibling differentiation leaves the tool's specific role unclear.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description does not add specific parameter semantics beyond the schema (e.g., valid chain values, time period formats), and the 'semantic search' claim creates confusion about how the chain/limit/timeperiod parameters function (as filters vs. search inputs).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description identifies the resource (Polymarket prediction markets) and general action (search/discover), mentioning specific return data like order book and analytics. However, it claims 'AI-powered semantic search' capabilities that are unsupported by the input schema (which lacks any query/text parameter), and fails to distinguish this tool from four siblings (polymarket_arbitrage, polymarket_conviction, polymarket_insider_scan, polymarket_spread_analysis).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit when-to-use guidance is provided, nor are alternatives named. Given the four specialized Polymarket siblings, the description misses a critical opportunity to clarify that this is the general discovery tool while siblings handle specific analyses (arbitrage, conviction, etc.).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

polymarket_spread_analysisBInspect

Polymarket Spread & Market Making: Analyze bid-ask spreads, completeness arbitrage (YES+NO < $1), and market making opportunities with real CLOB order book data.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description adds valuable behavioral context: it specifies the data source ('real CLOB order book data') and the specific arbitrage logic ('completeness arbitrage (YES+NO < $1)'). However, it lacks disclosure on rate limits, authentication requirements, or what constitutes a 'market making opportunity' in the output.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Excellent information density in a single sentence. The colon-separated structure front-loads the domain ('Polymarket Spread & Market Making') followed by specific analytical capabilities. Every phrase earns its place without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema exists, yet the description fails to indicate return value structure (e.g., spread metrics, arbitrage opportunities, or order book depth). With 3 optional parameters and no annotations, the description should also explain default behavior when no parameters are provided, which it does not.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description mentions 'real CLOB order book data' which loosely implies blockchain context, but does not explicitly reference chain, limit, or timeperiod parameters, nor does it explain default behaviors for these optional parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool analyzes bid-ask spreads, completeness arbitrage (YES+NO < $1), and market making opportunities using real CLOB order book data. It effectively distinguishes from generic polymarket_arbitrage by specifying 'completeness arbitrage' and 'market making,' though it doesn't explicitly contrast use cases with siblings.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this versus polymarket_arbitrage or other Polymarket siblings. While 'market making opportunities' implies usage for liquidity providers, there are no stated prerequisites, exclusions, or decision criteria for tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

portfolio_risk_allocationAInspect

Portfolio Risk Allocation Matrix: Per-category recommended allocation fractions with cross-strategy correlation, Kelly multiplier, and portfolio-wide drawdown adjustments applied. Each verdict includes: base category cap, current usage, Kelly multiplier, correlation penalty, drawdown dampener, and final recommended fraction with rationale. Also exposes the portfolio-wide drawdown circuit-breaker state. Lets a downstream agent query Einstein's risk allocator before sizing a trade — the same allocator Einstein's own executors use.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses key behavioral traits: applying cross-strategy correlation, Kelly multiplier, drawdown adjustments, and exposing a circuit-breaker state. This is sufficient transparency for a risk analysis tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, dense paragraph that front-loads the purpose and then details components. It is concise with no fluff, but slight structuring (e.g., bullet points) could improve readability. Still, it earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the moderate complexity and absence of an output schema, the description thoroughly explains the return values: each verdict includes base category cap, current usage, Kelly multiplier, correlation penalty, drawdown dampener, and final fraction with rationale, plus the circuit-breaker state. It also clarifies the usage context, making it complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 3 parameters with 100% description coverage, so the baseline is 3. The description does not add semantic detail about the parameters beyond what the schema provides, though it explains the output structure. It meets the minimum adequacy for parameter semantics.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states the tool provides a 'Portfolio Risk Allocation Matrix' with per-category recommended fractions, including cross-strategy correlation, Kelly multiplier, and drawdown adjustments. It clearly identifies the tool's resource (risk allocation) and action (querying the risk allocator), distinguishing it from siblings like 'strategy_allocation_recommendation' by noting it uses the same allocator as Einstein's executors.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description states the tool 'lets a downstream agent query Einstein's risk allocator before sizing a trade', providing clear context for use. It does not explicitly list when not to use or compare to alternatives, but the use case is well-defined.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pqc_migration_plannerAInspect

PQC Migration Planner: Generate a prioritized post-quantum migration plan for wallet addresses. Ranks by exposure (risk × balance), provides step-by-step migration instructions with destination recommendations and gas cost estimates.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.7/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It adds valuable context about the ranking algorithm (risk × balance) and output components (step-by-step instructions, gas estimates), but fails to clarify whether this tool executes transactions or is read-only analysis, and does not explain how wallet addresses are determined given the lack of a wallet address parameter in the schema.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is optimally concise with two high-density sentences. The first front-loads the core purpose (Generate a prioritized post-quantum migration plan), while the second efficiently enumerates the specific deliverables (ranking methodology, instructions, recommendations, estimates) without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and no annotations, the description adequately explains what the generated plan contains conceptually, but has notable gaps: it does not clarify how wallet addresses are specified (absent from schema parameters), lacks safety/destructive behavior disclosure, and does not describe the return format structure despite the absence of an output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description does not add additional semantic context for chain, limit, or timeperiod beyond what the schema already provides, nor does it explain the relationship between these parameters and the 'wallet addresses' mentioned in the description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action (generate a prioritized post-quantum migration plan), the target resource (wallet addresses), and distinguishes from siblings like pqc_readiness_assessment (assessment) and pqc_wallet_scan (scanning) by emphasizing actionable planning outputs (step-by-step instructions, destination recommendations, gas estimates).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

While the description implies usage through specificity (generating actionable plans vs. mere assessment), it lacks explicit guidance on when to select this over pqc_readiness_assessment or pqc_wallet_scan. No 'when to use' or explicit alternative recommendations are provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pqc_readiness_assessmentCInspect

PQC Crypto Readiness Assessment: Comprehensive ecosystem-wide quantum readiness report covering chain PQC status, NIST algorithm comparison (Dilithium, FALCON, SPHINCS+), quantum threat timeline, and optional portfolio exposure analysis.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It fails to indicate whether this is a read-only operation (highly likely given 'assessment'), whether it requires authentication, potential rate limits for quantum analysis, or the expected response format/structure. It only describes output topics, not behavioral traits or side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured as a title header followed by a comprehensive feature list. Every clause adds specific content information (algorithm names, analysis types). It avoids fluff, though the colon-separated title prefix slightly echoes the tool name without adding new information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of an output schema, the description partially compensates by listing report contents (algorithms, timeline, portfolio exposure). However, it omits the return structure (JSON vs markdown), whether the analysis is synchronous or async, and how the 'optional' portfolio analysis is triggered (no corresponding boolean parameter exists in the schema).

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, providing clear definitions for chain, limit, and timeperiod. The description adds marginal context by linking 'chain' to 'chain PQC status' but does not explain parameter interactions (e.g., whether timeperiod affects the portfolio analysis or just the threat timeline). With the schema doing the heavy lifting, this meets the baseline.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it generates a 'quantum readiness report' and lists specific content areas (NIST algorithms like Dilithium/FALCON/SPHINCS+, threat timelines, chain status). The technical specificity distinguishes it from sibling tools like pqc_migration_planner (which implies action) and pqc_wallet_scan (which implies targeted scanning), though it could more explicitly state this is an analytical/reporting tool rather than an operational one.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

There is no guidance on when to select this tool versus the related pqc_migration_planner, pqc_wallet_scan, or quantum_portfolio_monitor. The phrase 'optional portfolio exposure analysis' hints at a use case but doesn't clarify selection criteria or prerequisites (e.g., whether a specific chain must be selected for portfolio analysis to function).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pqc_wallet_scanBInspect

Quantum Address Resistance Scanner: Deep analysis of any Bitcoin or Ethereum wallet address for quantum resistance. Detects address type (P2PKH, P2SH, P2WPKH, P2TR, EOA, Contract), assesses vulnerability to Shor's and Grover's algorithms, and provides expert PQC migration advice.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the full burden. It discloses the analysis scope (detects address types, assesses vulnerabilities) and mentions specific quantum algorithms, but fails to describe output format, whether results are cached, rate limits, or if the tool makes external blockchain queries.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences with zero filler. The first acts as a descriptive title, the second lists specific technical capabilities. All content earns its place, though the leading label 'Quantum Address Resistance Scanner:' partially restates the tool name.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex quantum cryptography tool with 3+ parameters and no output schema, the description adequately explains the analysis purpose but lacks detail on return values or execution requirements. Given the technical complexity (Shor's algorithm assessment), more behavioral context would be appropriate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the three defined parameters (chain, limit, timeperiod), establishing a baseline of 3. The description implies an address input ('wallet address') which aligns with the required 'tokenAddress' field, but does not explain how 'limit' or 'timeperiod' apply to quantum resistance analysis (e.g., transaction history analysis vs. static address assessment).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'deep analysis' for 'quantum resistance' on wallet addresses, specifying exact address types (P2PKH, P2SH, P2WPKH, P2TR, EOA, Contract) and algorithms (Shor's, Grover's). However, it does not distinguish from siblings like 'pqc_migration_planner' or 'pqc_readiness_assessment' which also deal with quantum cryptography.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus alternatives like 'pqc_migration_planner' (which offers migration advice, similar to this tool's 'PQC migration advice') or 'pqc_readiness_assessment'. No prerequisites or exclusion criteria are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pump_fun_launchesAInspect

Pump.fun Launches: Fresh token launches on Pump.fun with bonding curve progress (Solana only).

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.6/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It adds valuable context about 'bonding curve progress' (specific to Pump.fun mechanics) and the Solana chain constraint, but omits rate limits, data freshness definitions, or read-only/destructive nature.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence with minimal waste. The leading 'Pump.fun Launches:' is slightly redundant with the tool name but serves as effective framing. Key constraints (Solana only, bonding curves) are front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for a data retrieval tool with full schema coverage and no output schema. Captures the domain-specific context (Pump.fun, bonding curves) necessary for an agent to understand what data is returned, though could specify data freshness/recency.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Despite 100% schema description coverage, the description adds critical semantic value by stating 'Solana only'—directly contradicting the schema's permissive enum (base, ethereum, etc.) to clarify the actual valid input for the chain parameter.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

States it retrieves fresh token launches from Pump.fun with bonding curve progress, and specifies 'Solana only' which distinguishes it from sibling tools like zora_launches or general EVM tools. Lacks an explicit verb but the resource and scope are clear.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The 'Solana only' constraint implies when to use it (Solana/Pump.fun specific queries), but provides no explicit alternatives or when-not-to-use guidance versus siblings like token_sniping or general market_movers.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

quantum_encryption_keyDInspect

Quantum Encryption Key Generator: Generate AES-256 encryption keys from real ANU quantum entropy. Returns 256-bit key in hex and base64 formats.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

D1.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries full disclosure burden. It mentions output formats (hex and base64) and entropy source (ANU), which is helpful if accurate. However, given the parameter mismatch, this behavioral information is likely incorrect or describes a different endpoint.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with a descriptive prefix and single explanatory sentence. No verbose filler, though the first clause acts as a title rather than content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness1/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Critical failure: the description does not reconcile the stated purpose (key generation) with the actual parameters (blockchain querying). A complete description would explain the relationship between timeperiod/limit and key generation, or acknowledge this as a blockchain analytics tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Despite 100% schema description coverage (baseline 3), the description fails to mention the actual parameters (chain, limit, timeperiod) or explain why a key generator requires blockchain network and time range filters. The domain mismatch actively confuses parameter semantics.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose2/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it generates AES-256 encryption keys from quantum entropy, but this appears to describe a different tool entirely. The input parameters (chain, limit, timeperiod) clearly indicate a blockchain data querying function, not key generation, making the description actively misleading.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines1/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus quantum-related siblings like quantum_signing_key, quantum_hardware_qrng, or quantum_wallet_generation. No mention of prerequisites or when-not-to-use constraints.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

quantum_hardware_qrngDInspect

Hardware-Verified Quantum Random Number Generator: Generate cryptographically secure random numbers on real IBM quantum hardware (127-qubit superconducting processor). Each result includes IBM job ID as proof of quantum origin.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

D1.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries the full burden of behavioral disclosure. It mentions IBM job IDs as proof of origin, which adds value, but fails entirely to explain the blockchain-related parameters (chain, timeperiod) or reconcile them with the quantum RNG purpose.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with two focused sentences that front-load the key value proposition (quantum hardware verification). However, the content is inappropriate for the actual interface.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness1/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the severe mismatch between the described quantum RNG functionality and the blockchain query parameters, the description is fundamentally incomplete. It provides no context for the chain, limit, or timeperiod parameters, rendering the tool interface unexplained.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

While the schema has 100% coverage (baseline 3), the description implies quantum-specific parameters (qubits, shots, backend) but the actual parameters are for blockchain queries (chain, timeperiod, limit). This contradiction creates confusion rather than adding meaning, warranting a score of 1.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose2/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it generates quantum random numbers on IBM hardware, which distinguishes it from sibling tools. However, this purpose directly contradicts the input schema, which accepts blockchain network parameters (ethereum, solana, etc.) and time periods, suggesting the tool is actually for blockchain analytics rather than quantum RNG.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines1/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to use this tool versus alternatives. Given the parameter schema suggests blockchain querying while the description claims quantum random number generation, agents have no way to determine correct usage or prerequisites.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

quantum_portfolio_monitorCInspect

Quantum Portfolio Risk Monitor: Scan multiple wallet addresses across chains for quantum vulnerability. Aggregates risk scores, identifies value at risk, and provides portfolio-level quantum threat assessment with risk distribution.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It successfully describes expected outputs (risk scores, value at risk, threat assessment, risk distribution) but fails to clarify operational aspects like whether this is read-only, rate limits, or how the 'multiple wallet addresses' are determined given the lack of address parameters.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured in two sentences with minimal waste, though the prefix 'Quantum Portfolio Risk Monitor:' restates the tool name and could be removed. The content is appropriately front-loaded with the core action before listing specific outputs.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema exists, the description adequately compensates by detailing return value types (risk scores, distributions). However, the unexplained discrepancy between the wallet-scanning description and the address-less input schema leaves critical context gaps regarding how the tool identifies which portfolios to monitor.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

While schema description coverage is 100% (baseline 3), the description introduces confusion by referencing 'multiple wallet addresses' and 'across chains' when the schema only accepts a single chain enum and provides no wallet address input field. This mismatch between described functionality and actual parameters reduces the score.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description provides specific verbs (Scan, Aggregates, identifies) and mentions the resource (wallet addresses, portfolio-level assessment). However, it fails to differentiate from sibling tools like 'pqc_wallet_scan' or 'pqc_readiness_assessment', and critically, it claims to scan wallet addresses while the input schema provides no parameter to specify them.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to use this tool versus alternatives like 'pqc_wallet_scan' (individual vs. portfolio) or 'pqc_migration_planner'. There are no prerequisites mentioned despite the complex quantum cryptography domain, and no warnings about the 0 required parameters potentially indicating global chain-wide scanning behavior.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

quantum_signing_keyCInspect

Quantum Signing Key Generator: Generate Ed25519 or secp256k1 signing keypairs from real ANU quantum entropy. Returns private key and public key pair.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries full burden. It discloses the entropy source (ANU quantum) and return values (private/public key pair), but omits critical behavioral details: key format encoding, whether keys are ephemeral or stored, and security warnings for handling private key material.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences with efficient front-loading. Structure is appropriate, though the content creates potential confusion given the schema mismatch.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description minimally covers return values. However, it fails to explain the parameter mismatch (why a key generator takes 'timeperiod' and 'limit'), omits security warnings critical for key generation tools, and doesn't clarify key formatting or encoding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Severe mismatch between description and schema. While the description implies curve selection (Ed25519/secp256k1), the schema provides analytics parameters (chain, limit, timeperiod). The description adds no meaning to these specific parameters, and 'limit'/'timeperiod' are nonsensical for key generation as described.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description states a clear purpose (generate Ed25519/secp256k1 keypairs from quantum entropy) with specific cryptographic resources. However, it fails to align with the input schema which contains analytics parameters (limit, timeperiod), creating confusion about actual functionality.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus siblings like quantum_wallet_generation or quantum_encryption_key. No mention of prerequisites for key generation or security considerations for handling private keys.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

quantum_wallet_generationAInspect

Quantum Wallet Generation: Generate BIP-39 mnemonic seed phrases from real ANU quantum entropy (vacuum fluctuation measurements). Returns 12 or 24-word mnemonics with derived addresses for EVM, Solana, and Bitcoin.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It successfully discloses the entropy source (ANU vacuum fluctuation measurements) and output format (mnemonics + derived addresses for EVM/Solana/Bitcoin). It does not address whether wallets are persisted server-side, idempotency, or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured as a single informative sentence. It loses one point for the redundant prefix 'Quantum Wallet Generation:' which restates the tool name, though the core content is dense and valuable.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Critical gaps exist: no output schema is present, no annotations are provided, and the description does not explain why a wallet generation tool accepts 'timeperiod' and 'limit' parameters (which suggest analytics/monitoring functionality). For a cryptographic tool handling seed phrases, the lack of security handling disclosure and parameter rationale is a significant omission.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description does not add parameter-specific context, but this is acceptable per rubric guidelines. Notably, the description fails to reconcile the stated wallet generation purpose with the query-like parameters ('limit', 'timeperiod') present in the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool generates BIP-39 mnemonic seed phrases using ANU quantum entropy, and specifies the output (12/24-word mnemonics with derived addresses). It effectively distinguishes from siblings like 'quantum_encryption_key' and 'quantum_signing_key' by specifying wallet-specific BIP-39 mnemonics rather than generic cryptographic keys.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains what the tool does but provides no guidance on when to use it versus alternatives (e.g., when to use this versus 'pqc_migration_planner' or regular wallet generation). It lacks prerequisites, security warnings, or exclusion criteria for when NOT to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

regime_classificationCInspect

BTC Multi-Factor Regime Classification: Returns the multi-factor BTC regime label (bull/bear/ranging/volatile) plus the continuous feature vector that produced it: 200-SMA trend, realized volatility percentile, drawdown from 90d high, BTC-SPX 30d Pearson correlation, BTC-NDX 30d Pearson correlation, funding-rate breadth, composite risk-on score. Replaces 1D funding-only regime models with a real macro-aware classifier. Ideal for any agent that needs to bias trading decisions by regime.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.4/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It mentions that it 'replaces 1D funding-only regime models' but does not disclose behavioral traits like whether it is read-only, rate limits, or the effect of optional parameters.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is one paragraph and lists the feature vector components clearly. It is reasonably concise, although it could be structured more for readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description specifies the output (regime label + feature vector), but fails to explain how the optional parameters affect the results. Given the mismatch between parameters and purpose, the documentation is incomplete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

While schema coverage is 100%, the description does not explain how the parameters (chain, limit, timeperiod) relate to the BTC regime classification. The parameters appear mismatched with the tool's stated purpose, adding no semantic value.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description states that it returns a BTC regime label and feature vector, which is clear. However, the input schema parameters (chain, limit, timeperiod) seem unrelated to BTC regime classification, creating confusion about the tool's actual purpose.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description says it's 'ideal for any agent that needs to bias trading decisions by regime,' but provides no guidance on when NOT to use it or how it differs from sibling tools. No alternative tools are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

runes_analyticsBInspect

Runes Protocol Analytics: Runes protocol overview, mint progress, holder distribution, and individual rune details. Includes mintable status and etching data.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full disclosure burden. It compensates somewhat by detailing what data is returned (overview, holder distribution, mint status), but omits operational details like rate limits, data freshness, or caching behavior that would help an agent understand invocation constraints.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with minimal bloat, though the opening 'Runes Protocol Analytics: Runes protocol...' is slightly redundant. The information is front-loaded with the protocol name and specific data categories, efficiently communicating value despite the repetition.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the lack of output schema, the description appropriately lists return value categories (mint progress, holder distribution, etc.). However, it misses the opportunity to clarify the Bitcoin-specific nature of Runes relative to the multi-chain input schema, leaving a significant gap in contextual understanding for an agent selecting parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the baseline is 3. The description adds no explicit parameter guidance (e.g., it doesn't clarify that 'chain' likely should be Bitcoin for Runes data, despite the broad enum), but the schema itself adequately documents the parameters without requiring additional description support.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's focus on 'Runes Protocol Analytics' and lists specific data points returned (mint progress, holder distribution, etching data, mintable status). It distinguishes from siblings like 'brc20_analytics' and 'bitcoin_onchain_analytics' by explicitly naming the Runes protocol, though it could strengthen this by clarifying it's specifically for Bitcoin Runes.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this versus 'brc20_analytics' or other Bitcoin analytics tools. Critically, it fails to address that Runes is a Bitcoin-specific protocol while the input schema allows non-Bitcoin chains (Ethereum, Solana, etc.), leaving users unclear about valid parameter combinations.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

security_scanBInspect

Token Security Scanner: Rug pull detection and security analysis with risk scoring.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis
`tokenAddress`	Yes	Token contract address to analyze

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It successfully indicates the analysis type (rug pull detection, risk scoring) but omits operational details like whether this is read-only, rate limits, data sources, or what the risk score format/scale looks like.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single efficient sentence with no redundancy, though the prefix 'Token Security Scanner:' partially restates the tool name. It is front-loaded with the key capability (rug pull detection). However, its extreme brevity leaves gaps given the lack of annotations and output schema.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of annotations and output schema, the description inadequately explains what the tool returns (e.g., risk score format, specific indicators checked). While it mentions 'risk scoring', a security tool with 4 parameters and no structured output documentation requires more descriptive context to be fully usable.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage for all 4 parameters, establishing a baseline of 3. The description itself does not mention any parameters or provide additional context about valid address formats or default behaviors beyond what the schema already documents.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the resource (tokens) and actions (rug pull detection, security analysis, risk scoring), providing concrete behavioral details beyond the tool name. However, it does not distinguish this from siblings like 'smart_contract_audit' or 'wallet_trust_score'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like 'smart_contract_audit', nor does it specify prerequisites (e.g., valid token address format) or exclusion criteria. It assumes the user knows when a security scan is appropriate.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

smart_contract_auditCInspect

Smart Contract Security Audit: Comprehensive smart contract vulnerability scanning across 30+ patterns with AI-powered reports and remediation suggestions.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden but only discloses that it generates 'AI-powered reports and remediation suggestions'. It fails to clarify whether the tool performs active mutation (deploying test transactions), whether it is asynchronous, or how to specify the target contract given the schema lacks a contract address field.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single efficient sentence that front-loads the function ('Smart Contract Security Audit:') before detailing capabilities. It avoids redundancy with the tool name while conveying key deliverables without excess verbiage.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the schema lacks a contract identifier parameter yet the description implies scanning functionality, the description is incomplete regarding what exactly gets audited (e.g., recent deployments, specific addresses). Without an output schema, it should better clarify the return structure beyond 'reports'.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, documenting chain, limit, and timeperiod. The description adds no specific parameter semantics beyond the schema, though it implies the scope of analysis ('30+ patterns') which does not map to specific inputs. Baseline score applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'Smart Contract Security Audit' with 'vulnerability scanning across 30+ patterns' and specifies outputs like 'AI-powered reports'. This distinguishes it from the generic `security_scan` sibling by focusing on smart contracts specifically. However, it does not explicitly differentiate usage contexts from similar security tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to select this tool versus alternatives like `security_scan` or other analysis tools. It lacks prerequisites (e.g., whether a contract address is required) and does not indicate if it is for pre-deployment or post-deployment analysis.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

smart_moneyAInspect

Smart Money Analysis: Rank top DEX wallets by trading volume and net buy/sell flow (accumulation vs distribution). Returns volume, trade count, and top tokens traded — not realized P&L or win rate.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis
`walletAddress`	No	Wallet address to analyze

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description bears the full burden. It discloses the output (volume, trade count, top tokens) and what is not returned, but omits behavioral traits like data freshness, authentication needs, or destructive actions. The description is adequate but not fully transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, each adding value: the first states the core function, the second clarifies scope and limits. No redundancies, well front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers purpose, scope, and limitations. With no output schema, it does not specify return format, but the parameter schema provides defaults for limit. Lacks mention of pagination or data source freshness, but overall complete for a data query tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds context by explaining 'smart money' and net buy/sell flow, but does not significantly enhance parameter understanding beyond schema descriptions. It also does not clarify optionality of walletAddress.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool ranks top DEX wallets by trading volume and net buy/sell flow, and explicitly distinguishes itself from profit/loss analysis. It also differentiates from sibling tools like hyperliquid_smart_money by focusing on general DEX wallets.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for accumulation/distribution analysis by stating what it returns and what it does not (not realized P&L or win rate). However, it lacks explicit when/when-not guidance or alternatives among the many sibling tools, relying on inferred context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

strategy_allocation_recommendationAInspect

Walk-Forward Validated Strategy Allocation: Returns Einstein's recommended numeric parameter overrides for live strategies, gated by recent walk-forward validation. For each strategy with a passing snapshot (≤7d age, ≥0.7 stability, ≥0.8 OOS Sortino, ≥16 trials), returns the best-performing parameter set + Sortino lift vs live + stability score + worst-fold drawdown. Closes the ReasoningBank loop for any agent that wants to run Einstein-validated strategies but doesn't want to operate the optimizer themselves.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, but description discloses gating criteria (snapshot age, stability, Sortino, trials) and return fields (parameter set, lift, stability, drawdown). Does not mention read-only or auth needs, but the read-only nature is implied.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Four sentences that front-load the main purpose and expand with details. Each sentence adds value, though minor verbosity in the last sentence could be trimmed.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description adequately explains return values and gating conditions. Input parameters are fully covered by schema. Sufficient for a non-destructive query tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for all 3 parameters (chain, limit, timeperiod). Description does not add extra meaning beyond schema, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it returns Einstein's recommended numeric parameter overrides for live strategies, validated by walk-forward testing. It distinguishes from siblings like optimize_strategy_params by noting it's for agents that don't want to operate the optimizer themselves.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Implies usage when a validated strategy is needed without running the optimizer, but does not explicitly state when not to use or name alternative tools for different scenarios.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

strategy_backtestBInspect

Trading Strategy Backtest: Historical backtesting simulations with Sharpe ratio, max drawdown, win rate, and P&L metrics.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses output metrics (Sharpe ratio, etc.) which helps, but fails to clarify whether this tool performs live computations or retrieves cached results, execution time expectations, or whether all parameters are truly optional given the empty required array.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description efficiently packs specific financial metrics into a single sentence. The 'Trading Strategy Backtest:' prefix effectively substitutes for the null title. No redundant phrases, though it could specify the retrieval vs. computation nature of the tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Critical gap: with 0 required parameters and no output schema, the description fails to explain how the trading strategy itself is specified (missing parameter? predefined strategies?). It also doesn't describe default behavior when called with no arguments, which is essential for a tool with all-optional parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description implies that 'limit' controls result sets containing the mentioned metrics but does not add syntax details, validation rules, or semantic relationships between parameters (e.g., whether timeperiod affects data granularity).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it performs 'Historical backtesting simulations' and specifies exact output metrics (Sharpe ratio, max drawdown, win rate, P&L), which distinguishes it from generic analysis tools. However, it doesn't differentiate from sibling tools like 'einstein_meta_strategy' or 'technical_analysis'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to select this tool versus alternatives like 'technical_analysis', 'general_analysis', or 'einstein_meta_strategy'. The description only states what it does, not when to use it or prerequisites for effective use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

strategy_gateAInspect

Strategy Rotator Verdicts: Returns Einstein's per-category active/cautious/disabled rotator verdicts for the current regime, derived from realized (category × regime) Kelly-multiplier history. Categories include leverage, spot, dca, snipe, arbitrage, prediction, yield, grid, liquidation. Lets a downstream agent skip categories that have catastrophic edge in this regime without re-implementing the attribution analysis.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

In the absence of annotations, the description carries the full burden. It indicates this is a read-only data retrieval tool (returns verdicts derived from historical data) with no destructive side effects. Though it does not mention rate limits or auth, the description suffices for safe invocation.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences: first states purpose and derivation, second lists categories, third gives usage guidance. No wasted words, well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains the tool's output concept (verdicts) and derivation, but lacks details on the exact return structure or format. Given no output schema, a small gap exists. However, it is sufficient for an agent to understand the tool's purpose and decide to invoke it.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema itself documents the three parameters. The description adds no further meaning to the parameters, thus baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool returns Einstein's per-category active/cautious/disabled rotator verdicts for the current regime, listing 9 specific categories. It explicitly differentiates from siblings by noting it enables downstream agents to skip categories without reimplementing attribution analysis.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description says it lets a downstream agent skip categories that have catastrophic edge in this regime, indicating when to use. However, it does not explicitly state when not to use or compare to alternatives like regime_classification, leaving some ambiguity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

technical_analysisCInspect

Technical Analysis: RSI, MACD, Bollinger Bands, EMA crossover, VWAP, ATR, Volume Spike, and Stochastic RSI analysis for any token.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Since no annotations are provided, the description carries the full burden. It successfully enumerates the specific indicators calculated (RSI, VWAP, ATR, etc.), but omits critical behavioral context such as data freshness, caching behavior, rate limits, or whether the analysis is real-time versus historical.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Efficient single-sentence structure front-loaded with the indicator list. The colon format effectively categorizes the content. However, the mention of 'token' without corresponding schema support creates unnecessary ambiguity, slightly undermining the clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Critical gap in explaining how to specify target tokens (given 'token' is referenced but absent from schema). With no output schema provided, the description should explain return format (e.g., whether it returns buy/sell signals, raw indicator values, or trend classifications), but it does not.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, establishing a baseline of 3, but the description mentions analysis 'for any token' while the input schema contains no 'token' parameter (only chain, limit, timeperiod). This creates ambiguity about how target tokens are specified. The description adds no explanatory context for the existing parameters beyond their schema definitions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly specifies the technical indicators provided (RSI, MACD, Bollinger Bands, etc.), distinguishing it from the more generic 'general_analysis' sibling. However, it doesn't clarify the scope limitations or coverage (e.g., whether it analyzes all tokens on a chain or requires specific token selection).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool versus 'general_analysis' or other market analysis siblings like 'market_movers' or 'smart_money'. Doesn't mention that all parameters are optional or suggest default workflows.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

token_snipingBInspect

Token Sniping Analysis: Early buyers analysis with bot detection and entry timing.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis
`tokenAddress`	Yes	Token contract address to analyze

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses the analytical behavior (bot detection, entry timing analysis) but lacks operational details such as data freshness, rate limits, authentication requirements, or whether the analysis covers historical vs real-time data. It does not describe the return format or structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Extremely concise single-sentence description with no redundant information. The colon structure effectively categorizes then specifies. While efficient, it borders on under-specification given the lack of annotations and output schema.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With 100% schema coverage for 4 parameters, the input side is well handled by the schema. However, no output schema exists and the description provides no hint about return values, analysis depth, or data granularity. Adequate but incomplete for a complex analytical tool with many siblings.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, with all four parameters (tokenAddress, chain, limit, timeperiod) fully documented in the schema. The description adds no additional parameter context, but given the complete schema coverage, baseline score 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool's function as early buyer analysis with specific focus on bot detection and entry timing. While it uses the tool name in the opening ('Token Sniping Analysis'), it expands with specific analytical capabilities (bot detection, entry timing) that distinguish it from sibling tools like smart_money or whale_tracking.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus alternatives. Given the extensive sibling list including copy_trading_intel, smart_money, and holder_concentration, the description fails to specify scenarios where bot detection and sniping analysis are preferred over general holder analysis or copy trading intelligence.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

top_gainersCInspect

Top Gainers Analysis: Top gainers/losers analysis with volume, price change %, and market insights.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It partially discloses behavioral traits by mentioning the analysis includes 'volume, price change %, and market insights', hinting at return content. However, it lacks critical safety information (read-only vs destructive), caching behavior, or data freshness guarantees.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is brief (one sentence) but structurally inefficient due to redundancy: 'Top Gainers Analysis: Top gainers/losers analysis' repeats the concept. The colon usage creates a title-like prefix that wastes space. The content after the colon is substantive but could be more direct: 'Analyze top gainers and losers with...'

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema, the description should ideally describe return values; it vaguely mentions 'market insights' but doesn't clarify data structure or format. No annotations means safety profile is undocumented. Parameter coverage is complete via schema, making this adequate but not comprehensive for a data retrieval tool with 3 optional parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage with clear enum values and defaults, establishing a baseline of 3. The description mentions 'volume, price change %' which provides context for what the 'timeperiod' parameter affects, but does not add specific semantic guidance on parameter interactions or selection criteria beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description states the tool performs analysis of top gainers/losers with specific metrics (volume, price change %, market insights). However, it creates confusion by mentioning 'losers' when the tool name is 'top_gainers', and fails to distinguish from sibling 'market_movers'. The opening 'Top Gainers Analysis:' restates the tool's purpose tautologically before adding details.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus siblings like 'market_movers', 'general_analysis', or 'technical_analysis'. No prerequisites, filtering recommendations, or exclusions are mentioned despite the tool having 0 required parameters.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

top_gainers_comprehensiveAInspect

Top Gainers (Alpha-Ranked): Today's biggest 24h gainers across the CoinGecko top-1000 universe (stablecoins and wrapped BTC/ETH derivatives filtered out) plus per-chain ecosystem gainers for Ethereum, Base, Arbitrum, BNB Chain, Polygon, and Optimism — RANKED BY CROSS-SIGNAL ALPHA CONFLUENCE, not raw % move. Each gainer is scored by how many independent signals corroborate it (whale accumulation, cross-chain DEX trending, social/CT hype, sector rotation, multi-timeframe chart trend, AI forecast); tokens with real confluence lead and pure price-pumps are demoted. Returns: alphaRanked[] (symbol, verdict, signalCount, per-signal badges + evidence), rawMovers[] (no-confluence pumps), losers, and per-chain breakdown. Optional { params: { chain } } drills into one chain. Reads cached signals (zero extra API cost) with graceful degradation. Use to find what's actually worth attention, not just what's pumping.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description fully bears the burden. It discloses that the tool reads cached signals with zero extra API cost, graceful degradation, and details the return structure (alphaRanked, rawMovers, losers, per-chain). It explains the ranking method (multiple signals: whale accumulation, DEX trending, etc.) and filtering logic.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is detailed but every sentence adds value, covering purpose, filtering, ranking, return structure, and usage. It is front-loaded with the key differentiating feature ('RANKED BY CROSS-SIGNAL ALPHA CONFLUENCE'). Some minor redundancy could be trimmed, but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, the description thoroughly explains the return format (fields like symbol, verdict, signalCount, badges, evidence) and the distinction between alphaRanked and rawMovers. It covers filtering, signal sources, caching, and degradation, making it fully self-contained.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with all three parameters described. The description adds value by explaining that the optional chain parameter drills into one chain, and it implies default behavior (top-1000 universe). This goes beyond the schema's enum and type info.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool finds top gainers ranked by cross-signal alpha confluence, not raw percentage move. It specifies the scope (CoinGecko top-1000, filtered stablecoins/wrapped BTC/ETH) and differentiates from simpler tools like top_gainers by emphasizing signal-based ranking.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description ends with 'Use to find what's actually worth attention, not just what's pumping,' providing clear context for when to use. While it doesn't explicitly state when not to use or list alternative sibling tools, the sibling list includes 'top_gainers' (raw movers), implying the distinction.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

wallet_pnlAInspect

Wallet P&L X-Ray: Reconcile a wallet's true P&L before copy-trading it: gross cash flow, FIFO realized, unrealized (marked to market) and total — with caveats on illiquid-token pricing, routing-hop inflation, and cost-basis window, so a single number is never mistaken for the truth. EVM computed via FIFO on Bitquery archive trades; Solana sourced from Vybe.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations exist, so the description carries full burden. It reveals caveats about illiquid pricing and routing-hop inflation, and explains data sources per chain (EVM via Bitquery, Solana via Vybe). It does not state side effects, but likely read-only.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, front-loaded paragraph that efficiently communicates purpose, scope, and caveats. Slightly dense but not verbose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The input schema lacks a wallet address parameter, yet the description implies analyzing a specific wallet. Without output schema, the return structure is unexplained. The description omits how the wallet is identified, which is a critical gap for correct invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

All three parameters have descriptions in the schema (100% coverage), so the description adds no extra meaning. The tool description does not elaborate on how parameters like timeperiod map to the cost-basis window or performance periods.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool computes a wallet's P&L including gross cash flow, FIFO realized, unrealized, and total, with caveats. It distinguishes from likely sibling 'wallet_portfolio' by focusing on profit/loss reconciliation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description advises using the tool 'before copy-trading it', indicating when to use. Caveats on illiquid tokens guide when not to fully trust results, but it does not explicitly compare to other P&L or wallet analysis tools among the many siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

wallet_portfolioCInspect

Wallet Portfolio Analysis: Complete wallet portfolio breakdown with holdings and performance.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis
`walletAddress`	No	Wallet address to analyze

Tool Definition Quality

C2.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure but offers minimal detail. While it implies a read-only analysis operation, it omits critical context such as data freshness, caching behavior, rate limits, or what happens when analyzing wallets with extensive holdings (despite the presence of a 'limit' parameter).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is brief but suffers from redundancy ('Wallet Portfolio Analysis' followed by 'Complete wallet portfolio breakdown'). The colon structure front-loads the category but wastes space restating the tool name rather than adding actionable guidance.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of annotations and output schema, the description is insufficient for a 4-parameter tool with optional filtering across 7 chains and 5 time periods. It fails to describe the return structure, pagination behavior (despite the 'limit' parameter), or the specific metrics included in 'performance' analysis.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, establishing a baseline of 3. The description mentions 'performance' which loosely maps to the 'timeperiod' parameter, but adds no specific syntax guidance, format examples, or semantic relationships between parameters (e.g., how 'chain' filters the analysis).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs a 'Complete wallet portfolio breakdown' covering 'holdings and performance,' providing specific verbs and scope. However, it does not explicitly differentiate from sibling tools like 'general_analysis' or 'wallet_trust_score' that might also analyze wallet data.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided on when to use this tool versus alternatives (e.g., 'general_analysis' for broader market context or 'wallet_trust_score' for risk assessment). There are no prerequisites, warnings, or exclusion criteria mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

wallet_trust_scoreBInspect

Wallet Trust Score: Score wallets by historical accuracy, profitability, consistency, and behavior. Identifies whales, smart money, KOLs, and bots.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It successfully discloses the scoring criteria (accuracy, profitability, consistency, behavior) and classification targets, adding valuable context. However, it lacks operational details such as score ranges, data freshness, rate limits, or whether results are ranked.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficiently structured sentences with zero redundancy. The first sentence establishes the core scoring mechanism, the second the identification capabilities—every word earns its place. Front-loaded with the most critical information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for a tool with 3 simple parameters and no output schema, explaining what the tool returns conceptually (scores and classifications). However, given the absence of an output schema, it should specify the return structure or score format, and it lacks differentiation guidance for overlapping sibling tools.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description mentions 'Score wallets' but adds no semantic meaning about how parameters (chain, limit, timeperiod) affect the scoring logic or result set beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scores wallets using specific metrics (historical accuracy, profitability, consistency, behavior) and identifies specific actor types (whales, smart money, KOLs, bots). However, it loses a point because it claims to identify 'whales' and 'smart money' without distinguishing from sibling tools 'whale_tracking' and 'smart_money', potentially causing selection confusion.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus siblings with overlapping functionality (whale_tracking, smart_money, wallet_portfolio). No prerequisites, exclusions, or alternative recommendations are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

whale_trackingCInspect

Whale Intelligence: Token-centric whale accumulation view with net capital flows and smart money tracking.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.7/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full disclosure burden. It implies read-only behavior via 'view' and hints at return data structure ('accumulation', 'net capital flows'), but omits rate limits, caching behavior, data freshness, and specific return format.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence avoids bloat, but 'Whale Intelligence:' prefix wastes space with category labeling rather than actionable information. Dense jargon ('smart money', 'net capital flows') packs information but requires domain knowledge to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Insufficient for a 3-parameter blockchain analytics tool lacking output schema and annotations. Description does not clarify that all parameters are optional, what 'whale' threshold defines results, or how 'token-centric' manifests without a token input parameter.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, establishing baseline 3. Description mentions 'Token-centric' which adds domain context (results organized by token) despite no token filter parameter, but this slightly confuses the tool's input requirements. No additional syntax or format guidance provided.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose3/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description conveys the domain (whale/accumulation tracking, capital flows) but uses marketing language ('Whale Intelligence', 'smart money') rather than clear operational verbs. It fails to differentiate from sibling 'smart_money' or 'copy_trading_intel' tools, leaving ambiguity about unique value.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to prefer this tool over alternatives like 'smart_money', 'holder_concentration', or 'copy_trading_intel'. Does not mention prerequisites (e.g., whether specific chains work better) or when-not-to-use conditions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

yield_depositBInspect

Yield Vault Deposit: Deposit tokens into an optimal DeFi yield vault via LI.FI Earn. Handles swap + bridge + deposit atomically across 60+ chains in a single transaction. Falls back to direct Aave V3 / Pendle adapters when LI.FI is unavailable. Returns signed transaction plan + approval data.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

B3.1/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden but lacks critical behavioral details. It mentions atomic transactions and fallback adapters but doesn't disclose permissions required, rate limits, cost implications, error handling, or what 'optimal' means. For a complex DeFi operation, this leaves significant gaps in understanding tool behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured in two sentences that cover core functionality and fallback behavior. It's front-loaded with the main purpose and avoids unnecessary details. Minor improvement could come from breaking into bullet points for complex operations, but overall it's concise and well-organized.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex DeFi deposit tool with no annotations and no output schema, the description is insufficient. It doesn't explain what 'signed transaction plan + approval data' contains, how to use the returned data, success/failure conditions, or security considerations. The input schema covers parameters well, but overall context for safe and effective use is lacking.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, providing good documentation for all three parameters. The description adds no parameter-specific information beyond what's in the schema. However, it doesn't contradict the schema, and with high coverage, the baseline score of 3 is appropriate as the schema handles parameter documentation adequately.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Deposit tokens'), target resource ('optimal DeFi yield vault via LI.FI Earn'), and scope ('swap + bridge + deposit atomically across 60+ chains'). It distinguishes from siblings like 'yield_discover' or 'yield_rebalance' by focusing on deposit execution rather than discovery or rebalancing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this tool versus alternatives is provided. The description mentions fallback mechanisms but doesn't specify scenarios for choosing this over other yield-related tools like 'defi_yield_analysis' or 'pendle_yield_trading'. Usage context is implied but not clearly articulated.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

yield_discoverAInspect

Yield Opportunity Discovery: Find the best DeFi yield opportunities across 20+ vault protocols (Aave, Morpho, Ethena, EtherFi, Euler, Pendle, Maple) on 60+ chains via LI.FI Earn. Returns a ranked list with APY, TVL, risk tier, and vault address. Falls back to direct protocol data when LI.FI is unavailable.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

A3.6/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It describes the tool's behavior by specifying the data sources (LI.FI Earn with fallback to direct protocol data), the scope (20+ protocols, 60+ chains), and the output format (ranked list with APY, TVL, risk tier, vault address). However, it does not mention performance characteristics like rate limits, error handling, or authentication requirements, leaving gaps for a tool with this complexity.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose in the first sentence and follows with operational details in a second sentence. It is appropriately sized with no redundant information, though it could be slightly more structured (e.g., separating fallback behavior into a distinct point).

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (multi-protocol, multi-chain discovery with fallback) and the absence of both annotations and an output schema, the description is moderately complete. It covers the what, how, and output format, but lacks details on error conditions, performance limits, or example outputs, which would be helpful for an AI agent to use it effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, so the baseline is 3. The description does not add any parameter-specific information beyond what the schema provides (e.g., it doesn't explain how 'chain' or 'timeperiod' affect the discovery process), but it doesn't need to since the schema is well-documented.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('Find the best DeFi yield opportunities') and resources ('across 20+ vault protocols on 60+ chains via LI.FI Earn'), distinguishing it from sibling tools like 'defi_yield_analysis' by specifying its multi-protocol, multi-chain discovery approach and fallback mechanism.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context by mentioning 'when LI.FI is unavailable' as a fallback scenario, but does not explicitly state when to use this tool versus alternatives like 'defi_yield_analysis' or 'yield_deposit'. It provides some operational context but lacks clear guidance on tool selection among siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

yield_rebalanceCInspect

Stablecoin Yield Rebalance: Analyze current stablecoin / ETH yield positions and recommend moving to the highest-APY vault when the improvement exceeds a threshold. Uses LI.FI Earn for opportunity discovery across chains.

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure. It mentions 'recommend moving' which implies read-only analysis rather than execution, but doesn't clarify if this is purely analytical or requires permissions. No information about rate limits, authentication needs, or what constitutes a 'recommendation' (e.g., format, confidence levels). The threshold concept is mentioned but undefined.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured sentence that efficiently conveys the core functionality. It front-loads the main purpose and includes key details about methodology (LI.FI Earn) without unnecessary elaboration. Every element serves a purpose, though it could potentially benefit from breaking into two sentences for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 3 parameters, 100% schema coverage, but no annotations and no output schema, the description provides adequate basic purpose but lacks important context. It doesn't explain what the output looks like (recommendations format), doesn't address behavioral aspects like permissions or rate limits, and doesn't differentiate from sibling tools. Given the complexity of yield analysis, more completeness would be helpful.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all three parameters (chain, limit, timeperiod) with descriptions and enums. The description doesn't add any parameter-specific information beyond what's in the schema, such as how these parameters affect the analysis or recommendations. Baseline 3 is appropriate when schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool analyzes stablecoin/ETH yield positions and recommends moving to highest-APY vaults when improvement exceeds threshold, using LI.FI Earn for opportunity discovery. It specifies the resource (stablecoin/ETH yield positions) and action (analyze and recommend moves). However, it doesn't explicitly differentiate from sibling yield tools like 'yield_deposit' or 'yield_discover'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like 'yield_deposit' or 'yield_discover' from the sibling list. It mentions using LI.FI Earn for opportunity discovery but doesn't specify contexts where this tool is preferred or excluded compared to other yield-related tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

zora_launchesCInspect

Zora Launchpad: Zora Launchpad token launches and volume analysis (Base only).

ParametersJSON Schema

Name	Required	Description
`chain`	No	Blockchain network to query
`limit`	No	Maximum number of results (default: 10, max: 100)
`timeperiod`	No	Time period for analysis

Tool Definition Quality

C2.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are present, so the description carries full disclosure burden. It provides one behavioral constraint ('Base only'), but this directly contradicts the input schema which accepts multiple chain values (ethereum, bsc, solana, etc.). No information on return format, pagination, rate limits, or mutation safety is provided.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is brief (one sentence) but inefficiently structured with redundant phrasing ('Zora Launchpad: Zora Launchpad...'). The parenthetical '(Base only)' is appropriately placed at the end, but the colon-separated prefix restates the tool name without adding value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema and no annotations, the description should explain what data is returned (e.g., launch metrics, volume calculations). It fails to resolve the ambiguity between the 'Base only' claim and the multi-chain schema, leaving agents uncertain about valid chain inputs. Essential behavioral context for a financial data tool is missing.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, establishing a baseline of 3. The description attempts to add semantic constraint with 'Base only' regarding the chain parameter, though this conflicts with the schema's enum values. No additional context is provided for timeperiod granularity or limit behavior beyond the schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the resource (Zora Launchpad tokens) and actions (launches and volume analysis). It specifies 'Base only' to scope the tool's domain. However, it redundantly repeats 'Zora Launchpad' and does not explicitly distinguish from the sibling 'pump_fun_launches' tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance provided on when to use this tool versus alternatives like 'pump_fun_launches' or 'general_analysis'. No prerequisites, exclusions, or workflow context is mentioned. The description only states what the tool does, not when to invoke it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?