predictionguard

by io.tooloracle

Server Details

Polymarket integrity MCP — 28 tools for wallet risk, whales, UMA scans, SAR/STR reports.

Status: Healthy
Last Tested: 2026-05-30 16:38
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A3.9/5.0

Tool DescriptionsA

Average 4/5 across 33 of 33 tools scored. Lowest: 3.1/5.

Server CoherenceA

Disambiguation5/5

Each tool has a clear, distinct purpose covering different aspects of prediction market integrity (market analysis, wallet analysis, AML/KYC, alerting, reporting). There is minimal overlap risk, as even related tools (e.g., pg_insider_signal_scan vs. pg_information_advantage_score) are differentiated by input (market vs. wallet) and output type.

Naming Consistency4/5

All tools share the 'pg_' prefix and use descriptive snake_case names, making the set predictable. However, the verb/noun order is inconsistent (e.g., pg_whale_add vs. pg_market_details). The pattern is still clear and functional, so minor deviation from a strict verb_noun pattern.

Tool Count4/5

With 33 tools, the set is large but well-scoped for a comprehensive platform covering market analysis, wallet intelligence, compliance, and reporting. Each tool serves a distinct function, and the count is justified by the breadth of the domain, though it pushes the upper bound of 'reasonable'.

Completeness5/5

The toolset covers the full lifecycle of prediction market integrity work: from market discovery and integrity scanning to wallet analysis, entity resolution, AML/KYC, watchlist management, alerting, and SAR reporting. There are no obvious gaps for the stated purpose.

Available Tools

41 tools

pg_adverse_media_entityAInspect

Adverse media scan — negative news on fraud, money laundering, corruption, sanctions. Delegates to AMLOracle. Core building block for evidence chains that link a wallet or trader to publicly reported concerns (always cite the source, never attribute).

ParametersJSON Schema

Name	Required	Description
`lang`	No	Optional language (e.g. en, de)
`name`	Yes	Person or entity name
`limit`	No	Max results (default upstream)

Tool Definition Quality

A3.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses delegation to AMLOracle and a usage rule (cite source, never attribute). Without annotations, the description provides some behavioral context but lacks details on errors, limits, or data freshness.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences, front-loaded with the core function, no unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for a simple tool with no output schema, but missing details on output format and how results integrate into evidence chains.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the description adds minimal value beyond the parameter descriptions. The context of adverse media scan gives background but does not explain parameter usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'adverse media scan' and lists types of negative news. It is distinct from siblings like pep_check_entity or sanctions_screen_entity, but does not explicitly differentiate.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Implied usage as a building block for evidence chains, with a note to cite sources. No explicit when-to-use or alternatives compared to siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_classified_keyword_matchAInspect

Daily compliance triage: scans top Polymarket markets by 24h volume and flags those whose question text matches curated sensitive-keyword categories — military action, regime change, leadership death, diplomacy, intelligence ops, monetary policy, regulatory action. Returns flagged markets with monitoring severity (HIGH/MEDIUM/LOW), matched categories and keywords, and volume. Pair flagged HIGH markets with pg_insider_signal_scan for active surveillance.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max markets to scan (default 25, max 100)
`category`	No	Optional: filter to one category (military_action, regime_change, leadership_death, diplomacy, intelligence, monetary_policy, regulatory_action)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, but the description explains the scanning logic (top markets by volume, keyword matching) and output details (severity, categories, volume). It does not specify caching or real-time behavior, but overall is transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no wasted words, well-structured with bullet-like listing of categories and return fields.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, but description explains return values (severity, categories, keywords, volume). Does not detail error handling or pagination, but sufficient for a simple scan tool with two optional params.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers both parameters with descriptions; description adds context that limit defaults to 25 and max 100, and categories are applied during scanning, enhancing understanding beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scans top Polymarket markets by 24h volume and flags those matching sensitive-keyword categories, listing categories and return fields. It distinguishes from siblings by mentioning pairing with pg_insider_signal_scan.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Indicates 'daily compliance triage' usage and recommends pairing with another tool for active surveillance, providing clear context for when to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_evidence_bundle_marketAInspect

Complete forensic dossier for a single Polymarket market: market details + integrity scan + volume anomaly profile + Trust Layer provenance trace — aggregated into one dossier ready for manual review, press inquiry, or as input to a SAR report.

ParametersJSON Schema

Name	Required	Description	Default
`id`	No	Market id / conditionId
`slug`	No	Market slug

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It clearly describes the tool as a read-only aggregation (dossier), implying no side effects. However, it does not explicitly state that it is non-destructive, mention authentication needs, or specify rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured sentence that front-loads the core concept ('Complete forensic dossier') and enumerates key components concisely. Every part adds value without unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (bundling multiple data types) and the absence of an output schema, the description adequately outlines the dossier's components and intended uses. However, it could be improved by hinting at the output format (e.g., structured report or text) to set expectations.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Both parameters ('id' and 'slug') have descriptions in the schema (100% coverage), so the baseline is 3. The description does not add additional meaning beyond what the schema provides, such as which parameter to prefer or their relationship.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: producing a 'complete forensic dossier' for a single Polymarket market. It lists specific components (market details, integrity scan, volume anomaly profile, Trust Layer provenance trace), distinguishing it from sibling tools like pg_market_details or pg_market_integrity_scan that provide only individual pieces.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description mentions use cases ('ready for manual review, press inquiry, or as input to a SAR report'), providing context for when to use the tool. However, it does not explicitly exclude situations where individual tools might be preferred or mention alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_information_advantage_scoreAInspect

Composite A-F rating for whether a wallet appears to have had non-public information on a specific market. Aggregates pg_position_timing_anomaly (50% weight) + pg_insider_signal_scan (30%) plus bonuses for being in top positioners (+10), new to market (+10), and sensitive category (+10). Returns letter grade A-F, composite score 0-100, all component scores, and interpretation guide. Pattern-matches reported cases (Van Dyke, Hyperliquid whale). Statistical signal only — not a legal determination.

ParametersJSON Schema

Name	Required	Description
`id`	No	Market id / conditionId
`slug`	No	Market slug
`wallet`	Yes	Wallet to score (0x + 40 hex)

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided; description compensates by detailing computation (weights, bonuses), output format (letter grade, score 0-100, component scores, interpretation guide), and references to known cases. Good disclosure of statistical nature, though does not discuss permissions or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

One informative paragraph, front-loaded with purpose, includes essential details without extraneous text. Well-structured for quick parsing.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description explains all return elements (grade, composite score, components, guide). Sibling list is dense, but the description doesn't compare explicitly; however, it covers the tool's function sufficiently for selection.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for id (market id/conditionId), slug, wallet (0x+40 hex). The description does not add new parameter details beyond schema, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clear statement: 'Composite A-F rating for whether a wallet appears to have had non-public information on a specific market.' Identifies constituents, weights, bonuses, and outputs. Distinguishes from siblings by naming aggregated tools (pg_position_timing_anomaly, pg_insider_signal_scan).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides caveat ('Statistical signal only — not a legal determination') but does not offer explicit when-to-use/avoid or compare to siblings. Implies use for insider signal detection but lacks exclusion criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_insider_alert_signedAInspect

Issue a cryptographically signed, on-chain anchored integrity alert for a Polymarket market. Combines evidence_anchor + get_signed_fact from FeedOracle Trust Layer. Alerts are anchored on Polygon with an ECDSA signature and are externally verifiable via pg_verify_alert and the JWKS at feedoracle.io/.well-known/jwks.json.

ParametersJSON Schema

Name	Required	Description	Default
`claim`	Yes	Short human-readable claim being asserted
`signals`	No	List of integrity signals triggered
`wallets`	No	Optional list of related wallet addresses
`severity`	No	Alert severity	MEDIUM
`market_slug`	Yes	Polymarket market slug

Tool Definition Quality

A3.7/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the full burden. It discloses combination of evidence_anchor + get_signed_fact, ECDSA signature, and on-chain anchoring on Polygon. However, it does not mention side effects, permissions, rate limits, or error behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the main action, and contains no unnecessary words. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and no annotations, the description explains the tool well but fails to mention the return value or output format. It partially compensates by referencing verification tool but lacks completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description does not add extra meaning to parameters beyond their schema definitions; it only references market_slug and claim implicitly.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's function: issuing a cryptographically signed, on-chain anchored integrity alert for a Polymarket market. It specifies the technical mechanism (ECDSA, Polygon, JWKS) and distinguishes from sibling pg_verify_alert by mentioning external verification.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for integrity alerting but does not explicitly state when to use this tool versus alternatives like pg_market_integrity_scan. No when-not-to-use or prerequisites are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_insider_signal_scanAInspect

Real-time insider-trading signal scan for a single Polymarket market. Aggregates 5 signals: (1) very large pre-event positions, (2) high ratio of wallets new to this market, (3) coordinated minute-bucket timing clusters, (4) one-sided positioning, (5) sensitive-keyword category match (military/regime/intel). Returns a 0-100 score (CLEAN/LOW/MEDIUM/HIGH/CRITICAL), triggered signals with evidence, top 5 positioners with new-wallet flag, and full methodology for audit. Default window = 72h before market end.

ParametersJSON Schema

Name	Required	Description
`id`	No	Market id / conditionId
`slug`	No	Polymarket market slug
`window_hours`	No	Pre-event window to analyze (default 72h)
`min_position_usd`	No	Min notional to flag (default $1000)

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully explains the tool's behavior: real-time scan, 5 signals, output score, triggered signals, top positioners, methodology. It defaults window to 72h. It doesn't cover edge cases or authorization but is fairly transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single dense paragraph that front-loads purpose then covers signals and output. It is mostly concise, though a slight structural improvement (e.g., bullet points) could enhance readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with no output schema and 4 optional parameters, the description adequately explains the output (score, signals, positioners, methodology) and the signals in detail. It is reasonably complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all parameters. The description mentions defaults (window, min position) but adds no new meaning beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the verb 'scan' and the resource 'insider-trading signals for a single Polymarket market'. It enumerates five specific signals, distinguishing it from sibling tools like pg_market_integrity_scan or single-signal tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description states it scans a single market and lists the aggregated signals, implying a comprehensive use case. However, it does not explicitly guide when to use it vs. alternatives or mention exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_conflict_of_interest_checkAInspect

CFTC Rule 5.17(z) conflict-of-interest pre-trade check. Analyzes a Kalshi market's metadata to detect whether trading it would expose decision-makers (election candidates, government officials, athletes, regulators) to insider-trading liability. Pattern-matches against the Kalshi April 2026 disciplinary cases (Moran/Klein/Enriquez) and the CFTC Van Dyke complaint. Returns ALLOW/MONITOR/WARN/BLOCK recommendation + 0-100 score + reference cases. Cannot identify actual trader identity (Kalshi public API anonymizes users) — pair with internal surveillance for full enforcement.

ParametersJSON Schema

Name	Required	Description
`ticker`	No	Kalshi market ticker
`event_ticker`	No	Or event_ticker for the whole event
`candidate_name`	No	Optional: specific person's name to match against title (e.g. 'Mark Moran')

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Description discloses behavioral traits: pattern-matches against specific disciplinary cases, returns ALLOW/MONITOR/WARN/BLOCK + score + reference cases, and explicitly states it cannot identify trader identity due to API limitations. No annotations exist, so description carries the full burden and does so comprehensively.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is four sentences with no wasted words, front-loading the regulatory context and purpose. It efficiently conveys functionality, output, and limitations.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no annotations and no output schema, the description fully covers what the tool does, what it returns, and its limitations. It is self-contained for an agent to understand when and how to use it.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema descriptions already cover all 3 parameters (100% coverage), so baseline is 3. The description adds value by providing context for 'candidate_name' (matching against specific case references like Moran/Klein), which clarifies its purpose beyond the schema description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool performs a CFTC conflict-of-interest pre-trade check on Kalshi markets, returning a recommendation and score. It uses specific verbs and resource, and its purpose is distinct from sibling tools (e.g., market details, volume analysis).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use for pre-trade compliance by referencing insider-trading liability and specific cases. It provides a clear limitation ('cannot identify actual trader identity') and suggests pairing with internal surveillance, offering context on when not to rely solely on this tool. No explicit differentiation from siblings, but the use case is well-defined.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_market_detailsAInspect

Detailed Kalshi market or event info. Pass 'ticker' for a single market (returns yes/no bids+asks, last price, volume, OI, spread, hours until close) or 'event_ticker' for all markets in an event (multi-outcome). Includes the rules_primary text (Kalshi's settlement criteria) which is critical for understanding resolution risk.

ParametersJSON Schema

Name	Required	Description	Default
`ticker`	No	Kalshi market ticker (e.g. KXG7LEADEROUT-45JAN01-DJT)
`event_ticker`	No	Kalshi event ticker (returns all child markets)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description discloses what the tool returns in each mode, including critical details like rules_primary text. It implies a read-only operation. It does not mention mutual exclusivity of parameters or error behavior, but the core behavior is transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, with the first stating the purpose and the second providing parameter guidance and key output features. No unnecessary words, and it is front-loaded with the most important information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the main use cases and outputs, and highlights the importance of rules_primary. However, it does not mention mutual exclusivity of parameters or differentiate from sibling tools like pg_kalshi_orderbook_analysis. Overall sufficient for an agent to use correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage for both parameters. The description adds value by explaining the effect of each parameter and the corresponding output, which goes beyond the schema's basic field descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it provides detailed info for Kalshi markets or events, distinguishes between two modes (single market via 'ticker' or event via 'event_ticker'), and lists the specific returned data (bids/asks, last price, etc.). It differentiates from sibling tools by mentioning Kalshi-specific and rules_primary text.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly tells the agent when to use each parameter: 'ticker' for a single market, 'event_ticker' for all markets in an event. It does not provide explicit exclusions or alternatives, but the context is clear enough for an agent to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_orderbook_analysisAInspect

Forensic orderbook analysis for one Kalshi market. Detects manipulation-vulnerable patterns: (1) wide spread (>$0.10), (2) shallow depth (<10 contracts), (3) few price levels, (4) single-order dominance (>80% in top level), (5) penny-wall pattern (large bids at ≤$0.005, commonly used to fake depth). Returns 0-100 score, severity, and full level-by-level data. Kalshi returns bids only — implied asks computed via yes_bid + no_bid = $1 reciprocity.

ParametersJSON Schema

Name	Required	Description	Default
`ticker`	Yes	Kalshi market ticker

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully discloses behavioral traits: it is a read-only analysis tool, explicitly states that Kalshi returns bids only and that asks are computed via reciprocity, and describes the output (score, severity, level data). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences long, front-loading the purpose and then listing patterns and return information. It is clear and efficient, though the numbered list could be condensed slightly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given one simple required parameter and no output schema, the description provides complete context: what the tool does, specific patterns detected, the computed score and severity, level-by-level data, and the crucial behavioral note about bids-only and reciprocity. No essential information is missing.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The single parameter 'ticker' is already well-documented in the schema (100% coverage). The description adds context by explaining it is for one Kalshi market and how the ticker is used in the analysis, which goes beyond the schema's minimal description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'forensic orderbook analysis' and detects five specific manipulation-vulnerable patterns, using a unique verb-resource combination. It distinguishes itself from siblings like pg_market_details and pg_market_integrity_scan by focusing on orderbook manipulation detection.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for in-depth orderbook analysis of a single Kalshi market and lists explicit patterns, giving clear context. However, it does not explicitly state when not to use or directly name alternatives, which would be needed for a perfect score.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_resolution_calendarAInspect

Compliance briefing: which Kalshi markets resolve in the next N hours? Bucketed by 6h, 24h, 48h, 1 week. Flags markets in CFTC-sensitive categories (Elections, Politics). Use for daily surveillance prep — markets near resolution have highest insider-trading risk window.

ParametersJSON Schema

Name	Required	Description
`category`	No	Optional category filter
`hours_ahead`	No	Look-ahead window (default 48h, max 168h)
`min_open_interest`	No	Only markets with at least this OI

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full behavioral burden. It mentions bucketing and CFTC-sensitive flags, adding context beyond the schema, but does not disclose performance, error handling, or exact output structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no fluff. The first sentence front-loads the core purpose, and the second provides usage context. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Lacks output schema or description of return fields (e.g., market names, resolution times). For a compliance tool, this is a gap. No mention of prerequisites or error scenarios.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, giving a baseline of 3. The description mentions bucketing which relates to hours_ahead but does not add new semantic detail beyond what the schema provides for any parameter.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'which Kalshi markets resolve in the next N hours' with bucketing and risk flags. It distinguishes this from siblings like market details or searches by framing it as a compliance briefing for resolution timing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use for daily surveillance prep' and explains the risk context. However, it does not provide explicit when-not-to-use guidance or name alternative tools for different needs.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_searchAInspect

Search Kalshi prediction markets (CFTC-regulated event contracts). Returns events ranked by aggregate 24h volume across all markets in the event. Kalshi categories include Elections, Politics, Economics, Climate, Sports, Entertainment, Financials. Public data, no auth needed. Each event has one or more child markets (mutually exclusive or yes/no).

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max events to return (default 25, max 100)
`query`	No	Keyword search in title, sub_title, event_ticker
`status`	No		open
`category`	No	Optional category filter (Elections, Politics, Economics, ...)

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the full burden. It discloses that the tool reads public data (no auth), returns events sorted by volume, and explains the structure of events having child markets. This adequately communicates that the operation is safe and read-only, but it lacks details on rate limits, pagination behavior, or what happens on empty results—leaving moderate gaps for complete transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is five sentences long, front-loaded with the core purpose, and every sentence adds distinct information (search function, ranking, categories, public nature, event structure). There is no redundancy or fluff, making it highly efficient for an agent to parse quickly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has four parameters and no output schema, the description provides the essential context: what the tool returns (events ranked by volume), how to filter (query, category, status), and important background (public, CFTC-regulated, child markets). It does not mention sorting direction (assumed descending), maximum events per page beyond the default, or error scenarios, but it is sufficient for basic usage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema describes 3 of 4 parameters (query, limit, category) with attributes; the status parameter lacks a description. The tool description adds value by listing example categories and explaining the ranking by volume, but it does not elaborate on parameter usage beyond what the schema already provides. For the undocumented status parameter, no additional guidance is given, so the description only marginally supplements the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches Kalshi prediction markets and returns events ranked by aggregate 24h volume. It specifies the resource (events) and the sorting criterion, distinguishing it from sibling tools like pg_kalshi_market_details (single market details) and pg_kalshi_trending (trending markets). The verb 'search' combined with 'returns events ranked by volume' makes the purpose unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides context by stating it searches by query, category, and status, and clarifies that data is public with no authentication needed. However, it does not explicitly tell an agent when to use this tool versus alternatives, such as when to use pg_kalshi_market_details for a specific market. The mention of child markets offers some implicit guidance, but explicit when-not-to-use instructions are missing.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_thin_market_alertAInspect

Scan all open Kalshi markets and flag those that are manipulation-vulnerable: closing soon (<24h), low open interest, low liquidity, wide spread, low volume. Daily triage tool. Returns markets ranked by risk_score (≥30 threshold). Best paired with pg_kalshi_resolution_calendar for full compliance briefing.

ParametersJSON Schema

Name	Required	Description	Default
`category`	No	Optional category filter
`max_results`	No	Max flagged markets (default 25, max 50)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description fully explains the tool's behavior: it flags markets based on multiple criteria and returns them ranked by risk_score with a threshold of 30. No annotations to contradict.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences, each serving a purpose: explaining the scan, noting its daily use, and suggesting a pairing. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description mentions returned markets ranked by risk_score. It covers the core behavior but omits details like exact output fields beyond risk_score. Still, it's sufficient for a scanning tool with clear intent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for both parameters (category, max_results). The description adds no significant extra meaning beyond what the schema provides, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scans all open Kalshi markets to flag manipulation-vulnerable ones with specific criteria (closing soon, low open interest, etc.). It distinguishes itself from siblings by suggesting a pairing with pg_kalshi_resolution_calendar.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description identifies this as a daily triage tool and suggests pairing with another tool for a full compliance briefing. It provides context for when to use it, though it doesn't explicitly state when not to use or list alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kalshi_trendingAInspect

Top trending Kalshi events ranked by aggregate 24h volume across child markets. Filterable by category. Daily-briefing-ready output. Includes the highest-volume sub-market per event.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max events to return (default 25, max 50)
`category`	No	Optional category filter (Elections, Politics, ...)

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the burden. It discloses ranking by aggregate 24h volume and inclusion of top sub-market but does not mention sorting direction, pagination, or rate limits. Adequate but could be richer.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences, front-loaded with key information, no wasted words. Efficiently conveys purpose and key features.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given simple parameters and no output schema, the description adequately explains output content (trending events, volume, top sub-market). Slightly lacking in precise return format but sufficient for selection.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with parameter descriptions already present. The tool description adds minimal extra meaning ('Filterable by category' is consistent with schema). Baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool retrieves top trending Kalshi events ranked by 24h volume, filterable by category, and includes the highest-volume sub-market. It uses specific verbs and distinguishes from sibling tools like 'pg_kalshi_search'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for daily briefings ('Daily-briefing-ready output') but does not explicitly state when to use versus alternatives or provide exclusions. Guidance is present but not directive.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_known_casesAInspect

Registry of publicly reported prediction-market integrity cases (Théo / French whale, Hyperliquid whale, Venezuela, Iran strikes, Nuclear, UMA Ukraine). Filter by category or minimum AML relevance (low/medium/high). Summaries only — no accusations against named parties.

ParametersJSON Schema

Name	Required	Description	Default
`category`	No	Optional category substring (e.g. 'geopolitical', 'oracle')
`min_aml_relevance`	No	Minimum AML relevance

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses that the tool returns summaries only and does not make accusations, implying it is a read-only information source. However, it does not mention authentication needs, rate limits, or response structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise: two sentences that front-load the purpose and then list filters and a behavioral note. Every word earns its place with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple registry lookup with two optional parameters and no output schema, the description adequately covers the tool's purpose, filtering, and a key behavioral constraint. It lacks details on return format or error handling, but given the low complexity, it is sufficiently complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Both parameters are fully described in the schema (100% coverage). The description repeats the filtering capability but adds examples of categories (e.g., 'geopolitical', 'oracle') and clarifies the enum values for min_aml_relevance. This adds marginal value beyond the schema, justifying a baseline 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as a registry of publicly reported prediction-market integrity cases, with concrete examples (e.g., Théo, Hyperliquid whale, Venezuela). It distinguishes itself from siblings by focusing on known cases, not other tools like pg_adverse_media_entity or pg_market_integrity_scan.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides context that this tool is for known cases and includes the note 'summaries only — no accusations against named parties,' which guides appropriate use. However, it does not explicitly state when to use this tool vs. alternatives like pg_adverse_media_entity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_kyc_bundle_entityBInspect

Full KYC bundle for a named entity: sanctions + PEP + adverse media + country risk in a single call. Delegates to AMLOracle.kyc_bundle.

ParametersJSON Schema

Name	Required	Description
`name`	Yes
`country`	No	ISO country code
`entity_type`	No	'person' or 'organization'

Tool Definition Quality

B3.2/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must cover behavioral details. It states it is a 'single call' that 'delegates to AMLOracle.kyc_bundle', but lacks information on auth requirements, rate limits, or what happens on missing data. This is insufficient for a KYC operation.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is succinct with two sentences, front-loading the core purpose and nesting details. Every sentence adds value, though it could be slightly more structured with a usage hint.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has no output schema and three parameters, the description should provide more context about the return structure or any limitations. It only mentions the components and delegation, leaving the output format entirely unspecified.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema covers 2 of 3 parameters with descriptions (country and entity_type), but the required 'name' parameter lacks any description. The description adds no additional meaning beyond 'for a named entity', failing to specify input format or constraints for name.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the tool's purpose: a 'full KYC bundle' including sanctions, PEP, adverse media, and country risk for a named entity. It distinguishes from sibling tools like pg_sanctions_screen_entity by emphasizing it is a combined single call.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use when a comprehensive KYC check is needed, but it does not explicitly state when to avoid this tool (e.g., if only one component is needed). No alternatives are directly compared, though siblings are individual checks.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_market_detailsAInspect

Full market info: outcomes, outcome prices, volume, liquidity, best bid/ask, spread, resolution source, conditionId. Provide 'slug' or 'id'.

ParametersJSON Schema

Name	Required	Description	Default
`id`	No	Market id / conditionId
`slug`	No	Market slug

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations, but description lists returned fields (e.g., outcomes, prices, volume). It implies read-only behavior, but doesn't explicitly state side effects, permissions, or rate limits. Adequate for a simple data retrieval.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence, front-loaded with return info and required parameters. No wasted words; highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and no annotations, description covers key returned fields. Lacks mention of error handling, but for a simple info endpoint, it's nearly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% and descriptions are clear. Description adds reinforcement ('Provide slug or id') but no additional format, constraints, or precedence detail. Baseline 3 appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description explicitly states 'Full market info' and lists specific fields (outcomes, volume, liquidity, etc.), clearly distinguishing it from siblings like pg_market_search (search) and pg_market_trades (trades). The verb is implicit but context makes purpose obvious.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Tells user to provide 'slug' or 'id', which is a basic usage hint. However, no guidance on when to use this vs alternative tools (e.g., searching first, or using other detail tools). Minimal context for tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_market_integrity_scanBInspect

Multi-signal integrity scan: combines thin-book flag, spread, low liquidity, geopolitical-term pattern, end-date proximity, and 24h volume concentration. Returns score 0-100, level HIGH/MEDIUM/LOW/CLEAN, triggered signals, and an operational recommendation.

ParametersJSON Schema

Name	Required	Description	Default
`id`	No
`slug`	No

Tool Definition Quality

B3.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses inputs (via schema) and outputs (score, level, signals, recommendation), but does not mention whether the tool has side effects, authorization requirements, rate limits, or idempotency. As a scan, it is likely read-only, but this is not explicit.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is a single sentence that efficiently conveys the tool's purpose and output structure. It is front-loaded with the main action and clearly lists the combined signals and return fields without unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no annotations, no output schema, and undocumented parameters, the description is minimally complete. It covers the tool's high-level function and output but lacks parameter details, usage context, and behavioral traits needed for an agent to reliably use it.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has two parameters (id, slug) with zero description coverage. The description does not mention these parameters or explain their meaning, leaving the agent to guess. Schema provides type-only info, which is insufficient for correct invocation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool performs a multi-signal integrity scan combining six signals, returns a score (0-100), level (HIGH/MEDIUM/LOW/CLEAN), triggered signals, and operational recommendation. It uses specific verbs and identifies the resource, differentiating it from siblings like pg_market_details or pg_resolution_risk_score.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this tool versus alternatives. The description does not mention prerequisites, exclusions, or scenarios where other tools would be more appropriate. Usage must be inferred from the description alone.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_market_searchAInspect

Search Polymarket markets by keyword (e.g. 'election', 'fed', 'bitcoin'). Returns question, slug, 24h volume, liquidity, end date.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 10, max 50)
`query`	Yes	Keyword to search
`active_only`	No	Only active / non-closed markets

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It correctly implies a read operation and lists return fields, but does not explicitly declare read-only behavior, rate limits, or prerequisites. It adds moderate behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that efficiently conveys purpose, examples, and return fields. Every word earns its place with zero redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (3 parameters, no output schema), the description is complete. It explains what the tool does, what parameters are used for, and what the output contains. No gaps remain.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but the description adds value beyond the schema by providing search examples, specifying default/max for limit, default for active_only, and listing return fields. This compensates well for schema's minimal descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses a specific verb ('Search Polymarket markets') and resource, with concrete keyword examples and return fields. It clearly distinguishes from siblings like pg_market_details (specific market) and pg_trending_markets (trending), leaving no ambiguity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description states the tool is for searching by keyword, which guides usage. However, it does not explicitly mention when not to use it or alternative tools. The context of sibling tools implies differentiation, but a brief exclusion note would elevate clarity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_market_tradesAInspect

Raw trade access for a Polymarket market via the Polymarket Data API. Supports client-side filters: time range (from/to, ISO 8601), side (BUY/SELL), minimum trade size, or a specific wallet. Paginates server-side (max 25 pages × 1000 trades). Returns compact records with wallet, pseudonym, side, size, price, notional USD, outcome, timestamp, tx_hash.

ParametersJSON Schema

Name	Required	Description
`id`	No	Market id / conditionId
`to`	No	Only include trades up to this ISO timestamp
`from`	No	Only include trades from this ISO timestamp
`side`	No	Only trades on this side
`slug`	No	Polymarket market slug
`limit`	No	Max trades to return after filtering (default 100, max 5000)
`wallet`	No	Filter to a single wallet address
`min_size`	No	Minimum trade size (shares)
`max_pages`	No	How many pages of 1000 to fetch (default 2, max 25)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description covers key behaviors: pagination (max 25 pages of 1000 trades), client-side filters, and return fields. It lacks mention of rate limits or ordering, but is otherwise transparent for a read-only data API.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences front-load the purpose and efficiently cover filters, pagination, and output format. No superfluous words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description adequately lists return fields and pagination constraints. It could mention ordering of results or error handling, but overall it provides sufficient context for a data retrieval tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but the description adds value by explaining the purpose of filters (time range, side, min size, wallet) and how pagination is handled server-side. It also describes the response structure, which is absent from the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it provides raw trade access for a Polymarket market, listing specific filters and output fields. It uniquely identifies the tool among siblings which cover different market aspects.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description details filtering options and pagination limits, giving clear context for when to use it (to access raw trades). However, it does not explicitly contrast with sibling tools or state when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_market_volume_profileBInspect

Volume anomaly heuristic: compares 24h volume vs average daily volume since market start (z-like ratio) and flags thin-book situations where 24h volume >> on-book liquidity.

ParametersJSON Schema

Name	Required	Description	Default
`id`	No
`slug`	No

Tool Definition Quality

B3.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It describes the core behavior (comparison and flagging) but does not disclose side effects, authentication needs, rate limits, or idempotency. The calculation approach is mentioned, providing moderate transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that front-loads the main purpose and key details. It is concise and avoids extraneous information, though a more structured format (e.g., separating purpose and input) would improve clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema and no annotations, the description fails to explain return values, how to interpret results, or how parameters affect the analysis. The high-level purpose is clear, but operational details are missing, leaving the agent underinformed.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters1/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, and the description contains no explanation of the 'id' and 'slug' parameters. The agent cannot infer what values to provide or how they affect the analysis, making the description inadequate for parameter usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states that the tool performs a volume anomaly heuristic, comparing 24h volume to average daily volume and flagging thin-book situations. It uses specific terms like 'z-like ratio' and clearly defines the condition, making the purpose distinct from sibling tools such as pg_market_trades or pg_market_details.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for detecting volume anomalies or thin-book conditions, but provides no explicit guidance on when to use it versus alternatives, no exclusion criteria, and no mention of prerequisites. The usage context is inferred rather than stated.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_overviewAInspect

PredictionGuard platform overview: covered platforms, tool inventory, Polymarket Gamma API ping, Polygon RPC ping (Ankr-backed), known-case count.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must carry the full burden. It mentions pings (to Polymarket Gamma API and Polygon RPC) but does not disclose potential side effects (e.g., network timeouts, rate limits) or whether the tool is read-only. For a read-only overview, this is acceptable but could be more explicit.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that efficiently lists the key components of the overview. Every word earns its place; no redundancy or fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (no parameters, no output schema), the description adequately conveys what it does. It covers the main aspects: platforms, tool inventory, pings, and known-case count. A note on the output format (e.g., 'returns a JSON object with status fields') would push it to 5.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The tool has zero parameters, and the schema coverage is 100% (vacuously). With no parameters to describe, the description adds no param value but also does not need to. Baseline score of 4 per guidelines for 0 params.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it provides a platform overview including specific components (covered platforms, tool inventory, pings, known-case count). The verb 'overview' combined with the listed items distinguishes this from the many specific sibling tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this tool versus alternative siblings. While the overview nature is implied, a brief note on using it for a high-level status check before diving into specific tools would improve clarity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_pep_check_entityAInspect

Politically Exposed Person (PEP) check via Wikidata. Delegates to AMLOracle. Relevant for prediction markets where reported traders overlap with political figures.

ParametersJSON Schema

Name	Required	Description	Default
`name`	Yes	Person name
`country`	No	Optional ISO country code

Tool Definition Quality

A3.6/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description should fully disclose behavioral traits. It notes delegation to AMLOracle and Wikidata, but does not specify read-only nature, side effects, error handling, or rate limits, which are important for an external API call.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description consists of two concise sentences that are front-loaded with the core purpose. Every word contributes meaning, with no extraneous content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple lookup tool with no output schema, the description covers the essential context: what it checks, data sources, and use case. Minor gaps exist, such as unspecified return format or error behavior, but these are not critical given the tool's simplicity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema already provides descriptions for both parameters (name and country). The description adds no additional meaning beyond what the schema offers, so the baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states the tool performs a PEP check via Wikidata and delegates to AMLOracle, specifying its relevance to prediction markets. This clearly differentiates it from sibling tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description mentions relevance for prediction markets but does not provide explicit guidance on when to use this tool over alternatives like pg_sanctions_screen_entity or pg_kyc_bundle_entity. Usage context is implied but not contrasted.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_position_timing_anomalyAInspect

Statistical timing analysis of a single wallet's position on a specific market. Checks: (1) how close to resolution the wallet entered (<6h=35pts, <24h=25pts, <72h=10pts), (2) position size vs market average (>20× avg=25pts, >5× avg=10pts), (3) single-bet pattern (≤3 trades=10pts, no historical trading rationale). Returns 0-100 score, first-trade timestamp, total notional, and methodology. Pair with pg_information_advantage_score for full assessment.

ParametersJSON Schema

Name	Required	Description
`id`	No	Market id / conditionId
`slug`	No	Market slug
`wallet`	Yes	Wallet to analyze (0x + 40 hex)

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Since no annotations are provided, the description fully carries the burden. It details the scoring methodology (three checks with point values), the output format (score, timestamp, notional, methodology), and implies a read-only operation without side effects. This level of specificity is commendable for a tool without annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, but the first sentence is dense with structured points (three checks with scoring). It front-loads the core purpose and provides detailed breakdown. While effective, a slight restructuring could improve readability without sacrificing content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has three parameters, no output schema, and no annotations, the description provides a comprehensive overview: it explains what the tool does, the scoring criteria, and the returned fields (score, timestamp, notional, methodology). This is sufficient for an LLM to decide when and how to invoke the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% coverage with descriptions for all three parameters (id, slug, wallet). The tool description adds context about the purpose (analyzing a wallet's position on a market) but does not provide additional parameter-level details beyond what the schema already offers. Thus, it meets the baseline for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'statistical timing analysis of a single wallet's position on a specific market' and enumerates three specific checks with scoring details. It also explicitly distinguishes itself by suggesting pairing with a sibling tool (pg_information_advantage_score), making the purpose very precise and differentiated.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides usage guidance by stating 'Pair with pg_information_advantage_score for full assessment,' which helps the agent understand when to use this tool in combination with another. However, it does not explicitly state when not to use it or list alternatives, leaving some room for interpretation.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_prediction_market_integrity_gateAInspect

Pre-trade integrity gate for AI agents and compliance teams. ONE call returns ALLOW/WARN/BLOCK decision + risk_level (0-100) + signed evidence receipt. Aggregates orderbook analysis, resolution-window risk, CFTC Rule 5.17(z) conflict-of-interest check, and (Polymarket) insider signal scan. Supports both Polymarket (slug) and Kalshi (ticker). Optional inputs: wallet (Polymarket-side checks), candidate name (Kalshi Rule 5.17(z) name match), amount_usd (amplifies risk for large bets on thin markets). Returns hashable + ECDSA-signed receipt for audit. Designed for x402 paywall pre-trade calls — agent pays once, gets binary go/no-go.

ParametersJSON Schema

Name	Required	Description
`wallet`	No	Optional: Polygon wallet address (Polymarket only)
`platform`	Yes	Which prediction market
`candidate`	No	Optional: candidate name for Kalshi Rule 5.17(z) name match
`market_id`	Yes	Polymarket slug or Kalshi ticker/event_ticker
`amount_usd`	No	Optional: bet size for risk amplification

Tool Definition Quality

A4.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must disclose behavior. It describes the output (decision, risk_level, signed receipt) and sub-checks performed. Lacks explicit statements about side effects, idempotency, or rate limits, but the read-only nature is implied.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Concise single paragraph with front-loaded purpose and bullet-like details. Could benefit from clearer separation of sections, but effectively conveys all needed information without extra fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given complexity and lack of output schema, description covers purpose, inputs, aggregated checks, output, and use context. Missing details about error handling or receipt format, but mostly complete for agent usage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds significant context: explains optional inputs (wallet, candidate, amount_usd) and how they influence the checks, which goes far beyond the schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it is a pre-trade integrity gate returning ALLOW/WARN/BLOCK plus risk_level and signed receipt. Aggregates multiple checks and supports both Polymarket and Kalshi, distinguishing it from sibling tools that focus on individual checks.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says it is designed for x402 paywall pre-trade calls, indicating when to use. However, it does not discuss when not to use or provide direct comparisons to sibling tools for alternative scenarios.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_pre_event_flow_analysisAInspect

Analyze pre-resolution positioning on a Polymarket market. For the N days before the market's endDate, aggregates trades by wallet, classifies wallets new to the market in that window, detects volume concentration, timing clusters (minutes where ≥5 distinct wallets trade in sync), and pre-event volume spikes vs historical average. Returns a 0–100 risk score (CRITICAL/HIGH/MEDIUM/LOW/CLEAN) with triggered signals, top 10 positioners, and top 10 new wallets. Default window = 7 days.

ParametersJSON Schema

Name	Required	Description
`id`	No
`slug`	No
`max_pages`	No	Trade pagination depth (default 10, max 25)
`days_before_end`	No	Pre-event window size in days (1–90, default 7)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description takes full responsibility. It transparently lists the analytical steps (aggregate trades, classify wallets, detect clusters, spikes), the risk score scale (CRITICAL/HIGH/MEDIUM/LOW/CLEAN), and outputs (top positioners, new wallets). It does not mention auth requirements, rate limits, or data freshness, but the behavioral core is clear.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured paragraph of 4 sentences. Every sentence adds value: first states purpose, second enumerates analyses, third specifies outputs, fourth gives default. No fluff, perfectly concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (multiple analysis types, risk scoring, top lists) and no output schema, the description covers the main outputs and parameters. It could be more complete by explaining how id and slug differ, or clarifying trigger criteria for signals, but it provides sufficient context for an agent to understand the tool's capabilities and boundaries.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 50%: max_pages and days_before_end have descriptions in the schema, but id and slug lack them. The tool description reinforces the 'days_before_end' parameter by mentioning 'N days' and default window, adding context. However, it does not clarify id vs slug, leaving ambiguity for the two required parameters. More value could be added.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: analyzing pre-resolution positioning on Polymarket markets. It lists specific analyses (wallet classification, volume concentration, timing clusters, volume spikes) and outputs (risk score with levels, signals, top wallets), making it distinct from sibling tools like pg_market_trades or pg_resolution_risk_score.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains when to use the tool: 'N days before the market's endDate' for pre-event analysis. It specifies a default window of 7 days. However, it does not explicitly contrast with sibling tools or state when not to use this tool, missing a clear alternative guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_resolution_risk_scoreAInspect

Multi-signal resolution-risk score (0–100, CLEAN/LOW/MEDIUM/HIGH/CRITICAL) for a Polymarket market. Combines category priors (geopolitical/election/sanctions patterns from pg_known_cases), resolution-phase proximity, market volume, resolution-source sanity check (is the UMA adapter a real deployed contract), UMA global dispute ratio, and thin-book flag. Returns triggered signals with point deductions for full auditability.

ParametersJSON Schema

Name	Required	Description	Default
`id`	No
`slug`	No

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full burden. It discloses that the tool is read-only (computes score, no side effects mentioned), returns triggered signals with point deductions, and references internal logic (pg_known_cases, UMA adapter). It stops short of specifying idempotency or auth requirements but is fairly transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two concise sentences, front-loading the score definition and following with a list of signals. No extraneous information, though the second sentence is dense and could be slightly reorganized for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains the tool's purpose and internal signals but lacks details about output format (e.g., JSON structure) and parameter constraints (e.g., whether id or slug is preferred). Given no output schema and the tool's complexity, additional completion about return values would be beneficial.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has two parameters (id, slug) with 0% schema description coverage. The description mentions 'Polymarket market' but does not explicitly map id and slug to market identifiers or explain their difference. Parameter meaning is largely left to inference, so the description adds minimal value over the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it computes a multi-signal resolution-risk score (0–100 with categories) for a Polymarket market, using a specific verb and resource. It distinguishes itself from sibling tools like pg_market_details or pg_uma_market_resolution by focusing on risk scoring.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description lists the input signals (category priors, proximity, volume, etc.) implying the tool should be used when assessing resolution risk. However, it does not explicitly state when not to use it or name alternatives, though the sibling tools make the context clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_sanctions_screen_entityAInspect

Screen a named entity against global sanctions lists (OFAC SDN, EU FSF, UN SC, Interpol). Delegates to AMLOracle. Use after resolving a wallet/trader to a real name via public reporting or known-case registry.

ParametersJSON Schema

Name	Required	Description
`name`	Yes	Person or entity name
`threshold`	No	Optional fuzzy-match threshold (0..1)
`entity_type`	No	Optional: 'person' or 'organization'

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must cover behavioral traits. It mentions delegation to AMLOracle and target lists but lacks details on output format, permissions, or side effects. Sufficient for basic understanding but incomplete.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences efficiently convey purpose and usage. No redundant information; each sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains what and when, but without output schema or annotations, it lacks details on return values, error conditions, or behavior with absence of threshold. Adequate but not comprehensive.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so description does not need to add much. It adds no extra meaning beyond schema descriptions for name, threshold, and entity_type. Baseline score of 3 applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool screens a named entity against global sanctions lists, naming specific lists (OFAC, EU, UN, Interpol). It distinguishes from sibling tools like pg_pep_check_entity and pg_adverse_media_entity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description advises using after resolving a wallet/trader to a real name, providing clear context. However, it does not explicitly exclude other scenarios or name alternatives for when not to use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_sar_reportAInspect

Generate a draft Suspicious Activity Report (SAR/STR) from structured inputs. Supports EU_GENERIC (FATF-aligned STR, default), US_FINCEN (Form 111 structure), and DE_goAML (German FIU draft). Takes subject, market, activity window, integrity signals, and evidence references — auto-classifies the primary suspicious activity type, generates a narrative, and renders regulator-style markdown. Output is marked draft_for_review; PredictionGuard does not submit SARs directly.

ParametersJSON Schema

Name	Required	Description	Default
`filer`	No	Information about the preparing institution.
`market`	No	Optional. Polymarket market context.
`signals`	No	Integrity signals (strings or objects with 'term'/'reason'/'layer'). Pass the 'triggered_signals' array from pg_market_integrity_scan or pg_resolution_risk_score directly.
`subject`	Yes	Required. The subject of the report.
`activity`	No	Activity window and amounts.
`evidence`	No	References to prior evidence.
`narrative_addendum`	No	Optional free-text addendum from the analyst.
`filing_jurisdiction`	No	Report format. Default EU_GENERIC.	EU_GENERIC

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full burden. It discloses that the tool generates a draft, does not submit, auto-classifies, and outputs markdown marked as draft_for_review. It does not mention error handling or validation, but the core behavior is transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is approximately 100 words, information-dense, and well-structured. Every sentence adds value, front-loading key purpose and format support. No redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (8 parameters, nested objects, no output schema), the description is remarkably complete: it covers purpose, supported formats, input sourcing, output nature (markdown draft), and constraints (not submitting). No essential information is missing.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, baseline 3. The description adds value by explaining how to use the signals parameter (directly pass arrays from other tools) and maps conceptual fields to parameters. It goes beyond the schema in providing usage context.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool generates a draft SAR from structured inputs, lists supported formats (EU_GENERIC, US_FINCEN, DE_goAML), and uses specific verbs like 'generate', 'auto-classifies', and 'renders'. It distinguishes itself from siblings as the only SAR generation tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains what the tool does and that the output is a draft for review, not submitted. It implicitly guides input sourcing by mentioning that signals can come from pg_market_integrity_scan or pg_resolution_risk_score. However, it does not explicitly state when not to use it or alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_trending_marketsAInspect

Top Polymarket markets by 24h volume. Useful for daily compliance triage / catching news catalysts before they land in mainstream coverage.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max results (default 10, max 25)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses that the tool returns trending markets based on 24-hour volume, which implies a read-only, non-destructive operation. It does not detail rate limits or authentication, but for a simple data retrieval tool, the description gives sufficient behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences: the first states the core function, the second adds practical use cases. Every word earns its place; there is no redundant information. It is optimally concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (one optional parameter, no output schema), the description adequately covers what the tool does and when to use it. However, it could explicitly state that results are sorted by volume descending, but this is implied by 'top markets by volume.'

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100% for the single parameter 'limit,' which already documents its default and maximum. The tool description does not add any extra meaning beyond the schema. Therefore, the description adds marginal value over the structured parameter definition.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool retrieves 'Top Polymarket markets by 24h volume,' specifying the verb (get/list), resource (markets), and metric (volume). It further distinguishes itself from siblings like pg_market_search by framing the use case as daily compliance triage and catching news catalysts.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use it (daily compliance triage, catching news catalysts). While it does not list alternative tools for other scenarios, the context of sibling tools implies that for keyword or specific market searches, one would use pg_market_search or pg_market_details. This provides adequate guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_uma_market_resolutionAInspect

Resolution source and phase for a specific Polymarket market. Fetches market metadata from Gamma, extracts the resolutionSource address, verifies it's a deployed contract on Polygon, and classifies the resolution phase (active_trading / approaching_resolution / imminent_resolution_window / past_end_awaiting_resolution / resolved).

ParametersJSON Schema

Name	Required	Description	Default
`id`	No	Market id / conditionId
`slug`	No	Market slug

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations exist, so the description must fully convey behavior. It details the process (fetch metadata, extract address, verify contract, classify phase) and lists phases, offering good transparency. However, it does not explicitly state that the operation is read-only.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, information-dense sentence that covers key aspects efficiently. Some structure (e.g., listing phases separately) could improve readability, but it remains concise and informative.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 2 optional parameters and no output schema, the description adequately explains the function and expected output (phases). It could mention the exact output format or list phases more explicitly, but it is reasonably complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for 'id' and 'slug'. The description adds no additional meaning beyond the schema, meriting the baseline score of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it identifies resolution source and phase for a Polymarket market, listing specific steps and possible phases. This distinguishes it from sibling tools like pg_market_details or pg_resolution_risk_score.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for retrieving resolution info but does not explicitly state when to use this tool versus alternatives like pg_resolution_risk_score, nor does it provide exclusions or prerequisites.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_uma_statusAInspect

Global UMA Optimistic Oracle health on Polygon. Scans event logs from OOv2 (and optionally OOv1) over a configurable window, tallies events by type (RequestPrice / ProposePrice / DisputePrice / Settle), computes per-hour rate, and derives a dispute/propose ratio — a leading indicator of oracle-level contention.

ParametersJSON Schema

Name	Required	Description	Default
`include_v1`	No	Also scan UMA OOv1 (legacy, default false)
`blocks_back`	No	Blocks to scan back (default 5000 ≈ 3h, max 20000)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully bears the burden. It accurately describes the non-destructive, analytical nature of the tool (scanning, tallying, computing) without mentioning destructive actions. However, it could be more explicit about being read-only.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured sentence that front-loads the main purpose and efficiently conveys all key aspects without unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description adequately explains the derived outputs (event counts, per-hour rate, ratio). It covers the main behavior but could mention the return format or data structure for completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, and this description adds meaning by explaining the role of each parameter: 'blocks_back' defines the scanning window and 'include_v1' enables legacy scanning. This enhances understanding beyond the schema alone.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the tool's function: scanning UMA OO event logs on Polygon, tallying events by type, computing per-hour rates, and deriving a dispute/propose ratio. It distinguishes itself from sibling tools (e.g., market- or wallet-focused) by focusing on oracle health.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for assessing oracle health but does not explicitly state when to use this tool versus alternatives or when not to use it. No exclusions or context about prerequisites are provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_verify_alertAInspect

Verify a previously issued signed alert (JWS or content-hash). Delegates to Trust Layer verify_signature. Returns validity, signer, and timestamp. JWKS at feedoracle.io/.well-known/jwks.json.

ParametersJSON Schema

Name	Required	Description	Default
`jws`	No	Full JWS string
`content_hash`	No	SHA-256 content hash (alternative to JWS)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the full burden. It discloses delegation behavior, return fields (validity, signer, timestamp), and JWKS location. It implies mutual exclusivity of parameters but does not explicitly confirm it, nor does it mention error handling or side effects.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the primary action, and contains no unnecessary words. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple verification tool with two optional parameters, the description covers the main behavior and return values. It lacks mention of error cases or what happens if both parameters are provided, but overall is fairly complete given no output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema already provides 100% coverage with descriptions for both parameters. The description adds only the hint that content_hash is an alternative to JWS, which is useful but minimal. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool verifies signed alerts using JWS or content-hash, and mentions delegation to Trust Layer. This distinguishes it from sibling tools like pg_insider_alert_signed (which likely creates alerts) and others focused on media, markets, or entities.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides context by naming the delegation and JWKS location, but does not explicitly state when to use this tool vs. alternatives or when not to use it. However, the specific function implies usage for alert verification only.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_wallet_correlation_matchAInspect

Find wallets that co-trade across multiple sensitive Polymarket markets with a target wallet. Given a wallet and 2-10 market slugs, returns: which markets the target traded, top 20 cotraders ranked by overlap count, and an aggregate severity score (≥3 shared markets = MEDIUM, ≥5 = HIGH). Use case: detect coordinated trading rings, shared-info syndicates, or wallet clusters that consistently bet on the same sensitive events.

ParametersJSON Schema

Name	Required	Description	Default
`wallet`	Yes	Target wallet address (0x + 40 hex)
`markets`	Yes	2-10 market slugs to check for co-trading overlap

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description does the job by detailing returns (which markets, top 20 cotraders, severity score) and the scoring thresholds. It is read-only in nature but does not explicitly state that. No mention of errors or edge cases, but sufficient for expected behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences: first states the core purpose, second details returns and use case. It is concise, front-loaded, and contains no filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description explains the return structure (markets traded, top 20 cotraders, severity score) and scoring logic. It covers key aspects but lacks precise output formatting and edge case handling. Still sufficient for an agent to understand the tool's function.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions. The description reinforces the parameters and adds context (e.g., 'sensitive Polymarket markets' and the 2-10 constraint). It adds value beyond the schema by explaining the use case for the parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it finds wallets co-trading across multiple sensitive markets with a target wallet. It specifies the action, subject, and scope, and distinguishes from siblings like pg_wallet_lookup and pg_market_trades by focusing on correlation analysis for detecting coordinated behavior.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides context and a use case (detect coordinated trading rings) but does not explicitly state when to use this tool over alternatives or when not to use it. Lacks comparative guidance with sibling tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_wallet_entity_resolveAInspect

Resolve a Polygon address to a known entity via PredictionGuard's Wallet Registry (Polymarket core contracts, major stablecoins, known CEX hot wallets), falling back to on-chain heuristic classification for unknown addresses. Returns entity name, type, category, AML relevance, confidence, and source-of-record URL.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Polygon address (0x + 40 hex chars)

Tool Definition Quality

A3.8/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses fallback mechanism and output fields. No annotations exist, so description provides behavioral context well. Could mention potential errors or limitations, but current level is good.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Efficiently communicates purpose, mechanism, and output in two sentences. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers main behavior and output fields. Without output schema, description adequately describes returns. Could add edge case handling, but sufficient for most agents.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema already describes the address parameter fully (100% coverage). Description does not add extra meaning. Baseline score of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

States specific verb 'Resolve' and resource 'Polygon address to known entity'. Differentiates from sibling tools like pg_wallet_lookup and pg_wallet_risk_profile by focusing on entity identification with AML relevance.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit usage context or alternatives mentioned. Agent must infer from naming. No guidance on when this tool is preferred over pg_wallet_lookup or pg_wallet_risk_profile.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_wallet_lookupAInspect

Polygon wallet base profile via JSON-RPC (Ankr-backed when RPC_POLYGON is set, public fallback otherwise). Returns MATIC balance, outbound tx count, contract-vs-EOA detection, and a lightweight risk heuristic.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Polygon address (0x + 40 hex chars)

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses the backend selection behavior (Ankr or public fallback) and specifies the return fields (MATIC balance, tx count, etc.), implying a read-only operation. However, it lacks details on failure modes, latency, or external dependencies, which is acceptable for a simple lookup.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loading the core purpose and endpoint behavior. Every word adds value, with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (single parameter, no output schema), the description is complete. It covers purpose, backend, return fields, and identification of the wallet type.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the single parameter 'address', which is already well-described in the schema. The description adds no additional semantic value beyond the schema, so a baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool looks up a Polygon wallet base profile, specifying the data returned (MATIC balance, tx count, contract/EOA detection, risk heuristic) and the backend (Ankr or public fallback). This distinguishes it from sibling tools like pg_wallet_entity_resolve or pg_wallet_risk_profile.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies the tool is for a basic wallet profile lookup, but it does not explicitly provide guidance on when to use this tool versus alternatives (e.g., pg_wallet_risk_profile for risk-specific data). No when-not-to-use or prerequisite conditions are mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_wallet_risk_profileAInspect

Aggregated A–F risk profile for a Polygon wallet. Combines on-chain behavior (pg_wallet_lookup), entity resolution (pg_wallet_entity_resolve), and — where the entity can be screened — AML bundle (sanctions + PEP + adverse media via AMLOracle). Grade A = clean, F = blocked. Includes triggered flags with points deductions for full auditability.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Polygon address (0x + 40 hex chars)

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully discloses the behavioral traits: it outputs an A–F grade with explanations (flags and points deductions). It describes the data sources and grading logic, which is sufficient for an agent to understand the tool's safety and output nature.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise at three sentences, each adding value: first defines the output, second explains the composition, third details grading and output. It is well-structured and front-loaded with key information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of an output schema, the description adequately explains the output (grade, flags, auditability) and the input (address). It covers the tool's operation and dependencies, making it complete for an agent to use correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% coverage for the single parameter 'address', providing a clear description. The tool description does not add extra semantics beyond the schema, so it meets the baseline of 3 without requiring compensation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it provides an aggregated A–F risk profile for a Polygon wallet, specifying the combination of on-chain behavior, entity resolution, and AML screening. It distinguishes itself by being a composite of related tools like pg_wallet_lookup and pg_wallet_entity_resolve, making its purpose distinct from siblings.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage when a risk profile is needed but lacks explicit guidance on when to use this tool versus alternatives. It does not state prerequisites or exclusions, leaving the agent to infer context from mentioned components.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_whale_addAInspect

Add a Polygon wallet to the PredictionGuard watchlist. Takes an initial baseline snapshot so subsequent pg_whale_scan calls can detect movement. Set alert_threshold_matic to tune sensitivity (default 100 MATIC). Tags are free-form.

ParametersJSON Schema

Name	Required	Description
`tags`	No	Free-form tags (e.g. ['polymarket','pre-election','whale'])
`alias`	No	Optional human-readable label
`notes`	No	Free-form notes
`address`	Yes	Polygon address (0x + 40 hex)
`alert_threshold_matic`	No	Balance delta threshold in MATIC for alerts (default 100)

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations exist, so description must carry behavioral disclosure. It mentions baseline snapshot but omits side effects (e.g., overwrite behavior, limits). Lacks detail on return value, permissions, or error handling.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise sentences, each adding distinct value: purpose, snapshot behavior, and parameter tuning. No filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Missing output schema info—no indication of what the tool returns. Also doesn't address idempotency or duplicates. Adequate for the parameter details but incomplete for a creation tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers 100% of parameters. Description adds minor context (default threshold, free-form tags) but mostly restates schema. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it adds a Polygon wallet to a watchlist and ties to subsequent pg_whale_scan. Distinguishes from siblings like pg_whale_remove and pg_whale_list.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Implies use before pg_whale_scan but gives no explicit when-to-use or when-not-to-use guidance. No reference to alternatives or prerequisites.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_whale_alertsAInspect

Retrieve alerts from the watchlist audit log. Filter by address, severity (LOW/MEDIUM/HIGH/CRITICAL), and/or ISO timestamp 'since'. Default returns the 50 most recent alerts across the whole watchlist.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (max 500)
`since`	No	ISO 8601 timestamp (e.g. 2026-04-20T00:00:00Z)
`address`	No
`severity`	No

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided; the description accurately conveys read-only retrieval behavior and default/filtering details, but lacks depth on rate limits or auth requirements.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no wasted words. Efficient and clear.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple retrieval tool with no output schema, the description covers core functionality and filtering. Missing return format details but acceptable given complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 50% schema coverage, the description adds meaning by explaining default limit and 'since' as ISO timestamp, but address and severity parameters are merely listed without extra context.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it retrieves alerts from the watchlist audit log with filtering options, distinguishing it from sibling tools like pg_whale_list and pg_whale_add.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides clear context on default behavior (50 most recent alerts) and filters, but does not explicitly state when to use alternatives or exclude cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_whale_listAInspect

List all watched wallets with their alias, threshold, tags, notes, and last snapshot. Optionally filter by tag.

ParametersJSON Schema

Name	Required	Description	Default
`tag`	No	Optional tag filter
`limit`	No	Max results (max 500)

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Lists returned fields and optional filter, but does not mention pagination or limit behavior beyond schema. No annotations, so description carries full burden.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences, front-loaded with purpose, no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple listing tool with two optional params and no output schema, the description covers purpose, returned fields, and optional filter completely.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline 3. Description adds 'optional filter by tag' but does not add new meaning beyond schema descriptions for tag and limit.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states 'List all watched wallets' with specific fields returned, distinguishing it from siblings like pg_whale_add or pg_whale_alerts.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Mentions optional tag filter, but does not explicitly state when to use vs alternatives. However, context from sibling names makes purpose clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_whale_removeAInspect

Remove a wallet from the watchlist. By default keeps historical snapshots and alerts (for audit trail). Set keep_history=false to purge everything for this address.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes
`keep_history`	No	Keep snapshot + alert history (default true)

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses default behavior (keeps history) and purge option, but lacks information on permissions, reversibility, or side effects. No annotations to contradict.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with the primary action, no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, but for a removal tool the description is adequate; however, it could mention return state or confirmation for completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 50%; description adds value for keep_history (purge everything) but does not describe the address parameter beyond its existence.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Remove a wallet') and the resource ('from the watchlist'), making it distinct from sibling tools like pg_whale_add or pg_whale_list.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit guidance on when to use keep_history=false vs default, but does not explicitly compare to alternatives, though the context of siblings makes the use case clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

pg_whale_scanAInspect

Scan one or all watched wallets against Polygon RPC. Compares current balance + nonce to the last snapshot, writes a new snapshot, and emits an alert when the delta exceeds the wallet's configured threshold. Severity is MEDIUM / HIGH / CRITICAL by delta size. Omit 'address' to scan the entire watchlist. Designed to be called periodically via SchedulerOracle or a systemd timer.

ParametersJSON Schema

Name	Required	Description	Default
`address`	No	Optional — scan a single address. Omit to scan the whole list.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Describes the full workflow: comparing balance/nonce, writing a snapshot, and emitting alerts with severity levels. No annotations provided, so the description carries the full burden and does so adequately.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences with no wasted words. The first sentence states the core action, followed by behavior details and usage guidance. Efficient and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With one optional parameter and no output schema, the description covers the main behavior. Missing return format, but the scan-and-alert workflow is sufficiently described for an agent to use correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema already describes the parameter at 100% coverage. The description adds one nuance: omitting 'address' scans the whole list. Baseline is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses a specific verb ('scan') and resource ('watched wallets'), and clearly distinguishes the tool from siblings like pg_whale_add or pg_whale_remove by focusing on scanning and alerting.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to omit the 'address' parameter (to scan the entire watchlist) and mentions intended periodic usage via scheduler. Lacks explicit when-not-to-use scenarios, but the guidance is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?