Mnemom — Trust Ratings for AI Agents

Name: Mnemom — Trust Ratings for AI Agents
Author: mnemom

by io.github.mnemom

Server Details

Trust infrastructure for AI agents: read a verifiable Trust Rating, claim an identity, earn a badge.

Status: Healthy
Last Tested: 2026-07-25 15:41
Transport: Streamable HTTP
URL
Repository: mnemom/mcp
GitHub Stars: 0

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4/5.0

Tool DescriptionsA

Average 4/5 across 16 of 16 tools scored. Lowest: 3.1/5.

Server CoherenceA

Disambiguation5/5

Each tool has a clearly distinct purpose, covering identity, reputation, alignment/protection, verification, and scanning. No two tools overlap significantly; even similar-sounding ones like get_reputation and verify_reputation serve different functions (read vs. attest).

Naming Consistency5/5

All tools follow a consistent verb_noun pattern (e.g., claim_agent, get_reputation, scan_trust). Long names like preview_compose_alignment_by_agent still adhere to the convention, and abbreviations like fn_fp are clear. No mix of casing or styles.

Tool Count5/5

With 16 tools, the server is well-scoped for its domain of agent trust ratings. Each tool addresses a specific need without redundancy, and the count feels appropriate for the breadth of functionality offered.

Completeness4/5

The tool set covers the main workflows: identity management, reputation reading/attestation, alignment/protection publication, website scanning, and verification. Minor gaps exist, such as no explicit 'get alignment' tool (though get_agent may partially cover it) and no deletion tools, but the core operations are present.

Available Tools

16 tools

claim_agentAInspect

Claim a verifiable identity — bind an agent to your organization so its trust and accountability record is provably yours. No human in the loop.

ParametersJSON Schema

Name	Required	Description
`org_id`	No	Optional. The organization to claim the agent into (e.g. `org-...` or `pers-...`). The caller must be a member of this org (role floor: member). If omitted, the agent is claimed into the caller's personal org.
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)
`hash_proof`	Yes	Agent possession proof: the full 64-hex SHA-256 digest of `${apiKey}\|${agentName}` (or `${apiKey}` for an unnamed singleton agent).

Output Schema

ParametersJSON Schema

Name	Required	Description
`org_id`	Yes	The organization the agent was claimed into (echoes the resolved org — the supplied `org_id`, or the caller's personal org when omitted).
`claimed`	Yes
`agent_id`	Yes
`claimed_at`	Yes

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations provide minimal behavioral cues (all false). The description adds context about binding and automation but does not disclose side effects, reversibility, or consequences of claiming an already-claimed agent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no wasted words. Efficient and clear.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With output schema present, return values are covered. However, the description lacks context on prerequisites (e.g., having an API key) and the effect on existing agent bindings, leaving some gaps for a complex operation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description does not add extra parameter-level meaning beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (claim/bind), the resource (agent to organization), and the outcome (trust record provably yours). It distinguishes from sibling tools like get_agent and verify_agent_binding.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies automated usage ('No human in the loop') but does not explicitly state when to use this tool versus alternatives or when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_agentA

Read-onlyIdempotent

Inspect

Look up an agent's public identity and trust state by ID — the accountable record other agents and humans can rely on.

ParametersJSON Schema

Name	Required	Description	Default
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`id`	No
`name`	No	Agent name (2-32 chars, alphanumeric + hyphens). Present on all list and get responses.
`email`	No
`caller`	No	Self-describing caller context for THIS response. `org_member` callers receive the full owner record (all fields here); `anonymous`/`authenticated` (non-member) callers receive the reduced public projection (id, name, claimed, created_at, last_seen, status, avatar_url, caller). The differing field set is GOVERNED by this value — read it instead of inferring why a field is absent.
`org_id`	No	Owner projection: the agent's org binding (ADR-062 authz boundary).
`public`	No	Identity-record visibility axis — whether the agent's IDENTITY RECORD is publicly discoverable. This is DISTINCT from reputation visibility: every registered agent's reputation is public by accountability standard (see `ReputationScore.visibility`). `public` here governs only the identity record, never the Trust Rating.
`status`	No
`claimed`	No	Public projection only: whether the agent has been claimed by a user.
`user_id`	No
`last_seen`	No
`agent_hash`	No	First 16 hex chars of `SHA256(apiKey + '\|' + agentName)` for named agents, or `SHA256(apiKey)` for unnamed singleton agents. The gateway computes the same value on each request and uses it as the lookup key. See [Agent Identity](https://docs.mnemom.ai/concepts/agent-identity#agent_hash--the-canonical-identity-hash).
`avatar_url`	No
`claimed_at`	No
`claimed_by`	No	Owner projection: user id that claimed the agent.
`created_at`	No
`created_by`	No	Owner projection: user id that created the agent (provenance).
`deleted_at`	No	Owner projection: soft-delete timestamp (null when live).
`key_prefix`	No	First 8 chars of the bound API key hash — useful for key-rotation debugging.
`agent_proof_hash`	No	Owner projection: captured hash_proof of the bound key (mig 263).
`billing_account_id`	No
`containment_status`	No	Containment state of the agent (ADR-053).
`aip_enforcement_mode`	No
`agent_proof_captured_at`	No

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, idempotentHint=true, and destructiveHint=false, so the agent knows it is a safe, non-destructive operation. The description adds context about what is returned (public identity and trust state), which is helpful. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, front-loaded sentence with no unnecessary words. It conveys the essential purpose and scope efficiently.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the low complexity (1 parameter, no nested objects), the presence of an output schema, and clear annotations, the description provides sufficient context. No additional details are needed.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema covers the single parameter agent_id with 100% coverage, including an example value. The description only mentions 'by ID' without adding additional semantic meaning beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the verb 'Look up' and the specific resource: an agent's public identity and trust state by ID. It distinguishes itself from sibling tools like list_agents (which lists multiple agents) and get_reputation (which focuses on reputation only) by emphasizing the comprehensive 'accountable record' aspect.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies that this tool is the authoritative source for an agent's public identity and trust state, and that it should be used when you have a specific agent ID. However, it does not explicitly mention when not to use it or provide alternative tools for different scenarios.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_reputationA

Read-onlyIdempotent

Inspect

Look up an AI agent's published Trust Rating — Mnemom's portable reliability signal for autonomous software, computed from the agent's own verified activity record. Returns the rating plus the technical factors behind it. Free, public, read-only: every registered agent's rating is published by standard (the visibility field is the reputation-publication axis, distinct from identity-record visibility).

ParametersJSON Schema

Name	Required	Description	Default
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`tier`	No
`grade`	Yes	AAA–D or NR.
`score`	Yes
`claimed`	No
`agent_id`	Yes
`trend_30d`	No
`agent_name`	No
`components`	Yes
`confidence`	Yes
`visibility`	Yes	Reputation-publication axis — whether this agent's Trust Rating is published. Every registered agent's reputation is `public` by accountability standard (the default; that is the whole point of a portable, verifiable rating); `private` is a rare owner opt-out that 403s the read to non-owners. This is DISTINCT from `Agent.public` (the identity-record visibility axis) — they share the word "public" but govern different things.
`computed_at`	No
`is_eligible`	Yes
`next_compute_at`	No	Next scheduled recompute — the 00/06/12/18 UTC cron slot strictly after `computed_at` (`floor(computed_at/6h)*6h + 6h`). Null when `computed_at` is null.
`checkpoint_count`	Yes
`a2a_trust_extension`	No	A2A trust extension for interop. Only present on `GET /reputation/{agent_id}` (not on batch/compare rows).
`checkpoint_accounting`	No	Structured breakdown of how checkpoints were counted toward the score. `analyzed` is the scoring population; `excluded` buckets are mutually exclusive and `analyzed + synthetic + insufficient_thinking + quarantined = total`. Null for legacy rows computed before this field existed.

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint, idempotentHint, and destructiveHint. Description adds context: it is free, public, read-only, and explains the visibility field's role. This goes beyond the annotations but does not fully cover potential rate limits or authorization needs.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is two sentences, front-loaded with the main purpose, and every clause adds value (e.g., free, public, read-only, visibility distinction). No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple look-up tool with one parameter and an output schema (not shown), the description adequately states what is returned and key contextual information. No additional details are necessary for an agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a description for agent_id. The tool description does not add any additional parameter context (e.g., format, examples), so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the verb 'Look up' and the resource 'published Trust Rating', and distinguishes it from sibling tools by explaining it returns both the rating and technical factors, and clarifying the visibility axis distinction.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Description implies usage (look up a reputation rating) but does not explicitly compare this tool to siblings like get_reputation_badge, search_reputation_directory, or verify_reputation. No when/when-not guidance is given.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_reputation_badgeA

Read-onlyIdempotent

Inspect

Get an embeddable Trust Rating badge for an agent — returns the badge image URL plus ready-to-paste Markdown and HTML snippets for a README or agent card.

ParametersJSON Schema

Name	Required	Description	Default
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`agent_id`	Yes	The agent the badge is for (echoed from the request).
`badge_url`	Yes	Canonical SVG Trust Rating badge image URL (always on api.mnemom.ai).
`html_embed`	Yes	Paste-ready HTML badge snippet.
`profile_url`	Yes	Human-readable reputation profile page (on www.mnemom.ai).
`verified_url`	Yes	Public cryptographic verification URL for the rating.
`markdown_embed`	Yes	Paste-ready Markdown badge snippet.

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint, idempotentHint, and destructiveHint=false, indicating a safe read operation. The description adds value by detailing the return format (badge URL and snippets), which goes beyond annotations. No contradictions noted.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured sentence that efficiently conveys the tool's purpose with no redundant information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Considering the tool's simplicity (one parameter), rich annotations, and the presence of an output schema (though not shown), the description is complete. It mentions the return content, providing sufficient context without needing further elaboration.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the single required parameter 'agent_id' with a clear description in the schema. The tool description adds no additional parameter semantics beyond what the schema provides, resulting in a baseline score of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool retrieves an embeddable Trust Rating badge, specifying the return includes a badge image URL and ready-to-paste snippets. This distinguishes it from siblings like 'get_reputation' which likely return raw data, and 'verify_reputation' which is for verification.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for embedding in README or agent cards, providing clear context. However, it does not explicitly state when to use this tool versus alternatives like 'get_reputation' or when not to use it, missing a perfect score.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_startedA

Read-onlyIdempotent

Inspect

Zero-auth, no-args orientation: who Mnemom is, the surface map, how to authenticate and what it unlocks, and the value tools to try right now (headlining scan_trust + the reputation reads).

ParametersJSON Schema

Name	Required	Description	Default
`token`	No	Optional Dojo try-me invite token. When supplied and valid, returns the token-gated dojo briefing manifest (the same content as GET /v1/dojo/try-me/resolve); omit for public orientation.

Output Schema

ParametersJSON Schema

Name	Required	Description
`who`	Yes	One-line positioning.
`verify`	Yes	How to verify signed artifacts in-band (verify, don't trust).
`try_now`	Yes	Zero-auth value tools to call right now.
`doctrine`	Yes
`skill_path`	Yes	The two-step on-ramp to declaring and advertising capabilities as A2A skills in a signed, portable AgentCard.
`value_prop`	Yes	What Mnemom does for an agent.
`surface_map`	Yes	Stable links to the canonical read-only surfaces.
`authenticate`	Yes	How to authenticate and what auth unlocks.
`developer_path`	Yes	The developer hero on-ramp: the npx one-liner plus the intent-named MCP prompt-skills (try-me, onboard_an_agent, become_sovereign). Advertisement only — no functional dependency on those prompts existing yet.
`showcase_agent`	Yes	A real Mnemom-owned agent the try_now reputation reads target, so the loop runs verbatim.
`sovereignty_path`	Yes	The five-step on-ramp to becoming a sovereign, accountable agent, composed from existing tools. Walked end to end by the become_sovereign MCP prompt.
`visibility_model`	Yes	Disambiguates the two axes that share the word 'public': reputation-publication visibility (public by standard) vs identity-record visibility (agent.public), plus the caller-context self-description.
`what_we_keep_private_and_why`	Yes

Tool Definition Quality

A3.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already mark the tool as readOnly, idempotent, non-destructive. The description adds context about zero-auth and no-args, which is helpful but does not disclose significant behavioral traits beyond what annotations provide.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that front-loads key attributes (zero-auth, no-args) and lists content. It is concise but somewhat dense with lists and colons, which slightly impacts readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (no required params, has output schema), the description adequately covers its purpose and content. It mentions the key outputs and highlights related tools. Could mention that return format is documented in output schema, but overall complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% and explains the optional 'token' parameter. The description states 'no-args', implying no input needed, but adds no extra meaning beyond the schema. Baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as a zero-auth, no-args orientation resource, listing specific content (who Mnemom is, surface map, authentication, value tools). It differentiates from sibling tools which are more specific (agents, reputation) by being a general starting point. However, the verb is implicit rather than explicit.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use when starting out, but does not explicitly state when to use or avoid this tool versus siblings. No alternative tools are mentioned, and there is no guidance on prerequisites or exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_agentsA

Read-onlyIdempotent

Inspect

List your agents — List all agents owned by the authenticated user. Supports pagination.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Maximum number of agents to return (max 100)
`offset`	No	Number of agents to skip for pagination

Output Schema

ParametersJSON Schema

Name	Required	Description
`scope`	Yes	Echoes the resolved listing scope.
`agents`	Yes

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint; the description adds scope (owned by user) and pagination support, providing additional context beyond the annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no unnecessary words. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the presence of an output schema, the description needs only to explain input context; it covers ownership scope and pagination, which is sufficient for a simple list tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with parameter descriptions; the description adds the pagination context linking limit and offset to pagination, adding value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'List' and the resource 'agents owned by the authenticated user', distinguishing it from siblings like 'get_agent' (single agent) and 'claim_agent' (ownership action).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It explicitly describes listing agents with pagination, allowing inference of when to use (need all agents) versus siblings like 'get_agent' (single agent), though it does not explicitly mention when not to use or list alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

preview_compose_alignment_by_agentA

Read-onlyIdempotent

Inspect

Preview composed alignment (dry run) — Composes the cascade against a hypothetical body at the agent layer and returns conflicts + the composed view. No DB writes. Used by the dashboard editor for live conflict markers.

ParametersJSON Schema

Name	Required	Description	Default
`body`	No	Unified alignment card (ADR-008/ADR-039). Authored in YAML or JSON; composed server-side with platform defaults, org template, and active exemptions before storage. This schema matches the runtime validator at src/composition/validate.ts EXACTLY — a card authored strictly to it passes `PUT /v1/agents/{id}/alignment-card` and the preview-compose endpoint. Output-only fields (card_id, issued_at, expires_at, _composition, content_hash, version) are server-assigned and must NOT be sent on a PUT.
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`ok`	Yes
`tool`	Yes
`summary`	Yes
`conflicts`	Yes
`effective`	Yes
`full_report`	Yes
`what_changed`	Yes
`what_to_do_next`	Yes
`what_would_break`	Yes
`coherence_violations`	Yes

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true; the description adds context by confirming 'No DB writes' and detailing return values (conflicts + composed view). No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences, front-loaded with the key action and context, and contains no superfluous information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (nested input schema, rich output schema), the description covers the core purpose, behavior, and use case. It does not detail the output format, but an output schema is provided separately.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description does not add additional meaning to the two parameters beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'preview composed alignment (dry run)' and specifies the resource 'alignment by agent'. It distinguishes itself from write operations by explicitly noting 'No DB writes' and mentions specific outputs (conflicts + composed view).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage as a preview tool for the dashboard editor and contrasts with write operations via 'No DB writes', but it does not explicitly name alternative tools or provide when-not-to-use guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

preview_compose_protection_by_agentA

Read-onlyIdempotent

Inspect

Preview composed protection (dry run) — Composes the cascade against a hypothetical body at the agent layer and returns conflicts + the composed view. No DB writes. Used by the dashboard editor for live conflict markers.

ParametersJSON Schema

Name	Required	Description	Default
`body`	No	Unified protection card (ADR-037). Safe House thresholds + trusted-source policy for a single agent. Shape matches src/composition/types.ts::UnifiedProtectionCard (canonical) and what the runtime validator at src/composition/validate.ts accepts. The customer-facing docs at /concepts/protection-card and /specifications/protection-card-schema document this same shape.
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`ok`	Yes
`tool`	Yes
`summary`	Yes
`conflicts`	Yes
`effective`	Yes
`full_report`	Yes
`what_changed`	Yes
`what_to_do_next`	Yes
`what_would_break`	Yes

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint, idempotentHint, and destructiveHint. The description reinforces no DB writes and specifies the return of conflicts and composed view, adding value beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is one sentence, concise and front-loaded with the key action 'Preview composed protection (dry run)'. Every word adds value with no waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a preview tool with an output schema and complex nested inputs, the description captures the essential functionality. It could briefly mention the return structure, but the output schema handles that.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for both parameters. The description adds context by mentioning 'hypothetical body', helping understand the body parameter's role. This justifies a score above the baseline of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a dry run that composes protection and returns conflicts + composed view, with no DB writes. It is distinguishable from sibling tools like put_protection_by_agent (which writes) and preview_compose_alignment_by_agent (which is for alignment).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description indicates it is used by the dashboard editor for live conflict markers, implying a preview context. However, it does not explicitly state when not to use it or mention alternatives like put_protection_by_agent, though the name and context make it clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

put_alignment_by_agentB

DestructiveIdempotent

Inspect

Publish or replace the alignment manifest — Accepts YAML (text/yaml, application/yaml) or JSON. Body is the full UnifiedAlignmentCard; server-side composition merges it across the platform → org → team → agent cascade and writes the canonical composed card. Requires Idempotency-Key. Honor...

ParametersJSON Schema

Name	Required	Description	Default
`body`	No	Unified alignment card (ADR-008/ADR-039). Authored in YAML or JSON; composed server-side with platform defaults, org template, and active exemptions before storage. This schema matches the runtime validator at src/composition/validate.ts EXACTLY — a card authored strictly to it passes `PUT /v1/agents/{id}/alignment-card` and the preview-compose endpoint. Output-only fields (card_id, issued_at, expires_at, _composition, content_hash, version) are server-assigned and must NOT be sent on a PUT.
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`audit`	Yes
`values`	Yes
`card_id`	No	Card row id. Server-assigned on PUT (`ac-{uuid}`).
`version`	No	Response-only: monotonic card version, injected by the GET/PUT response. Server-assigned — do not send on a PUT.
`agent_id`	Yes	Target agent id. On PUT, server overwrites to match the URL path.
`autonomy`	Yes
`issued_at`	No
`principal`	No	Required object describing whose authority the agent acts under (ADR-039 Decision 10).
`conscience`	No
`expires_at`	No
`extensions`	No
`enforcement`	No	Optional ADR-039 Decision-3 user-facing knobs for unmapped-tool handling. The legacy `mode`, `unmapped_tool_action` and `fail_open` keys are REJECTED by the validator (mode → top-level autonomy_mode; fail_open → gateway env config).
`_composition`	No
`capabilities`	No
`card_version`	Yes	Card schema version (required, non-empty). Current canonical value: `unified/2026-04-26`.
`content_hash`	No	Response-only: content hash of the composed card (`sha256:<hex>`), injected by the GET/PUT response. Server-assigned — do not send on a PUT.
`autonomy_mode`	Yes	ADR-039 master switch for the action-policing pipeline (autonomy constraints). Required at the top level post-cutover; the legacy `enforcement.mode` location is rejected.
`integrity_mode`	Yes	ADR-039 master switch for the values/conscience pipeline (integrity constraints). Required at the top level post-cutover; the legacy `integrity.enforcement_mode` location is rejected.

Tool Definition Quality

B3.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds value beyond annotations by detailing server-side composition merging across the cascade, content type requirements, and Idempotency-Key requirement. Annotations already indicate idempotent and destructive hints, so the description reinforces and extends with operational context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness2/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description consists of a single sentence followed by an incomplete truncation 'Honor...', making it poorly structured and incomplete. It lacks proper front-loading and is not appropriately sized for a complex tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite having an output schema, the description omits important context such as prerequisites, validation errors, or consequences of replacement for a destructive, idempotent tool. More guidance is needed for a tool with nested parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the schema thoroughly describes both parameters. The description only adds that the body is the full UnifiedAlignmentCard and accepts YAML/JSON, which is helpful but not essential beyond schema-provided details.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool publishes or replaces the alignment manifest using specific verb and resource. However, it does not distinguish from the sibling tool preview_compose_alignment_by_agent, which likely handles preview without persistence.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description mentions YAML/JSON acceptance and Idempotency-Key requirement but provides no guidance on when to use this tool versus alternatives like preview_compose_alignment_by_agent or put_protection_by_agent. No prerequisites or when-not-to-use information is given.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

put_protection_by_agentA

DestructiveIdempotent

Inspect

Publish or replace the protection manifest — Accepts YAML (text/yaml, application/yaml) or JSON. Body is the full UnifiedProtectionCard; server-side composition merges it across the platform → org → team → agent cascade and writes the canonical composed card. Requires Idempotency-Key. Hon...

ParametersJSON Schema

Name	Required	Description	Default
`body`	No	Unified protection card (ADR-037). Safe House thresholds + trusted-source policy for a single agent. Shape matches src/composition/types.ts::UnifiedProtectionCard (canonical) and what the runtime validator at src/composition/validate.ts accepts. The customer-facing docs at /concepts/protection-card and /specifications/protection-card-schema document this same shape.
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`mode`	Yes	Strictest-wins composition: enforce > nudge > observe > off.
`review`	No	Review-hold policy (Safe House Review, Slice 2a — MNE-920 design). gate_on is the minimum verdict band per surface that escalates to a review-hold. Composition is strictest-wins; on_timeout defaults to 'reject' (fail-closed). reviewer.kind 'endpoint' is designed for MNE-1650 and not consumed yet.
`card_id`	No
`version`	No	Response-only: monotonic card version, injected by the GET/PUT response. Server-assigned — do not send on a PUT.
`agent_id`	Yes
`issued_at`	No
`expires_at`	No
`extensions`	No	Free-form extension slot for non-canonical fields. Ignored by the composer; preserved on read for tooling that needs an audit-tail metadata bag.
`thresholds`	Yes	Score bands. Must satisfy warn <= quarantine <= block; each value in [0, 1].
`_composition`	No
`card_version`	Yes
`content_hash`	No	Response-only: content hash of the composed card (`sha256:<hex>`), injected by the GET/PUT response. Server-assigned — do not send on a PUT.
`screen_surfaces`	Yes	Which request surfaces Safe House inspects. Composed across scopes by OR-per-field (any scope requiring inspection wins).
`trusted_sources`	Yes	Sources for which detectors short-circuit (each match logged in the trace). Composed as platform->agent intersection (compliance ceiling) with org+agent union inside that ceiling — an agent cannot widen trust beyond what the platform allows.
`protected_surface`	No	Org-declared protected surface policy (MNE-830). Strengthen-only UNION across platform → org → team → agent: each scope may add entries; none may remove. The composer always emits this block; callers omit it to inherit the composed floor. See ADR-037 §protected_surface.

Tool Definition Quality

A3.8/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare this as a write operation (readOnlyHint=false) with destructive potential (destructiveHint=true). The description adds significant behavioral context: it accepts YAML or JSON, explains the server-side composition merge across scopes, and notes the required idempotency key. While it doesn't detail all side effects (e.g., overwrite behavior), it provides valuable transparency beyond the annotations regarding the multi-scope merging process.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, dense sentence that front-loads the core action ('Publish or replace the protection manifest') and concisely adds key details about input formats and composition behavior. It is not overly long and avoids redundancy, though it could benefit from clearer structure (e.g., separate sentences or bullet points) for readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of the tool (large nested body, output schema exists), the description covers the essential action, input format, composition behavior, and required header. However, it omits mention of the response (composed card) and does not address error cases or authorization needs. The output schema compensates, but the description could be more complete for a tool with this many intricate fields.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the input schema already fully documents both parameters (agent_id and body) with types and descriptions. The description adds that 'body is the full UnifiedProtectionCard' and that it accepts YAML or JSON, but these are already implied by the schema. The additional format note is minor. Baseline 3 is appropriate given no gaps in schema documentation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Publish or replace the protection manifest'. It specifies the resource (UnifiedProtectionCard) and the action (publish/replace with server-side composition). This distinguishes it from sibling tools like 'put_alignment_by_agent' which operates on a different resource (alignment manifest) and 'preview_compose_protection_by_agent' which only previews composition without writing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage via the composition cascade ('merges it across the platform → org → team → agent cascade') and mentions a required header ('Requires Idempotency-Key'). However, it does not explicitly state when to use this tool versus siblings like the preview tool, nor does it provide any exclusions or alternatives. The guidance is minimal and buried in technical detail.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

report_recipe_fn_fpAInspect

Submit a false-positive / false-negative correction for one of Mnemom's automated detection rules (a 'recipe') — technical feedback that improves detection accuracy, like filing a bug report against a spam filter.

ParametersJSON Schema

Name	Required	Description
`type`	Yes	'fn' (false negative — recipe should have fired) or 'fp' (false positive — recipe fired on legitimate behaviour).
`summary`	Yes	Customer's description of what happened.
`agent_id`	No	Optional. Customer's agent id, when the report concerns a specific agent.
`evidence`	No	Optional raw payload / log excerpt the admin reviewer can inspect.
`recipeId`	Yes	The detection_recipes id the report is filed against (the recipe that misfired or failed to fire).
`checkpoint_id`	No	Optional. Related integrity_checkpoints id (helps the reviewer correlate).

Output Schema

ParametersJSON Schema

Name	Required	Description
`ok`	Yes
`type`	Yes
`candidate_id`	Yes
`related_recipe_id`	Yes

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description conveys that the tool submits technical feedback, but beyond the analogy to a bug report, it does not disclose behavioral traits such as side effects, persistence, or review process. Since annotations provide minimal behavioral hints (only non-destructive), the description could add more context about what happens after submission.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured sentence that immediately states the core action and resource, followed by an effective analogy. No unnecessary words or repetition.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description adequately explains the tool's purpose for a simple submission action. With full schema coverage and an output schema (not shown), the description does not need to detail return values. Minor improvement would be mentioning that a report is created for admin review, but overall it is complete enough for this tool's complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema covers all 6 parameters with descriptions, achieving 100% coverage. The global description adds overall context but does not supplement individual parameter details. Given the high schema coverage, a baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the action ('submit'), resource ('correction'), and specifies the two types (false-positive/false-negative), making the purpose immediately clear. It also distinguishes the tool from all sibling tools, none of which perform a similar reporting function.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies the tool is for reporting incorrect detections, akin to filing a bug report, but it does not explicitly state when to use it or when not to, nor does it provide alternatives or prerequisites. The usage context is clear but lacks explicit guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

scan_trustA

Read-onlyIdempotent

Inspect

Scan a website's agent-trust-readiness and return a signed scorecard (Trust, plus an Access axis on newer rubrics). Zero-auth. Results are CACHED for up to 24h — check cached and scannedAt on the result; pass fresh: true to force a re-scan (rate-limited). Proxies to the SSRF-locked isittrustready scanner; the Ed25519 signature + permalink are preserved verbatim. Rubric + docs: https://www.isittrustready.ai/rubric and https://docs.mnemom.ai/.

ParametersJSON Schema

Name	Required	Description	Default
`url`	Yes	Domain or URL to scan, e.g. "example.com" or "https://example.com".
`fresh`	No	Force a fresh re-scan instead of the cached result (results are cached up to 24h; the engine rate-limits re-scans). Equivalent to the scanner's rescan flag.

Output Schema

ParametersJSON Schema

Name	Required	Description
`grade`	Yes	Trust letter grade (A+…F).
`score`	Yes	0–100 weighted overall TRUST score.
`access`	No	The independent Access/discoverability axis (never blended with Trust). Present from the two-axis rubric (0.3.0+).
`cached`	No	True when served from the scanner's 24h cache rather than a fresh scan.
`schema`	Yes	iitr-scan schema version string (e.g. "iitr-scan/v0.N").
`target`	Yes	Normalized host that was scanned.
`permalink`	No	Shareable /r/ permalink (only on /r/ responses; transport field).
`scannedAt`	No	When this scorecard was produced. Results are cached up to 24h — pass fresh:true to scan_trust to force a re-scan.
`signature`	Yes	Ed25519 signature over the canonical result (transport field; stripped before verify).
`categories`	No	Trust-axis categories with per-category scores + checks.
`verification`	No	Self-describing in-band verification block {alg, kid, jwks, canonicalization} — how to verify this scorecard's signature. Self-describing, so signed-EXCLUDED (stripped before verify).
`rubricVersion`	No	Rubric version (e.g. "0.4.0").

Tool Definition Quality

A4.2/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses caching behavior (24h), the fresh parameter to bypass cache, rate-limiting, proxy to SSRF-locked scanner, and preservation of Ed25519 signature and permalink. These details go well beyond the annotations (readOnlyHint, idempotentHint) to provide operational context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise at three sentences, front-loads the main action, and every sentence adds essential information without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (2 parameters, output schema exists), the description fully covers behavior: caching, fresh scans, signatures, documentation links. No gaps remain for effective agent invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% (both parameters described). The description adds examples for 'url' and explains the 'fresh' flag's purpose, equivalence to scanner's rescan flag, and rate-limiting implications, providing value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool scans a website for agent-trust-readiness and returns a signed scorecard. It specifies the action and resource but does not differentiate from sibling tools like verify_scan or get_reputation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for obtaining a trust scorecard, mentions caching and the fresh flag, but provides no explicit guidance on when to use this tool versus alternatives or when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_reputation_directoryA

Read-onlyIdempotent

Inspect

Resolve an agent name or id-prefix to a real agent_id over the PUBLIC reputation directory (only agents whose reputation visibility is public). Zero-auth. The arriving-agent entry point: discover a concrete agent_id, then call get_reputation / verify_reputation on it.

ParametersJSON Schema

Name	Required	Description	Default
`q`	No	Name search (ilike) or agent-id prefix match.
`page`	No	1-based page number for pagination. Default 1.
`sort`	No	Result ordering. Default "score" (highest-rated first); other supported keys order by recency or name.	score
`grade`	No	Filter to one grade (e.g. `AAA`, `B`, `NR`).
`per_page`	No	Number of results per page. 1–100, default 20.
`confidence`	No	Filter to agents at a given reputation-confidence level (driven by how much evidence backs the score).

Output Schema

ParametersJSON Schema

Name	Required	Description
`page`	Yes
`total`	Yes
`agents`	Yes
`per_page`	Yes

Tool Definition Quality

A4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Adds valuable context beyond annotations: 'only agents whose reputation visibility is public', 'zero-auth', and the resolution step. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, zero waste. Highly concise and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With output schema available and 6 params, the description covers the essential context: public directory, zero-auth, entry point. Could mention pagination but not needed given schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so description doesn't need to add param details. It doesn't explain parameters beyond the schema, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it resolves an agent name or id-prefix to a real agent_id from the public directory. Implicitly distinguishes from siblings like get_reputation by positioning it as the entry point, but no explicit differentiation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Specifies zero-auth and that it's the entry point for discovering agent IDs before calling get_reputation/verify_reputation. Provides clear context but no explicit when-not-to-use or alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

verify_agent_bindingA

Read-onlyIdempotent

Inspect

Verify that an API key is bound to this agent — confirms the holder is the accountable owner (anti-impersonation), hashing the key client-side.

ParametersJSON Schema

Name	Required	Description	Default
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)
`key_hash`	Yes	First 16 hex chars of SHA256(apiKey) or SHA256(apiKey + '\|' + agentName) for named agents.

Output Schema

ParametersJSON Schema

Name	Required	Description
`bound`	Yes	True if the submitted hash matches the registered key.
`caller`	Yes	Self-describing caller context. `key_prefix` is present iff `caller` is `org_member`; `anonymous` and `authenticated` (non-member) callers receive `{ bound, caller }` only. This is BY DESIGN, not a partial or leaky response — read `caller` instead of treating an absent `key_prefix` as a leak. The boolean `bound` is an anti-impersonation confirmation that reveals nothing the caller did not already supply.
`key_prefix`	No	First 16 chars of the registered key (or null if not yet captured). Returned ONLY to callers who are members of the agent's org (`caller` === `org_member`); omitted for anonymous / cross-org callers.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint, indicating safety and idempotency. The description adds the specific behavioral detail of client-side hashing, which is valuable context beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

A single well-structured sentence that front-loads the core action ('Verify that an API key is bound to this agent') and concisely explains the purpose and method without any filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the annotations and presence of an output schema, the description adequately covers the tool's purpose and key behavior. It could benefit from mentioning the output format or common use cases, but these are partially covered by the output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema covers both parameters with descriptions and an example. The description adds the context that the key is hashed client-side, reinforcing the key_hash parameter's purpose. With 100% schema coverage, the description provides marginal added value.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (verify), resource (API key bound to agent), and purpose (anti-impersonation). It distinguishes itself from sibling tools like verify_reputation by focusing on key binding ownership.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies use for identity verification during impersonation concerns, but it does not explicitly state when to use this tool vs alternatives. No when-not-to-use guidance or exclusions are provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

verify_reputationA

Read-onlyIdempotent

Inspect

Attest an agent's Trust Rating — returns a Merkle-root + hash-chain attestation (hash_chain_valid) proving the rating derives from an unbroken, append-only checkpoint chain, plus a pointer to the signed integrity certificate. This is a chain-integrity attestation, NOT an in-band Ed25519 signature check (that parity is verify_scan, for website scorecards).

ParametersJSON Schema

Name	Required	Description	Default
`agent_id`	Yes	Agent identifier (e.g. smolt-abc123)

Output Schema

ParametersJSON Schema

Name	Required	Description
`grade`	Yes
`score`	Yes
`agent_id`	Yes
`computed_at`	Yes
`verification`	Yes

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint, idempotentHint, and destructiveHint. The description adds behavioral context: it returns a Merkle-root + hash-chain attestation and a pointer to a signed integrity certificate, clarifying the nature of the attestation beyond the annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with the core purpose and output, then a clarifying distinction. Every sentence earns its place, no extraneous information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (one required parameter, no nested objects), the description is complete. It explains the output nature and the type of attestation, sufficient for an agent to understand its use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The single parameter agent_id is fully described in the schema (100% coverage). The description does not add additional meaning beyond what the schema provides, so it meets the baseline of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it attests an agent's trust rating, using specific verb 'attest' and specifying the resource. It distinguishes itself from verify_scan by clarifying it's a chain-integrity attestation, not an Ed25519 signature check.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly contrasts with verify_scan, indicating when not to use this tool. However, it doesn't provide broader guidance on when to use this tool versus other siblings like get_reputation or scan_trust.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

verify_scanA

Read-onlyIdempotent

Inspect

Verify a website scan scorecard's Ed25519 signature IN-BAND (verify, don't trust). Pass a scan (a scorecard from scan_trust) or a url to re-scan; returns {verified, key_id, canonicalization} checked against the public key at mnemom://iitr/jwks. Zero-auth. Spec + rubric: https://www.isittrustready.ai/rubric and https://docs.mnemom.ai/.

ParametersJSON Schema

Name	Required	Description	Default
`url`	No	Alternatively, a domain/URL to re-scan and then verify.
`scan`	No	A scan scorecard previously returned by scan_trust (or iitr's /r/ JSON), passed back verbatim to verify. Same shape as scan_trust's result; the signature is checked against mnemom://iitr/jwks.

Output Schema

ParametersJSON Schema

Name	Required	Description
`key_id`	Yes	The signing key id (kid) checked.
`reason`	No	Why verification failed or could not be evaluated (absent when verified).
`verified`	Yes	True iff the signature verifies against the in-band JWKS.
`algorithm`	Yes	Always "Ed25519".
`scorecard`	No	The scorecard verified (present when re-scanned via `url`).
`canonicalization`	Yes	The exact canonicalization used (so the verdict is reproducible).

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds value beyond annotations by stating the tool is zero-auth and describes the return format ({verified, key_id, canonicalization}) and verification against a public key URI. Annotations already indicate read-only, idempotent behavior, so the description enriches context without contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences long, each serving a distinct purpose: stating the action, specifying parameters, and providing reference links. No extraneous information; every sentence is justified.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of the tool (nested objects, output schema exists), the description covers the core purpose, input options, behavioral notes (zero-auth), and references for further detail. It fully equips an agent to invoke the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but the description adds context: the scan parameter originates from scan_trust, and the url parameter triggers a re-scan. This goes beyond the schema descriptions, which detail object properties but not the behavioral implications of each parameter.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool verifies the Ed25519 signature of a website scan scorecard in-band. It explicitly distinguishes itself from siblings by focusing on verification of scan scorecards, as opposed to scanning (scan_trust) or verifying other entities (verify_agent_binding, verify_reputation).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains the tool accepts a scan from scan_trust or a URL to re-scan, implying when to use it after obtaining a scorecard. However, it does not explicitly state when not to use it or provide direct comparisons to alternatives, which would elevate it to a 5.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Mnemom — Trust Ratings for AI Agents

Server Details

Tool Definition Quality

Available Tools

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Output Schema

Tool Definition Quality

Discussions

Your Connectors

Resources