Standout
Server Details
Anti-slop design taste for AI coding agents: art directions, section code, 0-100 page critic.
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 4.1/5 across 24 of 28 tools scored. Lowest: 3.1/5.
Multiple tools have overlapping or indistinct responsibilities: build_from_reference, build_from_reference_v2, and synthesize_direction_from_references all handle reference-based direction generation. Similarly, get_design_direction and synthesize_direction_from_references both produce art directions. Critiques are split across critique_design, critique_ui, critique_similarity, and client_report, which includes a score. This overlap will likely cause an agent to select the wrong tool.
Most tools follow a consistent verb_noun pattern (e.g., critique_design, get_design_direction, list_directions). However, there are minor deviations: build_from_reference_v2 uses a preposition and version suffix, and make_it_standout breaks the pattern entirely. Overall, the naming is predictable enough for an agent to infer purpose.
With 28 tools, the set is on the heavy side. While the domain (design audit and generation) is broad, several tools could be consolidated (e.g., merging critique variants or reference-building tools). The count may cause decision fatigue for an agent, though each tool has a clear niche.
The tool set covers the full design workflow: direction selection, reference search, critiques, component generation, motion, premium assets, and client reports. Minor gaps exist, such as no direct deployment or client management tools, but the core lifecycle is well-supported.
Available Tools
28 toolsbuild_from_referenceAInspect
Reference-first build — 'show me what you like, I'll build it.' Give a reference you LIKE (a public site URL via reference_url, and/or a curated library id via reference_id from search_references) plus what YOU are building, and get back a matching StandOut direction, a reference-backed build plan, and the next call for section code. This is the front door for building from an EXAMPLE instead of a text brief: Standout reads the URL server-side for its semantic signal (title, headings, copy) and folds it into the direction match. Note: pixel-level palette/type extraction from an image is not yet supported (that is the Phase 2 vision pass) — for now it matches on the page's text/structure plus the curated library. Pass reference_id to anchor on an exact curated reference.
| Name | Required | Description | Default |
|---|---|---|---|
| vibe | No | Desired feel, e.g. premium, technical, warm, playful | |
| audience | No | Who the page must convince | |
| business | Yes | What YOU are building (not the reference), e.g. 'booking site for a Fiji dive shop' | |
| industry | No | Industry or category, e.g. developer tool, restaurant, law firm | |
| direction | No | Optional StandOut direction id to force, e.g. terminal-grade | |
| reference_id | No | Or a curated reference id from search_references to anchor the match on | |
| reference_url | No | Public URL of a site whose style you like. Standout reads it server-side for its semantic signal. | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses that URLs are read server-side for semantic signals, that pixel-level palette/type extraction is not supported, and that matching is based on text/structure plus the curated library. This provides sufficient transparency about the tool's behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is somewhat lengthy but front-loaded with a clear hook ('show me what you like, I'll build it'). Every sentence adds value, though minor redundancy could be trimmed. The structure is logical: purpose, then usage, then limitations and parameter guidance.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 8 parameters, no output schema, and no annotations, the description covers all essential aspects: workflow, parameter roles, limitations, and expected outputs. It addresses potential confusion (e.g., distinguishing business from reference) and sets expectations for what is and isn't supported.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and the description adds significant meaning beyond schema descriptions. It clarifies that 'business' means what the user is building (not the reference), explains how reference_url and reference_id interact, and notes the limitation on image extraction. This helps prevent misuse.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: building from a reference (URL or library ID) rather than a text brief. It uses specific verbs ('build', 'give', 'get back') and distinguishes from text-only approaches by explicitly calling itself 'the front door for building from an EXAMPLE.'
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explains when to use (when you have a reference), mentions limitations (no pixel-level extraction yet), and provides guidance on parameters (e.g., 'Pass reference_id to anchor on an exact curated reference'). It implicitly differentiates from text-only briefs but does not explicitly list when not to use or mention sibling alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
build_from_reference_v2AInspect
Reference Intelligence Layer — build from a site you like without cloning it. Pass a public reference_url plus what YOU are building and optional favorite_bits such as 'hero animation', 'pricing card', or 'color mood'. Standout captures server-readable HTML/CSS signals, extracts Reference DNA, names what to borrow, names what to avoid, gives a similarity-risk guardrail, then returns a reference-backed original build direction. This is the right tool when the user says 'I like this site but don't want to steal it.'
| Name | Required | Description | Default |
|---|---|---|---|
| vibe | No | Desired feel, e.g. premium, technical, warm, playful | |
| audience | No | Who the page must convince | |
| business | Yes | What YOU are building (not the reference), e.g. 'Mac app that makes websites and PDFs readable outside' | |
| industry | No | Industry or category, e.g. developer tool, Mac utility, restaurant | |
| direction | No | Optional StandOut direction id to force, e.g. terminal-grade | |
| favorite_bits | No | Specific parts the user likes, e.g. ['hero animation', 'warm colors', 'pricing card']. These become inspiration tokens, not copied components. | |
| reference_url | Yes | Public URL of a site whose style or specific parts you like | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, description carries full burden. It details the process: captures HTML/CSS signals, extracts Reference DNA, names what to borrow/avoid, gives similarity-risk guardrail, and returns a direction. Lacks disclosure on auth or side effects but appropriate for a read-analysis tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Description is moderately sized, front-loaded with purpose. Each sentence adds value, though slightly verbose. Could be trimmed without losing clarity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 8 parameters, no output schema, no annotations, the description covers core function and key parameters. Return value explained sufficiently ('reference-backed original build direction'). Missing details on error handling but acceptable.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, and description adds meaning beyond schema. Explains 'favorite_bits' as inspiration tokens not copied components, and gives examples for 'business' and 'favorite_bits'.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool's action: 'build from a site you like without cloning it'. Uses specific verb 'build' and resource 'reference-backed original build direction'. Distinguishes from siblings by mentioning it's for when user says they like a site but don't want to steal it.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly provides when to use: 'the right tool when the user says I like this site but don't want to steal it'. Does not mention when not to use or alternatives explicitly, but the scenario is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
client_reportAInspect
Generate a CLIENT-READY design audit report for an agency to send to its client: a Standout Score, what's holding the page back, what to fix and why it matters, a client-safe summary, and the page screenshots — all in plain language with NO internal jargon. Pass url for the current page; optionally pass after_url (or after_source) for a rebuilt version to get a true before/after with both screenshots and the score improvement. Pass agency_name to WHITE-LABEL it: the report text and the shareable public report page both read 'Prepared by {agency}', ready to send to the client as your own.
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | The current (before) page URL to audit | |
| source | No | Raw HTML/CSS of the current page, if not published | |
| business | No | Business name/description for the report header and classification | |
| after_url | No | The rebuilt (after) page URL, for a before/after comparison | |
| agency_name | No | White-label: your agency/studio name, shown as 'Prepared by X' in the report and on the shareable report page | |
| after_source | No | Raw HTML/CSS of the rebuilt page, for a before/after comparison |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It details report contents but does not disclose if the tool creates a resource (e.g., stored report) or only returns data. It also omits any side effects or permissions needed, leaving some ambiguity for a generation tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, well-structured paragraph with no wasted words. It front-loads the purpose, then explains optional parameters, and ends with white-labeling. Every sentence earns its place, making it highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description adequately covers the tool's inputs and output contents for a moderate complexity (6 params, no output schema). However, it lacks explicit detail on the return format (e.g., URL to report, JSON structure, or file), which would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds value by explaining how 'after_url' and 'after_source' enable before/after comparison with score improvement, and how 'agency_name' enables white-labeling. This goes beyond the schema's basic descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool generates a 'CLIENT-READY design audit report' with specific elements like Standout Score, issues, fixes, and screenshots. It distinguishes from sibling tools like critique_ui and critique_design by emphasizing client-ready, jargon-free language and white-labeling.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit guidance on required 'url' and optional parameters 'after_url', 'after_source', and 'agency_name' for white-labeling. It explains when to use before/after comparison but does not explicitly state when not to use this tool or suggest alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
critique_designAInspect
Brutal, specific design critique of an existing page: automatic AI-slop detection (banned gradients, centered-hero defaults, h-screen bugs, cliche copy) plus a prioritized fix list with exact values. When you pass a URL, the response also includes LIVE SCREENSHOTS of the rendered page at phone (390px) and desktop (1440px) widths — look at them and fix what fails visually — and saves an unlisted SHAREABLE SCORE REPORT (score + category breakdown + screenshots) whose link you should give to the user. Always prefer passing the published URL over raw source for these reasons.
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | Public URL of the page to critique | |
| source | No | Raw HTML/CSS/JSX of the page | |
| direction | No | Direction pack id the page implements (e.g. ink-and-paper). ALWAYS pass this when the page was built from get_design_direction so the critique respects the design system instead of fighting it. | |
| description | No | Description of the current design if no source available |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description fully discloses key behaviors: it takes screenshots at two widths, saves an unlisted shareable score report, and expects the agent to examine visuals. No contradictions or hidden side effects are evident.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is informative but verbose, especially the sentence about screenshots and reports. It is front-loaded with the core purpose, but could be more concise while retaining necessary details.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (4 params, no annotations, no output schema), the description covers purpose, usage, behavior, and parameter guidance well. It lacks only minor specifics like output format or access requirements.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema has 100% coverage with descriptions. The tool description adds crucial context: preferring URL over source, and always passing 'direction' when built from get_design_direction. This enhances schema information significantly.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's function: providing a brutal, specific design critique with AI-slop detection and a fix list. It distinguishes from sibling critique tools (critique_similarity, critique_ui) by focusing on existing pages with screenshots and score reports.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly advises preferring URL over raw source and includes a note about passing the direction parameter for design system consistency. However, it does not explicitly state when to avoid using this tool or compare it to alternatives beyond the sibling list.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
critique_similarityAInspect
Originality guardrail — compare a built page/source/description against a reference site and flag shared recognizable fingerprints. Use after build_from_reference_v2, before showing or publishing, to avoid producing a design that is visually too close to the inspiration.
| Name | Required | Description | Default |
|---|---|---|---|
| favorite_bits | No | The same favorite bits passed to build_from_reference_v2, if any | |
| reference_url | Yes | Public URL of the reference site the user liked | |
| response_format | No | Return markdown or structured JSON | |
| candidate_source | No | Built page HTML/CSS/JSX to compare against the reference | |
| candidate_description | No | Plain-language description of the built design if source is unavailable |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. States it's an 'originality guardrail' but does not disclose whether it has side effects, access requirements, or what happens when similarities are found (e.g., returns a score, blocks output). Lacks behavioral details beyond core purpose.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with zero waste: first sentence defines purpose, second provides temporal placement. Front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 5 parameters and no output schema, the description fails to explain return format (flag vs score vs block) or behavior on failure/lack of similarity. Missing critical information for correct invocation and interpretation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with good inline descriptions. Tool description adds no extra parameter clarification beyond saying 'built page/source/description' which maps to candidate_source and candidate_description. At baseline 3 because schema already handles param semantics well.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states verb ('compare') and resource (built page vs reference site) with specific outcome ('flag shared recognizable fingerprints'). Distinct from sibling tools like critique_design by focusing on originality against a specific reference.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use after build_from_reference_v2, before showing or publishing', giving clear context. Does not mention when not to use or alternatives, but context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
critique_uiAInspect
Deterministic critique for APPLICATION UI (dashboards, admin panels, SaaS views): runs the app-UI slop rulebook against React/JSX/HTML source (radius chaos, card-in-card, gray-on-gray text, raw palette classes, missing empty/loading/error states, clickable divs, killed focus rings) and, when a Standout app theme is installed, a theme-conformance pass (foreign colors, missing semantic token classes). Returns a 0-100 UI score with a ship verdict and a prioritized fix list. Use after building every view; re-run until the score clears 85. For marketing/landing PAGES use critique_design instead.
| Name | Required | Description | Default |
|---|---|---|---|
| view | No | What this view is, e.g. 'invoices table', for context | |
| theme | No | Installed app theme id (e.g. graphite-ledger) to also check token-law conformance | |
| source | Yes | The component or screen source to critique (React/JSX/TSX/HTML/CSS) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Describes the deterministic rulebook and enumerates checks (radius chaos, card-in-card, etc.). Explains extra theme-conformance pass when a theme is provided. Returns a score, verdict, and fix list. No annotations provided, but description is thorough.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with purpose, includes key details without fluff. Every part earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, the description mentions return value (score, verdict, fix list). It covers both base and theme passes, usage guidance, and alternative tools. Complete for a tool of this complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema descriptions already cover all parameters (100% coverage). The description adds context about when the theme parameter triggers additional checks ('when a Standout app theme is installed, a theme-conformance pass'). This adds value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states it is a 'Deterministic critique for APPLICATION UI' and lists specific checks. It distinguishes itself from critique_design by saying 'For marketing/landing PAGES use critique_design instead.'
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly advises 'Use after building every view; re-run until the score clears 85' and gives a clear alternative for other use cases: 'For marketing/landing PAGES use critique_design instead.'
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_app_themeAInspect
Get a LOCKED design-token system for application UI (dashboards, admin panels, SaaS tools): every color role contrast-checked, exactly two radii, a five-step type scale, spacing, elevation and motion tokens, plus a Tailwind v4 semantic mapping, the theme's usage law, and the app-UI slop rulebook. This is how a vibe-coded app stops looking AI-generated: install the theme once and every view the agent builds matches. Call with NO theme to browse the menu (free); call with a theme id to receive the full system. Themes share brand DNA with the website directions, so a client's site and app stay on one identity.
| Name | Required | Description | Default |
|---|---|---|---|
| theme | No | Theme id from the menu, e.g. graphite-ledger, terminal-grade-ui, porcelain-ui. Omit to browse the menu (unmetered). | |
| response_format | No | Menu format when browsing: markdown (default) or JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses that the theme is 'LOCKED' and 'contrast-checked', describes the structure (two radii, five-step type scale, etc.), and states that installing the theme once ensures all views match. It implies a read-only operation with no side effects, which is transparent enough for an AI agent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with the core purpose and details. While it includes some fluff ('how a vibe-coded app stops looking AI-generated'), the essential information is presented efficiently. It is not excessively long and each sentence serves a purpose, though slight tightening could improve.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given there is no output schema, the description adequately explains return values: a menu when browsing or the full token system when a theme id is provided. It lists the content of the theme (color, radii, type scale, etc.), which is sufficient for an agent to understand what it will receive. The tool is simple and the description covers the necessary context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema covers all parameters (100% description coverage). The description adds meaning beyond the schema: it explains that omitting the theme parameter provides a free menu, and gives examples of theme ids (e.g., 'graphite-ledger'). It does not add much to response_format beyond what the schema provides, but overall it supplements the parameter documentation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool returns a 'LOCKED design-token system for application UI' and enumerates its components (color roles, radii, type scale, etc.). It distinguishes itself from sibling tools like website direction tools by stating themes share brand DNA with site directions, making clear it is for app UI theming.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly tells when to omit the theme parameter (to browse the menu for free) and when to provide a theme id (to receive the full system). It also mentions that themes maintain brand consistency with site directions, hinting at when this tool is appropriate versus sibling tools. However, it does not explicitly list alternatives or when not to use this tool.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_asset_packBInspect
Ready-to-paste CSS for a direction: custom properties (palette + fonts), button/card/nav treatments, grain texture, Google Fonts import, and the matched motion recipe list.
| Name | Required | Description | Default |
|---|---|---|---|
| query | No | Or describe the business/vibe and the best pack is chosen | |
| direction | No | Direction pack id (e.g. terminal-grade) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden for behavioral disclosure. It only lists output contents but fails to mention side effects (none expected), idempotency, authentication needs, rate limits, or return format. The description does not confirm whether the tool is read-only or if it performs any mutations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence that packs many items, making it slightly dense. It is front-loaded with 'Ready-to-paste CSS' which is good, but the structure could be improved with bullet points or separate clauses for readability. Every word adds value, but conciseness is slightly compromised by the run-on nature.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool has 2 optional parameters and no output schema, the description provides a reasonable overview of what is returned. However, it does not clarify the output format (e.g., string vs. file), whether it's a complete CSS file or snippets, or how the returned data integrates with other tools. It is adequate but not thorough.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Input schema covers 100% of parameters with descriptions, so the baseline is 3. The description adds no additional meaning beyond what the schema already provides for 'direction' and 'query'. It does not tie the listed output components to specific parameter values.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description explicitly states the tool returns 'ready-to-paste CSS for a direction' and enumerates specific components (palette, fonts, treatments, texture, import, motion recipes). This clearly distinguishes it from sibling tools like get_design_direction or get_motion_recipes, which serve different purposes.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide guidance on when to use this tool versus alternatives. It implies usage for obtaining CSS for a direction but lacks explicit context, when-not-to-use examples, or comparisons with sibling tools. The agent must infer the appropriate scenario.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_component_codeAInspect
Get FINISHED application-UI components (React + Tailwind, zero dependencies) written against a Standout app theme's semantic tokens: app shell with sidebar/topbar, stat row with mono tabular metrics, data table with layout-matched loading skeletons + instructive empty state + inline retryable error, and a settings form with inline validation, sticky save bar and a two-step danger zone. States are built in, so the agent wires data and content instead of inventing structure. Install the theme first via get_app_theme.
| Name | Required | Description | Default |
|---|---|---|---|
| theme | No | App theme id the components bind to (default graphite-ledger) | |
| component | No | Component id: app-shell, stat-row, data-table, settings-form, or 'all' (default) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must cover behavioral traits. It explains components are 'finished' with built-in states and that the agent 'wires data and content instead of inventing structure.' This provides some context but omits details on safety, permissions, or side effects.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single paragraph of 4-5 sentences, front-loading the main purpose. It is concise without unnecessary words, though a more structured list could improve scanability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's simplicity (2 optional params, no output schema), the description covers the key aspects: what is retrieved, the component types, built-in states, and a prerequisite. Missing explicit clarification of the return format (e.g., code string, file) but otherwise complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds meaningful value by elaborating on the component IDs (e.g., 'app shell with sidebar/topbar,' 'stat row with mono tabular metrics'). This enriches the agent's understanding beyond the schema's brief descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it retrieves finished application-UI components with specific types (app shell, stat row, data table, settings form). It distinguishes from general tools like get_app_theme, but does not explicitly contrast with similar siblings like get_premium_components or get_section_code.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when needing pre-built UI components and explicitly states a prerequisite ('Install the theme first via get_app_theme'). However, it lacks guidance on when not to use this tool or what alternatives exist among siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_design_directionAInspect
Get a complete, distinctive, reference-backed art direction for a website: palette with exact hex roles, type pairing, layout DNA, hero spec, component treatments, imagery rules, matched motion recipes, and public-product reference evidence. Use FIRST, before building, so the site has a coherent direction instead of defaults. The catalog holds 14 directions: 9 with light base palettes and 5 dark — pass tone to constrain. If you only know the business name, call it anyway: the response will give you a few quick questions to ask the user, then call again with their answers as vibe/audience. REDESIGNS: if the business has an existing website, pass its URL as current_site_url — Standout fetches it server-side and returns a content inventory (services, prices, hours, contacts) plus a redesign protocol, so no questions are needed.
| Name | Required | Description | Default |
|---|---|---|---|
| tone | No | Base palette tone. Pass when the client asks for a light or dark site; matching stays inside that pool (9 light / 5 dark directions) | |
| vibe | No | Desired feel, e.g. premium, playful, technical, warm | |
| audience | No | Who the site must convince | |
| boldness | No | How adventurous the signature concept should be: safe (conservative/trust-first businesses like clinics, accountants, law firms), elevated (default), bold, or experimental. A 'safe' request never receives a bold concept. Bold/experimental dials can unlock signature SET PIECES: finished interactive hero objects (drag-to-compare reveal, live pricing calculator, product console, scroll-driven process timeline) delivered as complete code by get_section_code. | |
| business | No | What the business is, e.g. 'dive shop in Fiji running charters'. If omitted, the response returns a few quick intake questions to ask the user instead of erroring. | |
| industry | No | Industry, e.g. restaurant, law firm, saas, gym | |
| current_site_url | No | The business's EXISTING website URL, for redesigns. Standout fetches it and returns the content to preserve + the redesign protocol |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden. It discloses that the catalog has 14 directions, that passing a current_site_url triggers server-side fetch and returns a content inventory, and that omitting business returns questions. However, it doesn't explicitly state side effects (e.g., no database writes) or rate limits, leaving minor gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is thorough but efficient, with each sentence adding value. It is front-loaded with the core purpose and then covers usage variants. While on the longer side, it is well-structured and avoids fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 7 parameters, no output schema, and complex behavior (catalog, redesigns, fallback questions), the description is remarkably complete. It explains error handling, redesign protocol, content inventory return, and how boldness unlocks components. No critical information seems missing.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with good descriptions, but the tool description adds significant value: for 'boldness', it explains that safe never gets bold concepts and bold/experimental unlock set pieces; for 'current_site_url', it explains the redesign protocol; for 'business', it explains the fallback to intake questions. This goes well beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Get a complete, distinctive, reference-backed art direction for a website.' It lists specific outputs (palette, type, layout DNA, etc.) and distinguishes from sibling tools by positioning as the first step before building and by mentioning the catalog of 14 directions. Unlike siblings like 'get_reference', this tool provides a complete direction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use FIRST, before building' and provides concrete scenarios: for new projects, pass tone; for redesigns, pass current_site_url; if only business name known, call anyway for intake questions. It also explains how boldness levels affect output. This gives clear when-to-use and when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_flowBInspect
Get the full step-by-step payload for one user journey: problem, steps, user actions, system responses, related direction ids, and agent implementation notes.
| Name | Required | Description | Default |
|---|---|---|---|
| flow_id | Yes | Flow id from search_flows | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden for behavioral transparency. It describes the return content but does not disclose whether the tool is read-only, idempotent, or requires specific permissions. There is no mention of error conditions or rate limits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, well-structured sentence that efficiently communicates the tool's purpose and return content. Every component listed adds value, and there is no redundant or irrelevant information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple retrieval tool with no output schema or annotations, the description adequately specifies the return payload contents. However, it could be more complete by explicitly stating that the operation is read-only and non-destructive, which would help the agent understand behavior without requiring inference.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the baseline is 3. The description adds context about the overall return payload but does not add new information about the parameters themselves beyond what is already in the schema (e.g., flow_id comes from search_flows, response_format controls output type).
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves the full step-by-step payload for a user journey, listing specific components (problem, steps, user actions, etc.). It distinguishes from siblings like search_flows (which lists flows rather than detailed payloads) and get_design_direction (which returns design direction info).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description does not provide any guidance on when to use this tool versus alternatives, nor does it mention prerequisites (e.g., needing a flow_id from search_flows). It simply describes what the tool does, leaving the agent to infer usage context.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_free_trial_keyAInspect
Self-provision a free Standout trial key (25 calls, no signup) without leaving the conversation. Call this when no license key is configured or your Standout calls return a license error. Returns the key plus the exact reconnect instructions to hand your user. Free and unmetered; issuance is capped per network, so never call it when you already have a working key.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It adds behavioral context: 'Free and unmetered; issuance is capped per network' and explains it returns key and instructions. It does not mention destructive actions, but the context is sufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences, front-loaded with purpose, then usage condition, output, and caution. No wasted words, every sentence adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 0 parameters, no output schema, and no annotations, the description is mostly complete. It explains what it does, when to use, what it returns, and a caution. Minor gap: does not specify format of the key or instructions, but sufficient for a simple tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The tool has 0 parameters and 100% schema coverage. The description adds meaning beyond the schema by explaining the purpose and usage, earning a baseline 4.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'Self-provision a free Standout trial key (25 calls, no signup) without leaving the conversation.' It uses specific verbs and resource, and it is distinct from sibling tools which are about design, assets, etc.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit guidance: 'Call this when no license key is configured or your Standout calls return a license error' and 'never call it when you already have a working key.' This clearly indicates when to use and when not to.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_hero_conceptBInspect
Get a complete hero section concept: headline + subline copy (no marketing cliches), CTA labels, exact background treatment, and copy-paste ambient motion CSS matched to the design direction.
| Name | Required | Description | Default |
|---|---|---|---|
| business | Yes | The business this hero sells | |
| direction | No | Direction pack id from get_design_direction (e.g. coastal-film) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided; description discloses what is returned and a constraint (no cliches) but does not mention side effects, idempotency, prerequisites, or rate limits. Basic behavioral disclosure with gaps.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single sentence efficiently conveys the tool's output components, including a specific exclusion. Front-loaded with the main action. No unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers the primary purpose and output components but lacks details on return format or structure (e.g., how CSS is provided). No error handling or prerequisite info. Adequate for a generation tool but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with clear descriptions for both parameters. The tool description adds context about matching to design direction but does not provide additional semantic value beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool returns a complete hero section concept with specific components (headline, subline, CTA, background, motion CSS) and explicitly excludes marketing cliches. Distinct from sibling tools like get_design_direction or get_section_code.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description implies usage context (matched to design direction) but does not explicitly state when to use this tool vs alternatives like get_motion_recipes or get_flow. No exclusions or when-not-to-use guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_motion_recipesAInspect
Production-ready motion for any stack: CSS/vanilla-JS recipes (staggered entrances, scroll reveals, marquees, grain, parallax) PLUS finished shader-grade ambient pieces - WebGL gradient mesh, image ripple displacement, live halftone, velocity-reactive type rail, scroll-scrubbed zoom portal - palette-keyed to Standout directions. Request by ids or describe what you need; pass direction for palette-filled shader code.
| Name | Required | Description | Default |
|---|---|---|---|
| ids | No | Recipe or piece ids, e.g. ['gradient-mesh-flow','fade-rise-stagger'] | |
| query | No | Or describe the desired motion, e.g. 'webgl hero shader' or 'premium dark portfolio hovers' | |
| direction | No | Direction pack id (e.g. coastal-film); fills shader piece palettes with the pack's hexes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It mentions the output includes recipes and shader code, but does not disclose behavioral traits like side effects, rate limits, or required permissions. The tool appears read-only, but that is not explicitly stated.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is informative and front-loaded with the main value proposition. It lists examples concisely, though it could be slightly shorter without losing clarity. Every sentence adds useful information, balancing detail and brevity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
There is no output schema, so the description should explain the return format. It implies output is code (recipes, shaders) but does not specify structure (e.g., JSON object, string, array). It also could clarify whether multiple items are returned or how to handle multiple requests. This is a gap for an otherwise good description.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with three parameters. The description adds concrete examples for each parameter (e.g., ids: ['gradient-mesh-flow','fade-rise-stagger'], query: 'webgl hero shader', direction: 'coastal-film') and explains how direction affects output (fills palette). This adds meaning beyond the schema's basic descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool provides motion recipes for CSS/vanilla-JS and shader-grade ambient pieces, with specific examples (staggered entrances, scroll reveals, etc.). It distinguishes from sibling tools focused on other UI components (e.g., get_component_code, get_section_code) by being uniquely about motion animations.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives clear usage instructions: request by IDs or describe your need, and optionally pass a direction pack for palette-filled shader code. It does not explicitly mention when not to use this tool or compare to siblings, but the context makes its purpose clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_referenceBInspect
Get the full structured payload for one reference: source URL, categories, tags, colors, typography, layout pattern, component rules, imagery, motion, do/don't constraints, and agent implementation notes.
| Name | Required | Description | Default |
|---|---|---|---|
| reference_id | Yes | Reference id from search_references | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are present, so the description carries full burden. It describes what the tool returns but does not disclose behavioral traits like authorization requirements, rate limits, or side effects. The listing of payload contents is helpful but insufficient.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, front-loaded sentence that efficiently lists the payload components. While slightly long due to the list, it is concise and every element adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description explains the output components, compensating for the lack of an output schema. It covers the necessary information for a simple retrieval tool, though it could mention error handling or permissions.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so the schema already documents both parameters. The description adds context about the payload but does not enhance parameter meaning beyond schema descriptions. Baseline 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves the full structured payload for a reference, listing all included components. This distinguishes it from siblings like search_references (which searches) and get_similar_references (which finds similar ones).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives. It does not specify prerequisites or context, leaving the agent to infer usage from the name and sibling list.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_section_codeAInspect
Finished, art-directed HTML/CSS page sections for a design direction: nav, hero, offer, image band, story, and footer — composed by a designer, seeded to this business (accent + expression already applied). Use AFTER get_design_direction: assemble the sections and replace only the content (copy + images), never the composition. This is what makes the result look like a $10k site instead of a template.
| Name | Required | Description | Default |
|---|---|---|---|
| business | Yes | The business, e.g. 'barbershop in Suva' - seeds the expression so same-direction sites stay distinct | |
| sections | No | Optional filter: section ids or kinds (nav, hero, offer, imageband, story, footer). Omit for the full set | |
| direction | No | Direction pack id from get_design_direction (e.g. ink-and-paper, harvest-table, aurora-glass) | |
| signature_move | No | The signature move id from get_design_direction's Signature Concept (e.g. editorial_dossier, field_report_layout). Pass it so the sections are ORDERED and FRAMED to that concept - the output then leads with the Creative Spine, Memorable Move, and hard Differentiation Rules. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. Says sections are 'finished, art-directed' and 'seeded to business', implying a read-only operation. However, does not explicitly state safety (e.g., no side effects) or mention any required permissions. Adequate but could be more explicit.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three concise sentences covering purpose, usage order, and a key selling point. No fluff, though the last sentence is slightly promotional. Good front-loading of core information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, so description should explain return format. It says 'art-directed HTML/CSS page sections' but not structure (e.g., list of objects). For a tool producing code snippets, more detail on output form would improve completeness. Still adequate for basic use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with descriptions. Description adds value by explaining how parameters interact: 'business seeds expression', 'direction pack id', and 'signature_move orders and frames to concept'. This enhances understanding beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it returns 'Finished, art-directed HTML/CSS page sections' for a design direction, with specific section types listed. It distinguishes from sibling tool 'get_design_direction' by using 'AFTER get_design_direction', implying it provides the concrete output after direction is set.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use AFTER get_design_direction' and advises to 'replace only the content (copy + images), never the composition'. While it doesn't contrast with all siblings, it gives clear context for when to use in the workflow.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_showpieceAInspect
The tier ABOVE the design directions: finished showpiece skeletons for scene-led, art-directed pages - world hero with environment bleed, VIDEO world hero (looping cinematic clip + liquid-glass chrome + word-by-word blur reveals), type-scene interlock (mega type behind a cutout subject), monochrome flood, scene-pinned glass annotations, ghost mega-type band - each with the scene/video-generation prompts that make the world bespoke to THIS business, contrast scrims, mobile behavior and reduced-motion built in. Call with no technique to browse the menu (free); call with a technique id for the full skeleton. Use when the brief says wow, cinematic, premium beyond a normal landing page.
| Name | Required | Description | Default |
|---|---|---|---|
| direction | No | Direction pack id (e.g. coastal-film); fills the skeleton and scene prompts with the pack's hexes | |
| technique | No | Skeleton id: world-hero, video-world-hero, type-interlock-hero, monochrome-flood-hero, annotation-scene, ghost-type-band. Omit to browse the menu (unmetered). |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It explains behavior: browsing returns menu skeletons, technique id returns full skeleton with prompts. Mentions free browsing and built-in mobile/reduced-motion behavior. Adequate transparency for a retrieval tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is verbose, with a dense list of techniques that could be streamlined. Front-loads main idea but could be more concise to improve readability.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema, description adequately explains tool behavior and return types (menu vs full skeleton with prompts). Covers modes, technique details, and mentions mobile/reduced-motion features. Fairly complete for a retrieval tool with two parameters.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline 3. Description adds extra context (e.g., direction pack fills with hexes, technique options listed), but does not fundamentally enhance understanding beyond schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool retrieves 'finished showpiece skeletons' for premium pages, lists specific techniques, and differentiates from design directions as a tier above. It uses specific verbs and resources, making purpose unambiguous.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states when to use ('when the brief says wow, cinematic, premium beyond a normal landing page') and describes two modes (browse menu without technique, get skeleton with technique). Lacks explicit alternatives or when-not-to-use, but sibling tools are listed separately.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
get_similar_referencesAInspect
Find references similar to a known reference by shared tags, categories, use cases, and StandOut direction ids. Use after finding one strong reference to broaden the evidence set.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of similar references to return, max 12 | |
| reference_id | Yes | Reference id from search_references | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It describes the function but does not disclose behavioral traits such as being read-only, destructive, or any side effects. It lacks information about authentication requirements or rate limits, but the description does not contradict any annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is extremely concise with two sentences that front-load the primary purpose and usage guide. Every sentence adds value with no redundancy or filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no output schema and moderate complexity (3 parameters, no nesting), the description is fairly complete. It covers the tool's purpose, usage context, and parameter rationale. However, it does not describe the return format or structure of the similar references, which would be helpful.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with all parameters described (reference_id, limit, response_format). The description adds context about when to use reference_id ('from search_references' and 'after finding one strong reference'), but does not significantly enhance understanding of the parameters beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'Find' and the resource 'references similar to a known reference', listing the similarity criteria (tags, categories, use cases, StandOut direction ids). It distinguishes itself from siblings like get_reference (single reference) and search_references (broad search) by specifying the use case of broadening the evidence set.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit guidance: 'Use after finding one strong reference to broaden the evidence set.' It tells the agent when to use the tool (after obtaining a reference), although it does not explicitly mention when not to use it or list alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
list_directionsAInspect
Browse the full 'pick a vibe' menu: all 14 named art directions (9 light-base, 5 dark-base) with their taglines, vibe tags, and what each is best for. Use when the user wants to SEE the available directions and choose one themselves, rather than have get_design_direction auto-pick from the business. Pick an id from the menu and pass it as the direction argument to get_section_code, get_asset_pack, get_hero_concept, or synthesize_direction_from_references. Browsing the menu does not consume a metered call.
| Name | Required | Description | Default |
|---|---|---|---|
| tone | No | Optionally show only light-base (9) or dark-base (5) directions | |
| response_format | No | Return a human-skimmable markdown menu (default) or a structured JSON array |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, but the description states that browsing does not consume a metered call, implying read-only and cost-free behavior. No contradictions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two well-structured sentences with front-loaded purpose, followed by usage instructions. Every sentence earns its place, no filler.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite no output schema, description fully explains what the tool returns (menu with taglines, vibe tags, best for). Complete context for a browse tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% but description adds meaning: 'light-base (9) or dark-base (5)' for tone, and explains response_format returns markdown (default) or JSON. Enhances parameter understanding.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool browses the 'pick a vibe' menu listing 14 art directions (9 light-base, 5 dark-base) with taglines, vibe tags, and best uses. This specific verb+resource and scope distinguishes it from siblings like get_design_direction.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says when to use (user wants to choose) vs. when to use get_design_direction (auto-pick). Also instructs how to use the selected id with other tools (get_section_code, get_asset_pack, etc.). No ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
list_watchesAInspect
List this license's active site watches with their last known scores and check times. Unmetered.
| Name | Required | Description | Default |
|---|---|---|---|
No parameters | |||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full burden. It discloses the tool is unmetered and lists only active watches, but does not mention authentication needs, rate limits, or any other behavioral characteristics beyond listing.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
A single sentence succinctly conveys the purpose and a key trait. No unnecessary words, and the crucial information is front-loaded.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given zero parameters and no output schema, the description covers the basic function and hinted return fields. However, it does not describe the return format, pagination, or whether results are ordered, which would be needed for a complete understanding.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has zero parameters with 100% coverage, so baseline is 4. The description adds value by specifying that the listing includes 'last known scores and check times', which is beyond the empty schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'List' and the resource 'active site watches', and specifies the information included (last known scores and check times). It distinguishes itself from sibling tools like 'watch_site' and 'unwatch_site' which are mutation tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description includes 'Unmetered' as a usage hint indicating no cost, but lacks explicit guidance on when to use this tool versus alternatives or any prerequisites. For such a simple tool, the context is implied but not explicit.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
make_it_standoutAInspect
THE master tool — one call to transform any site. Pass a url (or raw source, or just a business description) and Standout: (1) classifies the site type and locks the context, (2) scores it out of 100 across 8 categories with a client-ready bar at 85, (3) audits it for exact, prioritized fix patches, (4) gives it a creative spine + a reference-backed direction, (5) lays out the full-page composition, and (6) returns an LLM-ready prompt you can paste straight into Claude Code / Cursor / ChatGPT / Codex to apply everything, plus the MCP workflow and a re-audit checklist. Use this when the user just wants their page made client-ready and shouldn't have to know which individual Standout tool to call.
| Name | Required | Description | Default |
|---|---|---|---|
| url | No | Public URL to audit and improve (best input — enables the live score + screenshots) | |
| goal | No | The primary conversion goal, e.g. 'get users to run an audit and connect the MCP' | |
| vibe | No | Desired feel, e.g. premium, technical, warm | |
| source | No | Raw HTML/CSS/JSX, when the page is not yet published | |
| audience | No | Who the site must convince | |
| business | No | What the business is, e.g. 'MCP server that gives AI agents design taste'. Used to classify and build from scratch when no url/source. | |
| reference | No | A reference site URL or curated reference id whose style to lean toward |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden of behavioral transparency. It comprehensively explains the tool's actions: it takes a URL, source, or business description and performs classification, scoring, auditing, creative direction, layout, and returns a fully specified LLM-ready prompt along with a workflow and re-audit checklist. There is no contradiction or missing critical behavior.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is lengthy but well-structured with numbered steps, making it easy to parse. It is front-loaded with the key phrase 'THE master tool' and clearly outlines the sequence of operations. While not maximally concise, the structure earns a 4.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (7 parameters, no output schema, many sibling tools), the description covers the tool's purpose, inputs, and outputs well. It explains what the tool returns: an LLM-ready prompt, MCP workflow, and re-audit checklist. However, it does not detail the exact structure or format of the scoring or audit output, which would enhance completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, so baseline is 3. The description adds meaning beyond the schema by noting that 'url' enables live score and screenshots, and 'business' is used to classify and build from scratch when no URL/source is provided. This contextual value raises the score to 4.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states that this is a master tool to transform any site, listing six specific steps: classification, scoring, auditing, creative direction, layout, and prompt generation. It explicitly distinguishes itself from sibling tools by indicating that this tool should be used when the user wants a complete client-ready transformation without knowing which individual tool to call.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides explicit guidance on when to use this tool: 'Use this when the user just wants their page made client-ready and shouldn't have to know which individual Standout tool to call.' This implies that for more specific tasks, the sibling tools (e.g., critique_design, get_design_direction) should be used instead.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
search_flowsAInspect
Search reference-backed user journeys such as API key onboarding, pricing selection, hospitality booking, marketplace search, quickstart activation, and portfolio inquiry. Use when the page is part of a multi-step task, not just a visual screen.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of flows to return, max 12 | |
| query | No | Search terms like 'api key onboarding', 'pricing', 'booking', 'marketplace', or 'quickstart' | |
| category | No | Optional category filter, e.g. onboarding, pricing, booking | |
| direction | No | Optional StandOut direction id | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description must carry behavioral transparency. It only describes the purpose and usage, but does not disclose side effects, auth requirements, rate limits, or any other behavioral traits beyond the basic function.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, front-loaded with examples and usage guidance. Every phrase earns its place with no redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 5 parameters and no output schema or annotations, the description covers purpose and usage but omits behavioral details, return format (though implied by response_format parameter), and how results are structured. Adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so baseline is 3. The description adds example search terms ('api key onboarding', 'pricing') that provide context beyond the schema, but does not significantly enhance understanding of parameters like 'category' or 'direction'.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it searches for reference-backed user journeys and provides concrete examples like 'API key onboarding' and 'pricing selection'. It also distinguishes from visual screens, which helps differentiate from siblings like search_references.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly states when to use: 'Use when the page is part of a multi-step task, not just a visual screen.' No explicit when-not-to-use, but the context implies alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
search_referencesAInspect
Search Standout's curated reference library: real public-product style, screen, and component patterns with source URLs, color roles, layout notes, component rules, do/don't constraints, and agent implementation notes. Use before or during a build when you need evidence-backed visual patterns.
| Name | Required | Description | Default |
|---|---|---|---|
| type | No | Optional reference type filter | |
| limit | No | Number of references to return, max 12 | |
| query | No | Search terms like 'developer tool pricing', 'api key dashboard', 'restaurant booking', or 'dark command center' | |
| category | No | Optional category filter, e.g. pricing, onboarding, developer tool, restaurant | |
| direction | No | Optional StandOut direction id, e.g. terminal-grade, coastal-film, harvest-table | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries full behavioral burden. It lists output contents but does not disclose behavioral traits like rate limits, ordering, or result limits beyond what the parameter 'limit' implies. It is adequate but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences with no wasted words. The first sentence defines the tool's action and output, the second gives usage guidance. Information is front-loaded and every word earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a search tool with no output schema, the description explains what the tool returns (real patterns with source URLs, notes, etc.) and when to use it. It covers purpose, usage, and param hints adequately. Minor omission: no mention of ordering or pagination, but limit is provided.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
All parameters have schema descriptions (100% coverage), so baseline is 3. The description adds value by providing example query terms ('developer tool pricing', 'api key dashboard') and context for 'direction' (StandOut direction id), enhancing understanding beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Search Standout's curated reference library' with a specific verb and resource, and lists what it returns (style, screen, component patterns). It does not explicitly differentiate from sibling tools like get_reference or get_similar_references, but the verb 'search' implies a broad retrieval vs. specific get.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use before or during a build when you need evidence-backed visual patterns,' providing clear when-to-use context. It does not mention when not to use or suggest alternatives, but the context is sufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
synthesize_direction_from_referencesAInspect
Turn a business brief plus optional reference ids into a reference-backed build direction. Returns the chosen StandOut direction, matching public-product references, relevant flow patterns, a synthesized build plan, implementation lock, and next tool calls. Use when you want the agent to build from evidence instead of vibes.
| Name | Required | Description | Default |
|---|---|---|---|
| vibe | No | Desired feel, e.g. premium, technical, warm, playful | |
| query | No | Optional reference search query. Defaults to business/industry/vibe/audience | |
| audience | No | Who the page must convince | |
| business | Yes | What the website/app is for | |
| industry | No | Industry or category, e.g. developer tool, restaurant, law firm | |
| direction | No | Optional StandOut direction id to force, e.g. terminal-grade | |
| reference_ids | No | Optional exact reference ids from search_references | |
| response_format | No | Return markdown or structured JSON |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations, the description carries full burden for behavioral transparency. It lists outputs and implies a generative action, but does not disclose potential side effects, required permissions, or whether the operation is destructive or idempotent.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two well-structured sentences. The first sentence states the core transformation and outputs; the second sentence adds usage guidance. Every word is purposeful, and it is front-loaded with the most critical information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given 8 parameters and no output schema, the description outlines outputs but lacks detail on their structure or format. For example, 'next tool calls' is vague. It is adequate but could provide more specifics about the returned data shape.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with parameter descriptions, so baseline is 3. The description adds value by explaining the overall purpose and how parameters fit together (e.g., 'business brief plus optional reference ids'), enhancing understanding beyond the isolated schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it turns a business brief and optional references into a reference-backed build direction, listing specific outputs. It distinguishes from sibling tools like search_references (which only search) or get_design_direction (which likely retrieves an existing direction) by focusing on synthesis.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly says 'Use when you want the agent to build from evidence instead of vibes,' providing a clear usage context. However, it does not explicitly mention when not to use this tool or provide direct comparisons to sibling alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
unwatch_siteAInspect
Stop watching a site's design score. Pass the same URL that was given to watch_site. Unmetered.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The watched page address to stop watching |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must disclose behavior. It mentions 'Unmetered' suggesting no rate limits, but lacks details on idempotency, what happens if URL not watched, or other traits.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two concise sentences with no superfluous information. Every word adds value.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple tool with one parameter and no output schema, the description provides enough context for correct invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema has 100% coverage with description for url. The description adds value by specifying it must be the same URL as from watch_site, which is not in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Directly states it stops watching a site's design score, with clear verb and resource. Distinct from sibling watch_site as it is the inverse operation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly instructs to pass the same URL as given to watch_site, providing clear contextual usage. Does not discuss when not to use, but sufficient given the inverse relationship.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
watch_siteAInspect
Watch a live site's design score (Pro). Standout re-audits the URL about once a day and emails an alert when the design regresses: a drop of more than 5 points, or falling below the 85 client-ready bar. Use it right after shipping a client site so a broken deploy is caught before the client sees it. Up to 10 active watches per license.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Full public page address to watch, e.g. https://client-site.com | |
| No | Where regression alerts go. Defaults to the email bound to this license. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description carries full burden. It explains re-auditing frequency (~1/day), alert mechanism (email), and limits (10 active watches). Missing details on auth or failure behavior, but adequate for a monitoring tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Three sentences with no waste: first sentence states the action, second explains when to use, third adds a constraint. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
No output schema, but the description explains triggers, constraints, and usage context. It's fairly complete for a simple monitoring tool, though could mention how to stop watching (handled by sibling).
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with both parameters described. The description adds value by explaining the email defaults to the license-bound email, beyond the schema's description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool watches a live site's design score, with specific triggers for re-auditing (drop >5 points or below 85). It uses a specific verb+resource and distinguishes itself from siblings like list_watches and unwatch_site.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
It provides explicit context: 'Use it right after shipping a client site so a broken deploy is caught before the client sees it.' Lacks explicit when-not or alternatives, but the scenario is clear enough.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!