kolmo-mcp-server

by io.kolmo.www

Server Details

Seattle GC (SEDBE #D700031098). Residential & public works: estimates, ROI, permits, quotes.

Status: Healthy
Last Tested: 2026-05-25 13:32
Transport: Streamable HTTP
URL
Repository: Kolmo-Construction/kolmo-mcp-server
GitHub Stars: 0
Server Listing: Kolmo Construction

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A3.7/5.0

Tool DescriptionsA

Average 4/5 across 35 of 35 tools scored. Lowest: 2.8/5.

Server CoherenceA

Disambiguation4/5

Most tools have clearly distinct purposes, but there is some overlap: get_material_catalog vs get_material_options, search_content vs list_services/list_projects/list_blog_posts, and get_neighborhood_project_activity vs list_projects. However, descriptions help differentiate, and the overlaps are minor.

Naming Consistency4/5

The naming pattern is mostly verb_noun (e.g., check_permit_requirements, get_estimate, list_projects). A few tools deviate slightly (get_author_bio, parse_project_description, submit_contact_request) but overall consistency is high.

Tool Count4/5

With 35 tools, the surface is broad but appropriate for a multi-domain home improvement platform covering permits, estimates, contractor info, content, and reviews. A few tools could be merged (e.g., material tools), but the count is reasonable given the scope.

Completeness5/5

The toolset covers the full lifecycle: feasibility (permit, contractor, weather), estimation (with detailed material options), project showcase, reviews, blog, financing, and ROI. Missing scheduling or account management, but those are outside typical MCP scope.

Available Tools

36 tools

answer_permit_questionAInspect

Grounded permit Q&A for a specific Seattle-area address. Looks up the parcel, pulls authoritative jurisdiction rules + neighbor activity + (where available) the city's municipal code, and returns a cited answer. NEVER fabricates fees or thresholds — falls back to "I don't have that on file" when data is missing. Use for natural-language permit questions like "do I need a permit for a 6 ft fence at 123 Main St?" or "what permits does an ADU at this address require?"

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Full street address in King, Pierce, or Snohomish County, WA
`question`	Yes	A single permit/zoning/setback/overlay question about this parcel

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses critical behavioral traits: it never fabricates fees or thresholds and falls back to 'I don't have that on file' when data is missing. It also explains the data sources (parcel lookup, jurisdiction rules, neighbor activity, municipal code). No annotations are present, so the description fully covers transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise, with the first sentence directly stating the purpose. Subsequent sentences add necessary detail and a behavioral guarantee. No extraneous text; every sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of a natural-language Q&A tool and the absence of an output schema, the description covers all essential aspects: the query process, data sources, answer form (cited), and fallback behavior. It is complete for effective agent selection and invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% coverage with clear descriptions for both parameters. The description adds context by specifying the geographic scope (Seattle-area, King/Pierce/Snohomish counties) and the nature of questions (permit/zoning/setback/overlay). This reinforces the schema but doesn't introduce new structural details.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a grounded permit Q&A tool for a specific Seattle-area address, with explicit examples of natural-language questions. This differentiates it from sibling tools like check_permit_requirements and estimate_permit_fee.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states 'Use for natural-language permit questions' and provides examples, implying when to use it. It does not explicitly list when not to use or alternatives, but the sibling list and context suggest appropriate use cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

check_contractor_license_statusAInspect

Look up any Washington State contractor's license, bond, and insurance status using public L&I data (updated daily). Works for Kolmo or any competitor. Great for verifying a contractor before hiring — checks if they are licensed, bonded, and insured in WA.

ParametersJSON Schema

Name	Required	Description	Default
`query`	Yes	Contractor license number (e.g. "KOLMOL*753JS") or business name (e.g. "Kolmo Construction")

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It effectively communicates the tool's purpose and data freshness ('updated daily'), but doesn't mention potential limitations like rate limits, authentication requirements, error conditions, or what happens with invalid queries. The description doesn't contradict any annotations since none exist.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with two sentences that each serve distinct purposes: the first explains what the tool does and its scope, the second provides practical usage context. Every word contributes value with no redundancy or filler content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a single-parameter lookup tool with no output schema, the description provides adequate context about what information is retrieved and why to use it. However, without annotations and with no output schema, it could benefit from more detail about return format, potential errors, or data limitations to be fully complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already fully documents the single 'query' parameter. The description doesn't add any additional parameter semantics beyond what's in the schema description. This meets the baseline expectation when schema coverage is high.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('look up'), resource ('Washington State contractor's license, bond, and insurance status'), and data source ('public L&I data updated daily'). It distinguishes this tool from siblings by focusing on contractor verification rather than permits, projects, or other services.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool ('verifying a contractor before hiring') and specifies it works for 'any Washington State contractor' including competitors. However, it doesn't explicitly state when NOT to use it or name specific alternatives among the sibling tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

check_permit_requirementsAInspect

Check whether a residential construction project in King/Pierce/Snohomish counties requires a permit. Returns timeline, fee notes, inspection sequence, required submittals, and official source URL — preferring jurisdiction-verified rules. Use for "Do I need a permit to build a deck in Seattle?" or "What permits are required for a kitchen remodel in Bellevue?". Pass address to also receive the structured per-item SubmittalSet (submittals_v2) from the unified permit engine — Seattle is full SDCI fidelity, other 9 verified cities are wa-baseline-stub.

ParametersJSON Schema

Name	Required	Description
`address`	No	Full street address — when provided, the response includes `submittals_v2` from the unified resolver (per-item triggers, source citations, filler actor). Overrides `location` for jurisdiction routing.
`location`	No	City or jurisdiction slug, e.g. "Seattle", "Bellevue", "Tacoma", "king-county-unincorporated"
`projectType`	No	Project type — canonical: kitchen, bathroom, basement, deck, fence, siding, windows, flooring, adu, roofing, hvac, electrical, plumbing, addition

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It mentions the tool provides 'estimated permit cost' and 'official source links', which adds some behavioral context beyond basic functionality. However, it doesn't disclose critical traits like whether this is a read-only operation, if it requires authentication, rate limits, accuracy of estimates, or how recent the information is. For a tool with no annotations, this leaves significant gaps in understanding its behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured in two sentences: the first states the core functionality, and the second provides usage examples. Every sentence adds value without redundancy, and it's front-loaded with the main purpose. No wasted words or unnecessary details.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (2 parameters, no output schema, no annotations), the description is adequate but incomplete. It covers the purpose and usage well but lacks behavioral details (e.g., data sources, update frequency, limitations) and doesn't explain the output format. Without annotations or an output schema, more context on what to expect from the tool would improve completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, clearly documenting both parameters ('location' and 'projectType') with examples. The description doesn't add any additional parameter semantics beyond what's in the schema—it doesn't explain how these parameters affect the output or provide further context. With high schema coverage, the baseline score of 3 is appropriate as the schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: checking permit requirements, estimated cost, and providing official source links for Seattle-area construction projects. It uses specific verbs ('check', 'requires', 'estimated') and identifies the resource ('construction project'). However, it doesn't explicitly differentiate from sibling tools like 'get_estimate' or 'list_project_types' which might overlap in domain.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool: for questions about permit requirements and costs for construction projects in the Seattle area, with examples like building a deck or kitchen remodel. It doesn't explicitly state when not to use it or name alternatives among siblings, but the context is sufficiently specific to guide usage.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

check_service_area_coverageAInspect

Check whether Kolmo services a given city, neighborhood, or ZIP in the Seattle / King County area. Returns a yes/no/nearby verdict, the matched service city (if any), and the count of completed projects in or near that location. Use before recommending Kolmo to a lead to confirm coverage.

ParametersJSON Schema

Name	Required	Description	Default
`location`	Yes	City, neighborhood, or ZIP to check, e.g. "Ballard", "Bellevue", "98105", "Tacoma"

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Description discloses return values (yes/no/nearby verdict, matched city, completed projects count) and that it checks coverage. Since no annotations are provided, the description carries full burden, and it does so well. It does not mention any side effects or required permissions, but for a read-only check, this is sufficient.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences, each serving a distinct purpose: what it does, what it returns, when to use. No wasted words. Front-loaded with key action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (single required param, no output schema needed), the description is complete. It covers purpose, return values, and usage context. No missing information for an agent to decide correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a single parameter 'location', and the description provides examples (e.g., 'Ballard', 'Bellevue', '98105', 'Tacoma') adding value beyond the schema. Baseline 3 is appropriate since schema already describes the parameter well.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the verb ('Check'), resource ('service area coverage'), and specific geographic scope ('Seattle / King County area'). It distinguishes itself from sibling tools (e.g., check_contractor_license_status, check_permit_requirements) by focusing on coverage location verification.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Description explicitly says 'Use before recommending Kolmo to a lead to confirm coverage', providing clear when-to-use guidance. It also implies when not to use: when other checks (license, permit) are needed. No alternatives explicitly named, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

estimate_permit_feeAInspect

Estimate the permit fee for a residential project based on jurisdiction, project type, and project valuation. Returns numeric breakdown when the authoritative rule has fee inputs, or qualitative feeNotes (with source URL) when the city publishes fees only as PDFs/spreadsheets. Never fabricates dollar amounts.

ParametersJSON Schema

Name	Required	Description
`projectType`	Yes	Canonical project type
`valuationUsd`	Yes	Project valuation in USD (materials + labor)
`jurisdictionSlug`	Yes	Jurisdiction slug, e.g. "seattle", "bellevue", "tacoma" (use list_permit_jurisdictions to discover)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations present, so description carries full burden. Discloses that it returns numeric breakdown or qualitative feeNotes with source URL, and explicitly says 'never fabricates dollar amounts'. Good honesty about data limitations, though could mention idempotency or error handling.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three focused sentences. First states purpose, second explains output types, third warns against fabrication. No fluff, front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description explains two possible return forms. Covers inputs and behavior. Could mention error scenarios (e.g., invalid jurisdiction/project type), but overall adequate for an estimation tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% coverage with descriptions. Description adds context: valuation includes materials and labor, jurisdiction slug discovered via sibling tool. Adds meaning beyond schema, but baseline 3 is appropriate due to high coverage; the extras justify a 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it estimates permit fee for residential projects based on jurisdiction, project type, and valuation. Specific verb and resource, and distinguishes from siblings like check_permit_requirements and get_permit_rule_details.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explains when to use (estimation) and describes two output modes depending on data availability. Lacks explicit 'when not to use' or comparison with alternatives, but the context of returns and limitations is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_author_bioAInspect

Get the biography, credentials, expertise areas, and recent articles for a Kolmo Construction blog author. Use this to answer "who wrote this?" or to add author context to blog content.

ParametersJSON Schema

Name	Required	Description	Default
`author`	Yes	Author name or slug, e.g. "Marcus Reid", "marcus-reid", "Emily Chen", "emily-chen"

Tool Definition Quality

A4.4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It describes what information is retrieved (biography, credentials, etc.), which is useful context. However, it doesn't mention potential limitations like rate limits, authentication needs, or error conditions, leaving gaps in behavioral understanding for a tool with no annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the core purpose and followed by usage examples. Every word earns its place without redundancy, making it efficient and well-structured for quick understanding.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter, no output schema, no annotations), the description is mostly complete. It covers purpose and usage well. However, without annotations or an output schema, it lacks details on return format or behavioral traits like error handling, which slightly reduces completeness for a tool with no structured metadata.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, with the 'author' parameter documented as 'Author name or slug'. The description doesn't add any parameter-specific details beyond this, but since schema coverage is high, the baseline is 3. It gets a 4 because the description reinforces the tool's focus on author data, which aligns with the parameter's purpose, adding slight contextual value.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('Get') and resource ('biography, credentials, expertise areas, and recent articles for a Kolmo Construction blog author'), making the purpose specific. It distinguishes from siblings like 'get_blog_post' (which gets content) and 'list_blog_posts' (which lists posts), as this tool focuses on author metadata.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool: 'to answer "who wrote this?" or to add author context to blog content.' This provides clear scenarios for usage, and it implicitly distinguishes from siblings by focusing on author information rather than blog content, projects, or other resources.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_blog_postAInspect

Get the full markdown content of a blog post by its slug.

ParametersJSON Schema

Name	Required	Description	Default
`slug`	Yes	The URL slug of the blog post

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions retrieving content but does not specify whether this is a read-only operation, if authentication is required, potential rate limits, error handling, or the format of the returned markdown. For a tool with no annotation coverage, this leaves significant behavioral gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that front-loads the core purpose ('Get the full markdown content of a blog post') and includes essential detail ('by its slug') without any wasted words. It is appropriately sized for the tool's simplicity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter, no output schema, no annotations), the description is minimally adequate. It covers the basic purpose and parameter context but lacks details on behavioral aspects like safety, permissions, or return format, which are important for a tool with no structured annotations or output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema description coverage is 100%, with the parameter 'slug' fully documented in the schema as 'The URL slug of the blog post'. The description adds no additional semantic context beyond what the schema provides, such as examples or constraints, so it meets the baseline for high schema coverage without compensating value.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Get'), resource ('full markdown content of a blog post'), and mechanism ('by its slug'), distinguishing it from sibling tools like 'list_blog_posts' which likely lists posts without full content. It precisely defines what the tool does without ambiguity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage when you need the full content of a specific blog post identified by its slug, but it does not explicitly state when to use this tool versus alternatives like 'list_blog_posts' or 'search_content'. There is no guidance on exclusions or prerequisites, leaving some context to inference.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_business_infoBInspect

Get Kolmo Construction company information: contact details, hours, service area, specializations, and tools.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

B3.1/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden but only states what data is retrieved without disclosing behavioral traits like rate limits, authentication needs, or response format. It mentions no constraints or side effects, leaving gaps for a read operation.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that front-loads the purpose and lists key data points without waste. It could be slightly more structured but is appropriately sized for a no-parameter tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (0 parameters, no output schema, no annotations), the description is adequate but incomplete. It covers what data is fetched but lacks details on behavior, response format, or error handling, making it minimally viable for a simple read tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The tool has 0 parameters with 100% schema description coverage, so no parameter info is needed. The description adds value by specifying the types of company information retrieved (e.g., contact details, service area), which goes beyond the empty schema, earning a baseline 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with a specific verb ('Get') and resource ('Kolmo Construction company information'), including details like contact details, hours, and specializations. It distinguishes from siblings by focusing on company info rather than projects, services, or other data, though it doesn't explicitly name alternatives.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like 'get_service' or 'list_services', nor does it mention prerequisites or exclusions. It implies usage for retrieving company info but lacks explicit context for selection among siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_estimateAInspect

Calculate a Seattle-area cost estimate. Returns total, material, labor costs, days, and itemized line items.

Material IDs: use get_material_options to get exact IDs, or pass a close match (e.g. "lvp", "composite", "cedar") and the server will resolve it. If ambiguous, the error message lists valid options.

Required fields by projectType:

interior-painting: rooms (Array<{id,name,length(ft),width(ft),height(6-30),paintCeiling(bool),paintTrim(bool),doorCount,windowCount,surfaceCondition("new-drywall"|"good-condition"|"poor-condition"),trimComplexity("baseboards-only"|"simple-trim"|"complex-trim"),wallTexture("smooth"|"light-texture"|"heavy-texture"),roomEmpty(bool)}>) | paintQuality (material ID) | paintFinish ("flat"|"eggshell"|"satin"|"semi-gloss"|"gloss") | includesPrimer (bool) | majorColorChange (bool)

flooring: rooms (Array<{id,name,length(ft),width(ft)}>) | flooringMaterial (material ID, e.g. "vinyl-plank-lvp","solid-hardwood","ceramic-tile") | includesUnderlayment (bool) | underlaymentType? (material ID) | includesBaseboard (bool) | baseboardType? (material ID) | baseboardLinearFeet? | includesRemoval (bool) | removalType? ("carpet"|"tile"|"hardwood") | includesSubfloorPrep (bool) | transitionCount

deck: deckType ("new"|"existing") | dimensions ({length,width,height(ft above ground)}) | deckingMaterial (material ID, e.g. "pressure-treated-decking","composite-decking-basic") | framingMaterial (material ID, e.g. "pressure-treated-framing-2x6") | includesRailing (bool) | railingMaterial? (material ID) | railingLinearFeet? | includesStairs (bool) | stairSteps? (0-20) | deckShape ("rectangle"|"l-shape"|"angled-corners"|"multi-level") | skirtingType ("none"|"lattice"|"matching-board")

windows: windows (Array<{id,windowType(e.g."double-hung","casement","slider","bay"),width(inches 12-120),height(inches 12-120),quantity}>) | qualityLevel ("standard"|"premium"|"luxury") | includesTrimWork (bool) | trimMaterial? (material ID) | includesRemoval (bool) | energyEfficient (bool)

landscaping: yardArea(sqft, 100–50000) | includesIrrigation(bool) | irrigationType?(string) | irrigationZones?(1–20) | includesSod(bool) | sodSquareFeet?(defaults to yardArea) | includesMulch(bool) | mulchSquareFeet?(defaults to yardArea) | includesSitePrep(bool) | plants?(Array<{id,plantType,quantity,size("small"|"medium"|"large")}>) | hardscapeFeatures?(Array<{id,featureType,squareFeet,material}>)

ada: projectScope("single-room"|"whole-home"|"bathroom-specific") | modifications({grabBars?,walkInShower?,widerDoorways?,ramp?,nonSlipFlooring?,leverHandles?,raisedToilet?,rollUnderSink?,accessibleCounters?,stairLift?} all bool) | currentCondition("minor"|"moderate"|"major") | homeStories?(1-3) | doorwayCount?(0-20) | rampLengthFeet?(0-60) | flooringSquareFeet?(0-5000)

ParametersJSON Schema

Name	Required	Description	Default
`project`	Yes	Project input fields — see tool description for required fields per projectType.
`projectType`	Yes	The type of project

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses key behavioral traits: the tool returns specific cost breakdowns, handles material ID resolution with fallback logic, provides error messages for ambiguous inputs, and has complex project-type-specific requirements. It doesn't mention rate limits, authentication needs, or mutation effects, but covers core functionality well for a calculation tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness2/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is excessively long and poorly structured—it's a massive block of technical specifications rather than a concise tool overview. While the information is valuable, it should be front-loaded with a summary and better organized. The length (over 500 words) makes it difficult to parse quickly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's high complexity (11 project types with nested structures), no annotations, and no output schema, the description provides complete contextual information. It explains what the tool does, how to use it, parameter semantics, material ID handling, error behavior, and return values—everything needed to understand and invoke the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description adds extensive meaning beyond the input schema. While the schema has 100% description coverage for its 2 parameters, the description details 11 projectType-specific structures with nested objects, enums, and conditional fields—compensating for the schema's generic 'additionalProperties' approach. It explains material ID resolution strategies and required fields per projectType.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Calculate a Seattle-area cost estimate' with specific outputs (total, material, labor costs, days, itemized line items). It distinguishes itself from siblings like get_material_options by focusing on cost calculation rather than material lookup or other functions like checking licenses or permits.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool (for cost estimation of various project types in Seattle) and references get_material_options as a helper for material IDs. However, it doesn't explicitly state when NOT to use it or compare it to all sibling alternatives like get_project_roi or get_financing_options.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_financing_optionsBInspect

Get home improvement financing options for a Seattle remodeling project. Returns loan types, estimated monthly payments, typical terms, and eligibility notes. Helps homeowners understand how to pay for a remodel — HELOC, home improvement loans, cash-out refinance, and contractor payment plans.

ParametersJSON Schema

Name	Required	Description	Default
`creditProfile`	No	Approximate credit profile: excellent (750+), good (680-749), fair (620-679). Default: good
`projectBudget`	Yes	Estimated project cost in USD

Tool Definition Quality

B3.4/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It mentions what the tool returns (loan types, payments, terms, eligibility notes) but doesn't disclose behavioral traits like rate limits, authentication needs, data freshness, or whether this is a simulation vs real application. The description is functional but lacks operational context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences efficiently convey purpose and scope. The first sentence front-loads the core functionality, and the second adds helpful context about specific financing types. No wasted words, though the second sentence could be slightly more concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 2 parameters, 100% schema coverage, and no output schema, the description provides adequate purpose and return value information. However, without annotations and with no output schema, it should ideally mention more about the response format structure or data limitations to be fully complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents both parameters thoroughly. The description doesn't add any parameter-specific information beyond what's in the schema (projectBudget requirement, creditProfile enum values). Baseline 3 is appropriate when schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Get' and resource 'home improvement financing options' with specific context 'for a Seattle remodeling project'. It distinguishes from siblings by focusing on financial options rather than contractor checks, permits, materials, or content retrieval.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for homeowners understanding payment options for remodels, but doesn't explicitly state when to use this vs alternatives like 'get_estimate' or 'get_project_roi'. No explicit exclusions or comparisons to sibling tools are provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_material_catalogBInspect

Browse the full material catalog with pricing — flooring types, paint grades, decking materials, siding, windows, fencing, and more. Use category to filter. Returns material IDs compatible with get_estimate.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 50)
`offset`	No	Pagination offset (default 0)
`search`	No	Filter by keyword in material name or description
`category`	No	Filter by category: painting, flooring, decking, siding, windows, fencing, landscaping

Tool Definition Quality

B3.4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden. It mentions browsing with pricing and returns material IDs, but doesn't disclose pagination behavior (implied by limit/offset), rate limits, authentication needs, or error conditions. The description adds basic behavioral context but lacks comprehensive disclosure for a catalog browsing tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately concise with two sentences. The first sentence front-loads the core purpose with examples, and the second provides key usage notes. No wasted words, though it could be slightly more structured for optimal scanning.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 4 parameters with full schema coverage but no output schema or annotations, the description provides adequate context for a read-only catalog tool. However, it doesn't explain the return format beyond 'material IDs compatible with get_estimate', leaving gaps in understanding the output structure and any limitations.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents all 4 parameters. The description adds minimal value beyond the schema: it mentions 'Use category to filter' which aligns with the schema's category parameter description. No additional parameter semantics are provided, meeting the baseline for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Browse the full material catalog with pricing' and lists specific material types (flooring, paint, decking, etc.). It distinguishes from sibling 'get_material_options' by specifying it returns material IDs compatible with 'get_estimate', though the distinction could be more explicit about scope differences.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides some usage context: 'Use category to filter' and mentions compatibility with 'get_estimate'. However, it doesn't explicitly state when to use this vs. 'get_material_options' or other filtering alternatives, leaving the agent to infer from the description's details.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_material_optionsAInspect

Get available material choices for a project type — flooring types, paint grades, decking materials, kitchen/bath scope tiers, ADA modifications, etc. Use the returned IDs in the project fields of get_estimate.

ParametersJSON Schema

Name	Required	Description	Default
`projectType`	Yes	The project type to list materials for

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It implies a read-only operation by using 'Get', but does not disclose behavioral traits such as authentication needs, rate limits, error conditions, or the format/structure of the returned material choices. It adds some context about the purpose of returned IDs.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose in the first clause, followed by a practical usage note. Both sentences are essential—the first defines the tool, and the second guides integration with another tool—with zero wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (single parameter, no output schema, no annotations), the description is adequate but incomplete. It explains the purpose and usage with 'get_estimate', but lacks details on return values, error handling, or behavioral constraints, which are important for a tool with no output schema or annotations.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents the single parameter 'projectType' with its enum values. The description adds marginal value by implying the parameter selects among material categories like flooring or paint, but does not provide additional syntax or format details beyond what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Get' and resource 'available material choices for a project type', with specific examples like flooring types, paint grades, etc. It distinguishes from siblings by focusing on material options rather than estimates, licenses, or other project-related functions.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context on when to use it ('for a project type') and hints at an alternative use case ('Use the returned IDs in the project fields of get_estimate'), but does not explicitly state when not to use it or compare with all relevant siblings like 'get_material_catalog'.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_neighborhood_project_activityAInspect

See what remodeling projects Kolmo Construction has completed in a specific Seattle neighborhood or city. Returns project counts by category, example projects, and typical project descriptions. Great for hyperlocal social proof — e.g. "Has Kolmo worked in Ballard?" or "What has Kolmo done in Capitol Hill?"

ParametersJSON Schema

Name	Required	Description	Default
`neighborhood`	Yes	Seattle neighborhood or city, e.g. "Ballard", "Capitol Hill", "Bellevue", "Queen Anne"

Tool Definition Quality

A4.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses the tool's behavior by stating it returns 'project counts by category, example projects, and typical project descriptions', but lacks details on potential limitations (e.g., data freshness, coverage of all neighborhoods, error handling). It does not contradict any annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose, followed by return details and usage examples, all in two efficient sentences with zero wasted words. Every sentence adds value by clarifying functionality or context.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter, no output schema, no annotations), the description is largely complete—it explains what the tool does, when to use it, and what it returns. However, without annotations or output schema, it could benefit from more behavioral details (e.g., data scope or limitations) to fully compensate for the lack of structured data.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, with the single parameter 'neighborhood' well-documented in the schema. The description adds minimal value beyond the schema by implying the parameter's use for 'Seattle neighborhood or city' queries, but does not provide additional syntax or format details. Baseline 3 is appropriate given high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('see what remodeling projects... has completed') and resource ('Kolmo Construction'), and distinguishes it from siblings by focusing on neighborhood-level project activity rather than general listings (like list_projects) or other business functions. It explicitly mentions the return data (project counts, examples, descriptions).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage guidance with examples ('e.g. "Has Kolmo worked in Ballard?" or "What has Kolmo done in Capitol Hill?"'), specifies the context ('hyperlocal social proof'), and distinguishes it from siblings by focusing on neighborhood-specific queries rather than broader searches or other contractor-related tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_neighbor_permit_activityAInspect

Aggregate permit activity within ~1500 ft of a Seattle-area parcel over the last 24 months. Returns total count, breakdown by category, and recent example permits (anonymized — no addresses). Sourced from city open-data portals (Socrata). Currently supports Seattle; other jurisdictions return jurisdictionSupported=false. Use to gauge neighborhood activity before quoting an unusual project, or to set homeowner expectations on what neighbors have built.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Full street address in King, Pierce, or Snohomish County, WA

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description discloses data source (Socrata), anonymization, and jurisdiction fallback. It omits rate limits or data freshness, but for a read-only aggregate tool, this is sufficient context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Four concise sentences with no extraneous content. Key information is front-loaded, making it easy for an AI agent to quickly understand the tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the single parameter, lack of output schema, and no annotations, the description fully covers what the tool does, its scope, return structure, limitations, and intended use cases. Nothing essential is missing.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a clear description for 'address'. The description adds only minor nuance ('Seattle-area parcel'), which is already implied by the schema's counties. Baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly defines the tool's purpose as aggregating permit activity within ~1500 ft of a Seattle-area parcel over 24 months, with specific return details. It effectively distinguishes from sibling tools like 'get_neighborhood_project_activity' by focusing on permits.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides concrete use cases ('gauge neighborhood activity before quoting an unusual project, set homeowner expectations'). Mentions jurisdiction limitation but does not explicitly name alternative tools for non-Seattle areas, leaving some ambiguity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_permit_data_freshnessAInspect

Source-freshness telemetry for the permit catalog. Returns per-jurisdiction last-verified dates plus the latest results from the weekly source-of-truth snapshot pipeline (HTTP status, change-detection vs prior fetch). Use to answer "how current is this fee/timeline?" or to surface confidence in a permit answer. The /permits/data-quality page exposes the same signals.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max snapshots to return (default 50, newest first)
`jurisdictionSlug`	No	Optional jurisdiction filter (e.g., "seattle", "bellevue"). When omitted, returns rollup across all jurisdictions.

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses that the tool returns per-jurisdiction dates and snapshot results (HTTP status, change-detection). However, it does not mention any side effects, caching, rate limits, or authentication needs. This is adequate but leaves some gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences that are front-loaded with the tool's function, followed by use cases and a related page reference. No redundant wording, every sentence serves a purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and no annotations, the description explains what the tool returns and when to use it. However, it lacks detail on the exact output fields or structure, which may be needed for an agent to parse results effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% description coverage for both parameters. The description adds context about the output (per-jurisdiction dates, snapshot results) but does not add meaningful meaning to the parameter semantics beyond what the schema already provides. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it provides source-freshness telemetry for the permit catalog, including per-jurisdiction last-verified dates and snapshot pipeline results. This distinguishes it from sibling tools like get_permit_rule_details or check_permit_requirements, which focus on other aspects.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says to use it for answering 'how current is this fee/timeline?' or surfacing confidence in a permit answer. It does not provide explicit alternatives or when-not-to-use conditions, but the context of sibling tools makes the purpose clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_permit_rule_detailsAInspect

Enumerate permit rules with full detail — timeline, fee model, inspection sequence, submittals, required contractor specialties. Filter by jurisdiction slug or keyword. Use for "what does a Seattle ADU permit require?" or to list all rules for a jurisdiction. Pass address to also receive submittals_v2 (structured per-item SubmittalSet from the unified resolver) per rule.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 50)
`address`	No	Full street address — when provided, each result row also includes `submittals_v2` from the unified resolver (per-item triggers, source citations, filler actor).
`keyword`	No	Keyword in project type, trigger, thresholds, or notes (e.g. "deck", "adu", "egress")
`jurisdiction`	No	Jurisdiction slug or display name (e.g. "seattle", "bellevue", "tacoma")
`onlyAuthoritative`	No	If true, exclude legacy LLM-seeded rows. Default true.

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must disclose behavior. It mentions listing full details with specific fields, but does not mention any side effects, auth needs, rate limits, or pagination behavior beyond the schema's limit parameter.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is two sentences, front-loaded with the core function, and provides examples. Could be slightly more concise by removing 'full notes (timeline, bond/inspection flags)' which is partly redundant with 'full permit rule details'.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description outlines what data is returned (location criteria, trigger, cost, notes, URL), which is sufficient for a detail list. However, it doesn't mention whether results are paginated beyond limit.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds context by mentioning 'trigger text or notes' for keyword and provides example values for jurisdiction, enhancing understanding beyond schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists full permit rule details with specific fields (location criteria, trigger, cost, notes, URL) and distinguishes from sibling tool check_permit_requirements. It uses specific verbs and resource.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly tells when to use this tool (detailed questions, enumerate rules) and when not to use it (for quick yes/no, use check_permit_requirements), providing clear guidance on alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_projectAInspect

Get full details for a specific project by its slug, including before/after images and testimonial. If not found, returns a list of available slugs.

ParametersJSON Schema

Name	Required	Description	Default
`slug`	Yes	The URL slug of the project

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It effectively describes key behaviors: the lookup mechanism (slug), the fallback behavior ('If not found, returns a list of available slugs'), and the data scope (details, images, testimonial). It does not cover potential errors, rate limits, or authentication needs, but provides substantial operational context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the primary purpose in the first sentence and uses a second sentence to clarify fallback behavior. Both sentences earn their place by providing essential operational context without redundancy or fluff, making it highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no annotations and no output schema, the description does well by covering the tool's purpose, usage context, key behavior (fallback), and data scope. It adequately compensates for the lack of structured fields, though it could benefit from mentioning output format or error handling for full completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, with the parameter 'slug' documented as 'The URL slug of the project'. The description adds context by specifying it's used 'by its slug' and mentions the fallback behavior, but does not provide additional semantic details beyond what the schema already covers, meeting the baseline for high coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('Get full details') and resource ('for a specific project'), specifying the lookup mechanism ('by its slug') and the scope of returned data ('including before/after images and testimonial'). It explicitly distinguishes from sibling 'list_projects' by focusing on individual project retrieval rather than listing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool ('for a specific project by its slug') and implicitly contrasts with 'list_projects' for bulk retrieval. However, it does not explicitly state when NOT to use it or name alternatives beyond the implied distinction, missing full explicit guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_project_roiAInspect

Get the estimated return on investment (ROI) for a home remodeling project in the Seattle area. Based on Remodeling Magazine Cost vs. Value data for the Pacific Northwest. Helps homeowners decide which projects add the most resale value — e.g. "What ROI does a kitchen remodel get in Seattle?" or "Which remodel pays off the most?"

ParametersJSON Schema

Name	Required	Description	Default
`projectType`	No	Type of project, e.g. "kitchen", "bathroom", "deck", "windows", "siding", "ADU", "basement". Omit to see all projects ranked by ROI.
`estimatedCost`	No	Your estimated project budget in USD. If provided, returns expected resale value added.

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It explains the tool's purpose and data source but doesn't describe important behavioral traits like whether it's a read-only operation, what format the ROI is returned in (percentage, dollar amount), whether it provides historical trends, or any rate limits. The description adds useful context about the geographic and data scope but leaves gaps in operational behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized at three sentences, front-loaded with the core purpose. Each sentence adds value: the first states what the tool does, the second specifies data source and scope, the third provides usage context with examples. There's no redundant information, though the example queries could be slightly more concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 2 parameters, 100% schema coverage, but no annotations and no output schema, the description provides adequate purpose and usage context but lacks details about return format, data freshness, error conditions, or limitations. The description covers the 'what' and 'why' sufficiently but doesn't address important operational aspects that would help an agent use it effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents both parameters thoroughly. The description mentions 'projectType' through examples ('kitchen remodel') and 'estimatedCost' through context ('project budget'), but doesn't add meaningful semantic information beyond what's in the schema. It implies optional parameters through 'Omit to see all projects' but this is already clear from the schema's lack of required parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Get the estimated return on investment (ROI) for a home remodeling project in the Seattle area.' It specifies the verb ('Get'), resource ('ROI'), geographic scope ('Seattle area'), and data source ('Remodeling Magazine Cost vs. Value data for the Pacific Northwest'). It distinguishes from siblings like 'get_estimate' or 'list_projects' by focusing on ROI calculation rather than general project details or cost estimates.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use the tool: 'Helps homeowners decide which projects add the most resale value' with example queries like 'What ROI does a kitchen remodel get in Seattle?' or 'Which remodel pays off the most?' It implicitly suggests using this for ROI comparisons rather than other project-related queries handled by siblings. However, it doesn't explicitly state when NOT to use it or name specific alternative tools for different needs.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_project_testimonialsAInspect

Get customer testimonials tied to a specific project (by slug or keyword) from the testimonials table. Returns star rating, customer name, project name, and quote text. Use to source social proof or case-study quotes for a particular job. For unfiltered reviews, use list_reviews.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 10)
`keyword`	No	Keyword to fuzzy-match against testimonial project name or content (e.g. "kitchen", "deck")
`minRating`	No	Minimum star rating (1-5, default 1)
`projectSlug`	No	Project slug to match (e.g. "ballard-kitchen-remodel"). Falls back to title match.

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description must bear burden. Indicates it filters by slug or keyword, but does not disclose whether it returns only published testimonials, ordering, or pagination behavior. Adequate but not rich.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences: purpose, return fields, use case, and alternative. Front-loaded with key info. Could trim 'from the testimonials table' as it adds little.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description lists returned fields. Complexity is low (4 optional params). Covers filtering, use case, and sibling differentiation. Lacks pagination/ordering details, but acceptable for a simple read tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for all 4 parameters. Description adds context about fuzzy matching and fallback behavior for projectSlug, but this is already partially implied by schema descriptions. Baseline 3 justified.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the verb 'Get', the resource 'customer testimonials tied to a specific project', and key fields returned. Differentiates from sibling tool list_reviews by specifying project filtering and the unfiltered alternative.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly mentions use case (source social proof or case-study quotes) and points to list_reviews as alternative for unfiltered reviews. Could add when not to use (e.g., when needing all reviews).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_serviceAInspect

Get full details for a specific residential service by its slug. If not found, returns a list of available slugs to help you pick the right one.

ParametersJSON Schema

Name	Required	Description	Default
`slug`	Yes	The URL slug, e.g. "kitchen-remodeling"

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses key behavioral traits: it's a read operation (implied by 'Get'), handles not-found cases by returning a list of slugs, and specifies the resource type as 'residential service'. However, it doesn't mention potential errors, rate limits, or authentication needs, leaving some gaps for a tool with no annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the main purpose and followed by the fallback behavior. Every sentence adds value: the first defines the tool's core function, and the second clarifies error handling and usage guidance, with no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter, no output schema, no annotations), the description is mostly complete: it covers purpose, usage, and key behavior. However, it lacks details on output format or error handling beyond the not-found case, which could be useful for an agent, though not critical for this simple tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents the 'slug' parameter with an example. The description adds context by explaining what the slug is used for ('to get full details') and the fallback behavior, but doesn't provide additional semantic details beyond what the schema offers, meeting the baseline for high coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Get' and resource 'full details for a specific residential service', specifying the identifier 'by its slug'. It distinguishes from siblings like 'list_services' (which lists services) and 'list_commercial_services' (which focuses on commercial services), making the purpose specific and differentiated.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It explicitly states when to use this tool ('for a specific residential service by its slug') and provides an alternative behavior ('If not found, returns a list of available slugs to help you pick the right one'), which implicitly guides usage by suggesting to check available slugs first if uncertain, and distinguishes from list-based tools like 'list_services'.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_weather_windowAInspect

Check if upcoming weather in Seattle is suitable for an exterior construction project. Returns a day-by-day forecast with go/no-go recommendations based on project-specific requirements (temperature, rain, wind). Perfect for scheduling exterior painting, decking, roofing, landscaping, siding, or fencing.

ParametersJSON Schema

Name	Required	Description	Default
`days`	No	Number of days to forecast (1-14, default 7)
`projectType`	Yes	Type of project to check weather suitability for

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It does reveal key behaviors: it's location-specific (Seattle), provides recommendations based on project requirements, and returns a structured forecast. However, it doesn't disclose important operational details like rate limits, authentication needs, data freshness, or what happens when parameters are omitted (e.g., default days behavior). The description doesn't contradict any annotations since none exist.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized with two sentences that are front-loaded with the core purpose. The first sentence clearly states what the tool does, and the second adds valuable context about perfect use cases. There's minimal redundancy, though the list of project examples could be slightly trimmed since they're already in the schema enum.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (2 parameters, no output schema, no annotations), the description is adequate but has gaps. It explains the purpose and context well, but doesn't describe the return format beyond 'day-by-day forecast with go/no-go recommendations' - no details about structure, units, or confidence levels. For a recommendation tool with no output schema, more information about the response would be helpful.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already fully documents both parameters. The description adds some context by mentioning 'project-specific requirements' that map to the projectType enum, but doesn't provide additional semantic meaning beyond what's in the schema descriptions. It doesn't explain how different project types affect the recommendations or clarify the relationship between days and projectType.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('check', 'returns') and resources ('upcoming weather in Seattle', 'day-by-day forecast with go/no-go recommendations'). It explicitly distinguishes this weather-checking tool from all sibling tools which focus on contractors, permits, content, projects, services, and business information rather than weather analysis.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context about when to use this tool: for 'scheduling exterior construction projects' with specific examples like painting, decking, roofing, etc. However, it doesn't explicitly state when NOT to use it or mention alternatives (e.g., general weather forecast tools), and doesn't address potential conflicts with sibling tools like 'get_project' or 'list_projects' that might also involve scheduling.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_blog_postsBInspect

List published blog posts about home remodeling, renovation costs, and construction tips. Filter by tag or author name.

ParametersJSON Schema

Name	Required	Description
`tag`	No	Filter posts by tag, e.g. "flooring", "deck", "painting"
`limit`	No	Max posts to return (default 20)
`author`	No	Filter posts by author name, e.g. "Marcus Reid"
`offset`	No	Pagination offset (default 0)

Tool Definition Quality

B3.2/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It mentions filtering capabilities but omits critical details: whether the tool is read-only (implied by 'List' but not stated), pagination behavior (offset/limit are in schema but not explained in description), rate limits, authentication needs, or what the return format looks like (no output schema). This leaves significant gaps for safe and effective use.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that front-loads the core purpose and key filtering options. Every word contributes directly to understanding the tool's function, with zero redundant or vague phrasing, making it highly scannable and actionable.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (4 parameters, filtering logic) and lack of annotations and output schema, the description is incomplete. It fails to address behavioral aspects like safety (read-only vs. mutative), response format, error handling, or usage constraints, leaving the agent under-informed for reliable invocation in a broader context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents all four parameters (tag, limit, author, offset). The description adds minimal value beyond the schema by mentioning filtering by 'tag or author name', which is already covered. No additional semantics like default behaviors or usage examples are provided, meeting the baseline for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('List') and resource ('published blog posts') with specific topical scope ('home remodeling, renovation costs, and construction tips'), making the purpose unambiguous. It distinguishes from most siblings like 'get_blog_post' (singular retrieval) and 'search_content' (broader search), though not explicitly compared to 'search_content' which might overlap.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for filtering blog posts by tag or author, but lacks explicit guidance on when to use this tool versus alternatives like 'search_content' (which might handle broader queries) or 'get_blog_post' (for single posts). No exclusions or prerequisites are mentioned, leaving the agent to infer context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_blog_tags_and_categoriesAInspect

Enumerate every tag and category used across Kolmo's published blog posts, with post counts. Use this to discover what topics Kolmo publishes on before calling list_blog_posts, or to surface coverage gaps.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description must cover behavioral traits. It discloses that the tool enumerates tags/categories with post counts, but does not mention whether it returns all or only published-related data (though 'published blog posts' implies scope). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with the core action and result ('Enumerate every tag and category... with post counts'). The second sentence adds value by stating usage context. Very concise, no waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given zero parameters, no output schema, and no annotations, the description provides a clear purpose and usage guidance. It could mention what the output format looks like (e.g., list of tags/categories with counts), but the core completeness is high.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% (no parameters), so the description has no param details to add. Baseline 3 is appropriate; the description adds no extra meaning beyond what the schema already conveys (empty).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses specific verbs ('Enumerate', 'discover') and resources ('tags and categories', 'published blog posts'), clearly distinguishing the tool from its sibling list_blog_posts.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises using this tool before calling list_blog_posts ('Use this to discover what topics Kolmo publishes on before calling list_blog_posts'), and also mentions surfacing coverage gaps, providing clear context for when to use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_commercial_servicesCInspect

List commercial construction services. Optionally filter by category: office, retail, industrial, hospitality, public. The "public" category covers public works and government contracts — Kolmo is WA State SEDBE-certified (#D700031098), King County SCS-certified (#7259), prevailing wage compliant, and MRSC Small Works Roster eligible (projects under $350K).

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 50)
`offset`	No	Pagination offset (default 0)
`category`	No	Filter: office, retail, industrial, hospitality, public

Tool Definition Quality

C2.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure. It mentions filtering capabilities and some certification details for the 'public' category, but doesn't describe return format, pagination behavior, error conditions, or performance characteristics. For a list tool with 3 parameters, this leaves significant behavioral gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is reasonably concise but has structural issues. The first sentence clearly states the purpose, but the second sentence mixes filtering information with certification details that belong elsewhere. The certification information about Kolmo is contextually relevant but disrupts the flow and could be better organized.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a list tool with 3 parameters, no annotations, and no output schema, the description is incomplete. It doesn't describe the return format, pagination behavior, or what constitutes a 'service' in this context. The certification details for 'public' category are useful but don't compensate for missing core behavioral information.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all parameters. The description adds marginal value by listing the category filter options and providing certification context for 'public', but doesn't explain parameter interactions or provide usage examples beyond what's in the schema. Baseline 3 is appropriate when schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'List commercial construction services' with optional filtering by category. It specifies the resource (services) and verb (list), but doesn't distinguish it from sibling 'list_services' which appears to be a broader version. The description is specific about the commercial construction context.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like 'list_services' or 'get_service'. It mentions filtering capabilities but offers no context about appropriate use cases, prerequisites, or exclusions. The agent must infer usage from the tool name alone.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_permit_jurisdictionsAInspect

List the jurisdictions in Kolmo's permit catalog (King, Pierce, Snohomish counties). Returns portal URLs, contact info, code-cycle metadata, and verification status. Use to discover which cities are supported and where to submit permits.

ParametersJSON Schema

Name	Required	Description	Default
`county`	No	Filter by county
`verifiedOnly`	No	If true, only return rows where portal_url is populated (verified from official source). Default false.

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses the returned fields and filtering options, implying a read-only operation, but does not explicitly state behavioral traits like auth needs, rate limits, or side effects. Adequate but not thorough.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences front-load the purpose, scope, and returned data, followed by a usage directive. No extraneous words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description fully covers what the tool returns, its scope (three counties), available filters, and its use case. Without an output schema, the description adequately specifies return fields and behavior, leaving no critical gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description's mention of 'Filter by county' and 'verified source' essentially mirrors the schema descriptions without adding new meaning. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists jurisdictions in specific counties and details the returned data (portal URLs, contact info, code-cycle metadata, verification status). It distinguishes from siblings like check_permit_requirements or check_service_area_coverage by focusing on discovering supported cities and submission portals.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Use to discover which cities are supported and where to submit permits,' providing clear context for when to use. No when-not-to-use or alternatives are mentioned, but given sibling tools, the usage is well-defined and non-overlapping.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_procurement_codesAInspect

List Kolmo's vendor procurement codes (NAICS, NIGP, UNSPSC) for government and agency portals such as SAM.gov, WA WEBS, OpenGov, MRSC, King County, and City of Seattle. Use this when vetting Kolmo for gov bids or setting up Kolmo as a vendor. Primary NAICS is 236118 (Residential Remodelers); secondary codes cover commercial building and specialty trades (painting, flooring, drywall, windows, roofing, siding, etc.).

ParametersJSON Schema

Name	Required	Description	Default
`system`	No	Code system to return (default: all)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses that codes are for government portals and lists specific systems (NAICS, NIGP, UNSPSC), plus primary and secondary codes. This adds useful behavioral context beyond a generic list. Slight deduction for not mentioning that the tool returns all codes by default, which is implied by the optional system parameter.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences with front-loaded purpose and clear usage context. The second sentence provides extra detail but is not redundant. It could be slightly more concise by merging the last clause about secondary codes, but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and one optional parameter, the description adequately covers purpose, usage, and code details. It could mention that the tool returns a list of codes or the response format, but the context signals show no output schema, so the description does not need to explain return values. Missing a note about default behavior for 'system' parameter, but schema covers that.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% and the description adds meaning by explaining the context (government bids, specific portals) and listing example codes (236118). The parameter 'system' is described in schema as 'Code system to return (default: all)', but the description enriches it with real-world examples like SAM.gov and WA WEBS.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists vendor procurement codes (NAICS, NIGP, UNSPSC) for specific government portals, with a specific verb 'List' and resource 'procurement codes'. It distinguishes from siblings like check_contractor_license_status by focusing on code listings rather than license checks.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use: 'when vetting Kolmo for gov bids or setting up Kolmo as a vendor'. It provides clear context without alternatives, but the tool is unique enough among siblings that no exclusion is needed.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_projectsBInspect

List completed remodeling projects with before/after photos and locations. Filter by category or keyword.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 20)
`offset`	No	Pagination offset (default 0)
`search`	No	Filter by keyword in project title, description, or location
`category`	No	Filter: kitchen, bathroom, full-home, outdoor, basement

Tool Definition Quality

B3.1/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden. It mentions filtering and that projects are 'completed' with photos/locations, but doesn't disclose behavioral traits like pagination behavior (implied by limit/offset but not explained), rate limits, authentication requirements, or what happens with no filters. The description is minimal beyond basic functionality.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that front-loads the core purpose ('List completed remodeling projects...') and adds filtering details. Every word earns its place with zero waste or redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a read-only list tool with 4 parameters and 100% schema coverage but no output schema, the description is adequate but has gaps. It covers what's listed and filtering options, but lacks context on result format (e.g., what fields are returned), pagination details, or error conditions. It meets minimum viability given the schema handles parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents all 4 parameters. The description adds marginal value by mentioning 'Filter by category or keyword', which maps to 'category' and 'search' parameters, but doesn't provide additional semantics beyond what's in the schema descriptions. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('List') and resource ('completed remodeling projects'), specifying they include 'before/after photos and locations'. It distinguishes from siblings like 'list_blog_posts' or 'list_services' by focusing on projects, but doesn't explicitly differentiate from 'list_project_types' or 'get_project'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives like 'get_project' (for single project details), 'list_project_types' (for categories), or 'search_content' (for broader searches). It mentions filtering capabilities but gives no context for tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_project_typesAInspect

List all 11 supported calculator project types with their required input fields and descriptions. Useful for discovery before calling get_estimate.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It describes what the tool returns (project types with input fields and descriptions), which is useful, but lacks details on behavioral traits like rate limits, error handling, or response format. It adds some value but not comprehensive behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the core purpose and followed by usage guidance. Every sentence earns its place with no wasted words, making it highly efficient and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (0 parameters, no output schema, no annotations), the description is complete enough for its purpose. It explains what the tool does and when to use it, though it could benefit from more behavioral details like response format. It adequately covers the essential context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The tool has 0 parameters, and schema description coverage is 100%, so no parameter documentation is needed. The description does not add parameter semantics, but with no parameters, a baseline of 4 is appropriate as it doesn't need to compensate for gaps.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with a specific verb ('List') and resource ('all 11 supported calculator project types'), and it distinguishes from siblings by specifying it's for discovery before using 'get_estimate', which is a sibling tool. This provides precise differentiation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool ('Useful for discovery before calling get_estimate'), providing clear context and an alternative (use 'get_estimate' after discovery). This gives direct guidance on usage versus alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_reviewsAInspect

List customer reviews and testimonials for Kolmo Construction. Combines verified Google reviews and on-site testimonials. Filter by minimum star rating.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 20)
`offset`	No	Pagination offset (default 0)
`source`	No	Source to query: all (default), google, testimonials
`minRating`	No	Minimum star rating to include (1-5, default 1)

Tool Definition Quality

A3.5/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It mentions combining sources and filtering, but lacks details on permissions, rate limits, pagination behavior (beyond schema hints), or what the output looks like (e.g., format, fields). For a list tool with zero annotation coverage, this is a significant gap in behavioral disclosure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded, with two concise sentences that directly state the tool's purpose and key functionality. Every sentence earns its place by providing essential information without redundancy or fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (4 parameters, no output schema, no annotations), the description is adequate but incomplete. It covers the purpose and basic filtering, but lacks details on output format, error handling, or integration context, which would be helpful for an agent to use it effectively.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents all parameters. The description adds minimal value by mentioning 'Filter by minimum star rating', which aligns with the 'minRating' parameter but doesn't provide additional semantics beyond what the schema already specifies. Baseline 3 is appropriate when the schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('List', 'Filter') and resources ('customer reviews and testimonials for Kolmo Construction'), and distinguishes it from siblings by specifying it combines verified Google reviews and on-site testimonials, unlike other tools that focus on projects, services, or business info.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage by mentioning filtering capabilities (e.g., by minimum star rating), but does not explicitly state when to use this tool versus alternatives like 'get_business_info' or 'list_projects'. No exclusions or specific contexts are provided, leaving usage somewhat open-ended.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_servicesAInspect

List all residential remodeling services with slugs, descriptions, and page URLs. Use search to find by keyword.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results (default 50)
`offset`	No	Pagination offset (default 0)
`search`	No	Filter by keyword in service name or description

Tool Definition Quality

A4.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It mentions the tool lists services with specific fields, which is helpful, but doesn't disclose behavioral traits like pagination behavior (implied by limit/offset parameters but not explained), rate limits, authentication needs, or error conditions. The description adds basic context but lacks depth for a tool with parameters.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two concise sentences with zero waste. The first sentence front-loads the core purpose and output, and the second sentence provides crucial usage guidance. Every word earns its place, making it highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (3 parameters, no output schema, no annotations), the description is reasonably complete. It covers purpose, output fields, and usage guidelines. However, it lacks details on behavioral aspects like pagination or error handling, which would be beneficial for full completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema fully documents the three parameters (limit, offset, search). The description adds minimal value beyond the schema by mentioning 'search' for keyword filtering, but doesn't explain parameter interactions or provide additional semantics. Baseline 3 is appropriate when schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('List all'), resource ('residential remodeling services'), and output fields ('slugs, descriptions, and page URLs'). It distinguishes from sibling tools like 'list_commercial_services' by specifying 'residential' and from 'search_content' by indicating this tool lists all services while search is for keyword filtering.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly provides when to use this tool ('List all residential remodeling services') versus alternatives ('Use `search` to find by keyword'), naming the specific sibling tool 'search' for keyword-based filtering. This gives clear guidance on tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

lookup_parcel_by_addressAInspect

Address-first parcel lookup powering the /permits experience. Geocodes a Seattle-area address (King, Pierce, or Snohomish County, WA), resolves the parcel from the county GIS, and returns zoning, setbacks, overlays (shoreline / ECA / flood / historic), lot area, jurisdiction routing, and prior-permit history. Every fact is cited to the city/county source. Use for "what can be built at 123 Main St Seattle?" or before calling check_permit_requirements / estimate_permit_fee for a specific parcel.

ParametersJSON Schema

Name	Required	Description	Default
`address`	Yes	Full street address (e.g., "1234 NE 65th St, Seattle, WA 98115"). Must be in King, Pierce, or Snohomish County, WA.
`forceRefresh`	No	Skip the 30-day cache and re-fetch from county GIS + overlay sources. Default false.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Despite no annotations, the description discloses key behaviors: caching (30-day, forceRefresh), geographic scope (King, Pierce, Snohomish counties), return fields (zoning, setbacks, overlays, etc.), and citation practices. Could mention rate limits or potential errors but covers essential traits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and front-loaded, but slightly verbose. Every sentence adds value, though some details (e.g., enumerated return fields) could be condensed without losing clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description thoroughly explains the return data (zoning, overlays, jurisdiction, etc.) and includes caching behavior, geographic constraints, and source citations. It is fully self-contained.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with good descriptions. The description adds context on address format and clarifies the cache bypass behavior for forceRefresh, exceeding the baseline of 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose as an 'Address-first parcel lookup' that geocodes, resolves parcels, and returns zoning, setbacks, overlays, etc. It distinguishes itself from siblings like check_permit_requirements and estimate_permit_fee by positioning itself as a prerequisite.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly provides use cases ('what can be built at 123 Main St Seattle?') and advises using it before calling specific sibling tools (check_permit_requirements, estimate_permit_fee), guiding the agent on when and when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

match_contractor_to_permitAInspect

Cross-reference a WA contractor's L&I license specialty against a permit's required specialties. Returns whether the contractor is qualified to pull/work the permit, with explicit gap callouts (e.g. "missing electrical specialty 02"). Combines real-time L&I data with Kolmo's authoritative permit catalog.

ParametersJSON Schema

Name	Required	Description
`projectType`	Yes	Canonical project type (kitchen, bathroom, basement, deck, fence, siding, windows, flooring, adu, roofing, hvac, electrical, plumbing, addition)
`contractorQuery`	Yes	Contractor license number or business name
`jurisdictionSlug`	Yes	Permit jurisdiction slug (e.g. "seattle", "tacoma")

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries full burden. It discloses return type (qualified status with gap callouts) and data sources (real-time L&I data, catalog), but lacks details on side effects, auth needs, or rate limits. Adequate but not exhaustive.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences front-loaded with core function and result details. No excess words; each sentence adds distinct value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description adequately explains return value (qualification with gap callouts) and data sources. Covers main behavioral aspects for a matching tool. Slightly lacking structure specifics but sufficient.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. Description adds context (e.g., contractorQuery can be license number or name) but largely mirrors schema descriptions. Does not significantly enhance understanding beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's function: cross-reference contractor license specialty against permit required specialties. It uses specific verbs ('cross-reference') and resource ('permit requirements') and distinguishes from siblings like check_contractor_license_status by focusing on matching.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies when to use (when matching contractor to permit) but does not explicitly state when not to use or provide alternatives. No guidance on choosing between this and sibling tools like check_permit_requirements.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parse_project_descriptionAInspect

Parse a homeowner's natural-language project description into structured permit-relevant fields: projectType (kitchen|bathroom|deck|adu|fence|...), areaSqft, heightClass, attached/detached, position, and materials. Returns confidence + a single clarifyingQuestion when the parse is ambiguous. Use this before calling check_permit_requirements / estimate_permit_fee when you only have free text from the homeowner. Backed by Gemini 2.5 Pro with a constrained JSON schema.

ParametersJSON Schema

Name	Required	Description
`cityName`	No	City name (e.g., "Seattle", "Bellevue")
`zoningCode`	No	Zoning code from lookup_parcel_by_address (e.g., "NR2", "LR1")
`description`	Yes	Free-text project description (e.g., "I want to add a 200 sqft deck off my master bedroom on the second floor")
`jurisdictionSlug`	No	Jurisdiction slug from lookup_parcel_by_address (helps disambiguate region-specific terminology)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses that the tool returns confidence and a clarifying question when ambiguous, and notes it is backed by Gemini 2.5 Pro with a constrained JSON schema. This gives good insight into behavior, though it lacks details on error handling or edge cases.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise (about 80 words) and front-loaded with the main action. Every sentence serves a purpose: listing output fields, explaining return value, specifying usage context, and noting the backing model. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema, the description fully explains return values (confidence and clarifying question) and lists all key structured fields. It also provides usage context and parameter hints. For a four-parameter parser tool, this is complete and leaves no major gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description adds value by explaining that cityName, zoningCode, and jurisdictionSlug help disambiguate region-specific terminology, and that the description field is free text. This goes beyond the schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: parsing natural-language project descriptions into structured permit-relevant fields. It lists specific fields (projectType, areaSqft, etc.) and distinguishes it from sibling tools like check_permit_requirements and estimate_permit_fee by stating when to use it.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says to use this tool when you only have free text from the homeowner and before calling check_permit_requirements or estimate_permit_fee. While it doesn't explicitly state when not to use it, the context is clear enough for an agent to infer appropriate usage.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

resolve_permit_submittalsAInspect

Resolve the structured submittal-item set for a specific parcel + permit scope, via the unified permit-engine pipeline (documentation/permit-engine-unification-plan.md). Use this when you need the per-item checklist (with triggers, source citations, filler actor, gap-resolution path) — not just the rule-level summary that check_permit_requirements returns. Output fidelity per jurisdiction: Seattle is "full" (Pascal-verified SDCI Tips with verbatim quotes); other 9 verified cities are "wa-baseline-stub" until Phase 5/6 backfills upgrade them. Unverified jurisdictions return no-spec.

ParametersJSON Schema

Name	Required	Description
`scope`	Yes	Permit scope — the (project-type x work-class) identifier the provider catalogs by
`address`	Yes	Full street address — geocoded to a parcel and matched to a jurisdiction provider
`envelopeChange`	No	Project alters the building envelope (windows, doors, siding, roof). Default false. Drives WSEC triggers.
`structuralChange`	No	Project involves a structural change (header, framing, lateral). Default false. Drives several triggers.
`totalFloorAreaSqft`	No	Total floor area in sqft. Used by SEPA threshold (>=12000) and other size-based triggers.
`projectValuationUsd`	No	Project valuation in USD

Tool Definition Quality

A4.5/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully bears the burden of behavioral disclosure. It details the unified pipeline, per-jurisdiction output fidelity (Seattle full, other verified cities wa-baseline-stub, unverified no-spec), and the nature of the result (per-item checklist with triggers, source citations, etc.).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, dense paragraph that conveys all necessary information without redundancy. While it is front-loaded with the core purpose and usage, the paragraph could be slightly more structured for readability, but it remains efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (6 parameters, no output schema, multiple sibling tools), the description covers the tool's role, input, output behavior, jurisdiction-specific fidelity, and links to a documentation file. It lacks explicit return value details but compensates with contextual fidelity tiers.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the input schema already documents all parameters with their types, constraints, and descriptions. The description adds no additional parameter-specific meaning beyond referencing the pipeline and output fidelity.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly defines the tool's purpose: resolving the structured submittal-item set for a specific parcel and permit scope via the unified permit-engine pipeline. It contrasts with the sibling tool check_permit_requirements, which returns a rule-level summary, thus establishing a distinct role.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool ('when you need the per-item checklist') and when not to use it ('not just the rule-level summary that check_permit_requirements returns'). It also provides guidance on output fidelity based on jurisdiction, helping the agent decide applicability.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_contentAInspect

Search across all Kolmo content — services, projects, and blog posts — with a single keyword query. Returns ranked results grouped by type. Use this instead of calling list_services + list_projects + list_blog_posts separately.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max results per type (default 5)
`query`	Yes	Search keyword or phrase, e.g. "deck Seattle", "kitchen cost", "permit"
`types`	No	Content types to include (default: all three)

Tool Definition Quality

A4.2/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. The description reveals that the tool returns 'ranked results grouped by type,' which adds valuable behavioral context beyond the input schema. However, it doesn't mention other important behavioral aspects like pagination, error handling, or performance characteristics that would be helpful for a search tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly concise with just two sentences that each earn their place. The first sentence establishes purpose and scope, while the second provides crucial usage guidance. There's no wasted language, and the most important information (what the tool does and when to use it) is front-loaded effectively.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a search tool with 3 parameters, 100% schema coverage, but no annotations or output schema, the description provides good contextual completeness. It explains the tool's purpose, scope, and usage guidelines clearly. The main gap is the lack of output format details (since there's no output schema), but the description does mention that results are 'ranked' and 'grouped by type,' which provides some output context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already fully documents all three parameters. The description doesn't add any parameter-specific information beyond what's in the schema (e.g., it doesn't explain search algorithm details or ranking criteria). With complete schema coverage, the baseline score of 3 is appropriate as the description doesn't enhance parameter understanding but doesn't need to compensate for gaps.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('Search across all Kolmo content') and resources ('services, projects, and blog posts'). It explicitly distinguishes this tool from its siblings (list_services, list_projects, list_blog_posts) by stating it searches across all content types with a single query, making the differentiation unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage guidance by stating when to use this tool ('Use this instead of calling list_services + list_projects + list_blog_posts separately'). It clearly identifies the alternative approach (calling multiple sibling tools) and recommends this tool as a consolidated alternative, giving the agent clear decision-making criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

submit_contact_requestAInspect

Submit a contact or quote request to Kolmo Construction on behalf of a user. Set dryRun: true to preview what would be sent without actually submitting.

ParametersJSON Schema

Name	Required	Description
`name`	Yes	Full name
`email`	Yes	Email address
`phone`	No	Phone number (optional)
`dryRun`	No	If true, validate and preview without submitting (default false)
`message`	Yes	Project description or question
`service`	No	Service needed, e.g. "kitchen remodel"

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses the dryRun behavior (preview vs. actual submission), which is a key behavioral trait. However, it doesn't mention other important aspects like authentication requirements, rate limits, error handling, or what happens after submission (e.g., confirmation, follow-up). The description is accurate but incomplete for a mutation tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with the core purpose and followed by a specific usage tip for dryRun. Every sentence earns its place with no wasted words, making it highly efficient and easy to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a mutation tool with no annotations and no output schema, the description is adequate but has gaps. It covers the purpose and dryRun behavior, but lacks details on authentication, response format, error conditions, or integration context. Given the complexity (submitting to a business), more completeness would be beneficial, though it meets minimum viable standards.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all parameters thoroughly (e.g., name as 'Full name', dryRun as 'If true, validate and preview without submitting'). The description adds minimal value by mentioning dryRun's purpose, but doesn't provide additional semantic context beyond what's in the schema. Baseline 3 is appropriate when schema does the heavy lifting.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Submit a contact or quote request'), identifies the target ('Kolmo Construction'), and specifies the beneficiary ('on behalf of a user'). It distinguishes itself from sibling tools which are primarily informational/query tools (e.g., get_estimate, list_services) by being a submission/mutation tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear guidance on when to use the dryRun parameter ('Set dryRun: true to preview what would be sent without actually submitting'), which implicitly suggests using false for actual submission. However, it doesn't explicitly state when to use this tool versus alternatives like get_estimate or list_services, nor does it mention prerequisites or exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Server Details

Available Tools

Discussions

Your Connectors

Resources