ca-rate-filings

by ai.serff

Server Details

Natural-language search over California's public insurance rate, rule & form filings.

Status: Healthy
Last Tested: 2026-07-25 16:30
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4.7/5.0

Tool DescriptionsA

Average 4.7/5 across 17 of 17 tools scored.

Server CoherenceA

Disambiguation5/5

Every tool has a clearly distinct purpose: retrieval (get_*), listing (list_*), search (search_*), and server meta (mcp_*). The search tools are differentiated by the data source (actuarial memo, full document, summary, etc.), and filing vs. product are clearly separated. Detailed descriptions further eliminate ambiguity.

Naming Consistency5/5

Tool names follow a consistent verb_noun pattern: get_<entity>, search_<entity>, list_<entity>, mcp_<function>. The verb is always lowercase and the noun is descriptive. Even the mcp prefix is consistent for server-level operations.

Tool Count5/5

17 tools is well-scoped for a domain like insurance rate filings. It provides multiple entry points (by SERFF, by product, by content search) without overwhelming the user. Each tool earns its place, covering core retrieval, search, and meta operations.

Completeness5/5

The tool surface covers the full lifecycle of exploring rate filings: finding filings (search_filings, search_products), retrieving summaries and extracts (get_filing_summary, get_filing_extracts, get_filing_extract_meta), lineage and references, source file management, and semantic search across multiple granularities. No obvious gaps.

Available Tools

17 tools

get_filing_extract_metaGet Filing Extract MetaA

Read-onlyIdempotent

Inspect

Lists what's in each extracted artefact for a filing — section counts, item names, and the page each item came from — without returning any of the bulky factor tables, descriptions, or rate rows themselves.

Call this FIRST, before get_filing_extracts, for any "what does this filing contain" question. It costs a fraction of the tokens and tells you which file + which section you need to pull in detail. get_filing_extracts is then the targeted second call once you know the SERFF + file + section that actually answer the user's question.

Use this when the user asks:

"What forms does this filing include?" / "List the form numbers in TSIS-134726605."
"How many exclusions does it carry? What are they called?"
"What rate tables are in this filing, and which PDF page are they on?"
"List the discounts / endorsements / coverages this filing offers."
"Where in the source PDF is the territory rate table?"
Any "how many", "what are the names of", or "which page is X on" question about a filing's extracted artefacts.

Wrong surface for:

Anything that needs the actual numeric content (factor values, full rate rows, full exclusion text). Call get_filing_extracts instead, narrowing files to just the one(s) you discovered here.

Whitelist (same as get_filing_extracts):

calculations.json — example rate-calculation walk-throughs.
coverages.json — coverage definitions (perils, limits, applicability).
deductibles.json — deductible options + factors.
discounts.json — discount / surcharge schedules.
endorsements.json — optional endorsements / riders.
examples.json — worked policyholder rating examples.
exclusions.json — coverage exclusions + the conditions they apply to.
extraction_summary.json — structured filing-overview fields.
final_rating_calculation.json — canonical rating expression.
forms.json — policy form numbers + types.
rates_data.json — base rates + rate-table headers.
underwriting_guidelines.json — eligibility / UW rules.

Per item the tool returns { name, source_page? }. The item name is picked from whichever identifying field exists (name → form_number → id → key → code → coverage → label → title). source_page is the page in the source PDF where the item was extracted from, when the pipeline recorded one.

rates_data.json items additionally carry source_file — the source PDF the rate table lives in — when the filing has a single source PDF. Multi-source filings get source_file_note flagging the limit (per-item source_file on non-rate extracts needs a pipeline-side change, deferred).

Args: serff (required), files (optional — pass a subset of the whitelist to narrow; omit for all 12).

Returns: { serff, files: { "<name>": { file_name, filing_ref?, confidence?, sections: { "<key>": { count, items: [...] } }, total_items } }, count, skipped }.

ParametersJSON Schema

Name	Required	Description	Default
`files`	No	Optional subset of whitelist file names to summarise. Omit or pass empty for all 12.
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890").

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Adds behavioral context beyond annotations: token efficiency, item name selection priority, special handling of rates_data.json fields, and limitations (multi-source files). Annotations already declare readOnlyHint and idempotentHint, so description complements well.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured with bullet points, examples, and clear sections. Front-loaded with purpose and usage. Slightly long but every sentence adds value; could be trimmed for extreme conciseness.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, the description details the return shape (serff, files, count, skipped) and item structure (name, source_page). Covers all behavioral aspects for a complex tool with 12 file types and optional filtering.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with good parameter descriptions. The description adds context on serff format, files being an optional subset of whitelist, and defaults, enhancing usability without repeating schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it lists 'what's in each extracted artefact' with specific details (section counts, item names, source page) and explicitly distinguishes it from get_filing_extracts by noting it returns metadata without bulky content.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly instructs to 'Call this FIRST, before get_filing_extracts', provides concrete user question examples, and states what it is not for (actual numeric content). Also references sibling tool get_filing_extracts for targeted data.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_filing_extractsGet Filing ExtractsA

Read-onlyIdempotent

Inspect

Returns the structured-data JSON artefacts the pipeline extracted from a filing's source PDFs. Use this when the question is about rating mechanics, data tables, risk curves, calculation steps, or coverage / form definitions — anything where the narrative summary isn't enough and the LLM needs the actual structured rows.

Whitelist (and what each contains):

calculations.json — step-by-step rate calculation walk-through (base rate, factor application, final premium). One entry per documented example calculation.
coverages.json — coverage definitions: which perils / lines / risk types the filing addresses, with limits and applicability.
deductibles.json — deductible options offered, dollar amounts, and any peril-specific rules.
discounts.json — available discounts / surcharges, eligibility criteria, and the corresponding multiplicative factors.
endorsements.json — optional endorsements / riders attached to the filing.
examples.json — worked policyholder examples (sample insureds with calculated premiums).
exclusions.json — coverage exclusions and conditions under which they apply.
extraction_summary.json — structured machine-readable form of the same content get_filing_summary returns as Markdown; useful when you want filing-type / what-this-filing-does fields as JSON rather than prose.
final_rating_calculation.json — the canonical rating expression / equation the filing prescribes (base × factor1 × factor2 …).
forms.json — policy form numbers, edition dates, and the form types associated with the filing.
rates_data.json — the rate tables themselves: rows of (segment / cell / factor) values. The biggest file by far — can be hundreds of thousands of rows for territory-detailed filings. See truncation below.
underwriting_guidelines.json — eligibility and underwriting rules (e.g. credit-tier bands, prior-loss caps).

Truncation: any returned file whose JSON contains an array longer than 100 rows is truncated to the first 100 rows. The truncated file gets a _truncated envelope describing the original total. For a lighter table-of-contents view (counts, item names, source pages — no payloads) call get_filing_extract_meta instead; it's the right surface for "what's in this filing" questions. rates_data.json is the common case where truncation fires.

Args: serff (required), files (optional array — narrows the response to a subset of the whitelist; pass empty / omit for everything).

Returns: { serff, files: { "<name>": <parsed json> | { content, _truncated } }, count, skipped, truncated }.

ParametersJSON Schema

Name	Required	Description	Default
`files`	No	Optional subset of whitelist file names to return (e.g. ["rates_data.json", "calculations.json"]). Omit or pass empty for the full whitelist.
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890"). Validated against /^[A-Z]{3,5}-[A-Z0-9]{7,15}$/.

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already convey readOnlyHint and idempotentHint. The description adds critical behavioral context: truncation of arrays longer than 100 rows to the first 100 with a `_truncated` envelope, and states that 'rates_data.json is the common case where truncation fires.' No contradictions with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with clear sections: purpose, whitelist details, truncation behavior, and args summary. It is somewhat lengthy but every sentence contributes necessary detail. Minor verbosity could be trimmed, but overall efficient for the amount of information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (multiple file types, truncation, optional parameter), the description is remarkably complete. It details the return structure, includes examples of file contents, and references the sibling tool for lighter queries. No output schema exists, but the description compensates by explaining the return format.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds value beyond the schema: for the `files` parameter, it explains how to narrow the response and clarifies that omitting or passing empty returns everything. For `serff`, the schema already provides regex validation, but the description does not add new info there.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description starts with a clear verb+resource: 'Returns the structured-data JSON artefacts the pipeline extracted from a filing's source PDFs.' It further clarifies the tool's scope with specific use cases and distinguishes from the sibling tool get_filing_extract_meta by noting that meta returns a lighter table-of-contents view without payloads.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool (e.g., 'when the question is about rating mechanics, data tables, risk curves, calculation steps, or coverage / form definitions') and when not (e.g., 'narrative summary isn't enough'). It also directly points to an alternative: 'call get_filing_extract_meta instead; it's the right surface for "what's in this filing" questions.'

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_filing_lineageGet Filing LineageA

Read-onlyIdempotent

Inspect

Returns the reconciled lineage chain for a SERFF id — leaf filing plus ordered predecessors back to the bureau root. Each chain entry includes the SERFF id, position (0 = leaf), role (leaf / predecessor), and a lite filing record (state, year, carrier name, product name, filing type, filing date).

Distinct from get_filing_references, which returns what the filing itself claims inside the PDF. Use this when you want the canonical chain (e.g. "what's the bureau root and prior versions for this Progressive auto programme?"); use get_filing_references when you want the carrier-stated lineage.

Walks back from any SERFF in a programme's chain — pass either the leaf or any predecessor and you get the same chain back. Returns { error: ... } if the SERFF id has not been resolved into any programme chain (the filing may be a non-rate-affecting type — Withdrawal / Correspondence — or simply not yet ingested).

Pair with search_filings using predecessor_prefix: search returns "filings that some programme adopted from bureau X"; lineage tells you, for any of those filings, the full chain it sits in.

ParametersJSON Schema

Name	Required	Description	Default
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890"). Either the leaf or a predecessor — the chain is returned regardless. Validated against /^[A-Z]{3,5}-[A-Z0-9]{7,15}$/; invalid values return an error envelope.

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate read-only and idempotent behavior. The description adds context: walking back from any chain member, returning error envelope for unresolved IDs, and explaining the type of filings that cause errors. No contradictions with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with bold key terms, clear sections for purpose, distinction, and edge cases. It's slightly longer than minimal but each sentence adds value, justifying the length.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Although there's no output schema, the description adequately describes the chain entries (fields like SERFF id, position, role, lite record) and covers error behavior. It could explicitly mention the response format (array) but is still thorough for a single-parameter tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the single parameter, but the description adds value by explaining you can pass leaf or predecessor, validating against regex, and noting error envelope for invalid values. This goes beyond the schema's basic description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it returns the 'reconciled lineage chain' for a SERFF id, specifying the output includes ordered predecessors and leaf filing with fields. It explicitly distinguishes from sibling tool `get_filing_references`, making the purpose unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit when-to-use guidance: use this for canonical chain vs. `get_filing_references` for carrier-stated lineage. It also explains error cases (non-rate-affecting types, not yet ingested) and suggests pairing with `search_filings`.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_filing_referencesGet Filing ReferencesA

Read-onlyIdempotent

Inspect

Returns the predecessor, superseded, and companion filings that this filing itself cites in its supporting documentation. Carrier-claimed lineage extracted from inside the PDF (e.g. "supersedes XXXX-NNNN", "loss costs adopted from NCCI-NNNN").

Distinct from get_filing_lineage, which returns the reconciled chain across the corpus. The two often agree but can diverge — get_filing_references is the carrier's stated lineage; get_filing_lineage is what was actually wired together across filings. When they disagree, that is itself a signal worth surfacing.

Each entry typically carries a SERFF id, NAIC, group code, filing type, and a relationship label (predecessor / superseded / loss-cost-source). Use to answer "what does this filing claim to replace?" or "which bureau filing did this carrier adopt?".

Returns { error: ... } if no references record exists for the SERFF id (the filing has not yet been classified).

ParametersJSON Schema

Name	Required	Description	Default
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890"). Validated against /^[A-Z]{3,5}-[A-Z0-9]{7,15}$/; invalid values return an error envelope.

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations indicate readOnly and idempotent. Description adds that references are extracted from inside the PDF and that missing records return an error envelope. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three well-organized paragraphs: definition, differentiation from sibling, and usage/error info. Every sentence adds value; front-loaded with core action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a single-parameter tool with full schema coverage, the description explains return fields (SERFF id, NAIC, etc.) and error case, making it fully self-contained without an output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds validation pattern and error behavior for the serff parameter. This exceeds baseline.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it returns predecessor, superseded, and companion filings cited by the filing itself. It distinguishes from the sibling get_filing_lineage by explaining the difference in scope (carrier-stated vs. reconciled chain).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly contrasts with get_filing_lineage, noting when they may diverge and that divergence is a signal. Provides specific use cases: 'what does this filing claim to replace?' and 'which bureau filing did this carrier adopt?'

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_filing_source_file_linkGet Filing Source File LinkA

Read-only

Inspect

Returns a short-lived V4-signed GCS URL for a single SOURCE file (PDF / XLSM / XLSX / DOC / ZIP) the carrier submitted for a SERFF filing. The link is intended for display to the end user — they click it in their browser to download the file.

CRITICAL: DO NOT fetch this URL yourself. Surface it to the user verbatim and stop. The URL is a signed link for the human's browser, not for the model. Fetching it pulls the entire source file (often tens of MB of PDF / XLSM) into your context window and serves no purpose the user did not already get from seeing the link.

Pair with list_filing_source_files to discover the file names first, then call this to mint a link. When you respond to the user, include the URL and the expires_at timestamp so they know how long they have to click — after that the link returns 403 and they'll need to ask for a fresh one.

Link properties: direct V4-signed GCS URL, expires after ttl_seconds (default 900 = 15 min, capped at 3600). Bypasses Cloud Run entirely. Intended for human clicks, NOT for the model to fetch.

Whitelist is dynamic, keyed off the actual contents of the filing's source-files directory — same set list_filing_source_files advertises. file_name must be a basename (no slashes, no ..) AND must appear in the listing.

Returns { serff, file_name, url, expires_at, ttl_seconds, notice }. The notice repeats the don't-fetch directive — include it in your response to the user too.

ParametersJSON Schema

Name	Required	Description
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "REGU-134742228").
`file_name`	Yes	Basename of a file present in this SERFF's source-files directory (the exact set returned by `list_filing_source_files`). No slashes, no path traversal — pure basename.
`ttl_seconds`	No	Signed-URL lifetime in seconds. Default 900 (15 minutes). Capped at 3600 (1 hour). Shorter is preferred — the link is for the human to click immediately, not for long-term storage.

Tool Definition Quality

A5/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations indicate readOnlyHint=true, and the description adds extensive behavioral details: the URL is V4-signed, bypasses Cloud Run, has a configurable TTL (default 900s, capped 3600s), and returns a 403 if expired. It also clarifies that the model should not fetch the URL, mitigating misuse.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Though lengthy, every sentence is purposeful and front-loaded with the most critical info (purpose, then the critical 'do not fetch' warning, then usage pairing). Structure is logical and efficient for an AI agent.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, the description fully defines the return format (serff, file_name, url, expires_at, ttl_seconds, notice) and covers edge cases (expired link, security constraints). It also references the sibling tool for file discovery, ensuring complete operational context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description enriches each parameter with critical context: 'serff' shape explained, 'file_name' must be basename with no traversal, 'ttl_seconds' defaults and cap noted. This goes well beyond the schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool returns a short-lived V4-signed GCS URL for a single source file (PDF/XLSM/etc.), using specific verbs and resource types. It distinguishes itself from siblings by explicitly pairing with 'list_filing_source_files' and contrasting with other 'get_filing_*' tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit instructions: pair with 'list_filing_source_files' first, display the URL to the user verbatim, do not fetch it yourself, and include the 'expires_at' timestamp. It also warns against long-term storage of the URL, making usage context crystal clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_filing_summaryGet Filing SummaryA

Read-onlyIdempotent

Inspect

Returns an actuarial narrative summary for a single SERFF id — the Filing Type header, the "What This Filing Does" section (concrete bullet-pointed change list with page citations for Rate / Rule / Form / New Programme / Withdrawal filings), the structured Description, and the key references the summary cites.

This is the fastest route from "I have a SERFF id" to "I understand what this filing changes" — typically a few KB rather than the hundreds of KB of raw source. Page citations of the form (p. N) let a reviewer verify each claim against the source PDF.

Returns { error: ... } if no summary exists for the SERFF id (the filing has not yet been classified). Use list_filing_source_files and mcp_health to triage; do not retry.

ParametersJSON Schema

Name	Required	Description	Default
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890"). Validated against /^[A-Z]{3,5}-[A-Z0-9]{7,15}$/; invalid values return an error envelope.

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare readOnlyHint, idempotentHint, and openWorldHint; the description aligns by describing a read-only retrieval. It adds behavioral details: returns error if filing not classified, page citation format, and typical size (few KB vs raw source). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise (multiple sentences but no fluff), front-loads the core purpose, and uses formatting (bold, parentheses) to structure key details. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The tool is simple (one param, no output schema) but the description covers return content, error handling, and size context. With annotations covering safety and idempotency, the agent has sufficient information to use the tool correctly. A minor gap is the lack of explicit return type structure, but the narrative description compensates.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The sole parameter 'serff' is fully described in the input schema with format validation and error envelope. The description reiterates some of this but adds no significant new meaning; with 100% schema coverage, baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool 'returns an actuarial narrative summary' for a single SERFF id, listing specific sections returned (Filing Type, 'What This Filing Does', Description, references). It distinguishes from siblings by positioning as the 'fastest route' and referencing alternative tools for troubleshooting.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use (having a SERFF id, wanting to understand changes) and when not to (if error, do not retry; instead use list_filing_source_files and mcp_health). It provides concrete triage instructions, making the decision clear for an agent.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_productGet Product DetailA

Read-onlyIdempotent

Inspect

Returns the full record for a single product — by product_id (uuid) OR leaf_serff. Each call expands the predecessor chain into a list of {position, serff, filing} where each filing is the slim metadata (state, year, carrier, product, filing type, status, date). Saves the round-trip you'd otherwise need (search_products → get_filing_summary × N).

Returns the substantive-leaf signals (leaf_likely_new, leaf_file_bytes, leaf_embed_bytes), the chain evidence score (chain_evidence_score), the pricing-lineage label (adoption_sources), and the array of sibling product ids (related_product_ids) sharing a bureau/me-too source. Pair with get_product_siblings to resolve those ids into full sibling rows in one call.

Pass either argument — not both. Returns { error: ... } if neither provided, on malformed UUID/SERFF, or when no product matches.

ParametersJSON Schema

Name	Required	Description	Default
`leaf_serff`	No	Leaf-filing SERFF id, shape PREFIX-IDENTIFIER. Mutually exclusive with `product_id`.
`product_id`	No	Canonical product UUID (e.g. "00006cf7-ae2b-4eb4-b774-2fe66debd40d"). Mutually exclusive with `leaf_serff` — pass one.

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint. Description adds details on output structure, signals, and error conditions without contradicting annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Front-loaded with main purpose, followed by benefits and detailed return structure. Every sentence adds value, no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, description fully explains return fields and pairings. Complete for a focused read tool with annotation support.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds context like 'PREFIX-IDENTIFIER' for leaf_serff and mutual exclusivity beyond schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it returns the full record for a single product by product_id or leaf_serff. Distinguishes from siblings like search_products and get_product_siblings.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly describes when to use ('saves round-trip') and what not to do ('pass either argument — not both'), with error cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_product_ancestorsGet Product AncestorsA

Read-onlyIdempotent

Inspect

Returns the prior revisions of a product — other products in the same lineage cluster (same company_name, state, lob) with an older leaf_filing_date. Distinct from get_product_siblings, which returns peers sharing a bureau / cross-carrier source.

Use to answer "how has this programme evolved?" — walk back through each revision in chronological order.

Returns slim rows (product identity, leaf, adoption_sources, depth, root) ordered oldest → newest. If the seed product is missing any of the cluster-key fields (company_name / state / lob / leaf_filing_date), returns ancestors: [] with an explanatory note field — not an error.

Pass product_id (uuid) OR leaf_serff. Returns { ancestors: [], count: 0 } when no prior revision exists — not an error.

ParametersJSON Schema

Name	Required	Description	Default
`leaf_serff`	No	Leaf-filing SERFF id. Mutually exclusive with `product_id`.
`product_id`	No	Canonical product UUID. Mutually exclusive with `leaf_serff` — pass one.

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint, idempotentHint, and openWorldHint. Description adds context about return format (slim rows, fields, ordering), non-error handling for missing fields or no ancestors, and input alternatives. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is detailed but well-structured: first paragraph explains what, second paragraph when to use, third paragraph edge cases and parameter guidance. Every sentence adds value with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description adequately explains return shape, parameter constraints, edge cases, and intended use. Covers all required aspects for effective tool invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. Description adds clarity by explaining mutual exclusivity of the two parameters and specifying that one must be provided despite not being marked required. Also describes the function of leaf_serff explicitly.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states 'Returns the prior revisions of a product' and differentiates from sibling tool 'get_product_siblings' by explaining the difference between ancestors and peers. Verb and resource are specific.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says when to use: 'Use to answer "how has this programme evolved?" — walk back through each revision in chronological order.' Also distinguishes from alternative and describes edge case behavior (returns empty array with note, not error).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_product_siblingsGet Product SiblingsA

Read-onlyIdempotent

Inspect

Returns the sibling products of a given product — other products whose chain shares a bureau filing (ISO / AAIS / NCCI / WCRT / WCRB / SURE) or a cross-carrier predecessor with the seed product. The shared filing is the "source" both products adopted from — multiple carriers' parallel adoptions of the same ISO programme, or multiple successors of a single me-too source.

Resolves the seed's stored related_product_ids array into slim sibling rows (product identity, leaf, LOB, state, adoption_sources, depth, root). Cheaper than a full search_products predicate when you already have a known product and want its peers.

Use to:

find every product riding on a specific ISO loss-cost filing (start from one bureau-adopting product, get all the others adopting the same root),
spot programmes that look proprietary but actually share a cross-carrier source,
explore me-too clusters around a single source filing.

Pass product_id (uuid) OR leaf_serff. Returns { siblings: [], count: 0 } when the product has no siblings — not an error.

ParametersJSON Schema

Name	Required	Description	Default
`leaf_serff`	No	Leaf-filing SERFF id. Mutually exclusive with `product_id`.
`product_id`	No	Canonical product UUID. Mutually exclusive with `leaf_serff` — pass one.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint. The description adds context about resolving related_product_ids, returning slim rows, and handling empty results with a non-error response. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is structured into clear paragraphs: concept, resolution, use cases, parameter/return notes. Slightly long but efficient for the complexity. No redundant sentences.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of the sibling concept, complete schema coverage, and no output schema, the description fully explains the return shape (siblings array with count) and edge case (empty non-error). Also explains the cheaper aspect and mutual exclusivity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for both parameters. The description does not add new semantic info beyond confirming mutual exclusivity, which is already in the schema. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool returns sibling products of a given product, explaining the concept of siblings via shared filing or cross-carrier predecessor. It distinguishes from siblings like search_products (cheaper, specific use case) and get_product_ancestors.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly lists 'Use to:' scenarios and implies when not to use (full search) with mention of 'cheaper than a full search_products predicate'. Alternatives are clearly identified.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_filing_source_filesList Filing Source FilesA

Read-onlyIdempotent

Inspect

Lists the source files (PDFs, XLS spreadsheets, DOC manuals, ZIP archives) ingested for a SERFF id. Returns metadata only — name, size in bytes, MIME-class type (pdf / spreadsheet / document / csv / archive / other), file extension, modified timestamp.

Pair with get_filing_source_file_link to mint a signed download link the user can click — list names here, mint a link there.

Use this to:

triage a filing whose summary looks thin ("did we even ingest the right files?"),
discover the XLSM rater / rate manual PDF / rating-samples spreadsheet for a filing,
confirm which artefacts a filing actually shipped (e.g. is there a separate rate manual XLS, or just the PDF?).

Returns { error: ... } if no source files exist for the SERFF id.

ParametersJSON Schema

Name	Required	Description	Default
`serff`	Yes	Canonical SERFF id, shape PREFIX-IDENTIFIER (e.g. "AAIC-134567890"). Validated against /^[A-Z]{3,5}-[A-Z0-9]{7,15}$/; invalid values return an error envelope.

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true. The description adds that the tool only returns metadata, not file content, and that it returns an error if no files exist. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose, uses structured bullet points for use cases, and every sentence adds value. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given low complexity (1 parameter, simple list), the description fully covers what the tool does, what it returns, and when to use it. No output schema needed as return types are described.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds value beyond the schema by detailing the return metadata types (pdf, spreadsheet, etc.) and explaining the pairing with the sibling tool, making the parameter's purpose clearer in context.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists source files ingested for a SERFF id, returning metadata only. It explicitly differentiates from sibling `get_filing_source_file_link` by noting it provides names while the sibling mints download links.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage scenarios (triage, discover files, confirm artifacts) and pairs with a sibling tool. It also states the error case for missing files, giving clear when-to-use guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mcp_accountMCP Account DetailsA

Read-onlyIdempotent

Inspect

Returns the resolved identity behind the current MCP bearer — email, company_name, account_type (free vs production), and company_reference. Quota-exempt: this is an identity probe, not a value-bearing call. Returns nulls for fields mono has no value for. Useful for an MCP client to confirm "who am I talking to mono as" without burning the user's monthly quota.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnly and idempotent. Description adds that it is quota-exempt and returns nulls for missing fields, which are valuable behavioral insights beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences with no wasted words. Front-loaded with returned fields, then adds quota-exempt and null behavior. Perfectly structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description adequately explains return fields, null handling, and quota exemption. Could mention any prerequisites or limits, but overall complete for a simple identity probe.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist, so the description does not need to add parameter information. Schema coverage is 100% vacuously. Baseline for 0 params is 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the tool returns the resolved identity (email, company_name, account_type, company_reference) of the current MCP bearer, distinguishing it from sibling tools which focus on filings and products.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says it is useful for confirming identity without burning quota, providing context for when to use. However, it does not explicitly mention alternatives or when not to use, though siblings are clearly different.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

mcp_healthMCP Server HealthA

Read-onlyIdempotent

Inspect

Diagnostic snapshot of the deployed MCP server: build identifier, server_version (1.0. tag), boot time, advertised tool names, a hash of the tool surface, and corpus_updated_at (freshest watermark across the filings pipeline). Call this first when you suspect the connector is showing a stale tool list or you want to detect whether code or data has changed since your last call — compare tools_advertised against what your client lists, server_version for code, corpus_updated_at for data.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint and idempotentHint, so safe behavior is clear. Description adds context about return fields and their meanings, which goes beyond annotations. Could mention if there are any rate limits or costs, but overall transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with purpose and fields, then usage scenarios. It could be slightly shorter, but every sentence adds value. No redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Without an output schema, the description compensates by detailing the return fields (build identifier, server_version, boot time, advertised tool names, hash, corpus_updated_at) and their purpose. This is comprehensive for a health endpoint.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters are needed; the description handles this naturally by not mentioning any. The tool has zero parameters, so the description adds no confusion and is consistent with the empty input schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a diagnostic snapshot of the deployed MCP server and lists specific fields like build identifier, server_version, boot time, advertised tool names, hash, and corpus_updated_at. It distinguishes itself from sibling tools which focus on filings and products.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly recommends calling this tool first when suspecting stale tool lists or detecting code/data changes. Provides concrete comparisons: tools_advertised vs client list, server_version for code, corpus_updated_at for data.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_actuarial_embedsSemantic Search — Actuarial MemosA

Read-onlyIdempotent

Inspect

Pure vector search over per-filing actuarial-memorandum embeddings (extract_embeds where kind='actuarial_memo'). Each hit is a filing whose memo is semantically closest to your query, with the matching excerpt and lite filing metadata.

Cost: one query-embedding call + one indexed Postgres lookup. Bounded, cheap, fast. No LLM planning, no LLM composition.

This is the right tool any time the question is actuarial-shape. Reach for it — not search_summary_embeds and not search_filing_embeds — when the user is asking about:

Rate adequacy: headline rate change, indicated vs selected, off-balance, capping.
Loss trends: severity trend, frequency trend, pure-premium trend, projected ultimates, LDFs, IBNR development.
Credibility / experience: experience period, weight assigned to own experience vs class-plan / bureau, credibility tables.
Expense / profit provisions: permissible loss ratio, target combined ratio, profit & contingency loading, expense ratio, investment-income offset.
Reason codes / drivers: reinsurance cost, weather/cat load, severity-driven rate need, mix shift, frequency reductions from telematics.
Anything where the answer would be a number from the actuarial memo rather than a description of what the filing does.

The memo is where actuaries put the numerics; the extraction summary is where the pipeline puts the prose. If the question reaches for numbers, hit this surface first.

Wrong surface for:

Content questions ("filings discussing wildfire scoring", "telematics programmes", "parametric triggers") — those discuss what the filing is about, not actuarial numerics. Use search_summary_embeds (broader coverage).
Concrete-filter questions ("Filings from carrier NAIC 12345 in 2024") — use search_filings.
Filings with no actuarial memo. Memos are typically attached to Rate filings; Form, Rule, and Withdrawal filings often have none. Coverage is narrower than search_summary_embeds for that reason — most of the 2026 corpus is covered, prior years are backfilling.

How to combine:

"Personal auto filings in California whose indicated rate exceeds selected by 5+ points" → search_filings (state=CA, product_type="Personal Auto", filing_type="Rate") to scope a candidate set, then this tool over the candidates' memos.
"Carriers citing severity-driven rate need in 2025" → this tool first; get_filing_summary on the top hits to read in full.

Returns top-K hits, each with {serff, similarity, excerpt, meta}. Default topK=10, max 50. Excerpt is the first 800 chars of the matching memo.

ParametersJSON Schema

Name	Required	Description	Default
`topK`	No	Number of top filings to return. Defaults to 10; capped at 50.
`query`	Yes	Natural-language query. Pass the user's actuarial question verbatim — short, specific queries (5-30 words) match best. The query is embedded and cosine-compared against per-filing actuarial-memo embeddings.

Tool Definition Quality

A4.9/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint; description adds cost characteristics (one query-embedding call + indexed Postgres lookup, bounded cheap fast), return format (top-K hits with similarity, excerpt, meta), excerpt length (800 chars), and coverage limitations. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Relatively lengthy but well-structured with sections (cost, when to use, wrong surfaces, how to combine, returns). Front-loaded with key purpose. Every section adds value, but could be slightly more concise overall.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive description covering embedding source, search mechanism, coverage notes, return fields, and integration with other tools. No output schema but description explains return structure sufficiently. Handles all relevant context for this complex tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100% (both query and topK described). Description adds valuable context: for query, recommends short specific queries (5-30 words) and pass verbatim; for topK, states default 10, max 50. Enhances usability beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The title and description clearly identify it as a semantic search over actuarial memos. It distinguishes itself from sibling tools like search_summary_embeds and search_filing_embeds by specifying the kind of content (actuarial numerics vs content description vs filing metadata).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use the tool (questions about actuarial numerics like rate adequacy, loss trends, etc.) and when not to use it (content questions, concrete filters, filings without actuarial memos). Provides specific alternatives and examples of combining tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_filing_embedsSemantic Search — Filing BodyA

Read-onlyIdempotent

Inspect

Pure vector search over per-chunk full-document embeddings (filing_embeds, ~12.4M rows across ~65K filings — each filing sliced into ~190 paragraph-sized chunks). The most granular semantic surface in the corpus.

Cost: one query-embedding call + one indexed Postgres lookup. No LLM planning, no LLM composition.

Right surface for:

"Find the exact passage discussing X" — granular text-search where you need the paragraph not just the filing.
"Find filings whose body text mentions X" when the summary-level surface (search_summary_embeds) might miss a topic buried in a long PDF.
"Drill into this specific filing semantically" — pass serff to restrict the cosine search to a single filing. Without scoping, commodity-vocabulary chunks from other filings can out-rank your target filing; scoping eliminates that.

Wrong surface for:

Filing-level questions where multiple hits per filing are noise — use search_summary_embeds (one match per filing).
Concrete-filter questions like "Filings from carrier NAIC 12345 in 2024" — use search_filings.

aggregate: true (default) collapses to top-K filings by best-chunk similarity (one row per filing, the best matching paragraph as excerpt). aggregate: false returns top-K raw chunks (may include several from the same filing) — use when the user asked to see the actual paragraphs. When serff is set, aggregate is forced to false (every hit is the same filing already).

Returns top-K hits, each with {serff, chunk_index, similarity, excerpt, meta}. Default topK=10, max 50. Excerpt is the first 800 chars of the matching chunk.

ParametersJSON Schema

Name	Required	Description
`naic`	No	Exact NAIC carrier identifier (5-digit string).
`topK`	No	Number of top hits to return. Defaults to 10; capped at 50. If filters narrow the candidate set below topK you get what's there, no silent fallback to cross-filing matches.
`year`	No	Exact filing year. Mutually exclusive with year_from/year_to.
`query`	Yes	Natural-language query. Pass the user's question verbatim when you can — short, specific queries (5-30 words) match best. The query is embedded and cosine-compared against per-chunk body embeddings.
`serff`	No	Optional SERFF id to scope the chunk search to a single filing (shape PREFIX-IDENTIFIER, e.g. "REGU-134742228"). Use this when you already know which filing you want to read semantically — e.g. "find the territory factor table in REGU-134742228".
`state`	No	Two-letter US state code, uppercase. Corpus currently covers CA only.
`date_to`	No	Upper bound on filing date (ISO YYYY-MM-DD).
`year_to`	No	Upper bound on filing year, inclusive.
`aggregate`	No	When true (default), collapse to top-K filings by best-chunk similarity. When false, return top-K raw chunks (may include multiple chunks from the same filing). Ignored (forced to false) when `serff` is set — scoping to one filing always returns raw chunks.
`date_from`	No	Lower bound on filing date (ISO YYYY-MM-DD).
`year_from`	No	Lower bound on filing year, inclusive.
`filing_type`	No	Wildcard match on filing type ("Rate", "Rule", "Form", etc.). Substring match.
`product_type`	No	Wildcard match on product type. Substring match — "Auto" matches Personal Auto and Commercial Auto.
`predecessor_prefix`	No	Bureau / org SERFF prefix ("ISOF", "NCCI", "AAIS", "MSO").

Tool Definition Quality

A4.9/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint, and the description adds cost details, query behavior, aggregate parameter behavior (forced false with serff), and return format. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is well-structured with sections, front-loaded with core definition, costs, usage guidance, then parameter details. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given complexity (14 parameters, no output schema), the description covers all parameters, behavior, use cases, and return format comprehensively. Complete for agent usage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds value beyond schema by explaining query length recommendations, serff format, aggregate parameter interplay, and filter behavior. Baseline 3, plus extra context.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a pure vector search over per-chunk embeddings, the most granular semantic surface. It distinguishes from siblings by mentioning search_summary_embeds for filing-level and search_filings for concrete filters.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly lists 'Right surface for' and 'Wrong surface for' sections with specific scenarios and tool alternatives, providing clear guidance on when to use this tool vs. others.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_filingsSearch FilingsA

Read-onlyIdempotent

Inspect

Search the SERFF filings corpus by carrier, NAIC, product line, state, year-range, filing type, or bureau lineage. Returns a lite row shape per match (SERFF id, state, year, NAIC, group code, carrier name, product name, filing type / status / date). For the substance of a filing, follow up with get_filing_summary once you have a SERFF id.

All filters AND together. Defaults: limit=25, capped at 100; ordered by filing date descending. Pagination via offset. The full count matching the predicate is returned in total (independent of limit/offset) so you can decide whether to paginate or narrow the predicate.

Common patterns:

"All California auto filings from 2024" → state="CA", product_type="Auto", year=2024.
"Recent rule changes in workers comp" → product_type="Workers", filing_type="Rule", year_from=2023.
"Which Progressive filings adopted ISO?" → search="PRGS", predecessor_prefix="ISOF".
"Anything mentioning telematics in the product name" → search="telematics".

predecessor_prefix answers "filings adopted from a bureau" questions — it restricts to filings that appear in some programme's adopted-from chain, so orphan bureau filings no carrier ever pulled in are excluded. Validated against /^[A-Z][A-Z0-9]{1,7}-?$/; trailing dash optional. Invalid values return { error: ... } rather than a row set.

Only filings that have been fully read and classified are returned — partial / pre-classification rows are hidden so every result is a filing you can actually reason about.

ParametersJSON Schema

Name	Required	Description
`naic`	No	Exact match on the NAIC carrier identifier (5-digit string). Use when you know the specific carrier (e.g. "24260" = Progressive Direct). One filing always belongs to exactly one NAIC.
`year`	No	Exact filing year (e.g. 2024). The corpus covers filings from 2005 onward; earlier years return no rows. Mutually exclusive with `year_from`/`year_to` — pick one form.
`limit`	No	Max rows returned in this call. Defaults to 25; capped at 100. Pair with `offset` to page. Always check `total` in the response to decide whether you need more pages.
`state`	No	Two-letter US state code, uppercase. The corpus currently covers California (`CA`) only — other state codes will return no rows until additional states are onboarded. One filing always belongs to exactly one state.
`offset`	No	Row offset for pagination. Defaults to 0. Combined with the descending filing-date ordering, `offset=N` skips the most recent N filings matching the predicate.
`search`	No	Free-text wildcard match across SERFF id, carrier name, and product name. Useful for "anything mentioning Progressive" or "filings whose product name contains 'condo'". Prefer the structured fields below — `naic`, `state`, `filing_type`, `product_type` — when you can; they are more precise and do not false-match on substrings.
`date_to`	No	Upper bound on filing date as ISO date (YYYY-MM-DD).
`year_to`	No	Upper bound on filing year, inclusive.
`date_from`	No	Lower bound on filing date as ISO date (YYYY-MM-DD). Use for finer-grained windows than `year_from` allows. Compares against the date the carrier submitted the filing, not the rate effective date.
`year_from`	No	Lower bound on filing year, inclusive. The corpus covers filings from 2005 onward. Pair with `year_to` for a range, or use alone for "everything since".
`filing_type`	No	Wildcard match on filing type. Common values: "Rate", "Rule", "Rate/Rule", "Loss Cost / Rule", "Form", "Rate/Rule/Form", "Withdrawal", "Correspondence", "Adoption". Substring matches: `filing_type="Rule"` returns Rule, Rate/Rule, and Loss Cost / Rule. Substantive (rate-affecting) filings are typically Rate, Rule, Rate/Rule, Loss Cost / Rule, or Form combinations.
`product_type`	No	Wildcard match on product type. Common values include "Personal Auto", "Homeowners", "Commercial Auto", "Workers Compensation", "Property", "Liability". Substring matches: `product_type="Auto"` returns Personal Auto and Commercial Auto.
`predecessor_prefix`	No	Bureau or organisation SERFF prefix — common values: "ISOF" (ISO Services), "NCCI" (workers comp loss costs), "AAIS" (American Association of Insurance Services), "MSO" (Mutual Service Organisation). Returns filings carriers actually adopted into a programme — orphan bureau filings nobody picked up are excluded. Validated against /^[A-Z][A-Z0-9]{1,7}-?$/ (1-8 alphanumeric chars, trailing dash optional). Invalid input returns `{ error: ... }`. Case-insensitive on the way in.

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare read-only and idempotent. Description adds: only fully read/classified filings returned, pagination with offset and total count, AND-filtering, predecessor_prefix validation with error, state limitation to CA. Rich behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured: purpose, return shape, behavior, examples, special cases. Somewhat lengthy but every sentence adds value. Front-loaded with key info.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 13 optional params and no output schema, description covers all aspects: pagination, ordering, filtering, validation, data quality. Return shape described textually. Complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers 100% of parameters with descriptions. Description adds value with examples (e.g., predecessor_prefix prefix patterns, wildcard behavior for filing_type and product_type, date vs year differentiation). Not redundant.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it searches the SERFF filings corpus by multiple criteria and mentions return shape. Distinguishes from sibling get_filing_summary for follow-up details.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit context: use for initial search, follow with get_filing_summary for substance. Includes common examples and patterns. No explicit when-not-to-use, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_productsSearch ProductsA

Read-onlyIdempotent

Inspect

Scope warning — read before reaching for this tool. The products corpus is a curated subset of filings, not the canonical breadth. Many filings are absent by design: bureau-only rows, withdrawals, correspondences, prior revisions (collapsed into the latest), and any filing the build pipeline hasn't classified as a substantive programme. For "what filings exist", "what carriers have filed", "is carrier X in state Y" questions — use search_filings. Reach for search_products only when the question is explicitly about programmes, lineage clusters, latest-revision picks, or adoption-source labels.

Search the products corpus by carrier, state, LOB, year-range, pricing-lineage classification (adoption_sources), and chain shape. Products are programme histories — each row is one carrier programme defined by its leaf filing plus an ordered chain of predecessor SERFFs back to a root.

By default, returns ONE row per (company_name, state, lob) lineage cluster — the latest revision by leaf_filing_date. Pass all_revisions=true to get every row including prior revisions of the same programme.

Filters AND together. Defaults: limit=25 capped at 100, ordered by leaf_filing_date DESC. total is the full predicate-matching count, independent of limit/offset.

Common patterns:

"Full bureau workers-comp programmes in CA" → state="CA", lob="WORKERS COMPENSATION", adoption_sources_any=["bureau"].
"Proprietary commercial-auto programmes 2024+" → lob="AUTO", leaf_year_from=2024, adoption_sources_any=["proprietary"].
"Me-too programmes Progressive's chain feeds" → adoption_sources_any=["me_too","bureau_forms_only"] (returns either), company_name="Progressive".
"Substantive new-program leaves" → leaf_likely_new=true. These are programmes whose own filing carries the full rate manual (high-confidence price-ables).
"Strong-evidence chains" → chain_evidence_min=0.7 (cosine of leaf vs. immediate parent chunks).

adoption_sources valid values: bureau (full bureau adoption — loss costs + LCM), me_too (full adoption of another carrier's programme), proprietary (carrier owns the rates), bureau_forms_only (modifier — bureau forms with own/me-too rates). Use adoption_sources_any to match products whose array contains ANY of the listed values.

Each row returns a lite shape — product identity, leaf and root SERFFs, chain depth, adoption_sources, leaf_likely_new, chain_evidence_score. For the full row (full chain expansion + related siblings) follow up with get_product.

ParametersJSON Schema

Name	Required	Description
`lob`	No	Substring match on product LOB family (state_product_type → falls back to serff_product_type). Common values: "AUTO LIAB/PHYS DAMAGE", "WORKERS COMPENSATION", "OTHER LIABILITY", "MULTI-PERIL", "HOMEOWNERS MULTI-PERIL", "FIRE", "INLAND MARINE". `lob="AUTO"` returns all auto variants.
`depth`	No	Exact chain depth (number of predecessors). depth=0 means a standalone leaf with no chain.
`limit`	No	Max rows returned. Defaults to 25, capped at 100. Pair with `offset` to page.
`state`	No	Two-letter US state code, uppercase. The corpus currently covers California (`CA`) only — other state codes will return no rows.
`offset`	No	Row offset for pagination. Defaults to 0.
`depth_to`	No	Upper bound on chain depth, inclusive.
`leaf_year`	No	Exact leaf year (e.g. 2026). The leaf is the most recent filing in the chain — this is the programme's most recent revision. Mutually exclusive with `leaf_year_from`/`leaf_year_to`.
`root_type`	No	Substring match on the root filing's type. Common values: "New Program", "Transferred Program", "Auto Class Plan", "Class Plan", "Manual", "Rate". `root_type="New Program"` returns programmes that trace back to a clean launch.
`depth_from`	No	Lower bound on chain depth, inclusive. Use for "programmes with at least N predecessors".
`leaf_serff`	No	Direct lookup by leaf SERFF id. Exact match. Same as filtering all products that this filing is the leaf of (max one row per leaf).
`root_serff`	No	Filter to products rooted at this SERFF id. Returns all programmes whose chain terminates at the given root (typically many products share a bureau root).
`company_name`	No	Substring (ILIKE) match on the leaf-filing carrier name. Use for "Progressive programmes" type queries.
`leaf_year_to`	No	Upper bound on leaf year, inclusive.
`all_revisions`	No	When true, returns every product including prior revisions of the same lineage cluster. Default false — only the latest revision per (company_name, state, lob) is returned. Use for time-series queries (e.g. "how has Acme's CA auto rate manual evolved?") or chain audits.
`leaf_year_from`	No	Lower bound on leaf year, inclusive. Use for "programmes whose most recent revision is on or after Y".
`leaf_likely_new`	No	True when the leaf itself is a substantive programme launch — its own filing carries the rate manual (root-type label AND ≥3MB files or ≥200KB embeds). High-confidence buildables.
`chain_evidence_max`	No	Upper bound on `chain_evidence_score`. Use for "low-evidence" diagnostic queries.
`chain_evidence_min`	No	Lower bound on `chain_evidence_score` (cosine similarity of leaf-filing chunks vs immediate-parent chunks, 0-1). Use to find products with strong content-overlap chains (e.g. `chain_evidence_min=0.7`). NULL scores are excluded by any min filter.
`adoption_sources_any`	No	Match products whose `adoption_sources` array contains ANY of the listed labels (PostgreSQL `&&` overlap). E.g. `["bureau"]` returns full-bureau-adoption products; `["me_too","bureau_forms_only"]` returns products that are me-too OR carry bureau forms. Valid labels: `bureau` (full bureau — loss costs + LCM), `me_too` (full adoption of another carrier), `proprietary` (carrier owns rates), `bureau_forms_only` (modifier — bureau forms with own/me-too rates).

Tool Definition Quality

A4.9/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate read-only and idempotent. The description adds details about default behavior (one row per lineage cluster), pagination, and the meaning of fields like 'chain_evidence_score'. No contradictions with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with a warning, explanation, patterns, and parameter details. It is somewhat verbose but every sentence adds value. Slight trimming could improve conciseness.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (19 parameters, no output schema), the description covers default behavior, pagination, filtering logic, and example queries. It is comprehensive enough for effective agent use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

All 19 parameters have descriptions in the input schema (100% coverage). The description text further elaborates on several parameters, such as 'adoption_sources_any' valid values and 'chain_evidence_min' meaning, adding value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches the 'products corpus' by various filters and explicitly distinguishes from 'search_filings' by specifying when each should be used. The verb 'search' and resource 'products' are clear.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description includes a 'Scope warning' that explains what the products corpus does and does not contain, and directs to 'search_filings' for broader queries. It provides common patterns and explicit guidance on when to use this tool vs. alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

search_summary_embedsSemantic Search — Filing SummariesA

Read-onlyIdempotent

Inspect

Pure vector search over per-filing extraction-summary embeddings (one embedding per filing, ~59K rows total). Each hit is a filing whose extraction summary is semantically closest to your query, with the matching excerpt and lite filing metadata (state, year, company, product type, filing type, filing date).

Cost: one query-embedding call + one indexed Postgres lookup. Bounded, cheap, fast. No LLM planning, no LLM composition. Always reach for this before any LLM-driven alternative.

Right surface for what is this filing about questions:

"Show me filings discussing X" — content questions where X is not a concrete filter (wildfire scoring, telematics programmes, autonomous-vehicle exposure, ESG factors, parametric triggers, etc.).
"Find filings that mention " — when you need to discover filings by content rather than by structured metadata.
"Filings citing trend data on " — when the question is content-shaped, not numerics-shaped.

Wrong surface for:

Actuarial-shape questions like "filings with credibility under 50%", "filings whose indicated and selected rate diverge sharply", "rate filings where frequency trend is negative". Use search_actuarial_embeds — those numerics live in the actuarial memo, not the summary.
Concrete-filter questions like "Filings from carrier NAIC 12345 in 2024" or "ISOF-rooted filings carriers adopted". Use search_filings with the typed filters — much faster, no embedding cost at all.
Anything with a SERFF id already in hand — use the get_filing_* tools.

How to combine:

For "recent auto programmes in California with novel rating factors": first search_filings (state=CA, product_type="Auto", year_from=…) to get a candidate set, then call this tool over those candidates' descriptions implied by the question.
For "filings whose summary mentions X": this tool alone, then get_filing_summary on the top hits to read in full.

Returns top-K hits, each with {serff, similarity, excerpt, meta}. Default topK=10, max 50. Excerpt is the first 800 chars of the matching summary.

ParametersJSON Schema

Name	Required	Description
`naic`	No	Exact NAIC carrier identifier (5-digit string). Restricts the cosine search to that carrier.
`topK`	No	Number of top filings to return. Defaults to 10; capped at 50. The result will contain at most this many rows; if filters narrow the candidate set below topK you get what's there, no silent fallback.
`year`	No	Exact filing year. Mutually exclusive with year_from/year_to.
`query`	Yes	Natural-language query. Pass the user's question verbatim when you can — short, specific queries (5-30 words) match best. The query is embedded and cosine-compared against per-filing summary embeddings.
`serff`	No	Optional SERFF id to scope the search to a single filing's summary embedding (shape PREFIX-IDENTIFIER). Each filing has at most one summary embedding, so topK is effectively 1 when serff is set.
`state`	No	Two-letter US state code, uppercase. Corpus currently covers CA only.
`date_to`	No	Upper bound on filing date (ISO YYYY-MM-DD).
`year_to`	No	Upper bound on filing year, inclusive.
`date_from`	No	Lower bound on filing date (ISO YYYY-MM-DD).
`year_from`	No	Lower bound on filing year, inclusive.
`filing_type`	No	Wildcard match on filing type ("Rate", "Rule", "Form", "Withdrawal", etc.). Substring match.
`product_type`	No	Wildcard match on product type ("Personal Auto", "Homeowners", "Commercial Auto", "Workers Compensation", etc.). Substring match — "Auto" matches both Personal and Commercial Auto.
`predecessor_prefix`	No	Bureau / org SERFF prefix ("ISOF", "NCCI", "AAIS", "MSO"). Restricts to filings carriers actually adopted into a programme.

Tool Definition Quality

A4.9/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true. The description adds significant behavioral context: cost (one embedding call + indexed lookup), no LLM planning/composition, bounded and fast. It also explains query embedding and cosine comparison. No contradictions with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with bullet lists and clear sections, but it is somewhat lengthy. Every sentence adds value, though some repetition could be trimmed. It is front-loaded with the core purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 13 parameters, 1 required, no output schema, the description covers return format (top-K hits with fields), default/max topK, excerpt length, behavior when filters narrow candidate set, and cost. It also explains query interpretation and combination with other tools. Very complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for all 13 parameters. The description adds extra meaning: typ for naic (5-digit string), default/cap for topK, verbatim query advice, mutual exclusivity of year and year_from/to, wildcard behavior for filing_type and product_type, and serff scoping. This greatly aids correct parameter usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states this is a pure vector search over per-filing summary embeddings. It distinguishes from siblings like search_actuarial_embeds and search_filing_embeds by specifying the embedding type and use cases. The scope (one embedding per filing, ~59K rows) and exact matching excerpt/metadata are well-defined.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly lists right surfaces (content questions, topic discovery) and wrong surfaces (actuarial-shape questions, concrete filters, SERFF id). It also provides combination strategies with other tools like search_filings and get_filing_summary, guiding the agent on when to use this tool versus alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?