UK Legal Research

Name: UK Legal Research
Author: paulieb89

by io.github.paulieb89

Ownership verified

Server Details

UK legal research — case law, legislation, Hansard, bills, votes, committees, HMRC, citations

Status: Healthy
Last Tested: 2026-05-25 11:33
Transport: Streamable HTTP
URL
Repository: paulieb89/uk-legal-mcp
GitHub Stars: 7
Server Listing: UK Legal Research MCP Server

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4.1/5.0

Tool DescriptionsA

Average 4.4/5 across 29 of 29 tools scored. Lowest: 3.5/5.

Server CoherenceA

Disambiguation4/5

Most tools target distinct domains (bills, case law, citations, etc.) but there is some potential overlap among the case law tools (search vs. grep vs. get header/index/paragraph) and among citation parsing tools. Descriptions generally clarify boundaries, so an agent can differentiate with careful reading.

Naming Consistency5/5

All tools use a consistent underscore_separated verb_noun pattern (e.g., bills_get_bill, case_law_search, legislation_get_section). Despite some tools using domain prefix (e.g., judgment_get_header), the pattern is predictable and uniform across the set.

Tool Count4/5

29 tools is at the high end of reasonable for a comprehensive legal research server covering multiple subdomains (bills, case law, legislation, parliament, etc.). Each tool serves a distinct purpose, and the count feels justified for the breadth of functionality.

Completeness5/5

The tool surface covers the full lifecycle of legal research: search, retrieve details, navigate structure, and analyze citations for bills, case law, legislation, committees, parliament, and HMRC. There are no obvious gaps—core operations like CRUD and deeper analysis are present.

Available Tools

29 tools

bills_get_billGet Bill DetailA

Read-onlyIdempotent

Inspect

Get full detail for a specific parliamentary bill.

Returns sponsors, current stage, long title, summary, and Royal Assent date if enacted. Summary text is capped per max_summary_chars — check summary_truncated in the response to see if it was cut.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	BillDetailInput with bill_id and optional max_summary_chars.

Output Schema

ParametersJSON Schema

Name	Required	Description
`id`	Yes	Bill ID
`url`	Yes	Parliament URL for this bill
`is_act`	No	Whether the bill has received Royal Assent
`stages`	No	Legislative stages the bill has passed through
`summary`	No	Bill summary text, possibly truncated per max_summary_chars. Check summary_truncated and summary_original_length for full-text info.
`sponsors`	No	Bill sponsors
`long_title`	No	Full long title
`short_title`	Yes	Short title of the bill
`current_house`	No	House where the bill currently sits
`current_stage`	No	Current legislative stage
`originating_house`	No	House where the bill was introduced
`royal_assent_date`	No	Date Royal Assent was given
`summary_truncated`	No	True if summary was cut to fit max_summary_chars
`summary_original_length`	No	Original summary length in characters before any truncation

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already cover key behavioral traits (read-only, idempotent, non-destructive, open-world), so the bar is lower. The description adds value by specifying the return content (sponsors, stage history, etc.) and the Royal Assent condition, which provides useful context beyond annotations. No contradiction with annotations is present.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded: the first sentence states the core purpose, followed by specific return details and parameter guidance. Every sentence adds value with no waste, and the structure is clear and efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter), rich annotations, and the presence of an output schema (which handles return values), the description is complete. It covers purpose, usage context, parameter details, and return content, leaving no significant gaps for the agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It explicitly defines the single parameter (bill_id) and its source (bills_search_bills results), adding crucial meaning not in the schema. This fully addresses the parameter semantics gap, making it highly effective.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with a specific verb ('Get') and resource ('full detail for a specific parliamentary bill'), distinguishing it from siblings like bills_search_bills (which searches) and other parliamentary tools. It explicitly lists the types of details returned, making the scope unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool: to retrieve detailed information for a specific bill, with the bill_id sourced from bills_search_bills. However, it does not explicitly state when not to use it or compare it to alternatives beyond the implied sibling relationship, missing explicit exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

bills_search_billsSearch Parliamentary BillsA

Read-onlyIdempotent

Inspect

Search UK parliamentary bills by keyword, session, house, or legislative stage.

Returns a paginated page of bill summaries including title, current stage, and whether it has become an Act. Use bills_get_bill with the bill ID for full detail.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	BillSearchInput with query, optional session/house/stage filters, pagination.

Output Schema

ParametersJSON Schema

Name	Required	Description
`bills`	No	Matching bills. Use the integer `id` field from any bill to call bills_get_bill for full detail.
`limit`	Yes	Maximum results requested in this call
`query`	Yes	The search term that was used
`total`	No	Total results matching the query across all pages, if the upstream API reported it. None if unknown.
`offset`	Yes	Number of results skipped before this page
`has_more`	Yes	True if more results exist beyond this page. Re-call with offset=offset+returned to fetch the next page.
`returned`	Yes	Number of results actually on this page

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable context beyond annotations: it specifies the return format ('JSON array of BillSummary objects') and clarifies the scope of information returned ('bill summaries' vs 'full detail'). While annotations already indicate read-only, open-world, idempotent, and non-destructive behavior, the description provides practical implementation details that enhance transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured and front-loaded: the first sentence states the core functionality, the second describes returns, the third provides usage guidance, and the final two lines document parameters and returns. Every sentence earns its place with zero wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations, comprehensive input schema with excellent property descriptions, and explicit output schema reference, the description provides complete context. It covers purpose, usage guidelines, return format, and relationship to sibling tools, making it fully adequate for agent understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description carries the full burden of explaining parameters. It mentions the main parameter ('params (BillSearchInput)') and lists the search criteria ('keyword, session, house, or legislative stage'), but doesn't provide detailed semantics for each field. The schema itself has excellent descriptions for all properties, so the description adds only moderate value beyond what's already in the structured schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search UK parliamentary bills') and resources ('by keyword, session, house, or legislative stage'), distinguishing it from sibling tools like 'bills_get_bill' which provides full detail. It explicitly mentions what it returns ('bill summaries including title, current stage, and whether it has become an Act'), making the purpose unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool versus alternatives: 'Use bills_get_bill with the bill ID for full detail including sponsors and stage history.' This clearly distinguishes it from its sibling tool and helps the agent understand the scope and limitations of this search function.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

case_law_grep_judgmentSearch within a UK Court JudgmentA

Read-onlyIdempotent

Inspect

Find paragraphs in a single judgment whose text matches a pattern.

Returns a list of {eId, snippet, match} hits — small per-paragraph snippets centred on the match — so the LLM can decide which full paragraphs to read via judgment://{slug}/para/{eId}.

Use this when answering content-based questions ("what did the judges say about negligence?", "find the test for foreseeability", "did this case cite Donoghue?") rather than navigating by paragraph number (which uses the index resource).

Pattern is regex; if it doesn't compile, falls back to literal substring search.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	Input schema for case_law_grep_judgment.

Output Schema

ParametersJSON Schema

Name	Required	Description
`hits`	Yes	Matching paragraphs in document order
`slug`	Yes	The judgment slug that was searched
`pattern`	Yes	The pattern that was searched for
`truncated`	Yes	True if hit count reached max_hits and more matches may exist

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it explains the fallback behavior (regex to literal substring), describes the return format ('list of {eId, snippet, match} hits'), and provides guidance on how to use results ('so the LLM can decide which full paragraphs to read'). While annotations cover safety (readOnly, non-destructive), the description adds practical implementation details.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with zero wasted sentences. It opens with the core purpose, immediately describes the return format and how to use results, provides clear usage guidelines with examples, and ends with important behavioral details about pattern matching. Every sentence serves a distinct purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (regex search with fallback), rich annotations (readOnly, openWorld, idempotent), comprehensive schema coverage (100%), and presence of an output schema, the description provides complete context. It explains the search behavior, result format, usage scenarios, and relationship to other resources without needing to duplicate structured information.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline is 3. The description adds minimal parameter semantics beyond the schema - it mentions the regex/literal fallback behavior (which is also in the schema) and implies the slug identifies a 'single judgment,' but doesn't provide additional parameter meaning or usage context beyond what's already documented in the schema properties.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Find paragraphs in a single judgment whose text matches a pattern') and distinguishes it from sibling tools by contrasting with 'index resource' navigation and listing specific use cases like content-based questions. It explicitly names the resource (judgment paragraphs) and verb (search/find).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool ('when answering content-based questions') and when not to use it ('rather than navigating by paragraph number'). It gives concrete examples of appropriate questions and mentions the alternative approach (index resource).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

case_law_searchSearch UK Case LawA

Read-onlyIdempotent

Inspect

Search UK case law via the TNA Find Case Law API.

Returns paginated judgment summaries: neutral citations, court, dates, stable URIs. Use the judgment://{slug}/header resource to inspect a result, then judgment://{slug}/index to discover paragraphs and judgment://{slug}/para/{eId} to read individual paragraphs. For content-based discovery within a judgment, use case_law_grep_judgment.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CaseLawSearchInput with query, optional filters (court, judge, party, from_date, to_date), and page number.

Output Schema

ParametersJSON Schema

Name	Required	Description
`page`	Yes	Current page number (1-indexed)
`results`	Yes	Matching judgments for this page
`has_more`	Yes	Whether additional pages exist
`total_pages`	No	Total page count if available from API

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true. The description adds valuable behavioral context beyond annotations: it discloses pagination behavior ('paginated judgment summaries'), clarifies the relationship with sibling tool for full text retrieval, and describes the return format structure. However, it doesn't mention rate limits, authentication needs, or error conditions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured and concise: first sentence states purpose, second describes returns, third provides usage guidance, then clearly separated Args and Returns sections. Every sentence earns its place with zero wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has comprehensive annotations (readOnlyHint, openWorldHint, idempotentHint, destructiveHint), an output schema (Returns section details JSON structure), and clear sibling differentiation, the description provides complete context. It covers purpose, usage guidance, parameter overview, and return format without needing to repeat what's already in structured fields.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description carries full burden for parameter documentation. The description lists the parameters (query, optional filters: court, judge, party, from_date, to_date, page number) and provides some semantic context about what they filter. However, it doesn't explain parameter formats, constraints, or provide examples beyond what's implied. The schema itself has good descriptions, but with 0% coverage in the context signals, the baseline expectation is higher.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search UK case law via the TNA Find Case Law API'), resource ('UK case law'), and scope ('returns paginated judgment summaries'). It distinguishes from sibling tool 'case_law_get_judgment' by explaining this tool returns summaries while that one fetches full text.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly provides when-to-use guidance: 'Use case_law_get_judgment with the returned uri to fetch full text.' This clearly indicates this tool is for search/summaries while the sibling is for detailed content, providing clear alternative usage.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

citations_networkGet Case Citation NetworkA

Read-onlyIdempotent

Inspect

Map all citations within a judgment — cases cited, legislation referenced, SIs, EU law.

Fetches the judgment XML from TNA and parses all OSCOLA citations within it. Returns citations grouped by type for easy analysis. Each bucket is de-duplicated and sorted.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CitationsNetworkInput with case_uri (TNA slug, e.g. 'uksc/2024/12').

Output Schema

ParametersJSON Schema

Name	Required	Description
`eu_refs`	No	Retained EU law references, e.g. 'Regulation (EU) 2016/679'
`si_refs`	No	Statutory Instrument references, e.g. 'SI 2018/1234'
`case_uri`	Yes	The judgment URI that was fetched and parsed
`law_report_refs`	No	Law report citations, e.g. '[2020] 1 WLR 100'
`total_citations`	Yes	Sum of all de-duplicated citations across every bucket
`legislation_refs`	No	Legislation section references, e.g. 's.47 Companies Act 2006'
`neutral_citations`	No	Neutral citations referenced, e.g. '[2020] UKSC 14'

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and behavior. The description adds valuable context beyond annotations by specifying that it fetches XML from TNA, parses OSCOLA citations, and groups results by type, which helps the agent understand the tool's operational flow and output structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded, starting with a high-level purpose, followed by implementation details, and ending with return values. Every sentence earns its place by conveying essential information without redundancy, such as specifying the data source (TNA), parsing method (OSCOLA), and output format (JSON with grouped citations).

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (parsing citations from XML) and the presence of an output schema (implied by the Returns section detailing JSON structure), the description is complete enough. It covers purpose, usage, parameters, and return values, while annotations handle behavioral traits. No gaps remain for effective agent use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description carries full burden. It effectively compensates by clearly explaining the 'case_uri' parameter in the Args section, providing examples ('uksc/2024/12'), usage instructions ('Use the 'uri' field from case_law_search results'), and formatting rules ('Do not include the full URL prefix'). This adds significant meaning beyond the basic schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('Map all citations', 'Fetches the judgment XML', 'parses all OSCOLA citations') and resources ('within a judgment', 'cases cited, legislation referenced, SIs, EU law'). It distinguishes from sibling tools like 'citations_parse' and 'citations_resolve' by focusing on network mapping rather than parsing or resolving individual citations.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool ('Map all citations within a judgment') and implies usage by mentioning it fetches from TNA and parses OSCOLA citations. However, it does not explicitly state when not to use it or name specific alternatives among siblings, such as 'case_law_get_judgment' for raw judgment data or 'citations_parse' for individual citation parsing.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

citations_parseParse OSCOLA CitationsA

Read-only

Inspect

Extract and classify all OSCOLA legal citations from free text.

Identifies: neutral citations ([2024] UKSC 12), law reports ([2024] 1 WLR 100), legislation sections (s.47 Companies Act 2006), SIs (SI 2018/1234), and retained EU law (Regulation (EU) 2016/679).

Ambiguous citations (e.g. bare [2024] EWHC without division) are optionally disambiguated via LLM sampling. Resolves citations to TNA / legislation.gov.uk URLs.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CitationsParseInput with text (max 50,000 chars) and disambiguate flag.

Output Schema

ParametersJSON Schema

Name	Required	Description
`ambiguous`	Yes	Citations with confidence < 0.7; may have been partially disambiguated via sampling
`citations`	Yes	All successfully parsed citations (confidence >= 0.7)
`text_length`	Yes	Character length of the input text
`parse_duration_ms`	Yes	Time taken to parse, in milliseconds

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and destructiveHint=false, indicating a safe read operation. The description adds valuable behavioral context: it explains the optional LLM disambiguation for ambiguous citations, mentions resolution to URLs (TNA/legislation.gov.uk), and describes the JSON return structure. This goes beyond what annotations provide without contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured: it starts with the core purpose, lists supported citation types with examples, explains optional disambiguation, mentions URL resolution, and clearly documents parameters and return value. Every sentence adds essential information with no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (parsing multiple citation types with optional disambiguation), the description is complete: it covers purpose, supported formats, behavioral details, parameters, and return format. With an output schema present, it appropriately omits detailed return value explanations, focusing on high-level JSON structure.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description carries full burden. It effectively explains both parameters: 'text' is described with examples of supported citation types and a length constraint, and 'disambiguate' is clearly defined with its boolean role in LLM sampling. This compensates well for the lack of schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('extract and classify') and resource ('OSCOLA legal citations from free text'), with detailed examples of citation types. It distinguishes itself from sibling tools like citations_resolve or citations_network by focusing on parsing rather than resolution or network analysis.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context by listing supported citation types, but does not explicitly state when to use this tool versus alternatives like citations_resolve or legislation_search. No guidance on prerequisites or exclusions is provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

citations_resolveResolve Single OSCOLA CitationA

Read-onlyIdempotent

Inspect

Parse and resolve a single OSCOLA citation to its canonical URL.

Supports: neutral citations, SIs, legislation sections, retained EU law. Returns parsed fields and resolved_url if resolvable. Raises ValueError if no recognised citation is found in the input.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CitationsResolveInput with a single citation string.

Output Schema

ParametersJSON Schema

Name	Required	Description
`raw`	Yes	Original citation text as found in the source
`page`	No	Starting page in the law report
`type`	Yes	Classification of the citation type
`year`	No	Year component of the citation
`court`	No	Court code: UKSC, UKPC, EWCA Civ, EWCA Crim, EWHC (KB), EWHC (Ch), EWHC (Comm), EWHC (Fam), EWHC (Pat), EWHC (IPEC), UKUT (IAC), UKUT (TCC), UKUT (AAC), UKUT (LC), EAT, UKFTT (TC), UKFTT (GRC)
`number`	No	Judgment number within the year
`volume`	No	Report volume number (for law reports)
`section`	No	Section number referenced
`si_year`	No	SI year (for SI YYYY/NNN citations)
`si_number`	No	SI number
`confidence`	Yes	Parse confidence 0.0–1.0. Citations below 0.7 are ambiguous and may have been sent for LLM disambiguation.
`resolved_url`	No	TNA Find Case Law or legislation.gov.uk URL if successfully resolved
`report_series`	No	Law report series abbreviation: WLR, AC, QB, KB, Ch, All ER, EWCA Civ, etc.
`legislation_title`	No	Title of legislation (for s.NN Act YYYY citations)

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it specifies that the tool returns a resolved_url 'if resolvable' and may return an error otherwise, clarifying the conditional nature of the response. Annotations already indicate it's read-only, non-destructive, and idempotent, but the description enhances this by detailing the tool's scope (single citation) and output behavior, though it doesn't mention rate limits or auth needs.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose in the first sentence, followed by supporting details (scope, return behavior) and a clear Args/Returns section. Every sentence earns its place by adding necessary information without redundancy, making it efficient and well-structured for quick comprehension.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (parsing and resolving citations), the description is complete: it covers purpose, usage, parameter semantics, and output behavior. With annotations providing safety hints and an output schema likely detailing ParsedCitation structure, the description doesn't need to explain return values further, making it adequately comprehensive for the context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage (the schema only defines the parameter structure without describing the citation input), the description fully compensates by explaining the parameter's purpose: 'A single OSCOLA citation to parse and resolve', with examples like '[2024] UKSC 12'. This adds essential meaning beyond the bare schema, ensuring the agent understands what to provide.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verbs ('parse and resolve') and resource ('a single OSCOLA citation'), distinguishing it from sibling tools like citations_parse (which likely only parses) and citations_network (which likely handles multiple citations). It explicitly mentions the target formats (neutral citations, SIs, legislation sections, retained EU law) and the output (canonical URL), making the purpose unambiguous and distinct.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool: for resolving a single OSCOLA citation to a URL, with examples of supported citation types. However, it does not explicitly state when not to use it or name alternatives (e.g., citations_parse for parsing without resolution, or other tools for non-OSCOLA citations), which prevents a perfect score.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

committees_get_committeeGet Committee DetailA

Read-onlyIdempotent

Inspect

Get detail for a parliamentary committee including current membership.

Fetches committee metadata and member list in parallel.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CommitteeDetailInput with committee_id from committees_search_committees.

Output Schema

ParametersJSON Schema

Name	Required	Description
`id`	Yes	Committee ID
`url`	No	Parliament URL for this committee
`name`	Yes	Committee name
`email`	No	Contact email
`house`	No	Commons, Lords, or Joint
`phone`	No	Contact phone number
`members`	No	Current committee members

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it mentions that metadata and member list are fetched 'in parallel,' which is not covered by annotations like readOnlyHint or idempotentHint. This enhances understanding of performance or implementation details. No contradiction with annotations exists.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized and front-loaded: the first sentence states the core purpose, followed by implementation details and parameter/return info. Every sentence adds value without redundancy, making it efficient and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (1 parameter, nested objects, output schema), the description is complete enough. It covers purpose, usage context, behavioral traits, and parameter semantics. With an output schema present, it need not explain return values in detail, and annotations provide safety and idempotency info.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description compensates by explaining the parameter semantics: it clarifies that committee_id comes from committees_search_committees results, adding meaning beyond the bare schema. However, it does not detail the structure or format of the input object (CommitteeDetailInput), leaving some gaps.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Get detail for a parliamentary committee including current membership.' It specifies the verb ('Get'), resource ('parliamentary committee'), and scope ('including current membership'), and distinguishes it from sibling tools like committees_search_committees by focusing on detailed retrieval rather than searching.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for usage: it references committees_search_committees as the source for committee_id, indicating when to use this tool (after obtaining an ID from search). However, it does not explicitly state when not to use it or name alternatives beyond the implied search tool, keeping it from a perfect score.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

committees_search_committeesSearch Parliamentary CommitteesA

Read-onlyIdempotent

Inspect

Search or list UK parliamentary select committees.

Returns committee names, house, and active status. Use committees_get_committee with the committee ID for membership detail.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CommitteeSearchInput with optional query, house, active_only, limit.

Output Schema

ParametersJSON Schema

Name	Required	Description
`house`	No	House filter applied, or None
`query`	No	Name substring filter applied, or None
`total`	Yes	Number of committees returned in this call
`committees`	No	Matching committees. Use committees_get_committee for membership detail.
`active_only`	Yes	Whether results were restricted to currently active committees

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond what annotations provide. While annotations cover read-only, open-world, idempotent, and non-destructive characteristics, the description adds that filtering is 'client-side against committee names' and clarifies the return format ('JSON array of CommitteeSummary objects'). This provides useful implementation details not captured in annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured and concise with zero wasted words. It opens with the core purpose, immediately provides key behavioral information (what's returned), gives usage guidance, then documents parameters and return format. Every sentence serves a distinct purpose and contributes to understanding.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, comprehensive annotations, and the presence of an output schema, the description provides complete contextual information. It covers purpose, usage guidelines, behavioral details, and parameter overview while leveraging the structured fields for technical specifications. The combination of description and structured data leaves no gaps for agent understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description carries the full burden of parameter documentation. It mentions 'optional query, house, active_only filters' which covers the three parameters, but provides minimal semantic detail about what each filter does. The schema itself has good descriptions for each parameter, but the description text adds only basic identification without deeper meaning.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search or list'), resource ('UK parliamentary select committees'), and scope ('committee names, house, and active status'). It explicitly distinguishes from the sibling tool 'committees_get_committee' by indicating that tool should be used for membership detail, establishing clear differentiation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool versus alternatives: 'Use committees_get_committee with the committee ID for membership detail.' This directly addresses sibling tool differentiation and gives clear context for when to choose each tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

committees_search_evidenceSearch Committee EvidenceA

Read-onlyIdempotent

Inspect

Search oral and written evidence submitted to a parliamentary committee.

Returns ONE PAGE of evidence (default 20). Free-text titles are capped per max_title_chars; witness lists are capped at 10 per item. For committees with many submissions, re-call with offset=offset+returned while has_more is true.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	CommitteeEvidenceInput with committee_id, evidence_type, offset/limit pagination, and max_title_chars.

Output Schema

ParametersJSON Schema

Name	Required	Description
`limit`	Yes	Max evidence items requested for this page
`offset`	Yes	Number of evidence items skipped before this page
`evidence`	No	Evidence items in this page. Titles are capped per max_title_chars; witness lists are capped at 10 per item.
`has_more`	Yes	True if there may be more evidence beyond this page. Re-call with offset=offset+returned to fetch the next page. Conservative: when evidence_type='both', True if either oral or written upstream page came back full.
`returned`	Yes	Number of evidence items actually returned in this call
`committee_id`	Yes	Committee ID this page belongs to
`evidence_type`	Yes	Evidence type filter applied to this query

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true. The description adds useful context about what types of evidence are searched (oral/written/both) and what data is returned (titles, dates, witness names), which goes beyond the annotations. However, it doesn't mention rate limits, authentication needs, or pagination behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured: first sentence states purpose, second describes returns, third specifies parameters, fourth specifies return format. Every sentence adds value with zero waste, and it's appropriately sized for the tool's complexity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the annotations cover safety/behavioral aspects (read-only, non-destructive, idempotent), the input schema fully defines parameters, and there's an output schema (implied by 'Returns: str: JSON array of EvidenceItem objects'), the description provides complete context. It explains what the tool does, what it returns, and parameter basics without needing to duplicate structured data.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the description provides a high-level overview of parameters ('committee_id and evidence_type filter') and mentions the committee_id source. However, it doesn't explain parameter semantics in detail beyond what's implied. With 0% schema coverage, the description partially compensates but leaves gaps about specific parameter behaviors.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search oral and written evidence'), the resource ('submitted to a parliamentary committee'), and the return data ('evidence titles, dates, and witness names'). It distinguishes itself from sibling tools like 'committees_search_committees' by focusing on evidence rather than committees themselves.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage when searching for committee evidence, but provides no explicit guidance on when to use this tool versus alternatives like 'committees_get_committee' or 'committees_search_committees'. It mentions the committee_id comes from 'committees_search_committees results', which is helpful but doesn't constitute full usage guidelines.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_promptAInspect

Get a prompt by name with optional arguments.

Returns the rendered prompt as JSON with a messages array. Arguments should be provided as a dict mapping argument names to values.

ParametersJSON Schema

Name	Required	Description	Default
`name`	Yes	The name of the prompt to get
`arguments`	No	Optional arguments for the prompt

Output Schema

ParametersJSON Schema

Name	Required	Description
`result`	Yes

Tool Definition Quality

A3.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full burden. It describes the return format and that arguments are a dict, which adds some behavioral context. However, it does not disclose error handling, idempotency, or whether the operation is safe (likely read-only).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise sentences with no wasted words. Information is front-loaded: verb, resource, optional arguments, return format, and argument format are all covered efficiently.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the existence of an output schema, the description appropriately explains the return structure. It could mention error behavior or suggest using list_prompts to find prompt names, but overall covers the essential context for a simple retrieval tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds value by specifying that arguments should be provided as a dict mapping names to values, which clarifies the structure beyond the schema's vague 'Optional arguments'.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it gets a prompt by name with optional arguments, and specifies the return format as JSON with a messages array. It distinguishes itself from list_prompts by focusing on a single prompt retrieval, though it does not explicitly mention the sibling.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool vs alternatives like list_prompts. No mention of prerequisites or context for calling, leaving the agent to infer usage from the tool name alone.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

hmrc_check_mtd_statusCheck MTD VAT StatusA

Read-onlyIdempotent

Inspect

Check a business's Making Tax Digital VAT mandate status via the HMRC API.

NOTE: Connects to the HMRC sandbox by default. Set HMRC_API_BASE env var to 'https://api.service.hmrc.gov.uk' for production. Requires HMRC_CLIENT_ID and HMRC_CLIENT_SECRET environment variables (OAuth 2.0). Returns whether the business is mandated for MTD, effective date, and trading name.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	HMRCMTDStatusInput with the 9-digit VAT Registration Number.

Output Schema

ParametersJSON Schema

Name	Required	Description
`vrn`	Yes	VAT Registration Number queried
`mandated`	Yes	Whether this business is mandated for MTD VAT
`trading_name`	No	Registered trading name if available
`effective_date`	No	Date from which MTD obligation applies

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond what annotations provide: it specifies the default sandbox environment, how to switch to production, authentication requirements (OAuth 2.0 with specific env vars), and what data is returned. While annotations cover read-only, open-world, idempotent, and non-destructive properties, the description adds practical implementation details that help the agent use the tool correctly.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with a clear purpose statement upfront, followed by important notes about environment and authentication, then parameter and return value documentation. Every sentence serves a distinct purpose with no redundancy, and the information is well-organized for quick comprehension.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (API integration with authentication), the description provides complete context: purpose, environment configuration, authentication requirements, parameter details, and return value information. With annotations covering safety properties and an output schema presumably documenting the JSON structure, the description fills all necessary gaps without duplicating structured data.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage for the single parameter, the description fully compensates by clearly explaining the 'vrn' parameter in the Args section: it specifies it's a 9-digit VAT Registration Number, mentions GB prefix handling, and provides an example. This adds crucial meaning beyond the bare schema that only defines string constraints.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Check'), resource ('business's Making Tax Digital VAT mandate status'), and method ('via the HMRC API'). It distinguishes this tool from sibling tools like 'hmrc_get_vat_rate' and 'hmrc_search_guidance' by focusing on MTD status verification rather than VAT rates or guidance search.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context about when to use this tool (to check MTD VAT status for a business) and includes important prerequisites (environment variables for OAuth 2.0). However, it doesn't explicitly state when NOT to use it or name specific alternatives among the sibling tools, though the purpose clearly differentiates it from other HMRC tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

hmrc_get_vat_rateGet VAT Rate for CommodityA

Read-onlyIdempotent

Inspect

Look up the UK VAT rate for a commodity or service type.

Returns the rate category (standard 20%, reduced 5%, zero 0%, exempt), effective date, and any relevant conditions or exceptions. Uses a static lookup table current as of 22 Nov 2023 (Autumn Statement). Rates may have changed — always verify against GOV.UK for recent Budgets.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	HMRCVATRateInput with the commodity_code (description of goods or service).

Output Schema

ParametersJSON Schema

Name	Required	Description
`rate`	Yes	VAT rate category
`notes`	No	Any additional notes or conditions on this rate
`commodity_code`	Yes	Commodity code or description queried
`effective_from`	Yes	Date from which this rate applies
`rate_percentage`	Yes	Applicable rate as percentage: 20.0 (standard), 5.0 (reduced), 0.0 (zero/exempt)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond what annotations provide. While annotations already indicate read-only, non-destructive, and idempotent operations, the description reveals that it 'Uses a static lookup table current as of 22 Nov 2023' and has data currency limitations. This disclosure about the static nature and potential staleness of data is crucial context not captured in annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and appropriately sized. It front-loads the core purpose, follows with return details, then important behavioral context about data currency, and finally includes parameter and return value documentation. Every sentence adds value with zero waste or redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (single parameter, static lookup), the description provides complete context. It explains what the tool does, what it returns, important limitations about data currency, and parameter semantics. With annotations covering safety aspects and an output schema presumably documenting the return structure, the description adds all necessary contextual information.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage for the single parameter, the description must compensate. It provides meaningful context by explaining that 'commodity_code' can be either 'Commodity code or plain-English description' and gives examples like 'food', 'domestic fuel', 'software'. This significantly enhances understanding beyond the bare schema, though it doesn't fully document all possible values or constraints.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose with specific verb ('Look up') and resource ('UK VAT rate for a commodity or service type'). It distinguishes itself from sibling tools by focusing on VAT rate lookup, while siblings like 'hmrc_check_mtd_status' or 'hmrc_search_guidance' serve different HMRC-related functions.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool (looking up VAT rates) and includes an important caveat about data currency ('Rates may have changed — always verify against GOV.UK for recent Budgets'). However, it doesn't explicitly state when NOT to use this tool or name specific alternatives among the sibling tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

hmrc_search_guidanceSearch HMRC GuidanceA

Read-onlyIdempotent

Inspect

Search GOV.UK for HMRC tax guidance documents.

Returns matching guidance titles, URLs, summaries, and last-updated dates. Searches the official GOV.UK content API filtered to HMRC publications.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	HMRCGuidanceSearchInput with query (topic to search).

Output Schema

ParametersJSON Schema

Name	Required	Description
`query`	Yes	The search query that was run
`total`	Yes	Number of guidance documents returned in this call
`results`	No	Matching HMRC guidance pages. Each entry's `summary` is capped per the max_summary_chars input parameter.

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds valuable context beyond annotations by specifying it searches 'the official GOV.UK content API filtered to HMRC publications' and describes the return format (titles, URLs, summaries, dates), which helps the agent understand the tool's behavior and output structure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured and front-loaded: the first sentence states the core purpose, followed by return details, search scope, and parameter/return documentation. Every sentence earns its place with no redundant information, making it efficient and easy to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (search operation with one parameter), rich annotations covering safety and behavior, and the presence of an output schema (described in the Returns section), the description is complete. It explains what the tool does, what it returns, where it searches, and provides parameter examples, leaving no significant gaps for agent understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the description provides the 'Args' section that explains the single parameter 'query' as 'topic to search' and gives examples ('VAT digital services', 'R&D tax relief SME'). This adds meaningful semantic context beyond the schema's technical constraints (string with length limits), though it doesn't fully compensate for the lack of schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search GOV.UK for HMRC tax guidance documents') and resource ('HMRC tax guidance documents'), distinguishing it from sibling tools like 'hmrc_check_mtd_status' or 'hmrc_get_vat_rate' which perform different HMRC-related functions. It precisely defines the scope as searching official GOV.UK content filtered to HMRC publications.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context by specifying it searches for 'HMRC tax guidance documents' on GOV.UK, suggesting it should be used when looking for official tax guidance rather than other HMRC data. However, it doesn't explicitly state when to use this vs. alternatives like 'legislation_search' or provide clear exclusions for when other tools might be more appropriate.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

judgment_get_headerGet Judgment HeaderA

Read-onlyIdempotent

Inspect

Get metadata for a UK court judgment: parties, judges, neutral citation, court, dates.

Use case_law_search to find the slug, then call this for orientation before reading specific paragraphs via judgment_get_paragraph.

ParametersJSON Schema

Name	Required	Description	Default
`slug`	Yes	Judgment slug, e.g. 'uksc/2024/12' or 'ewca/civ/2023/450'

Output Schema

ParametersJSON Schema

Name	Required	Description
No output parameters

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true, so the description doesn't need to reiterate safety. The description adds context about what metadata is returned but does not disclose any further behavioral traits beyond what annotations provide.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences: first states purpose and fields, second gives usage flow. No fluff, front-loaded with key action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a single-parameter read-only tool with annotations and output schema, the description sufficiently covers what it does, how to use it, and its role in the workflow. No gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% coverage with a description for slug including example format. The description does not add additional parameter meaning beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states it retrieves metadata for UK court judgments, listing key fields (parties, judges, neutral citation, court, dates). It distinguishes itself from sibling tools like case_law_search (for finding the slug) and judgment_get_paragraph (for reading paragraphs).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Clear usage guidance: use case_law_search first to find the slug, then this tool for orientation, then judgment_get_paragraph for specific paragraphs. This implicitly tells when not to use it (not for search or paragraph reading).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

judgment_get_indexGet Judgment Paragraph IndexA

Read-onlyIdempotent

Inspect

Get the paragraph navigation index for a UK court judgment.

Returns eId: first_line pairs for every paragraph. Use this to discover paragraph identifiers, then call judgment_get_paragraph to read specific ones.

ParametersJSON Schema

Name	Required	Description	Default
`slug`	Yes	Judgment slug, e.g. 'uksc/2024/12'

Output Schema

ParametersJSON Schema

Name	Required	Description
No output parameters

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint. Description adds that it returns eId:first_line pairs for navigation, providing behavioral context beyond annotations without contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, no wasted words. Every sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with one required parameter and an output schema, the description fully explains what it does, what it returns, and how to use it. No gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 100% coverage for the single parameter 'slug' with description. Description adds an example format ('e.g. 'uksc/2024/12'), adding value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the verb 'Get' and resource 'paragraph navigation index' for 'UK court judgment', distinguishing it from sibling tools like 'judgment_get_paragraph' which reads paragraphs.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says to use this to discover paragraph identifiers and then call 'judgment_get_paragraph' to read specific ones, providing clear when-to-use and alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

judgment_get_paragraphGet Judgment ParagraphA

Read-onlyIdempotent

Inspect

Get a single paragraph from a UK court judgment by its LegalDocML eId.

Use judgment_get_index first to discover available eIds. Returns the paragraph XML content (400–1,700 tokens typical).

ParametersJSON Schema

Name	Required	Description	Default
`eId`	Yes	Paragraph eId from judgment_get_index, e.g. 'para_12'. Numeric strings like '12' are accepted and normalized to 'para_12'.
`slug`	Yes	Judgment slug, e.g. 'uksc/2024/12'

Output Schema

ParametersJSON Schema

Name	Required	Description
No output parameters

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint. The description adds further context about return format (XML content) and typical size (400–1,700 tokens), which is useful.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise sentences, each adding essential information: purpose, usage guidance, and output characteristics, with no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given two well-documented parameters, existing output schema (implied), and annotations, the description is complete for an AI agent to correctly select and invoke the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% coverage with clear descriptions for both parameters. The description does not add significant additional meaning beyond the schema, so baseline of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool gets a single paragraph from a UK court judgment by its LegalDocML eId, distinguishing it from siblings like judgment_get_index and judgment_get_header.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It explicitly instructs to use judgment_get_index first to discover eIds, providing clear guidance on prerequisite and context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

legislation_get_sectionGet Legislation SectionA

Read-onlyIdempotent

Inspect

Retrieve a specific section of a UK Act or Statutory Instrument.

Returns the full section text, territorial extent, in-force status, and prospective flag. Content is capped per max_chars (default 10,000, ~2,500 tokens) — raise max_chars for unusually long definition sections. Check content_truncated in the response to see if it was cut.

IMPORTANT: Always check extent — a section may apply to England & Wales but not Scotland or Northern Ireland.

Alternative: read the resource template legislation://{type}/{year}/{number}/section/{section} to get raw CLML XML directly. Use this tool when you want the parsed structured response (extent, in-force, version_date) instead of raw XML.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	type, year, number, section identifier, optional max_chars.

Output Schema

ParametersJSON Schema

Name	Required	Description
`title`	Yes	Section title or heading
`extent`	No	Territorial extent: list of 'England', 'Wales', 'Scotland', 'Northern Ireland'. Empty list means unknown — do not assume full UK extent.
`content`	Yes	Plain text content of the section, possibly truncated per max_chars. Check content_truncated and original_length for full-text information.
`in_force`	No	Whether this section is currently in force; None if unknown
`warnings`	No	Non-fatal retrieval or parsing warnings the caller should disclose where relevant.
`prospective`	No	True if this section has not yet come into force; None if unknown
`version_date`	No	Date of the version retrieved
`source_format`	No	Source parsed for this response. html_fallback means CLML XML was unavailable and text was parsed from the public HTML page.
`section_number`	Yes	Section number, e.g. '47', '12A', 'Schedule 2'
`original_length`	No	Original plain-text length in characters before any truncation
`content_truncated`	No	True if content was cut to fit max_chars

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it warns about territorial extent limitations ('may apply to England & Wales but not Scotland or Northern Ireland') and clarifies what information is returned. While annotations cover safety (readOnlyHint, idempotentHint), the description provides domain-specific behavioral insights that annotations don't capture.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured: purpose statement first, key return values listed, important usage warning highlighted, and parameter/return format clearly specified. Every sentence adds value with zero redundancy, making it easy for an agent to parse quickly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (legislation retrieval with territorial considerations), the description provides complete context: purpose, return values, important caveats, parameter structure, and return format. With annotations covering safety and an output schema presumably detailing the JSON structure, this description gives the agent everything needed to use the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description carries full burden for parameter semantics. It provides the structure (params object with type, year, number, section) and clarifies the return format. While it doesn't detail each parameter's purpose, it gives enough context for the agent to understand what's needed, compensating reasonably for the schema gap.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Retrieve a specific section') and resource ('UK Act or Statutory Instrument'), distinguishing it from sibling tools like legislation_search or legislation_get_toc. It precisely defines what the tool does beyond just restating the name.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context about when to use this tool ('Retrieve a specific section') and includes an important usage note about checking territorial extent. However, it doesn't explicitly contrast with alternatives like legislation_search or legislation_get_toc, which would be needed for a perfect score.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

legislation_get_tocGet Legislation Table of ContentsA

Read-onlyIdempotent

Inspect

Retrieve the table of contents for a UK Act or SI.

Returns structural elements (parts, chapters, sections, schedules) with XML id and title, e.g. 'section-47: Definitions'. When calling legislation_get_section, pass only the numeric part — use '47', not 'section-47'.

Large statutes (Companies Act 2006 has 1300+ items) are paginated via offset/limit. Check has_more and total_items on the response.

Alternative: read the resource template legislation://{type}/{year}/{number}/toc for the full TOC as a newline-separated id: title string (no pagination). Use this tool when you need the structured LegislationTOC response with offset/limit/has_more for stepping through Companies-Act-scale lists.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	LegislationGetTocInput with type, year, number, offset, limit.

Output Schema

ParametersJSON Schema

Name	Required	Description
`type`	Yes	Legislation type code echoed from the request
`year`	Yes	Year of enactment echoed from the request
`items`	No	TOC entries in XML document order, formatted as '<id>: <title>', e.g. 'section-47: Definitions'. When calling legislation_get_section pass only the numeric part ('47', not 'section-47').
`limit`	Yes	Page size applied after offset
`number`	Yes	Chapter or SI number echoed from the request
`offset`	Yes	Offset applied to the full TOC item list
`has_more`	Yes	True if more items remain beyond offset+returned
`returned`	Yes	Number of items in this response
`total_items`	Yes	Total number of structural items parsed from the XML, before offset/limit. Use this to know the full size of the TOC.

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, idempotentHint=true, and openWorldHint=true, covering safety and idempotency. The description adds valuable context about the return format ('JSON array of strings in format section-N: Title text') and the relationship with legislation_get_section, which enhances behavioral understanding beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is efficiently structured with clear sections: purpose, return format, usage note for legislation_get_section, parameters, and return type. Every sentence adds value without redundancy, and key information is front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations, and the presence of an output schema, the description provides complete context. It covers purpose, usage guidelines, parameter semantics, and output format, making it fully adequate for an AI agent to understand and invoke the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the description compensates by explaining the params object structure (type, year, number) and providing examples of type values ('ukpga', 'uksi', etc.). However, it doesn't fully document all parameter details like year range or number constraints, leaving some gaps.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Retrieve the table of contents'), resource ('for a UK Act or SI'), and output format ('structural elements with XML id and title'). It distinguishes from sibling tools like legislation_get_section by focusing on the table of contents rather than individual sections.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool vs alternatives: it specifies to use legislation_get_section for retrieving specific sections, and clarifies the parameter format difference ('use 47, not section-47'). It also references legislation_search for obtaining the type parameter value.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

legislation_searchSearch UK LegislationA

Read-onlyIdempotent

Inspect

Search UK legislation on legislation.gov.uk.

Returns ranked results: title, type, year, number, and legislation.gov.uk URL. Use legislation_get_toc to explore structure, then legislation_get_section for provisions.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	LegislationSearchInput with query, optional type filter, optional year.

Output Schema

ParametersJSON Schema

Name	Required	Description
`total`	Yes	Total number of matches
`results`	Yes	Matching legislation items

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds valuable context beyond annotations: it specifies the source (legislation.gov.uk), describes the ranking of results, and outlines the return format (JSON with specific fields and total count).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and front-loaded: first sentence states purpose, second describes returns, third gives usage guidelines, and fourth outlines parameters and return format. Every sentence adds value with zero waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations (readOnlyHint, openWorldHint, idempotentHint), and the presence of an output schema (implied by 'Returns: str: JSON...'), the description is complete. It covers purpose, usage, parameters, and return format adequately without needing to duplicate structured data.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It provides the 'params' object with query, optional type filter, and optional year, which matches the input schema. While it doesn't detail enum values for 'type' or constraints for 'year' and 'query', it gives enough semantic context for basic usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches UK legislation on legislation.gov.uk, specifying the exact resource (UK legislation) and action (search). It distinguishes from siblings like 'legislation_get_toc' and 'legislation_get_section' by explaining that those are for exploring structure and provisions after searching.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool vs alternatives: 'Use legislation_get_toc to explore structure, then legislation_get_section for provisions.' This clearly differentiates it from sibling tools and establishes a workflow.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_promptsAInspect

List all available prompts.

Returns JSON with prompt metadata including name, description, and optional arguments.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Output Schema

ParametersJSON Schema

Name	Required	Description
`result`	Yes

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries full burden. It discloses the return format (JSON with prompt metadata) and is a read-only operation, so no side effects expected. No contradiction with annotations (none provided).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with no wasted words. Front-loaded with the key action ('List all available prompts.') and then return format.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 0 parameters and an output schema exists, the description fully covers the tool's purpose and return structure. No gaps detected.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 0 parameters, so no parameter information needed. The description adds nothing beyond the schema, but baseline for 0 params is 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists all available prompts, and the sibling tool 'get_prompt' contrasts as a single-retrieval tool, making the purpose distinct.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance on when to use this tool versus alternatives like 'get_prompt' or other listing tools. The description implies usage for browsing all prompts but lacks explicit context or exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_find_memberFind Member of ParliamentA

Read-onlyIdempotent

Inspect

Search for a current or former MP or Lord by name.

Returns all members matching the name query, each with the integer id required by parliament_member_debates and parliament_member_interests, plus party, constituency, house, and current-sitting status.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	FindMemberInput with the name (full or partial).

Output Schema

ParametersJSON Schema

Name	Required	Description
`query`	Yes	The name that was searched
`total`	Yes	Number of members matching the query
`members`	No	Matching members. Use the integer `id` field from any member to call parliament_member_debates or parliament_member_interests.

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds valuable context beyond this: it specifies the return format (JSON array of MemberResult objects with specific fields like id, name, party), mentions it handles both current and former members, and implies partial name matching. However, it doesn't detail rate limits, auth needs, or pagination behavior, keeping it from a perfect score.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose in the first sentence, followed by return details and usage guidelines. Every sentence earns its place: the first defines the tool, the second specifies returns, the third provides integration guidance, and the last details parameters and output. No wasted words, and it's appropriately sized for a simple lookup tool.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's low complexity (1 parameter), rich annotations (covering read-only, open-world, idempotent), and the presence of an output schema (implied by the Returns section), the description is complete enough. It explains what the tool does, when to use it, parameter semantics, and output structure, leaving no gaps for an AI agent to understand and invoke it correctly in context with siblings.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate for parameter documentation. It adds meaning by explaining that the 'name' parameter accepts 'full or partial member name' and provides examples ('Starmer', 'Baroness Hale'), which clarifies usage beyond the schema's basic type constraints. However, it doesn't cover edge cases like minimum characters or formatting, and with only one parameter, the baseline is high but not fully comprehensive.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('Search for') and resource ('current or former MP or Lord by name'), making the purpose specific. It distinguishes from siblings like parliament_member_debates by focusing on member lookup rather than debates, and from parliament_member_interests by not handling interests. The mention of 'current or former' adds nuance beyond just 'find member'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool: for searching MPs/Lords by name. It also specifies an alternative use case by stating 'Use the integer member_id with parliament_member_debates', indicating this tool is for initial lookup and another tool (parliament_member_debates) should be used for subsequent actions with the ID. This clearly differentiates it from siblings like parliament_search_hansard or votes_search_divisions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_member_debatesGet Member DebatesA

Read-onlyIdempotent

Inspect

Retrieve Hansard contributions by a specific member, optionally filtered by topic.

Use parliament_find_member first to obtain the integer member ID. Each contribution's text field is capped at 3000 characters.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	MemberDebatesInput with member_id and optional topic filter.

Output Schema

ParametersJSON Schema

Name	Required	Description
`limit`	No	Page size requested
`topic`	No	Topic phrase filter applied, if any
`total`	Yes	Number of contributions returned in this call
`offset`	No	Skip applied to this page
`has_more`	No	True if a full page was returned (more may exist)
`member_id`	Yes	Parliament Members API member ID
`contributions`	No	Hansard contributions for the member. Each `text` field is capped at 3000 characters.

Tool Definition Quality

A4.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds useful context: it clarifies that the topic filter is an 'exact phrase' search (implied in schema but emphasized here) and mentions the prerequisite of using parliament_find_member. However, it doesn't disclose additional behavioral traits like rate limits, pagination, or error handling beyond what annotations cover.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and concise: the first sentence states the purpose, the second provides usage guidance, and the Args/Returns sections are clearly formatted. Every sentence earns its place—no wasted words—and key information is front-loaded. The formatting with bullet-like sections enhances readability without verbosity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (1 required parameter, optional filtering), rich annotations (covering safety and idempotency), and the presence of an output schema (Returns specifies JSON array of HansardContribution objects), the description is complete enough. It covers purpose, prerequisites, parameter semantics, and return format, addressing gaps from the 0% schema coverage without redundancy.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate. It adds meaningful semantics: it explains that member_id is an 'integer member ID' obtained from parliament_find_member (prerequisite context) and that topic is an 'optional phrase to filter by topic' with examples ('housing benefit', 'net zero'). This clarifies parameter purposes beyond the bare schema, though it doesn't detail all schema constraints (e.g., minimum value for member_id).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Retrieve Hansard contributions by a specific member, optionally filtered by topic.' It specifies the verb ('retrieve'), resource ('Hansard contributions'), and scope ('by a specific member'), distinguishing it from siblings like parliament_search_hansard (general search) or parliament_member_interests (different resource).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage guidance: 'Use parliament_find_member first to obtain the integer member ID.' This clearly indicates a prerequisite and distinguishes it from alternatives like parliament_search_hansard (which likely doesn't require a member ID). It also mentions optional topic filtering, setting expectations for when to use this feature.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_member_interestsGet Member Financial InterestsA

Read-onlyIdempotent

Inspect

Look up registered financial interests for a member of Parliament.

Returns ONE PAGE of interests (default 20, caller controls via limit). For prolific members (big donors, many directorships, extensive land holdings), re-call with offset=offset+returned while has_more is true to paginate. Description text is capped per max_description_chars; raise it for forensic provenance work that needs the full narrative.

Use parliament_find_member first to obtain the integer member_id.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	member_id, optional category filter, pagination (offset/limit), and max_description_chars content cap.

Output Schema

ParametersJSON Schema

Name	Required	Description
`limit`	Yes	Max interests requested for this page
`offset`	Yes	Number of interests skipped before this page
`category`	No	Category filter applied to this query, or None for all categories
`has_more`	Yes	True if there may be more interests beyond this page. Re-call with offset=offset+returned to fetch the next page.
`returned`	Yes	Number of interests actually returned in this call
`interests`	No	The interests in this page. `description` text is capped per the max_description_chars input parameter.
`member_id`	Yes	Parliament Members API member ID

Tool Definition Quality

A4.8/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds valuable context beyond this: it specifies the data source ('registered financial interests'), lists the categories of interests returned, and mentions the JSON return format. However, it doesn't detail rate limits, authentication needs, or pagination behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose, followed by usage guidance, return details, and parameter explanations in a structured 'Args' and 'Returns' section. Every sentence adds value without redundancy, making it efficient and well-organized.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (1 required parameter, optional filter), rich annotations, and the presence of an output schema (which handles return values), the description is complete. It covers purpose, prerequisites, parameter semantics, and return format, leaving no gaps for the agent to infer critical usage details.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description fully compensates by explaining both parameters. It clarifies that member_id is an 'integer member ID' obtained from parliament_find_member, and it lists examples of category filters (e.g., 'employment', 'donations', 'land') and states that omitting it returns all categories. This adds essential meaning not present in the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Look up registered financial interests') and resource ('for a member of Parliament'), distinguishing it from sibling tools like parliament_find_member (which finds members) or parliament_member_debates (which gets debates). It explicitly lists the categories of interests returned, making the purpose highly specific.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit guidance on when to use this tool: 'Use parliament_find_member first to obtain the integer member ID.' This directly addresses the prerequisite and distinguishes it from alternatives by specifying the required input source. It also mentions the optional category filter for focused queries.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_search_hansardSearch Hansard DebatesA

Read-onlyIdempotent

Inspect

Search Hansard for parliamentary debates, questions, and speeches.

Returns contributions from MPs and Lords including date, party, debate title, and text (capped at 3000 chars per contribution). Useful for understanding legislative intent or political context.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	HansardSearchInput with query, optional date range, optional member filter.

Output Schema

ParametersJSON Schema

Name	Required	Description
`limit`	No	Page size requested
`query`	Yes	The phrase that was searched in Hansard
`total`	Yes	Number of contributions returned in this call
`member`	No	Member name filter applied, if any
`offset`	No	Skip applied to this page (Hansard API: skip)
`to_date`	No	End date filter applied, if any
`has_more`	No	True if a full page was returned (more may exist; re-call with offset=offset+limit)
`from_date`	No	Start date filter applied, if any
`contributions`	No	Matching Hansard contributions. Each `text` field is capped at 3000 characters.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it specifies the return format (JSON array with specific fields), mentions the types of content returned (debates, questions, speeches), and provides usage context. Annotations already indicate read-only, open-world, idempotent, and non-destructive behavior, so the description appropriately supplements rather than contradicts.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and front-loaded: first sentence states purpose, second describes returns, third provides usage context, then clearly labeled Args and Returns sections. Every sentence adds value with zero waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (search with multiple filters), the description provides complete context: purpose, usage guidelines, parameter semantics, and return format. With annotations covering safety/behavioral traits and the description detailing inputs/outputs, this is comprehensive for a search tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description carries full burden for parameter documentation. It clearly explains the 'params' input structure, mentions query, optional date range, and optional member filter, and the Returns section details the output format with specific fields. This fully compensates for the schema coverage gap.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches Hansard for parliamentary debates, questions, and speeches, specifying it returns contributions from MPs and Lords with specific fields. It distinguishes from siblings like 'parliament_find_member' or 'parliament_member_debates' by focusing on content search rather than member-specific operations.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool ('useful for understanding legislative intent or political context'), but does not explicitly state when not to use it or name specific alternatives among siblings. It implies usage for content-based searches rather than member-focused queries.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_search_petitionsSearch UK Parliament PetitionsA

Read-onlyIdempotent

Inspect

Search UK Parliament petitions by keyword.

Returns petition title, state, signature count, and dates for government response or parliamentary debate if applicable.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	PetitionSearchInput with query and optional state filter.

Output Schema

ParametersJSON Schema

Name	Required	Description
`limit`	No	Page size requested
`query`	Yes	The term that was searched in petitions
`state`	Yes	Petition state filter applied to this query
`total`	Yes	Number of petitions returned in this call
`offset`	No	Skip applied to this page
`has_more`	No	True if a full page was returned (more may exist)
`petitions`	No	Matching petitions (title, state, signature count, key dates, URL).

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, openWorldHint=true, idempotentHint=true, and destructiveHint=false, covering safety and idempotency. The description adds valuable context by specifying what data is returned ('petition title, state, signature count, and dates for government response or parliamentary debate if applicable') and the return format ('JSON array of PetitionSummary objects'), which goes beyond the annotations. No contradictions with annotations exist.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose in the first sentence, followed by return details and parameter/return specifications in a structured format. Every sentence adds value without redundancy, making it efficient and easy to parse for an AI agent.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations (covering safety and behavior), and the presence of an output schema (implied by 'Returns: str: JSON array of PetitionSummary objects'), the description is complete. It explains the purpose, return data, and parameters adequately, with no significant gaps for a search tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the input schema includes detailed descriptions for 'query' and 'state' parameters. The description mentions 'query and optional state filter' in the Args section, adding minimal semantics beyond the schema. With one parameter (a nested object) and schema doing the heavy lifting, this meets the baseline for adequate parameter information.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action ('Search UK Parliament petitions by keyword') and resource ('UK Parliament petitions'), distinguishing it from sibling tools like 'parliament_search_hansard' or 'parliament_find_member'. It provides a precise verb+resource combination that leaves no ambiguity about the tool's function.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage through the phrase 'Search UK Parliament petitions by keyword', suggesting this is for keyword-based searches of petitions. However, it doesn't explicitly state when to use this tool versus alternatives like 'bills_search_bills' or 'legislation_search', nor does it provide exclusions or specific scenarios where this tool is preferred over others.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

parliament_vibe_checkParliamentary Policy Vibe CheckA

Read-only

Inspect

Assess the likely parliamentary reception of a policy proposal.

Searches Hansard for relevant debate contributions, then uses LLM sampling to classify sentiment and extract supporters, opponents, and key concerns.

Degrades gracefully if sampling is unavailable — returns contributions only.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	PolicyVibeInput with policy_text (full description) and topic (search keyword).

Output Schema

ParametersJSON Schema

Name	Required	Description
`query`	Yes	The policy query that was analysed
`key_concerns`	No	Main concerns raised in debate
`contributions`	Yes	Raw contributions retrieved from Hansard
`key_opponents`	No	Members identified as opposed or critical
`key_supporters`	No	Members identified as supportive
`sentiment_summary`	No	LLM-generated sentiment summary (None if sampling unavailable)

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds valuable behavioral context beyond annotations: it explains the two-stage process (search then LLM analysis), the graceful degradation when sampling is unavailable, and what gets returned in each case. Annotations cover safety (readOnlyHint=true, destructiveHint=false) and data characteristics (openWorldHint=true), but the description adds implementation details that help the agent understand tool behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and efficiently written. The first sentence states the core purpose, followed by implementation details, fallback behavior, and clear sections for Args and Returns. Every sentence adds value with no redundancy or wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (search + LLM analysis), the description provides complete context. It explains the process, fallback behavior, parameter usage, and return format. With an output schema present, the description doesn't need to detail return values, and it adequately compensates for the lack of schema descriptions through clear parameter explanations.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The description provides essential parameter semantics despite 0% schema description coverage. It explains that 'policy_text' is a 'full description' of the proposal to assess, and 'topic' is used as 'search keyword' for Hansard. The Args section clarifies the parameter structure (PolicyVibeInput containing both fields), adding significant value beyond the bare schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Assess the likely parliamentary reception of a policy proposal' with specific verbs ('searches Hansard', 'uses LLM sampling', 'classify sentiment', 'extract supporters/opponents/concerns'). It distinguishes from siblings like 'parliament_search_hansard' by adding analysis beyond simple search.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear context for when to use this tool: for assessing parliamentary reception of policy proposals. It doesn't explicitly state when NOT to use it or name alternatives, but the purpose naturally differentiates it from sibling tools like 'parliament_search_hansard' (which only searches) or 'bills_search_bills' (which deals with bills rather than policy proposals).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

votes_get_divisionGet Division DetailA

Read-onlyIdempotent

Inspect

Get full detail for a parliamentary division including how each member voted.

Voter lists are truncated to 100 per side to fit response limits. Total voter counts are always accurate regardless of truncation.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	DivisionDetailInput with division_id and house.

Output Schema

ParametersJSON Schema

Name	Required	Description
`id`	Yes	Division ID
`date`	Yes	Date of the division
`house`	Yes	Commons or Lords
`title`	Yes	Division title / motion text
`passed`	Yes	Whether the motion passed
`truncated`	No	Whether voter lists were truncated to fit response limits
`aye_voters`	No	Members who voted Aye (may be truncated)
`ayes_count`	Yes	Total Aye votes
`noe_voters`	No	Members who voted No (may be truncated)
`noes_count`	Yes	Total No votes
`total_aye_voters`	No	Total number of Aye voters before truncation
`total_noe_voters`	No	Total number of No voters before truncation
`is_government_win`	No	Whether the government won (Lords only)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate read-only, non-destructive, idempotent, and open-world operations. The description adds valuable behavioral context beyond annotations: it discloses response truncation limits ('truncated to 100 per side'), response constraints ('to fit response limits'), and data accuracy guarantees ('Total voter counts are always accurate regardless of truncation').

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured and front-loaded with the core purpose, followed by behavioral details, then parameter and return sections. Every sentence adds value: the first states the purpose, the next two explain truncation behavior, and the last two document inputs/outputs without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity, rich annotations, and presence of an output schema, the description is reasonably complete. It covers purpose, behavioral constraints, and basic I/O structure. The output schema handles return values, so the description doesn't need to detail them. Minor gaps remain in parameter documentation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the description provides minimal parameter guidance in the 'Args' section, naming 'division_id and house' and referencing the input type. However, it doesn't add meaningful semantics beyond what's implied by the schema structure, leaving parameters largely undocumented in both schema and description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('Get full detail') and resource ('parliamentary division'), specifying it includes 'how each member voted'. It distinguishes from sibling 'votes_search_divisions' by focusing on detailed results for a specific division rather than searching.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context by referencing 'division_id from votes_search_divisions results', suggesting this tool follows a search. However, it doesn't explicitly state when to use this vs alternatives or provide exclusions, though the distinction from 'votes_search_divisions' is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

votes_search_divisionsSearch Parliamentary DivisionsA

Read-onlyIdempotent

Inspect

Search parliamentary divisions (votes) in the Commons or Lords.

Returns division summaries including title, date, vote counts, and whether the motion passed. Use votes_get_division with the division ID for full voter lists.

ParametersJSON Schema

Name	Required	Description	Default
`params`	Yes	DivisionSearchInput with optional query, house, date range, member filter.

Output Schema

ParametersJSON Schema

Name	Required	Description
`house`	Yes	Commons or Lords
`limit`	No	Page size requested
`query`	No	The search term, if any (None = browse recent)
`total`	Yes	Number of divisions returned in this call
`offset`	No	Skip applied to this page
`has_more`	No	True if a full page was returned (more may exist)
`divisions`	No	Matching divisions. Use the integer `id` field with votes_get_division to fetch the full voter list.

Tool Definition Quality

A4.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, destructiveHint=false, openWorldHint=true, and idempotentHint=true, covering safety and idempotency. The description adds useful context about what data is returned (summaries including title, date, vote counts, pass status) and the relationship to the sibling tool, but doesn't mention rate limits, authentication needs, or pagination behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is perfectly structured: first sentence states purpose, second describes returns, third provides sibling guidance, then separate Args and Returns sections. Every sentence earns its place with no wasted words, and key information is front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has comprehensive annotations (readOnly, openWorld, idempotent, non-destructive), an output schema exists (so return values are documented), and the description provides clear purpose, usage guidelines, and parameter overview, this is complete for a search tool. The description effectively complements the structured data without redundancy.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, but the description provides a helpful overview: 'optional query, house, date range, member filter.' It explains the purpose of the single 'params' parameter and its nested structure, though it doesn't detail individual sub-parameters like query format or date ranges. This adds meaningful context beyond the bare schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool searches parliamentary divisions (votes) in Commons or Lords, specifying the verb 'search' and resource 'parliamentary divisions'. It distinguishes from sibling 'votes_get_division' by indicating this returns summaries while the sibling provides full voter lists.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly provides when-to-use guidance: 'Use votes_get_division with the division ID for full voter lists.' This clearly distinguishes between this summary search tool and the detailed division tool, helping the agent choose the right alternative.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Server Details

Available Tools

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Output Schema

Discussions

Your Connectors

Resources