Citation Verification
Server Details
Real-time fact-check, citation verification, and source-freshness for AI agents.
- Status
- Unhealthy
- Last Tested
- Transport
- Streamable HTTP
- URL
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 4.1/5 across 3 of 3 tools scored.
Each tool has a clearly distinct purpose: cite_check verifies URL resolution and claim match, fact_check verifies factual claims against authoritative sources, and source_freshness checks URL liveness and modification date. No overlap or ambiguity.
All tools follow a consistent snake_case pattern with a verb (check, check, source) and noun (cite, fact, freshness), making the naming predictable and easy to understand.
With 3 tools, the server is well-scoped for its purpose of citation verification. Each tool serves a distinct function without being excessive or insufficient.
The server covers the core aspects of citation verification: URL validity, claim fact-checking, and source freshness. A minor gap might be the lack of a tool to retrieve citation metadata (e.g., author, date), but the current set is sufficient for typical use cases.
Available Tools
3 toolscite_checkAInspect
Verify that one or more cited URLs (a) resolve, (b) match the cited claim text, and (c) aren't retracted. Useful when an LLM produced inline citations and you want to confirm they're real and on-point. Pass DOIs directly (e.g. '10.1038/nature12373') or full URLs. Up to 25 per call.
| Name | Required | Description | Default |
|---|---|---|---|
| citations | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description bears full responsibility. It discloses the three checks and the batch limit, but does not mention authentication needs, rate limits, or what happens on invalid input. The output format is not described, leaving the agent to infer success/failure states.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is three sentences with no wasted words. It front-loads the core functionality, then provides usage guidance and constraints. Every sentence earns its place.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With no output schema and moderate complexity, the description lacks details about the return format (e.g., per-citation results, error handling). While it hints at a match_score, the overall output structure is missing, which is required for an agent to fully understand the tool's behavior.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema includes descriptions for the 'url' and 'claim' fields, but the tool description adds value by clarifying that DOIs can be passed directly and that up to 25 citations are allowed. It also explains that providing a claim yields a match_score, which is not in the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool verifies that cited URLs resolve, match the claim text, and are not retracted. It specifies the exact checks performed using verbs like 'verify', 'resolve', 'match', and 'aren't retracted'. This distinguishes it from siblings like fact_check and source_freshness by focusing on citation integrity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly mentions when to use the tool ('when an LLM produced inline citations and you want to confirm they're real and on-point') and provides input format guidance (DOIs or full URLs, up to 25 per call). No exclusions or alternatives are discussed, but the context is clear.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
fact_checkAInspect
Verify a factual claim against authoritative sources (Wikipedia, Wikidata, Crossref academic citations). Returns a verdict ('supported' / 'contradicted' / 'mixed' / 'unverified'), a 0–1 confidence score, and the list of sources with excerpts. Use this when an agent is about to assert a fact it isn't 100% sure of, or when post-processing LLM output to flag possibly-hallucinated claims.
| Name | Required | Description | Default |
|---|---|---|---|
| claim | Yes | A single factual claim, ideally one sentence. Longer text is truncated to 200 chars. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations provided, so description must cover behavioral traits. It mentions return values (verdict, confidence, sources) and truncation behavior from schema. However, does not disclose potential latency, required permissions, or how authoritative sources are accessed. Adequate but missing details.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences front-loading purpose and usage advice. No wasted words, efficient and clear.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a single-parameter tool with no output schema, description covers purpose, usage, and return format. Missing explanation of what happens with multiple claims or edge cases, but overall sufficient.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% with one parameter. Description adds meaning by specifying 'ideally one sentence' and noting truncation to 200 chars, which is helpful beyond the schema's description.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states verb 'verify' and resource 'factual claim against authoritative sources', listing specific sources (Wikipedia, Wikidata, Crossref). Distinguishes from siblings by focusing on verification of claims, whereas cite_check and source_freshness likely handle citations and freshness.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Explicitly says 'Use this when an agent is about to assert a fact it isn't 100% sure of, or when post-processing LLM output to flag possibly-hallucinated claims.' Provides clear context for when to use, though does not explicitly state when not to use or mention alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
source_freshnessAInspect
Check whether a source URL is still live and how recently it was modified. Returns the HTTP last-modified header (if present), the most recent Wayback Machine snapshot, and a 'changed_recently' flag (last 90 days). Use this for RAG hygiene — flagging citations that point to pages that may have changed since the agent's training data.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL to check. |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden. It discloses the returned data (last-modified, Wayback snapshot, 90-day flag) but omits details like rate limits, redirect behavior, or handling of inaccessible URLs.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two sentences, directly stating purpose and use case with no extraneous information. It is front-loaded and efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a simple one-parameter tool with no output schema, the description adequately explains purpose, return values, and a usage scenario. Minor gaps like interpreting 'changed_recently' do not significantly detract.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100% for the single 'url' parameter. The description does not add meaning beyond the schema's 'URL to check' description, so baseline score of 3 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool checks if a URL is live and how recently it was modified, listing specific return values (last-modified header, Wayback snapshot, changed_recently flag). It distinguishes itself from sibling tools by focusing on freshness and liveness.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly recommends use for RAG hygiene to flag changed citations. While it doesn't mention when not to use or alternatives, the use case is clearly stated, providing adequate guidance.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!