archive
Server Details
Archive MCP — wraps the Internet Archive APIs (free, no auth)
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
- Repository
- pipeworx-io/mcp-archive
- GitHub Stars
- 0
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 3.5/5 across 3 of 3 tools scored.
Each tool has a clearly distinct purpose: get_metadata retrieves detailed information for a known item, search performs broad queries across the archive, and wayback_check verifies URL archiving status. There is no overlap in functionality, making tool selection straightforward for an agent.
All tool names follow a consistent snake_case pattern with clear verb_noun structures: get_metadata, search, and wayback_check. The naming is uniform and predictable across the set.
Three tools is a minimal but reasonable count for an archive server, covering key operations like retrieval, search, and Wayback Machine checks. It feels slightly thin but adequately scoped for the domain without being overwhelming.
The tools cover core archive interactions (metadata retrieval, search, and Wayback checks), but there are notable gaps such as downloading archived content, managing collections, or advanced filtering options. While basic workflows are supported, the surface is not fully comprehensive for archive operations.
Available Tools
3 toolsget_metadataAInspect
Retrieve full metadata for an Internet Archive item by its identifier (the unique ID in the archive.org URL).
| Name | Required | Description | Default |
|---|---|---|---|
| id | Yes | Archive.org item identifier (e.g., "principleofrelat00eins", "ApolloMissionsMoonLandings") |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden. It states the tool retrieves metadata but doesn't disclose behavioral traits like rate limits, authentication needs, error handling, or response format. This is a significant gap for a tool with no annotation coverage.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, well-structured sentence that front-loads the purpose and includes essential details (e.g., identifier source). There's no wasted text, making it highly efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's low complexity (1 parameter, no nested objects) and high schema coverage, the description is adequate for basic use. However, with no output schema and no annotations, it lacks details on return values, error cases, or behavioral constraints, leaving gaps for an agent to operate effectively.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents the single parameter 'id' with examples. The description adds minimal value by restating the parameter's purpose ('by its identifier'), but doesn't provide additional syntax or format details beyond the schema. With 0 parameters beyond the one covered, baseline 4 is appropriate.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the specific action ('Retrieve full metadata'), the resource ('an Internet Archive item'), and the method ('by its identifier'). It distinguishes from sibling tools like 'search' (which likely finds items) and 'wayback_check' (which likely checks availability).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when you have a specific identifier and need metadata, but doesn't explicitly state when to use this versus 'search' (e.g., for finding items without an ID) or 'wayback_check' (e.g., for availability). No exclusions or alternatives are mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
searchBInspect
Search the Internet Archive for texts, audio, video, software, and other items. Supports Lucene query syntax.
| Name | Required | Description | Default |
|---|---|---|---|
| limit | No | Number of results to return (1-100, default 20) | |
| query | Yes | Search query (e.g., "subject:astronomy", "creator:NASA", "moon landing") |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions support for Lucene query syntax, which adds some context, but it does not cover important aspects such as rate limits, authentication needs, pagination behavior, or what the response format looks like (especially since there is no output schema). This leaves significant gaps for a search tool.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded, consisting of two sentences that efficiently convey the tool's purpose and key feature (Lucene query syntax). Every sentence earns its place without unnecessary elaboration, making it easy for an agent to parse quickly.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity of a search tool with no annotations and no output schema, the description is incomplete. It lacks details on behavioral traits (e.g., rate limits, response format) and does not compensate for the absence of structured data. While it covers the purpose well, it falls short in providing enough context for reliable tool invocation.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the input schema already documents both parameters ('query' and 'limit') with descriptions. The description adds minimal value by implying the query syntax (Lucene) but does not provide additional details beyond what the schema offers. Baseline 3 is appropriate as the schema does the heavy lifting.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose with a specific verb ('Search') and resource ('Internet Archive'), listing the types of items it searches for (texts, audio, video, software, and other items). It distinguishes itself from sibling tools like 'get_metadata' and 'wayback_check' by focusing on search functionality rather than metadata retrieval or Wayback Machine checks.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage by specifying what the tool searches for and mentioning Lucene query syntax, but it does not explicitly state when to use this tool versus alternatives like 'get_metadata' or 'wayback_check'. No exclusions or specific contexts are provided, leaving the agent to infer usage based on the purpose.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
wayback_checkAInspect
Check whether a URL has ever been archived in the Wayback Machine and retrieve the closest available snapshot.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to look up (e.g., "https://example.com/some-page") |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries the full burden of behavioral disclosure. It mentions the tool's purpose but lacks details on error handling, rate limits, authentication needs, or what happens if no snapshot exists. This is a significant gap for a tool that interacts with an external service.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, efficient sentence that front-loads the core purpose ('check whether a URL has ever been archived') and adds necessary detail ('retrieve the closest available snapshot') without any wasted words or redundancy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's moderate complexity (interacting with an external archive service), no annotations, and no output schema, the description is minimally adequate. It states what the tool does but lacks details on behavior, output format, or error cases, which are important for effective agent use.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The schema description coverage is 100%, so the schema already documents the single 'url' parameter. The description adds no additional parameter semantics beyond what the schema provides, such as URL format constraints or examples beyond the schema's example, meeting the baseline for high schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the specific action ('check whether a URL has ever been archived') and resource ('Wayback Machine'), and distinguishes it from siblings by mentioning 'retrieve the closest available snapshot' which suggests a different function than 'get_metadata' or 'search'.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for checking URL archival status and retrieving snapshots, but provides no explicit guidance on when to use this tool versus the sibling tools 'get_metadata' or 'search', leaving the agent to infer based on the described functionality.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!