Schema | mcp-server-wayback

mcp-server-wayback

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`WAYBACK_MCP_IA_ACCESS_KEY`	No	Your Internet Archive S3 access key for higher rate limits.
`WAYBACK_MCP_IA_SECRET_KEY`	No	Your Internet Archive S3 secret key for higher rate limits.

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": false }
`prompts`	{ "listChanged": false }
`resources`	{ "subscribe": false, "listChanged": false }
`experimental`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
check_availabilityA	Check if a URL has been archived by the Wayback Machine and return the closest snapshot.
lookup_snapshotsA	Return CDX snapshots for a URL, with optional date range and status-code filter. The Wayback Machine often crawls the same URL many times per day; raw CDX results would return one row per crawl. `collapse` is a server-side de-duplication: adjacent rows that share the same value in the chosen field get folded into a single representative row. By default we collapse on the first 8 digits of the timestamp (`"timestamp:8"`), which is the YYYYMMDD prefix — i.e. one row per day. This is almost always what you want for "show me snapshots of this URL"; otherwise the default limit of 50 gets eaten by 50 captures from a single hour and you see nothing about the URL's history. Override `collapse` when you need different granularity: `"digest"` — collapse on content hash, so you only see captures where the page actually changed `"timestamp:10"` — one row per hour (first 10 digits of timestamp) `""` (empty string) — disable collapsing entirely; return every capture any other CDX collapse spec is passed through verbatim `latest=True` uses CDX's fastLatest path to return the N most recent captures cheaply (much faster than a full scan over the index). Cannot be combined with `from_date`/`to_date`.
search_archiveA	Search Internet Archive collections (uploaded books, audio, video, software items) using Lucene query syntax. This is NOT a search over the Wayback Machine web crawl. It only returns items that someone has uploaded to archive.org as a discrete media item. Do NOT use this for: Current news, journalism, or recent events Government circulars, press releases, or official web pages Wikipedia articles or any live web content "What was on this website" / "what URLs are archived" — use `search_domain` or `lookup_snapshots` for those. Good uses: historical books, lecture recordings, archived films, software releases, podcast episodes, scanned magazines. Use Lucene fields when possible (e.g. `subject:"civil war"`, `creator:"NASA"`, `collection:librivoxaudio`).
search_domainC	Find archived URLs under a domain or path prefix. Auto-detects matchType from input.
get_snapshot_contentA	Fetch and extract text content from an archived web page. Returns extracted text, word count, and extraction metadata.
get_item_metadataC	Fetch rich structured metadata for any Internet Archive item by its identifier.

Prompts

Interactive templates invoked by user choice

Name	Description
`research_topic`	Research a topic across Internet Archive collections and synthesise an overview. Takes a single free-text `topic` argument. Time ranges and sample sizes are interpreted from the topic phrasing by the model (e.g. "the moon landing in the 1960s" → year range 1960–1969).
`track_site_changes`	Narrate how an archived web page changed over time using sampled snapshots. Takes a single free-text `url` argument. Date ranges are interpreted from the surrounding context by the model when present.
`audit_link_rot`	Audit a list of URLs for link rot, surfacing archived alternatives.
`setup_authentication`	Walk the user through configuring Internet Archive API keys for higher rate limits.

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/lakshyamehta03/wayback-machine-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server