This server provides access to the Internet Archive's Wayback Machine for retrieving historical web snapshots and searching archived content.
List snapshots: Get available archived versions of a URL with date range filtering (
from
,to
), result limits, and match type control (exact, prefix, host, domain)Fetch archived pages: Retrieve content from specific timestamps (YYYYMMDDHHMMSS format), with option to get original content without Wayback Machine's rewriting
Search archive items: Query Internet Archive's collection with advanced filters like media type, collection, field selection, sorting, and pagination
Resource access: Directly fetch archived content via
wayback://{url}/{timestamp}
resource URLs
Provides access to the Internet Archive Wayback Machine, enabling retrieval of historical website snapshots, archived page content, and searching of archive.org items with metadata filtering capabilities.
Wayback Machine MCP Server (Python)
A Model Context Protocol (MCP) server that provides access to the Internet Archive Wayback Machine: list snapshots and fetch archived pages. Inspired by the TypeScript server described in the LobeHub listing.
- Reference: Wayback MCP (listing) — link
- Reference: Internet Archive APIs — Wayback APIs index
- Reference: MCP server quickstart — Build an MCP Server
Features
- Tools
get_snapshots(url, from, to, limit, match_type)
via Wayback CDX APIget_archived_page(url, timestamp, original)
fetches archived contentsearch_items(query, mediatype, collection, fields, sort, rows, page)
searches archive.org items
- Resource
wayback://{url}/{timestamp}
returns the archived page content
Install
Run (stdio)
Install as CLI
Run with uvx (no install)
- Pin Python version:
- Pin package version:
Configure in MCP client
Claude Desktop settings (example):
Alternatively, using uvx (no install):
Usage examples
- Tool call (snapshots):
- Tool call (page):
- Tool call (items search):
- Resource fetch:
Notes
- Snapshot data via CDX API:
https://web.archive.org/cdx/search/cdx?url={url}&output=json
- Page retrieval via Wayback:
https://web.archive.org/web/{timestamp}/{url}
(orid_
mode for original content) - Advanced item search endpoint:
https://archive.org/advancedsearch.php
(JSON output)
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Provides access to the Internet Archive Wayback Machine to list snapshots, fetch archived web pages, and search archive.org items. Enables retrieval of historical website content and metadata through natural language queries.