Fetch a subset of pages from a cached PDF
fetch_document_pagesReturn specific pages of a cached PDF document in pdf (authoritative), text (skimmable), or png (scanned) format. Up to 100 pages per call.
Instructions
Return specific pages of a PDF in one of three formats: • format='pdf' — pdf-lib page slice, preserves the original text layer and fonts (no re-encoding). This is the ONLY format that gives you byte-exact, citation-grade content. Use this for financial numbers, legal quotes, and any answer requiring precision. • format='text' — raw extracted text from pdfjs. Machine-readable but NOT authoritative — OCR errors on bad-quality text layers can silently garble digits. Use only for summarisation / light reading, and cross-check numbers by re-fetching with format='pdf'. • format='png' — page rasterization via Cloudflare Browser Rendering, for documents with text_layer='none' (scanned PDFs). Phase 6 — may return 'not implemented' in current deployment.
The response includes at most 100 pages (Anthropic document-block hard cap). Split larger ranges into multiple calls.
Requires the document's bytes to already be cached — call fetch_document on the full document first if this is a new filing.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| jurisdiction | Yes | ISO 3166-1 alpha-2 country code (uppercase). All registries are official government sources. Currently supported: AU, BE, CA, CA-BC, CA-NT, CH, CY, CZ, DE, ES, FI, FR, GB, HK, IE, IM, IS, IT, KR, KY, LI, MC, MX, MY, NL, NO, NZ, PL, RU, TW. Per-country capability, ID format, examples, status mapping, and caveats: call `list_jurisdictions({jurisdiction:'<code>'})`. To find which countries support a specific tool: `list_jurisdictions({supports_tool:'<tool>'})`. | |
| document_id | Yes | ||
| pages | Yes | Page spec like '1-5', '3,7,9', or '1,3-5'. 1-based. Max 100 pages per call. | |
| format | No | Output format. Use 'pdf' for authoritative content (default), 'text' for quick skimming, 'png' for scanned documents. | |
| dpi | No | DPI for format='png'. Default 150. 72 for thumbnails, 200+ for high-detail reading. | |
| company_id | No | OVERRIDE (rare use). Normally auto-resolved from the list_filings side-cache. | |
| transaction_id | No | OVERRIDE (rare use). Normally auto-resolved. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| queried_at | Yes | ISO-8601 + Europe/London timezone stamp for when the registry was queried. | |
| jurisdiction | No | ||
| document_id | No | ||
| pages_requested | No | ||
| chosen_format | No | ||
| size_bytes | No | ||
| bytes_base64 | No | ||
| text | No |