Open the navigation index for a cached document
get_document_navigationGet structural outline and text previews of large PDFs to locate key sections like balance sheets before fetching full pages. Returns bookmarks, per-page summaries, keyword landmarks, and source URLs. Always follow up with fetch_document_pages for authoritative content.
Instructions
For PDFs that don't fit in a single document block (>~20 MB or >100 pages) OR whenever you need to locate specific sections, call this FIRST before fetching content. Returns outline (PDF bookmarks), per-page text previews (first ~200 chars), keyword-matched landmarks (balance sheet, directors report, auditor report etc.), text_layer classification, and source URLs.
CRITICAL — these are NAVIGATION AIDS ONLY. Page previews, outline titles, landmark matches, and search snippets may be truncated, contain OCR errors, or match false positives. NEVER cite them as source material for numbers, quotes, legal text, financial figures, dates, or names. Always follow up with fetch_document_pages(pages=) to retrieve authoritative content before answering.
Requires the document bytes to already be cached — call fetch_document once first if this is a new document.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| jurisdiction | Yes | ISO 3166-1 alpha-2 country code (uppercase). All registries are official government sources. Currently supported: AU, BE, CA, CA-BC, CA-NT, CH, CY, CZ, DE, ES, FI, FR, GB, HK, IE, IM, IS, IT, KR, KY, LI, MC, MX, MY, NL, NO, NZ, PL, RU, TW. Per-country capability, ID format, examples, status mapping, and caveats: call `list_jurisdictions({jurisdiction:'<code>'})`. To find which countries support a specific tool: `list_jurisdictions({supports_tool:'<tool>'})`. | |
| document_id | Yes | ||
| fresh | No | Set true to ignore the cached index.json and re-run pdfjs against the stored source bytes. Does not re-pull from upstream. | |
| company_id | No | OVERRIDE (rare use). Normally auto-resolved from the list_filings side-cache. | |
| transaction_id | No | OVERRIDE (rare use). Normally auto-resolved. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| queried_at | Yes | ISO-8601 + Europe/London timezone stamp for when the registry was queried. | |
| jurisdiction | No | ||
| document_id | No | ||
| pages | No | ||
| headings | No | ||
| outline | No |