read_page
Retrieve the full Markdown or text content of a crawled webpage using its URL. Returns title and source. Works offline on local files with case-insensitive URL matching.
Instructions
Read the full extracted content of a specific crawled page by its URL.
Returns the complete Markdown or text content of a single page, including
its title and source URL. Use this after search_pages to read the full
content of a relevant result.
This is a read-only operation on local files — no network requests are made.
URL matching is case-insensitive and tolerates trailing slashes.
Args:
url: The exact URL of the page to read. Must match a URL from a previous
crawl. Case-insensitive. Example: "https://docs.example.com/auth".
jsonl_path: Full path to the pages.jsonl file. If empty, defaults to
<MARKCRAWL_OUTPUT_DIR>/pages.jsonl.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| jsonl_path | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |