read_page
Fetch complete page content from crawled URLs. Retrieve full Markdown, titles, and source text from local files to analyze specific results after searching your web crawl.
Instructions
Read the full extracted content of a specific crawled page by its URL.
Returns the complete Markdown or text content of a single page, including
its title and source URL. Use this after search_pages to read the full
content of a relevant result.
This is a read-only operation on local files — no network requests are made.
URL matching is case-insensitive and tolerates trailing slashes.
Args:
url: The exact URL of the page to read. Must match a URL from a previous
crawl. Case-insensitive. Example: "https://docs.example.com/auth".
jsonl_path: Full path to the pages.jsonl file. If empty, defaults to
<WEBCRAWLER_OUTPUT_DIR>/pages.jsonl.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| jsonl_path | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |