list_pages
List all pages from a previous crawl to get an overview of available content. Returns URLs, titles, and word counts to identify content-rich pages. Read-only operation on local files.
Instructions
List all pages from a previous crawl with their URLs, titles, and word counts.
Returns a summary of every page in the crawl index. Use this to get an
overview of available content before searching or reading specific pages.
Word counts help identify content-rich pages vs. thin landing pages.
This is a read-only operation on local files — no network requests are made.
Args:
jsonl_path: Full path to the pages.jsonl file. If empty, defaults to
<MARKCRAWL_OUTPUT_DIR>/pages.jsonl.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| jsonl_path | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |