list_pages
Retrieve a structured summary of crawled web pages displaying URLs, titles, and word counts to identify content-rich pages and prioritize detailed review.
Instructions
List all pages from a previous crawl with their URLs, titles, and word counts.
Returns a summary of every page in the crawl index. Use this to get an
overview of available content before searching or reading specific pages.
Word counts help identify content-rich pages vs. thin landing pages.
This is a read-only operation on local files — no network requests are made.
Args:
jsonl_path: Full path to the pages.jsonl file. If empty, defaults to
<WEBCRAWLER_OUTPUT_DIR>/pages.jsonl.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| jsonl_path | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |