view_page
Retrieve visible page text and stable element references to enable clicking, typing, and form filling. Use after navigation or clicking to inspect content and locate interactive elements.
Instructions
The way to see what is on the page. Call this after navigate/click/switch_tab — not capture_image. Returns text content + stable element refs (e.g. 'e5') for click/type/fill_form. Also use this to read visible text, check errors, find buttons. Default filter:'interactive' shows actionable elements; for paragraphs/table cells call view_page(ref: 'eN', filter: 'all'). Collapsed containers show as [eXX role, N items] — expand with view_page(ref:'eXX', filter:'all'). 10-30x cheaper than capture_image.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| depth | No | Nesting depth — how many tree levels to display (default: 3). Controls indentation, not visibility. Hidden sections (display: none) require clicking tabs/buttons to reveal. | |
| ref | No | Element ref (e.g. 'e5') to get subtree for | |
| filter | No | Filter mode: interactive (default), all, landmark, or visual (adds bounds/click/visibility) | interactive |
| max_tokens | No | Token budget — page content is automatically downsampled to fit. Omit for full output. |