CEL See
cel_seeObserve the current screen state to capture structured UI elements, windows, screenshots, and accessibility details. Use before taking any action to gain context. Supports waiting for elements, event-driven watching, and element inspection.
Instructions
Read and observe the current screen state. Returns structured UI elements, window lists, screenshots, CDP page content, accessibility element details, and screen change events. Always use this BEFORE acting to understand what's on screen.
Screen Context: context (elements with filter/compression — use detail 'compact' to save tokens), screenshot (PNG capture), windows (visible window list), monitors (display list).
Element Inspection: focused (high-fidelity detail for one element_id), element_at (hit-test x,y coordinates), is_settable (check if set_value works), make_reference (resilient ref that survives across snapshots), cursor_position.
Browser (CDP): cdp_status (debug targets & connection state), cdp_page (full page content as text).
Observation Recall: observation (load a persisted context snapshot by observation_id).
Waiting & Watching: wait_for_element (poll for element by type/label, default 10s timeout), wait_for_idle (poll until screen stabilizes — requires 2 consecutive stable polls), watch (event-driven — 18 event types: tree_changed, network_idle, focus_changed, value_changed, window_created, menu_opened, menu_closed, sheet_created, layout_changed, title_changed, app_activated, app_deactivated, window_moved, window_resized, window_minimized, window_restored, selection_changed, row_count_changed). Note: watch is unavailable during an active cel_perceive session.
Limits: CDP enrichment caps at 50 text_blocks, 50 interactive_elements, 3000 char body_text.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||