workspace_snapshot
Capture a comprehensive overview of your desktop workspace including display layouts, window thumbnails, and actionable UI elements with coordinates for automation.
Instructions
Purpose: Orient fully in one call — returns display layouts, all window thumbnails (WebP), and per-window actionable element lists with clickAt coords. Details: uiSummary.actionable[] per window includes: action ('click'|'type'|'expand'|'select'), clickAt {x,y} (pass directly to mouse_click), value (current text for editable fields). Runs parallel internally; latency ≈ max(single screenshot), not N×screenshots. Also resets the diffMode buffer so subsequent screenshot(diffMode=true) returns only changes (P-frame). Prefer: Use at session start or after major workspace changes. Use screenshot(detail='meta') for cheap re-orientation within a session. Use screenshot(detail='text', windowTitle=X) for a single-window update. Caveats: Thumbnails are scaled, not 1:1 — use screenshot(dotByDot=true, windowTitle=X) for pixel-accurate coords on a specific window after snapshot.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| thumbnailMaxDimension | No | Max size of per-window thumbnail images (default 400px) | |
| includeUiSummary | No | Whether to include UI element summaries for each window |