browser_ocr
Capture a screenshot of the current browser page via Chromium CDP and extract all visible text using local OCR, bypassing the need to transmit image data to the LLM.
Instructions
Take a screenshot of the current browser page (via CDP connection to a shared Chromium instance) and extract all visible text using local OCR. Requires Chromium to be running with --remote-debugging-port=9222.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| fullPage | No | Capture the full scrollable page instead of just the viewport. Default: false. |