Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| WS_PORT | No | WebSocket port for extension connection | 7225 |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| browser_navigate | Navigate to a URL in the active browser tab |
| browser_click | Click an element on the page using a ref from snapshot or a CSS selector |
| browser_type | Type text into an input element |
| browser_scroll | Scroll the page or an element. Supports pixel offsets, scrolling to elements, and named positions (top/bottom). Works with virtual scroll containers used by social media sites. |
| browser_press_key | Press a keyboard key or combination (Enter, Escape, Tab, ArrowDown, etc). Supports modifiers like Ctrl+A, Cmd+C. |
| browser_wait | Wait for a condition: element to appear, element to disappear, or a fixed delay. Useful for SPAs and dynamic content. |
| browser_snapshot | Get an accessibility tree snapshot of the page. Returns element refs you can use with click, type, and other tools. Use compact mode (default) for smaller output - only interactive elements. |
| browser_screenshot | Capture a screenshot of the visible page area |
| browser_console | Read console messages (log, warn, error) from the browser |
| browser_network | Read network requests made by the page. Filter by URL pattern. |
| browser_tabs | Manage browser tabs: list, create, close, or focus |
| browser_find | Find elements on the page using natural language (e.g. "login button", "search input"). Returns refs you can use with click/type. |
| browser_text | Extract raw text content from the page or a specific element |
| browser_hover | Hover over an element to trigger tooltips, dropdown menus, or hover states |
| browser_select | Select an option from a dropdown/select element |
| browser_evaluate | Execute JavaScript in the page and return the result. Use for DOM queries, reading page state, or any operation not covered by other tools. |
| browser_click_text | Click an element by its visible text content. Works on React dropdowns, portals, and overlays that may not appear in snapshots. CSP-safe (no eval). Prefers deepest matching element. |
| browser_handle_dialog | Handle JavaScript dialogs (alert, confirm, prompt). Dialogs block page interaction until handled. |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |