Server Configuration
Describes the environment variables required to run the server.
Name | Required | Description | Default |
---|---|---|---|
CAPS | No | Comma-separated list of capabilities to enable, possible values: tabs, pdf, history, wait, files, install | |
PORT | No | Port to listen on for SSE transport | |
VISION | No | Run server that uses screenshots (Aria snapshots are used by default) | |
BROWSER | No | Browser or chrome channel to use. Possible values: chrome, firefox, webkit, msedge, chrome-beta, chrome-canary, chrome-dev, msedge-beta, msedge-canary, msedge-dev | chrome |
HEADLESS | No | Run browser in headless mode (headed by default) | |
CDP_ENDPOINT | No | CDP endpoint to connect to | |
USER_DATA_DIR | No | Path to the user data directory | |
EXECUTABLE_PATH | No | Path to the browser executable |
Schema
Prompts
Interactive templates invoked by user choice
Name | Description |
---|---|
No prompts |
Resources
Contextual data attached and managed by the client
Name | Description |
---|---|
Page console |
Tools
Functions exposed to the LLM to take actions
Name | Description |
---|---|
browser_close | Close the page |
browser_wait | Wait for a specified time in seconds |
browser_resize | Resize the browser window |
browser_file_upload | Upload one or multiple files |
browser_install | Install the browser specified in the config. Call this if you get an error about the browser not being installed. |
browser_press_key | Press a key on the keyboard |
browser_navigate | Navigate to a URL |
browser_navigate_back | Go back to the previous page |
browser_navigate_forward | Go forward to the next page |
browser_pdf_save | Save page as PDF |
browser_snapshot | Capture accessibility snapshot of the current page, this is better than screenshot |
browser_click | Perform click on a web page |
browser_drag | Perform drag and drop between two elements |
browser_hover | Hover over element on page |
browser_type | Type text into editable element |
browser_select_option | Select an option in a dropdown |
browser_take_screenshot | Take a screenshot of the current page. You can't perform actions based on the screenshot, use browser_snapshot for actions. |
browser_tab_list | List browser tabs |
browser_tab_new | Open a new tab |
browser_tab_select | Select a tab by index |
browser_tab_close | Close a tab |