Skip to main content
Glama
sheriff-stuff

Integrated Browser MCP

Integrated Browser Agent Connect

Exposes VS Code's integrated browser to external agents (Claude Code, scripts, curl) via a local HTTP API and MCP server.

Every existing browser automation solution targets an external Chrome process. This extension is different: it bridges the browser already inside VS Code — with your session cookies, your localhost dev server, your DevTools — to any agent that can speak HTTP or MCP.

How it works

Claude Code / curl / scripts
    │
    │  MCP (stdio) or HTTP
    ▼
MCP Server  ──HTTP──▶  VS Code Extension  ──CDP──▶  Integrated Browser
                       localhost:3788+               (real Chromium, in-editor)

The extension uses VS Code's built-in editor-browser and the Chrome DevTools Protocol (CDP) to provide full browser automation: navigation, JavaScript evaluation, clicking, typing, screenshots, DOM access, console and network monitoring.

Related MCP server: VS Code Simple Browser MCP Server

Getting started

This is a standalone build — it's not on the VS Code Marketplace. Build and install from source:

  1. Package and install the extension:

    npm install
    npx vsce package
    code --install-extension integrated-browser-agent-connect-<version>.vsix --force

    (vsce package runs the production build itself via vscode:prepublish. On vsce 3.x add --allow-proposed-apis browser, since it refuses to package an extension declaring the browser proposal without it; vsce 2.x neither needs nor accepts the flag.)

    (Or press F5 in VS Code to run it in an Extension Development Host for development.)

  2. The HTTP server starts automatically on localhost:3788

  3. For Claude Code: the MCP server is auto-configured in ~/.claude.json on first activation

  4. The browser launches lazily on the first request — no browser tab until you need one

Usage with Claude Code

The MCP tools are available immediately. Ask Claude Code to use them by name:

use browser_navigate to open http://localhost:3000

Or reference the MCP server:

use the integrated-browser-agent-connect to open my app

The MCP server ships an instructions field that conformant clients surface to the model automatically. If your agent still picks the wrong tool (e.g. shells out to open instead of using browser_navigate), add a short hint to your project's CLAUDE.md:

For browser automation, use the integrated-browser-agent-connect MCP tools (browser_navigate, browser_screenshot, etc.) — never shell out to `open`/`xdg-open`/`start`.

Usage with curl

curl -X POST http://127.0.0.1:3788/navigate \
  -H 'Content-Type: application/json' \
  -d '{"url":"http://localhost:3000"}'

See HTTP API below for the full endpoint list.

MCP tools

All interaction tools accept an optional tabId parameter. Omit it to target the active tab.

Tool

Description

browser_navigate

Navigate to a URL

browser_eval

Execute JavaScript in the page

browser_click

Click an element by CSS selector

browser_type

Type text into an element by CSS selector

browser_scroll

Scroll the page or a specific element

browser_screenshot

Capture page as PNG. fullPage for whole-document capture; waitMs to delay capture for in-flight CSS transitions.

browser_emulate

Override viewport dimensions, DPR, mobile flag, and User-Agent. Sticky until reset:true.

browser_snapshot

Get the accessibility tree

browser_dom

Get the full page HTML

browser_console

Read buffered console output (aggregates across tabs when tabId omitted)

browser_network

Read buffered network requests (aggregates across tabs when tabId omitted)

browser_network_clear

Clear the network log

browser_url

Get the current page URL

browser_tab_open

Open a new browser tab (proposed API only)

browser_tab_close

Close a tab by id

browser_tab_list

List open tabs with their ids, URLs, titles, and active flag

browser_tab_activate

Set the default target tab

browser_status

Check bridge connection status

HTTP API

All responses follow the format { ok: true, data: ... } or { ok: false, error: "..." }.

All interaction endpoints (navigate, eval, click, type, scroll, screenshot, snapshot, dom, url) accept an optional tabId — as a ?tabId= query param on GET requests or in the JSON body on POST. Omit to target the active tab.

Method

Endpoint

Body

Description

GET

/status

Bridge health + diagnostics (transport, active tab, buffer sizes, event counts)

POST

/navigate

{ url, tabId? }

Navigate to URL

POST

/eval

{ expression, tabId? }

Run JS in page context

POST

/click

{ selector, tabId? }

Click element by CSS selector

POST

/type

{ selector, text, tabId? }

Type into element

POST

/scroll

{ deltaX, deltaY, selector?, tabId? }

Scroll page or element

GET

/screenshot

?tabId=X&fullPage=true&waitMs=N

Base64 PNG screenshot. fullPage=true captures beyond the viewport. waitMs sleeps before capture (handles CSS transitions).

POST

/emulate

{ width, height, deviceScaleFactor?, mobile?, userAgent?, reset?, tabId? }

Device-metric override. {reset:true} clears.

GET

/snapshot

?tabId=X

Accessibility tree

GET

/dom

?tabId=X

Full page outerHTML

GET

/console

?limit=N&tabId=X

Buffered console output (last 200). Aggregates across tabs when tabId omitted.

GET

/network

?limit=N&filter=x&tabId=X

Buffered network requests (last 200). Aggregates across tabs when tabId omitted.

POST

/network/clear

?tabId=X

Clear network log (one tab or all)

GET

/url

?tabId=X

Current page URL

GET

/tabs

List open tabs [{ tabId, url, title, active, state, transport }]

POST

/tab/open

{ url, makeActive? }

Open a new tab (proposed API only). Returns { tabId, url, title }

POST

/tab/close/:tabId

Close a tab

POST

/tab/activate/:tabId

Set the active (default) tab

Multi-window support and port discovery

Each VS Code window gets its own browser and HTTP server. Ports are assigned automatically starting from 3788 and incrementing if already taken.

How the MCP server finds the right window

When Claude Code calls a browser tool, the MCP server needs to know which VS Code window to talk to. It resolves this automatically:

  1. Each VS Code window registers itself at ~/.integrated-browser-agent-connect/instances/<hash>.json with its port, workspace path, and PID

  2. The MCP server reads all instance files and filters out dead processes

  3. It matches process.cwd() (Claude Code's working directory) against registered workspace paths — deepest match wins

  4. If no workspace matches, it falls back to the most recently started instance

This means when you run Claude Code inside a VS Code terminal, it automatically connects to the browser in that VS Code window.

Manual override

Set the BROWSER_BRIDGE_PORT environment variable to force a specific port:

BROWSER_BRIDGE_PORT=3789 claude

Troubleshooting

If the MCP server connects to the wrong window, check the registered instances:

cat ~/.integrated-browser-agent-connect/instances/*.json

Stale instance files from crashed VS Code windows are cleaned up automatically on the next window startup. You can also delete them manually.

Enabling worker event capture (proposed API)

By default the bridge launches the integrated browser via a VS Code debug session and talks to it through vscode-js-debug's CDP proxy. That proxy only forwards events from the main page session — so logs and network requests from web workers and service workers never reach the /console and /network buffers.

VS Code ships a proposed API (vscode.window.openBrowserTab) that bypasses vscode-js-debug entirely and gives direct multiplexed access to the CDP stream. On this path, worker and iframe events are captured and tagged with a target field.

To enable it, launch VS Code with the proposed API flag:

code --enable-proposed-api=sheriff-stuff.integrated-browser-agent-connect

The extension feature-detects the proposal at startup and uses it if available. Without the flag, the bridge falls back to the debug-session path and works exactly like before — so setting the flag is optional and safe.

Check which path you're on via the status bar tooltip (Browser MCP: Connected (proposed) vs (debug-session)), or GET /statustransport: "browserTab" vs "websocket".

Caveat: the browser proposal is still tracked upstream and its shape can change between VS Code releases. The fallback path keeps the extension usable regardless.

Multi-tab

Multi-tab support requires the proposed API (previous section). When enabled:

  • browser_tab_open("https://example.com") opens a new tab, returns its tabId.

  • browser_tab_list() shows all open tabs — the active flag marks which one receives commands by default, and the number field (1, 2, 3…) matches the (N) prefix in each tab's title. Numbers are stable per tab with reuse: close tab 3 and the next new tab gets 3, but tab 4 stays tab 4 for its lifetime.

  • Every interaction tool (browser_navigate, browser_eval, browser_click, etc.) accepts an optional tabId. Omit it to target the active tab; pass it to target a specific tab.

  • browser_console and browser_network aggregate across all tabs by default — each entry carries the tabId of the tab it came from. Pass tabId to filter.

  • Closing a tab in the VS Code UI is picked up automatically; the bridge untracks it and the tabId becomes invalid.

The (N) prefix is auto-applied even to pages without a <title> element (about:blank, raw API responses), and it re-applies after navigation. The bridge strips any prefix a prior version of the extension may have left on a pre-existing tab, so you won't see stacked markers after an upgrade.

On the debug-session fallback path, the bridge always exposes exactly one tab (synthetic id tab-main) and browser_tab_open returns an error pointing to the proposed API.

Limitations and trust model

  • HTTP server binds to 127.0.0.1 only — never network-exposed. No authentication, same trust model as VS Code's built-in terminals.

  • /eval runs arbitrary JavaScript in the open page — same trust model as the DevTools console. Don't pass untrusted input.

  • On the debug-session path (default, no proposed-API flag): only one tab, web worker and service worker events not captured, and the debug toolbar / "(1)" badge appears while the browser is active.

  • The browser tab lives in the VS Code editor area. Moving it to a side panel is fine; closing it disconnects CDP.

F
license - not found
-
quality - not tested
B
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/sheriff-stuff/vscode-integrated-browser-agent-connect'

If you have feedback or need assistance with the MCP directory API, please join our Discord server