OpenChrome
Enables AI agents to navigate, interact, and extract data from Amazon's website through the OpenChrome MCP server.
Enables AI agents to navigate, interact, and extract data from Datadog's dashboard through the OpenChrome MCP server.
Enables AI agents to navigate, interact, and extract data from eBay's website through the OpenChrome MCP server.
Enables AI agents to navigate, interact, and extract data from Stripe's dashboard through the OpenChrome MCP server.
Enables AI agents to navigate, interact, and extract data from Walmart's website through the OpenChrome MCP server.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@OpenChromecompare AirPods Pro prices on Amazon, Best Buy, and Walmart"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
What it is
OpenChrome is an MCP server that controls your real, already-logged-in Chrome through the CDP — no middleware, no separate browser, no re-authentication. One Chrome process, many isolated tabs, ~300 MB for 20 parallel lanes.
It is harness-engineered: the server doesn't just expose browser APIs, it wraps them with a hint engine, a circuit breaker, an automatic-recovery runtime, and token-efficient page serialization — so the agent makes fewer mistakes, recovers without "thinking", and burns far fewer tokens.
You: compare "AirPods Pro" prices across Amazon, eBay, Walmart, Best Buy
AI: [4 parallel lanes, already authenticated everywhere]
Best Buy $179 · Amazon $189 · Walmart $185 · eBay $172
live pages · already authenticated · past bot detectionTraditional (Playwright et al.) | OpenChrome | |
5-site task* | ~250s (login each) | ~3s (parallel) |
Memory* | ~2.5 GB (5 browsers) | ~300 MB (1 Chrome) |
Re-auth | every run | never |
Bot detection | flagged | invisible (real Chrome) |
* Illustrative of the architectural difference (one authenticated Chrome vs. N cold
browsers), not benchmarked competitive results. Quantitative speed/cost figures become
headline claims only as live or recorded-real benchmark rows — see
docs/benchmarks/benchmark-direction.md.
Quick start
Install and point your MCP client at it — one command:
npm install -g openchrome-mcp
openchrome setup # Claude Code
openchrome setup --client codex # Codex CLI
npx openchrome-mcp setup --client opencode # OpenCodeRestart your MCP client. That's it — Chrome auto-launches on first tool call.
For manual Codex CLI configuration, run openchrome config --client codex
and add the printed [mcp_servers.openchrome] block to ~/.codex/config.toml.
{
"mcpServers": {
"openchrome": {
"command": "openchrome",
"args": ["serve", "--auto-launch"]
}
}
}Run openchrome update later to refresh the CLI and client config.
Prefer no terminal? A one-click desktop app (macOS / Windows / Linux, beta) runs the server with no Node.js setup.
What you can do with it
Ask your agent in plain language — these all map to OpenChrome tools:
Parallel research — "screenshot AWS billing, GCP, Stripe, Datadog at once" → 4 lanes, one Chrome, already authenticated.
Authenticated scraping — crawl dashboards and member-only pages using your existing login. No credentials in config.
Form & flow automation — fill, click, navigate multi-step flows; the agent gets corrective hints when a step drifts.
Production UI debugging —
oc_performance_insights/oc_vitalsfor LCP/CLS,console_capture,oc_devtools_urlto attach live DevTools.Site monitoring & diffing —
oc_evidence_bundlesnapshots +oc_difffor deterministic before/after (DOM, screenshot pHash, network, console).Crawling — async
crawl_start/crawl_status/crawl_canceljobs with cursor pagination.Verifiable runs —
oc_assertchecks page state against an Outcome Contract (pass / fail / inconclusive) instead of guessing.
The default surface is 118 tools across navigation, interaction, reading,
extraction, parallel workflows, contracts, skills, recovery, and diagnostics.
Full catalogue: docs/agent/capability-map.md.
Using it conveniently
Drive it from the shell — no MCP host needed
The CLI can call the MCP surface directly. Great for scripts, CI, and debugging:
oc run navigate --arg url=https://example.com
oc run read_page --arg mode=dom --json
oc navigate https://example.com # positional sugar for common tools
oc click ref_5Declarative scenarios with oc playbook
Write a YAML scenario where every step is one tool call with an inline Outcome Contract — deterministic, no LLM judgement:
oc playbook run scenario.yaml --vars url=https://iana.org --out report.mdSee docs/cli/playbook.md.
Keep one browser warm — HTTP daemon mode
Run OpenChrome as a long-lived daemon so multiple clients (Claude Code + CI + a dashboard) share one Chrome process, and the server outlives whatever launched it (Docker, systemd, CI):
openchrome serve --http 3100 --auth-token <token> --idle-timeout 30m
curl -s http://127.0.0.1:3100/healthOne Chrome process, tabs isolated per session. Without --idle-timeout it stays
up until stopped; with it, it self-exits after the idle window. Full guide:
docs/getting-started/http-daemon.md.
Diagnose your environment
openchrome doctor # Node, disk, Chrome binary/port, orphans, perms, locks
openchrome check # verify CLI + runtime wiringToken-efficient page reads
read_page mode="dom" serializes the page into a compact text form — ~5–15x
fewer tokens than the raw DOM. Each element carries an affordance marker so the
agent knows the type at a glance:
# [142]<input type="search" .../> ★ ← # text input
$ [156]<button type="submit"/>Search ★ ← $ button / control
@ [289]<a href="/home"/>Home ★ ← @ link (% = visual target)[backendNodeId] identifiers are stable for the node's lifetime — pass 142,
node_142, or ref_N to any action tool. oc_observe goes further: it returns
a ready-to-act numbered list in one call instead of read_page → query_dom →
inspect → interact.
Why agents fail less on OpenChrome
The bottleneck in browser automation is the LLM thinking between steps — every wrong guess costs 10–15s of inference. OpenChrome's harness cuts that loop:
Subsystem | What it does |
Hint engine (30+ rules) | Catches error→recovery patterns and corrects the agent before mistakes cascade. Promotes repeated patterns to permanent rules. |
Recovery runtime | Deterministic, bounded recovery for a tool call — recover in-server, no LLM round-trip (pilot tier). |
Ralph engine | 7-strategy interaction waterfall: AX click → CSS → CDP coords → JS → keyboard → raw mouse → human escalation. |
3-level circuit breaker | Element / page / global — stops the agent burning tokens on permanently broken elements. |
Outcome classifier | Reports what actually happened after a click (SUCCESS / SILENT_CLICK / WRONG_ELEMENT). |
49 reliability mechanisms | 8 defense layers from process lifecycle to MCP gateway — no single failure hangs the server. See |
The harness is designed to cut the think-act loop: fewer LLM round-trips, faster
wall time, lower cost. Specific speed/cost multipliers are published as headline
claims only when backed by live or recorded-real benchmark rows — see
docs/benchmarks/benchmark-direction.md.
Other capabilities worth knowing
Parallel sessions — 1 Chrome, N tabs/lanes;
workerId+profileDirectorygive per-client isolation. Multiple MCP clients can share one Chrome safely when they connect through a single broker/HTTP owner (--broker/--connect-broker); independent stdio clients should use separate--port/--user-data-dirprofiles. Seedocs/mcp/topologies.md.Anti-bot / Turnstile — 3-tier auto-fallback (headless → stealth → real headed Chrome) bypasses CDN/WAF blocks. Turnstile guide.
Interactive login — headed by default since the launcher runs visible; complete 2FA/CAPTCHA once, reuse the persistent profile after.
Session persistence —
--persist-storagesaves cookies + localStorage atomically for headless reuse.Shadow DOM — open + closed roots via CDP-pierced reads;
__pierce()/__openchrome.querySelectorAllDeep()helpers injavascript_tool.Element intelligence — find elements by natural language (AX-first, CSS fallback, Korean role keywords built in:
"버튼"→ button).Core / pilot tiers — core is on by default and preserves the stable surface;
--pilotopts into contract runtime, handoff persistence, voting, and the skill curator.
Server & headless deployment
openchrome serve --server-mode # headless + auto-launch + server defaultsWorks in CI/CD and containers with no login — navigation, scraping, screenshots,
forms, and parallel workflows all run in clean sessions. A production
Dockerfile is included (docker build -t openchrome . && docker run openchrome).
Authentication (per-tenant API keys, JWT/OAuth, shared token): docs/auth.md.
Transport stability policy: docs/transport-lifecycle.md.
Host plugins
OpenChrome ships plugin manifests for Claude Code and Codex CLI so both hosts load the MCP server and skill body from the same shared source — no per-host duplication.
Claude Code
Load from the repo for local development:
claude --plugin-dir /path/to/openchromeOr install from npm after the package is published to a marketplace:
/plugin install openchrome # once listed in a marketplaceThe skill body lives under skills/openchrome/ and the /openchrome:connect
slash command is registered from the top-level commands/ directory. Run
/reload-plugins after updating to pick up new content.
Manifest: .claude-plugin/plugin.json — registers the openchrome MCP server
(openchrome serve --auto-launch). Skills and commands are auto-discovered
from skills/ and commands/.
Codex CLI
Load from the repo for local development:
codex --plugin-dir /path/to/openchromeOr follow the Codex plugin installation instructions once the package is listed in the Codex plugin registry.
Manifest: .codex-plugin/plugin.json — identical MCP server entry point and
skill path; the two manifests share the same skills/openchrome/ body.
Shared skill body
Both manifests share skills/openchrome/SKILL.md as the single source of
truth for skill content, and the slash command stub at commands/connect.md
registers /openchrome:connect. The commands/ directory sits at the plugin
root so both hosts auto-discover it.
Documentation
Topic | Link |
Architecture & reliability layers | |
Getting started walkthrough | |
Full tool catalogue | |
CLI & playbook | |
HTTP daemon mode | |
Research recipes | |
Latest release notes |
Development
git clone https://github.com/shaun0927/openchrome.git
cd openchrome
npm install && npm run build && npm testLint before submitting source changes: npm run lint -- --max-warnings=0
(or npm run lint:changed -- --base origin/develop for changed files only).
PRs target the develop branch.
License
MIT
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/shaun0927/openchrome'
If you have feedback or need assistance with the MCP directory API, please join our Discord server