Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@atlas-browser-mcpGo to news.ycombinator.com and tell me the top 3 headlines"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
π atlas-browser-mcp
Visual web browsing for AI agents via Model Context Protocol (MCP).
β¨ Features
πΈ Visual-First: Navigate the web through screenshots, not DOM parsing
π·οΈ Set-of-Mark: Interactive elements labeled with clickable
[0],[1],[2]... markersπ Humanized: Bezier curve mouse movements, natural typing rhythms
π§© CAPTCHA-Ready: Multi-click support for image selection challenges
π‘οΈ Anti-Detection: Built-in measures to avoid bot detection
π Quick Start
Installation
Use with Claude Desktop
Add to your Claude Desktop config (claude_desktop_config.json):
Then ask Claude:
"Navigate to https://news.ycombinator.com and tell me the top 3 stories"
π οΈ Available Tools
Tool | Description |
| Go to URL, returns labeled screenshot |
| Capture current page with labels |
| Click element by label ID |
| Click multiple elements (for CAPTCHA) |
| Type text, optionally press Enter |
| Scroll page up or down |
π Usage Examples
Basic Navigation
CAPTCHA Handling
π§ Configuration
Headless Mode
For servers without display:
Custom Viewport
ποΈ How It Works
Navigate: Browser loads the page
Inject SoM: JavaScript labels all interactive elements
Screenshot: Capture the labeled page
AI Sees: The screenshot shows
[0],[1],[2]... on buttons, links, inputsAI Acts: "Click
[5]" β Browser clicks the element at that positionRepeat: New screenshot with updated labels
π€ Integration
With Cline (VS Code)
Programmatic Use
π Requirements
Python 3.10+
Playwright with Chromium
π Troubleshooting
"Playwright not installed"
"Browser closed unexpectedly"
Try running with headless=False to see what's happening:
Elements not being detected
Some dynamic pages need more wait time. The browser waits 1.5s after navigation, but complex SPAs may need longer.
π License
MIT License - see LICENSE
π Credits
Built for Atlas, an autonomous AI agent.
Inspired by:
anthropic/mcp - Model Context Protocol
AskUI - Visual testing approach
Set-of-Mark prompting - Visual grounding technique