Browser Automation MCP Server

Browser Automation MCP Server

This is a Model Context Protocol (MCP) server that provides browser automation capabilities for Roo Code. It enables AI agents to control web browsers through a standardized interface.

Features

  • Browser control (launch, close)
  • Mouse interactions (click at coordinates)
  • Keyboard input (type text)
  • Page navigation (scroll up/down)
  • Fixed viewport size (900x600)

Installation

  1. Clone the repository:
git clone https://github.com/grapheneaffiliates/browser-automation-mcp.git cd browser-automation-mcp
  1. Install dependencies:
npm install
  1. Build the project:
npm run build

Configuration

Add the following to your Cline MCP settings file:

{ "mcpServers": { "browser": { "command": "node", "args": ["path/to/browser-server/build/index.js"], "disabled": false, "alwaysAllow": [] } } }

Available Tools

The server provides the following MCP tools:

  • launch_browser: Launch a new browser instance at a specified URL
  • click: Click at specific x,y coordinates on the page
  • type: Type text into the page
  • scroll: Scroll the page up or down
  • close_browser: Close the browser instance

Usage Example

// Using the MCP tools in Roo Code const result = await use_mcp_tool({ server_name: "browser", tool_name: "launch_browser", arguments: { url: "https://example.com" } });

License

MIT

-
security - not tested
F
license - not found
-
quality - not tested

local-only server

The server can only run on the client's local machine because it depends on local resources.

Enables AI agents to control web browsers via a standardized interface for operations like launching, interacting with, and closing browsers.

  1. Features
    1. Installation
      1. Configuration
        1. Available Tools
          1. Usage Example
            1. License

              Related MCP Servers

              • A
                security
                F
                license
                A
                quality
                Enables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.
                Last updated -
                1
                38
                Python
                • Linux
                • Apple
              • -
                security
                F
                license
                -
                quality
                A web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.
                Last updated -
                1
                Python
                • Apple
              • -
                security
                A
                license
                -
                quality
                AI-driven browser automation server that implements the Model Context Protocol to enable natural language control of web browsers for tasks like navigation, form filling, and visual interaction.
                Last updated -
                1
                Python
                MIT License
                • Apple
              • -
                security
                A
                license
                -
                quality
                Empowers AI agents to perform web browsing, automation, and scraping tasks with minimal supervision using natural language instructions and Selenium.
                Last updated -
                1
                Python
                Apache 2.0
                • Apple

              View all related MCP servers

              ID: 3o1j64rc1q