Skip to main content
Glama

MCP Browser Use

by janspoerer

MCP 浏览器使用

使用此 MCP 可以实现什么

该项目旨在使 AI 代理能够使用模型上下文协议 (MCP) 和 Selenium 执行网络使用、浏览器自动化、抓取和自动化。

我们的使命是让人工智能代理在最少的人工监督下完成任何网络任务——所有这些都基于自然语言指令。

如何使用此 MCP

请参阅modelcontextprotocol.io 上的 MCP 文档

请注意,您需要在 MCP 配置文件指向的 Python 环境中安装所有依赖项。例如,如果您指向pythonpython3可执行文件,则指向全局 Python 环境。通常,最好指向虚拟环境,例如:

/Users/yourname/code/mcp_browser_use/.venv/bin/python

如果您已将此存储库克隆到本地code文件夹,则您的 MCP 配置文件应如下所示:

{ "mcpServers": { "mcp_browser_use": { "command": "/Users/janspoerer/code/mcp_browser_use/.venv/bin/python", "args": [ "/Users/janspoerer/code/mcp_browser_use/mcp_browser_use" ] } } }

它将在这里(在 macOS 中): /Users/janspoerer/Library/Application Support/Claude/claude_desktop_config.json

请参阅requirements.txt来了解您需要安装哪些依赖项。

重启 Claude 来检查 JSON 配置是否有效。如果出现问题,Claude 会将 MCP 的错误日志发送给你。

如果设置成功,您将在 Claude 的“新聊天”窗口右下角看到一个小锤子图标。锤子旁边显示的是 MCP 提供的功能数量。

点击锤子可以看到类似这样的内容:

Available MCP Tools Claude can use tools provided by specialized servers using Model Context Protocol. Learn more about MCP. click_element Click an element on the page. Args: session_id: Session ID of the browser selector: CSS selector, XPath, or ID of the element to click selector_type: Type of selector (css, xpath, id) From server: mcp_browser_use close_browser Close a browser session. Args: session_id: Session ID of the browser to close From server: mcp_browser_use fill_text Input text into an element. Args: session_id: Session ID of the browser selector: CSS selector, XPath, or ID of the input field text: Text to enter into the field selector_type: Type of selector (css, xpath, id) clear_first: Whether to clear the field before entering text From server: mcp_browser_use navigate Navigate to a URL. Args: session_id: Session ID of the browser url: URL to navigate to From server: mcp_browser_use scroll Scroll the page. Args: session_id: Session ID of the browser x: Horizontal scroll amount in pixels y: Vertical scroll amount in pixels From server: mcp_browser_use send_keys Send keyboard keys to the browser. Args: session_id: Session ID of the browser key: Key to send (e.g., ENTER, TAB, etc.) selector: CSS selector, XPath, or ID of the element to send keys to (optional) selector_type: Type of selector (css, xpath, id) From server: mcp_browser_use start_browser Start a new browser session. Args: headless: Whether to run the browser in headless mode From server: mcp_browser_use take_screenshot Take a screenshot of the current page. Args: session_id: Session ID of the browser From server: mcp_browser_use

演示视频(YouTube)

-
security - not tested
A
license - permissive license
-
quality - not tested

local-only server

The server can only run on the client's local machine because it depends on local resources.

使用自然语言指令和 Selenium,使 AI 代理能够在最少的监督下执行网页浏览、自动化和抓取任务。

  1. 使用此 MCP 可以实现什么
    1. 如何使用此 MCP
      1. 演示视频(YouTube)

        Related MCP Servers

        • A
          security
          F
          license
          A
          quality
          Enables AI agents to interact with web browsers using natural language, featuring automated browsing, form filling, vision-based element detection, and structured JSON responses for systematic browser control.
          Last updated -
          1
          46
          Python
          • Linux
          • Apple
        • A
          security
          A
          license
          A
          quality
          A server that provides web scraping and intelligent content searching capabilities using the Firecrawl API, enabling AI agents to extract structured data from websites and perform content searches.
          Last updated -
          5
          2
          TypeScript
          MIT License
          • Apple
          • Linux
        • -
          security
          F
          license
          -
          quality
          Enables AI agents to control web browsers via a standardized interface for operations like launching, interacting with, and closing browsers.
          Last updated -
          0
          JavaScript
        • -
          security
          A
          license
          -
          quality
          AI-driven browser automation server that implements the Model Context Protocol to enable natural language control of web browsers for tasks like navigation, form filling, and visual interaction.
          Last updated -
          1
          Python
          MIT License
          • Apple

        View all related MCP servers

        MCP directory API

        We provide all the information about MCP servers via our MCP API.

        curl -X GET 'https://glama.ai/api/mcp/v1/servers/janspoerer/mcp_browser_use'

        If you have feedback or need assistance with the MCP directory API, please join our Discord server