Skip to main content
Glama

Selenium MCP Server

Selenium MCP Server

A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python. This server allows you to control web browsers programmatically through Claude's desktop application.

Features

  • Browser automation with Chrome and Firefox support
  • Comprehensive set of Selenium WebDriver operations
  • Easy integration with Claude desktop application
  • Support for various browser interactions:
    • Navigation
    • Element interaction (click, send keys, etc.)
    • Screenshot capture
    • Window and iframe handling
    • Local storage management
    • And much more!

Prerequisites

  • Python 3.7+
  • Chrome or Firefox browser installed
  • Claude desktop application

Installation

  1. Clone this repository:
git clone https://github.com/Jyothishkumarav/selenium-mcp-server-python.git cd selenium-mcp-server-python
  1. Install the required dependencies:
pip install -r requirements.txt
  1. Install the MCP server in Claude:
mcp install server.py

Usage

  1. Start the server:
python server.py
  1. In Claude desktop application:
    • Go to Settings > Developer Options
    • You should see "MCP-Selenium" listed as an available server
    • Enable the server to start using Selenium automation features

Available Tools

The server provides the following automation capabilities:

  • Browser Management:
    • Start browser sessions
    • Close sessions
    • Switch between windows
  • Navigation:
    • Navigate to URLs
    • Refresh pages
    • Wait for page loads
  • Element Interaction:
    • Find elements
    • Click elements
    • Send keys
    • Clear fields
    • Double click
    • Right click
  • Element Information:
    • Get element text
    • Get element attributes
    • Check element presence
    • Check element visibility
    • Check element selection
  • Page Operations:
    • Take screenshots
    • Get page content
    • Scroll pages
    • Manage local storage
  • Frame Handling:
    • Switch to iframes
    • Return to default content

Example Usage

Here's a simple example of how to use the server with Claude:

  1. Start a browser session
  2. Navigate to a website
  3. Find and interact with elements
  4. Take screenshots
  5. Close the session

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

-
security - not tested
A
license - permissive license
-
quality - not tested

local-only server

The server can only run on the client's local machine because it depends on local resources.

A server implementation that enables controlling web browsers programmatically through Claude's desktop application, providing comprehensive Selenium WebDriver operations for browser automation with Chrome and Firefox support.

  1. Features
    1. Prerequisites
      1. Installation
        1. Usage
          1. Available Tools
            1. Example Usage
              1. Contributing
                1. License
                  1. Acknowledgments

                    Related MCP Servers

                    • A
                      security
                      A
                      license
                      A
                      quality
                      Enables automation of browser tasks using Playwright by interacting via Claude Desktop for executing user-defined prompts and operations.
                      Last updated -
                      12
                      9
                      TypeScript
                      MIT License
                    • A
                      security
                      F
                      license
                      A
                      quality
                      Enables browser automation for LLMs on Linux display servers, supporting web interaction, screenshots, and JavaScript execution in a real browser.
                      Last updated -
                      7
                      0
                      6
                      JavaScript
                      • Linux
                    • -
                      security
                      F
                      license
                      -
                      quality
                      A web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.
                      Last updated -
                      2
                      Python
                      • Apple
                    • -
                      security
                      A
                      license
                      -
                      quality
                      A server that integrates with Claude Desktop to enable real-time web research capabilities, allowing users to search Google, extract webpage content, and capture screenshots directly from conversations.
                      Last updated -
                      525
                      MIT License
                      • Apple

                    View all related MCP servers

                    MCP directory API

                    We provide all the information about MCP servers via our MCP API.

                    curl -X GET 'https://glama.ai/api/mcp/v1/servers/Jyothishkumarav/selenium-mcp-server-python'

                    If you have feedback or need assistance with the MCP directory API, please join our Discord server