Skip to main content
Glama

Selenium MCP Server

MIT License
2

Selenium MCP Server

A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python. This server allows you to control web browsers programmatically through Claude's desktop application.

Features

  • Browser automation with Chrome and Firefox support
  • Comprehensive set of Selenium WebDriver operations
  • Easy integration with Claude desktop application
  • Support for various browser interactions:
    • Navigation
    • Element interaction (click, send keys, etc.)
    • Screenshot capture
    • Window and iframe handling
    • Local storage management
    • And much more!

Prerequisites

  • Python 3.7+
  • Chrome or Firefox browser installed
  • Claude desktop application

Installation

  1. Clone this repository:
git clone https://github.com/Jyothishkumarav/selenium-mcp-server-python.git cd selenium-mcp-server-python
  1. Install the required dependencies:
pip install -r requirements.txt
  1. Install the MCP server in Claude:
mcp install server.py

Usage

  1. Start the server:
python server.py
  1. In Claude desktop application:
    • Go to Settings > Developer Options
    • You should see "MCP-Selenium" listed as an available server
    • Enable the server to start using Selenium automation features

Available Tools

The server provides the following automation capabilities:

  • Browser Management:
    • Start browser sessions
    • Close sessions
    • Switch between windows
  • Navigation:
    • Navigate to URLs
    • Refresh pages
    • Wait for page loads
  • Element Interaction:
    • Find elements
    • Click elements
    • Send keys
    • Clear fields
    • Double click
    • Right click
  • Element Information:
    • Get element text
    • Get element attributes
    • Check element presence
    • Check element visibility
    • Check element selection
  • Page Operations:
    • Take screenshots
    • Get page content
    • Scroll pages
    • Manage local storage
  • Frame Handling:
    • Switch to iframes
    • Return to default content

Example Usage

Here's a simple example of how to use the server with Claude:

  1. Start a browser session
  2. Navigate to a website
  3. Find and interact with elements
  4. Take screenshots
  5. Close the session

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

-
security - not tested
A
license - permissive license
-
quality - not tested

A server implementation that enables controlling web browsers programmatically through Claude's desktop application, providing comprehensive Selenium WebDriver operations for browser automation with Chrome and Firefox support.

  1. Features
    1. Prerequisites
      1. Installation
        1. Usage
          1. Available Tools
            1. Example Usage
              1. Contributing
                1. License
                  1. Acknowledgments

                    Related MCP Servers

                    • A
                      security
                      A
                      license
                      A
                      quality
                      An advanced web browsing server enabling headless browser interactions via a secure API, providing features like navigation, content extraction, element interaction, and screenshot capture.
                      Last updated -
                      6
                      9
                      Python
                      MIT License
                    • A
                      security
                      F
                      license
                      A
                      quality
                      Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.
                      Last updated -
                      4
                      18
                      Python
                      • Linux
                    • -
                      security
                      A
                      license
                      -
                      quality
                      Enables browser automation using the Selenium WebDriver through MCP, supporting browser management, element location, and both basic and advanced user interactions.
                      Last updated -
                      175
                      21
                      JavaScript
                      MIT License
                    • -
                      security
                      F
                      license
                      -
                      quality
                      A web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.
                      Last updated -
                      1
                      Python
                      • Apple

                    View all related MCP servers

                    MCP directory API

                    We provide all the information about MCP servers via our MCP API.

                    curl -X GET 'https://glama.ai/api/mcp/v1/servers/Jyothishkumarav/selenium-mcp-server-python'

                    If you have feedback or need assistance with the MCP directory API, please join our Discord server