Supports browser automation for Firefox, allowing navigation, element interaction, and other Selenium WebDriver operations
Provides browser automation capabilities for web scraping and testing, supporting navigation, element interaction, screenshots, iframe handling, and local storage management
Selenium MCP Server
A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python. This server allows you to control web browsers programmatically through Claude's desktop application.
Features
- Browser automation with Chrome and Firefox support
- Comprehensive set of Selenium WebDriver operations
- Easy integration with Claude desktop application
- Support for various browser interactions:
- Navigation
- Element interaction (click, send keys, etc.)
- Screenshot capture
- Window and iframe handling
- Local storage management
- And much more!
Prerequisites
- Python 3.7+
- Chrome or Firefox browser installed
- Claude desktop application
Installation
- Clone this repository:
- Install the required dependencies:
- Install the MCP server in Claude:
Usage
- Start the server:
- In Claude desktop application:
- Go to Settings > Developer Options
- You should see "MCP-Selenium" listed as an available server
- Enable the server to start using Selenium automation features
Available Tools
The server provides the following automation capabilities:
- Browser Management:
- Start browser sessions
- Close sessions
- Switch between windows
- Navigation:
- Navigate to URLs
- Refresh pages
- Wait for page loads
- Element Interaction:
- Find elements
- Click elements
- Send keys
- Clear fields
- Double click
- Right click
- Element Information:
- Get element text
- Get element attributes
- Check element presence
- Check element visibility
- Check element selection
- Page Operations:
- Take screenshots
- Get page content
- Scroll pages
- Manage local storage
- Frame Handling:
- Switch to iframes
- Return to default content
Example Usage
Here's a simple example of how to use the server with Claude:
- Start a browser session
- Navigate to a website
- Find and interact with elements
- Take screenshots
- Close the session
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A server implementation that enables controlling web browsers programmatically through Claude's desktop application, providing comprehensive Selenium WebDriver operations for browser automation with Chrome and Firefox support.
Related MCP Servers
- AsecurityAlicenseAqualityEnables automation of browser tasks using Playwright by interacting via Claude Desktop for executing user-defined prompts and operations.Last updated -129TypeScriptMIT License
- AsecurityFlicenseAqualityEnables browser automation for LLMs on Linux display servers, supporting web interaction, screenshots, and JavaScript execution in a real browser.Last updated -706JavaScript
- -securityFlicense-qualityA web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.Last updated -2Python
- -securityAlicense-qualityA server that integrates with Claude Desktop to enable real-time web research capabilities, allowing users to search Google, extract webpage content, and capture screenshots directly from conversations.Last updated -525MIT License