Supports browser automation for Firefox, allowing navigation, element interaction, and other Selenium WebDriver operations
Provides browser automation capabilities for web scraping and testing, supporting navigation, element interaction, screenshots, iframe handling, and local storage management
Selenium MCP Server
A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python. This server allows you to control web browsers programmatically through Claude's desktop application.
Features
- Browser automation with Chrome and Firefox support
- Comprehensive set of Selenium WebDriver operations
- Easy integration with Claude desktop application
- Support for various browser interactions:
- Navigation
- Element interaction (click, send keys, etc.)
- Screenshot capture
- Window and iframe handling
- Local storage management
- And much more!
Prerequisites
- Python 3.7+
- Chrome or Firefox browser installed
- Claude desktop application
Installation
- Clone this repository:
- Install the required dependencies:
- Install the MCP server in Claude:
Usage
- Start the server:
- In Claude desktop application:
- Go to Settings > Developer Options
- You should see "MCP-Selenium" listed as an available server
- Enable the server to start using Selenium automation features
Available Tools
The server provides the following automation capabilities:
- Browser Management:
- Start browser sessions
- Close sessions
- Switch between windows
- Navigation:
- Navigate to URLs
- Refresh pages
- Wait for page loads
- Element Interaction:
- Find elements
- Click elements
- Send keys
- Clear fields
- Double click
- Right click
- Element Information:
- Get element text
- Get element attributes
- Check element presence
- Check element visibility
- Check element selection
- Page Operations:
- Take screenshots
- Get page content
- Scroll pages
- Manage local storage
- Frame Handling:
- Switch to iframes
- Return to default content
Example Usage
Here's a simple example of how to use the server with Claude:
- Start a browser session
- Navigate to a website
- Find and interact with elements
- Take screenshots
- Close the session
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
This server cannot be installed
A server implementation that enables controlling web browsers programmatically through Claude's desktop application, providing comprehensive Selenium WebDriver operations for browser automation with Chrome and Firefox support.
Related MCP Servers
- AsecurityAlicenseAqualityAn advanced web browsing server enabling headless browser interactions via a secure API, providing features like navigation, content extraction, element interaction, and screenshot capture.Last updated -69PythonMIT License
- AsecurityFlicenseAqualityEnables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.Last updated -418Python
- -securityAlicense-qualityEnables browser automation using the Selenium WebDriver through MCP, supporting browser management, element location, and both basic and advanced user interactions.Last updated -17521JavaScriptMIT License
- -securityFlicense-qualityA web browser automation server that allows AI assistants to control Chrome with persistent state management, enabling complex browsing tasks through asynchronous browser operations.Last updated -1Python