Selenium MCP Server

A powerful Model Context Protocol (MCP) server that brings Selenium WebDriver automation to AI assistants. This server enables AI tools like Claude Desktop and Cursor AI to control web browsers programmatically, making web automation accessible through natural language commands.

📚 Table of Contents

🚀 Features

Multi-browser support: Chrome and Firefox (headless or full mode)
Session management: Start, list, switch, and close browser sessions
Navigation: Go to URLs, reload, and retrieve page info
Element interaction: Find, click, type, hover, drag-and-drop, double/right click, upload files
Advanced actions: Execute JavaScript, take screenshots, get/set element text
File operations: Upload files, download files, take full-page screenshots
Robust error handling
Easy integration with any MCP-compatible client (Cursor AI, Claude Desktop, Google Gemini, etc.)
PEP 517/518 compliant packaging

⚡ Quick Start

pip install selenium-mcp-server
python -m selenium_mcp_server

Add this to your MCP client config (e.g., Cursor AI):

{
  "mcpServers": {
    "selenium": {
      "command": "python",
      "args": ["-m", "selenium_mcp_server"]
    }
  }
}

🏁 Getting Started

Install
pip install selenium-mcp-server
Run the server
python -m selenium_mcp_server
Connect your MCP client (see config above)

Windows users: If you see a .exe.deleteme error, delete any selenium-mcp-server.exe or .exe.deleteme files in your Python Scripts directory, then retry the install. You can always run the server with python -m selenium_mcp_server.

🤖 Client Integration

Cursor AI: ~/.cursor/mcp_config.json
Claude Desktop: ~/.config/claude-desktop/config.json (Linux/macOS) or %APPDATA%\claude-desktop\config.json (Windows)
Other MCP Clients: See your client's documentation

🛠️ Available Tools & Examples

Browser Management

Start Browser
{ "browser": "chrome", "options": { "headless": true } }
List Sessions
{ "name": "list_sessions", "arguments": {} }
Switch Session
{ "session_id": "your-session-id" }
Close Session
{ "session_id": "your-session-id" }

Navigate
{ "url": "https://example.com", "wait_for_load": true }
Get Page Info
{ "include_title": true, "include_url": true, "include_source": false }

Element Interaction

Find Element
{ "by": "css", "value": "#my-element" }
Click Element
{ "by": "css", "value": "#my-button", "force_click": true }
Send Keys
{ "by": "css", "value": "#input", "text": "hello", "clear_first": true }
Get Element Text
{ "by": "css", "value": "#output" }
Hover
{ "by": "css", "value": "#hover-target" }
Drag and Drop
{ "by": "css", "value": "#source", "targetBy": "css", "targetValue": "#target" }
Double Click / Right Click
{ "by": "css", "value": "#element" }
Press Key
{ "key": "Enter" }
Upload File
{ "by": "css", "value": "input[type='file']", "filePath": "C:/Users/YourName/file.txt" }
Wait for Element
{ "by": "css", "value": "#wait-for-me", "wait_for_visible": true }

Advanced Actions

Take Screenshot
{ "full_page": true }
Execute Script
{ "script": "return document.title;" }

📊 Example Automation Flow

⚙️ Advanced Configuration

You can configure the Selenium MCP Server in several ways:

Option 1: Installed Package (Recommended)

{
  "mcpServers": {
    "selenium": {
      "command": "python",
      "args": ["-m", "selenium_mcp_server"],
      "env": {
        "PYTHONUNBUFFERED": "1"
      }
    }
  }
}

Option 2: Direct File Execution

Windows:
{ "mcpServers": { "selenium": { "command": "python", "args": ["C:\\path\\to\\selenium-mcp-server\\src\\selenium_mcp_server.py"], "env": { "PYTHONPATH": "C:\\path\\to\\selenium-mcp-server\\src", "PYTHONUNBUFFERED": "1" } } } }
macOS/Linux:
{ "mcpServers": { "selenium": { "command": "python", "args": ["/path/to/selenium-mcp-server/src/selenium_mcp_server.py"], "env": { "PYTHONPATH": "/path/to/selenium-mcp-server/src", "PYTHONUNBUFFERED": "1" } } } }

Option 3: Console Script

{
  "mcpServers": {
    "selenium": {
      "command": "selenium-mcp-server"
    }
  }
}

🌐 Environment Variables

PYTHONUNBUFFERED=1: Ensures Python output is not buffered
SELENIUM_LOG_LEVEL=INFO: Sets logging level (DEBUG, INFO, WARNING, ERROR)
PYTHONPATH: Points to the directory containing the Python modules (needed for direct file execution)

🧪 Testing Your Configuration

After configuring, test with:

{
  "name": "list_sessions",
  "arguments": {}
}

You should get an empty list if no sessions are active.

❓ FAQ / Troubleshooting

Q: I see "0 tools enabled" in Cursor AI.

Make sure the package is installed: pip install selenium-mcp-server (or pip install -e . for development)
Verify the module works:
python -c "import selenium_mcp_server; print('Module found!')"
Check if the entry point works:
selenium-mcp-server --help
Try using the console script entry point in your config.

Q: "Module not found" errors

Make sure you've installed the package: pip install selenium-mcp-server or pip install -e .
Check that the PYTHONPATH points to the correct directory if running from source
Verify the file paths are correct for your system

Q: "Command not found" errors

Ensure Python is in your system PATH
Try using the full path to Python: C:\Python312\python.exe (Windows) or /usr/bin/python3 (Linux/macOS)

Q: Permission errors

On Windows, try running your MCP client as administrator
On Linux/macOS, check file permissions: chmod +x src/selenium_mcp_server.py

Q: I get a .exe.deleteme error on Windows when upgrading.

Close all terminals, delete any selenium-mcp-server.exe or .exe.deleteme files in your Python Scripts directory, and retry the install. You can always run the server with python -m selenium_mcp_server.

Q: How do I check the server version?

The server prints its version on startup. You can also check with pip show selenium-mcp-server.

Q: Can I use this with headless browsers?

Yes! The server supports both headless and full browser modes.

Q: How do I contribute or report issues?

See the Contributing section below.

🤝 Contributing

Fork the repo and create a feature branch
Make your changes (see src/selenium_mcp_server/)
Add/modify tests if needed
Open a pull request with a clear description
For issues, use the GitHub Issues tab

🗒️ Changelog

1.1.6: Improved error handling, updated dependencies, enhanced Windows compatibility
1.1.5: Simplified packaging, removed legacy scripts, improved docs
1.1.4 and earlier: Initial releases, core MCP and Selenium features

Project Structure

All source code is in src/selenium_mcp_server/
No unnecessary files or scripts in the root or src directory
The package is fully PEP 517/518 compliant and ready for PyPI distribution

License

MIT License - feel free to use this in your own projects.

Contact & Support

For questions, open a GitHub Issue
For discussions, feature requests, or help, use the GitHub Discussions or Issues
Maintained by Krishna Pollu

Note: This server is designed for legitimate automation tasks. Please respect websites' terms of service and robots.txt files when using this tool.

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

A Model Context Protocol server that enables AI assistants to control web browsers programmatically, allowing for web automation through natural language commands.

Related MCP Servers

browser-use MCP server
Deploya-labs
A
security
A
license
A
quality
AI-driven browser automation server that implements the Model Context Protocol to enable natural language control of web browsers for tasks like navigation, form filling, and visual interaction.
Last updated -
1
1
Python
MIT License
Cloudflare Playwright MCP
bmoir23
-
security
F
license
-
quality
A Model Control Protocol server that enables AI assistants to control a browser through tools for web automation tasks like navigation, typing, clicking, and taking screenshots.
Last updated -
TypeScript
Cloudflare Playwright MCP
kathayl
-
security
F
license
-
quality
A Model Control Protocol server that enables AI assistants to control a browser through tools for web automation tasks like navigation, typing, clicking, and taking screenshots.
Last updated -
TypeScript
Puppeteer Real Browser MCP Server
withLinda
-
security
F
license
-
quality
A Model Context Protocol server that enables AI assistants to control a real web browser with stealth capabilities, avoiding bot detection while performing tasks like clicking, filling forms, taking screenshots, and extracting data.
Last updated -
101
9
TypeScript

View all related MCP servers

Selenium MCP Server