Provides comprehensive browser automation capabilities including navigation, content extraction, form filling, screenshot capture, and JavaScript execution through Playwright's Firefox engine.
Browser MCP Server
A Model Context Protocol (MCP) server that provides comprehensive browser automation capabilities using Playwright. This server enables AI assistants to interact with web pages through standardized MCP tools for navigation, content extraction, form filling, and screenshot capture.
π Features
Core Browser Operations
Navigate to URLs with smart waiting strategies
Extract page content with customizable selectors
Take screenshots (full page, viewport, or specific elements)
Execute JavaScript with result capture
Click elements by CSS selectors
Fill forms automatically with validation
Advanced Capabilities
Multi-browser support (Chromium, Firefox, WebKit)
Request interception and monitoring
Viewport customization and responsive testing
Link extraction and URL processing
Error handling with detailed responses
Resource management and cleanup
Related MCP server: Playwright Server MCP
π¦ Installation
Prerequisites
Python 3.8 or higher
Node.js (for Playwright browser installation)
Install from Source
Install from PyPI (when available)
π Usage
As MCP Server
Start the server with stdio transport:
Configuration
Configure the browser through environment variables:
MCP Client Integration
Add to your MCP client configuration:
π§ Available Tools
navigate_to
Navigate to a specified URL with optional waiting.
get_page_content
Extract text content from the current page.
click_element
Click on elements by CSS selector.
fill_form
Fill form fields with data.
take_screenshot
Capture page screenshots.
execute_javascript
Run JavaScript in the browser context.
π Project Structure
π Architecture
Server (server.py)
MCP server implementation with tool registration
Request routing and response formatting
Error handling and logging
Async tool execution
Browser Manager (browser.py)
Playwright browser lifecycle management
Context creation and configuration
Resource cleanup and recovery
Multi-browser support
Actions (actions.py)
High-level browser automation methods
Content extraction and processing
Form interaction and validation
Screenshot and JavaScript execution
Utils (utils.py)
HTML sanitization and cleaning
URL validation and normalization
Image processing and encoding
Data formatting utilities
π Security Considerations
HTML sanitization removes dangerous scripts and attributes
URL validation prevents malicious redirects
Input validation for all user-provided data
Resource limits prevent excessive memory usage
Timeout controls prevent hanging operations
π³ Docker Deployment
Quick Start with Docker
Production Deployment
Development with Docker
Container Management
π¨ Error Handling
The server provides detailed error responses with:
Error categorization (timeout, validation, execution)
Context information (URL, selector, arguments)
Recovery suggestions where applicable
Logging for debugging and monitoring
π Response Format
All tools return standardized JSON responses:
Error responses include:
π‘ Environment Variables
Variable | Default | Description |
|
| Run browser in headless mode |
|
| Browser engine to use |
|
| Default timeout (ms) |
π€ Development
Setting up Development Environment
Adding New Tools
Define tool schema in
server.pyImplement action method in
actions.pyAdd utility functions in
utils.pyUpdate documentation and tests
π License
MIT License - see LICENSE file for details.
π Acknowledgments
Playwright for browser automation
MCP for the protocol specification
Anthropic for Claude and MCP development
π Support
Issues: Report bugs and request features on GitHub
Documentation: See inline code documentation
Community: Join MCP community discussions
Note: This is a foundational implementation. Additional features like request interception, advanced form handling, and performance optimizations can be added based on specific use cases.