Enables browser automation with CSS selector-based element targeting for web scraping, form automation, and page interaction workflows.
Provides multi-browser support including Firefox for web automation tasks through Playwright's browser automation capabilities.
Enables web automation interactions with GitHub through browser navigation, form filling, and page interaction capabilities.
Allows execution of custom JavaScript code within browser contexts for advanced web automation and data extraction tasks.
Supports taking screenshots in JPEG format for visual page capture and documentation of web automation workflows.
Provides multi-browser support including Safari for web automation tasks through Playwright's browser automation capabilities.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MCP Macaco Playwrighttake a full page screenshot of the current page and save it as homepage.png"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Macaco Playwright
Enhanced Playwright Tools for Model Context Protocol (MCP) with Chrome DevTools Protocol (CDP) Support
Overview
MCP Macaco Playwright is a comprehensive browser automation server that provides AI agents with powerful web interaction capabilities through the Model Context Protocol. It combines Playwright's robust browser automation with CDP integration for advanced debugging and control scenarios.
Features
Complete Browser Automation: Navigate, interact, and extract data from web pages
Chrome DevTools Protocol (CDP) Support: Connect to existing browser instances
AI-Optimized: Designed specifically for AI agents and automated workflows
Comprehensive Tool Set: 50+ specialized functions for web automation
Multi-Browser Support: Chrome, Firefox, Safari, and Edge
Screenshot & Snapshot Capabilities: Visual and accessibility-based page capture
Form Automation: Complete form filling and submission workflows
Network Monitoring: Track requests, responses, and console messages
Installation
npm install mcp-macaco-playwrightQuick Start
import { createConnection } from "mcp-macaco-playwright";
// Create MCP server connection
const server = await createConnection();
// Use with MCP client
await client.callTool({
name: "browser_navigate",
arguments: { url: "https://example.com" },
});Function Reference
Navigation Functions
browser_navigate
Navigate to a specific URL.
Parameters:
url(string, required): The URL to navigate to
Example:
await client.callTool({
name: "browser_navigate",
arguments: { url: "https://github.com" },
});browser_navigate_back
Go back to the previous page in browser history.
Parameters: None
Example:
await client.callTool({
name: "browser_navigate_back",
arguments: {},
});browser_navigate_forward
Go forward to the next page in browser history.
Parameters: None
Example:
await client.callTool({
name: "browser_navigate_forward",
arguments: {},
});Page Analysis Functions
browser_snapshot
Capture an accessibility snapshot of the current page for analysis and interaction.
Parameters: None
Example:
await client.callTool({
name: "browser_snapshot",
arguments: {},
});browser_take_screenshot
Take a visual screenshot of the page or specific element.
Parameters:
type(string, optional): Image format ('png' or 'jpeg', default: 'png')filename(string, optional): Custom filename for the screenshotelement(string, optional): Human-readable element descriptionref(string, optional): Element reference from snapshotfullPage(boolean, optional): Capture full scrollable page
Example:
await client.callTool({
name: "browser_take_screenshot",
arguments: {
type: "png",
fullPage: true,
filename: "homepage.png",
},
});Element Interaction Functions
browser_click
Click on a specific element on the page.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_click",
arguments: {
element: "Sign in button",
ref: "button-signin-123",
},
});browser_double_click
Perform a double-click on an element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_double_click",
arguments: {
element: "File icon",
ref: "file-icon-456",
},
});browser_right_click
Perform a right-click to open context menu.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_right_click",
arguments: {
element: "Image thumbnail",
ref: "img-thumb-789",
},
});Text Input Functions
browser_type
Type text into an editable element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshottext(string, required): Text to typesubmit(boolean, optional): Press Enter after typingslowly(boolean, optional): Type character by character
Example:
await client.callTool({
name: "browser_type",
arguments: {
element: "Search input field",
ref: "search-input-123",
text: "playwright automation",
submit: true,
},
});browser_press_key
Press a specific key on the keyboard.
Parameters:
key(string, required): Key name (e.g., 'Enter', 'ArrowLeft', 'a')
Example:
await client.callTool({
name: "browser_press_key",
arguments: { key: "Escape" },
});Form Functions
browser_select_option
Select options in a dropdown menu.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotvalues(array, required): Array of values to select
Example:
await client.callTool({
name: "browser_select_option",
arguments: {
element: "Country dropdown",
ref: "country-select-456",
values: ["United States"],
},
});browser_check_checkbox
Check or uncheck a checkbox element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotchecked(boolean, required): Whether to check (true) or uncheck (false)
Example:
await client.callTool({
name: "browser_check_checkbox",
arguments: {
element: "Terms and conditions checkbox",
ref: "terms-checkbox-789",
checked: true,
},
});browser_select_radio
Select a radio button.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_select_radio",
arguments: {
element: "Payment method: Credit Card",
ref: "payment-radio-cc",
},
});browser_clear_input
Clear the content of an input field.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_clear_input",
arguments: {
element: "Email input field",
ref: "email-input-123",
},
});Data Extraction Functions
browser_get_text
Extract text content or attribute values from elements.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotattribute(string, optional): Specific attribute to extract (e.g., 'href', 'src')
Example:
await client.callTool({
name: "browser_get_text",
arguments: {
element: "Product price",
ref: "price-display-456",
},
});browser_get_elements
Get multiple elements matching a selector.
Parameters:
selector(string, required): CSS selector to find elementsattribute(string, optional): Attribute to extract from each element
Example:
await client.callTool({
name: "browser_get_elements",
arguments: {
selector: ".product-card h3",
attribute: "textContent",
},
});Scrolling and Focus Functions
browser_scroll_to
Scroll to a specific element or coordinate position.
Parameters:
element(string, optional): Human-readable element descriptionref(string, optional): Element reference from page snapshotx(number, optional): X coordinate to scroll toy(number, optional): Y coordinate to scroll tobehavior(string, optional): Scroll behavior ('auto' or 'smooth')
Example:
await client.callTool({
name: "browser_scroll_to",
arguments: {
element: "Footer section",
ref: "footer-section-123",
behavior: "smooth",
},
});browser_focus_element
Set focus on a specific element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_focus_element",
arguments: {
element: "Search input",
ref: "search-input-456",
},
});browser_blur_element
Remove focus from a specific element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: "browser_blur_element",
arguments: {
element: "Email input",
ref: "email-input-789",
},
});Wait Functions
browser_wait_for
Wait for specific conditions to be met.
Parameters:
time(number, optional): Time to wait in secondstext(string, optional): Text to wait for to appeartextGone(string, optional): Text to wait for to disappear
Example:
await client.callTool({
name: "browser_wait_for",
arguments: {
text: "Loading complete",
time: 5,
},
});Tab Management Functions
browser_tab_list
List all open browser tabs.
Parameters: None
Example:
await client.callTool({
name: "browser_tab_list",
arguments: {},
});browser_tab_new
Open a new browser tab.
Parameters:
url(string, optional): URL to navigate to in the new tab
Example:
await client.callTool({
name: "browser_tab_new",
arguments: { url: "https://example.com" },
});browser_tab_close
Close a browser tab.
Parameters:
index(number, optional): Index of tab to close (closes current if not specified)
Example:
await client.callTool({
name: "browser_tab_close",
arguments: { index: 1 },
});browser_tab_select
Switch to a specific tab by index.
Parameters:
index(number, required): Index of the tab to select
Example:
await client.callTool({
name: "browser_tab_select",
arguments: { index: 0 },
});Network and Console Functions
browser_network_requests
Get all network requests made since page load.
Parameters: None
Example:
await client.callTool({
name: "browser_network_requests",
arguments: {},
});browser_console_messages
Get all console messages from the page.
Parameters: None
Example:
await client.callTool({
name: "browser_console_messages",
arguments: {},
});Chrome DevTools Protocol (CDP) Functions
browser_connect_cdp
Connect to an existing browser instance via CDP.
Parameters:
endpoint(string, required): CDP endpoint URLtimeout(number, optional): Connection timeout in milliseconds (default: 30000)
Example:
await client.callTool({
name: "browser_connect_cdp",
arguments: {
endpoint: "ws://localhost:9222",
timeout: 30000,
},
});browser_get_cdp_endpoints
Discover available CDP endpoints from running browsers.
Parameters:
port(number, optional): CDP port to check (default: 9222)host(string, optional): Host to check (default: 'localhost')
Example:
await client.callTool({
name: "browser_get_cdp_endpoints",
arguments: {
port: 9222,
host: "localhost",
},
});browser_disconnect_cdp
Disconnect from the current CDP connection.
Parameters: None
Example:
await client.callTool({
name: "browser_disconnect_cdp",
arguments: {},
});JavaScript Evaluation Functions
browser_evaluate
Execute JavaScript code in the browser context.
Parameters:
script(string, required): JavaScript code to execute
Example:
await client.callTool({
name: "browser_evaluate",
arguments: {
script: "document.title",
},
});Dialog Handling Functions
browser_handle_dialog
Handle browser dialogs (alert, confirm, prompt).
Parameters:
action(string, required): Action to take ('accept' or 'dismiss')text(string, optional): Text to enter for prompt dialogs
Example:
await client.callTool({
name: "browser_handle_dialog",
arguments: {
action: "accept",
text: "User input",
},
});File Functions
browser_upload_file
Upload files to file input elements.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotfiles(array, required): Array of file paths to upload
Example:
await client.callTool({
name: "browser_upload_file",
arguments: {
element: "File upload input",
ref: "file-input-123",
files: ["/path/to/document.pdf"],
},
});PDF Functions
browser_save_pdf
Save the current page as a PDF.
Parameters:
filename(string, optional): Custom filename for the PDFformat(string, optional): Page format (e.g., 'A4', 'Letter')landscape(boolean, optional): Use landscape orientation
Example:
await client.callTool({
name: "browser_save_pdf",
arguments: {
filename: "report.pdf",
format: "A4",
landscape: false,
},
});Configuration
The MCP server can be configured with various options:
const config = {
browser: {
headless: false,
viewport: { width: 1280, height: 720 },
cdpEndpoint: "ws://localhost:9222", // Optional CDP connection
},
capabilities: ["core", "vision"], // Enable specific tool capabilities
};
const server = await createConnection(config);Common Usage Patterns
Web Scraping Workflow
// Navigate to page
await client.callTool({
name: "browser_navigate",
arguments: { url: "https://example.com" },
});
// Take snapshot to analyze page structure
await client.callTool({
name: "browser_snapshot",
arguments: {},
});
// Extract data from elements
await client.callTool({
name: "browser_get_text",
arguments: {
element: "Product title",
ref: "product-title-123",
},
});Form Automation Workflow
// Navigate to form page
await client.callTool({
name: "browser_navigate",
arguments: { url: "https://example.com/contact" },
});
// Fill form fields
await client.callTool({
name: "browser_type",
arguments: {
element: "Name input",
ref: "name-input-123",
text: "John Doe",
},
});
// Submit form
await client.callTool({
name: "browser_click",
arguments: {
element: "Submit button",
ref: "submit-btn-456",
},
});CDP Integration Workflow
// Connect to existing browser
await client.callTool({
name: "browser_connect_cdp",
arguments: { endpoint: "ws://localhost:9222" },
});
// Work with existing tabs
await client.callTool({
name: "browser_tab_list",
arguments: {},
});
// Disconnect when done
await client.callTool({
name: "browser_disconnect_cdp",
arguments: {},
});Troubleshooting
CDP Connection Issues
If you encounter CDP connection problems:
Start Chrome with debugging enabled:
google-chrome --remote-debugging-port=9222 --remote-debugging-address=0.0.0.0Verify the endpoint is accessible:
curl http://localhost:9222/json/versionCheck for firewall or network restrictions
Browser Launch Issues
Ensure Playwright browsers are installed:
npx playwright installCheck system dependencies:
npx playwright install-depsVerify sufficient system resources for browser instances
License
Apache License 2.0 - see LICENSE file for details.
Contributing
Contributions are welcome! Please see the contributing guidelines for more information.
Support
For issues and questions:
GitHub Issues: https://github.com/macacoai/mcp-playwright/issues
Email: gaston@macaco.ai