Question 1

What can you do with this server?

Accepted Answer

The WebDriverIO MCP Server enables AI assistants to automate web browsers and mobile applications through a unified interface.

Browser Automation

* Launch and manage sessions with Chrome, Firefox, Edge, or Safari (headed/headless, custom dimensions)
* Attach to existing Chrome instances via remote debugging port
* Navigate URLs, click elements, fill forms, scroll pages, and capture screenshots
* Retrieve visible/interactable elements and accessibility tree for page analysis
* Manage cookies (get, set, delete) with full attribute control
* Emulate mobile/tablet devices (iPhone 15, Pixel 7, etc.) with viewport, DPR, user-agent, and touch event emulation
* Execute arbitrary JavaScript in the browser context

Mobile App Automation (iOS/Android via Appium)

* Start and manage native app sessions (.app/.ipa/.apk) with configurable state preservation (noReset/fullReset)
* Perform touch gestures: tap, swipe, drag-and-drop
* Control app lifecycle and check app state (installed, running, background, foreground)
* Switch between native and webview contexts for hybrid app testing
* Control device orientation, keyboard visibility, and GPS geolocation
* Support diverse selectors: CSS, XPath, Accessibility ID, UiAutomator (Android), iOS Predicates
* Automatically grant permissions and handle system alerts
* Execute platform-specific mobile commands

Session Management & Recording

* Maintain one active session at a time; close or detach while preserving state
* All tool calls are automatically recorded and exportable as runnable WebDriverIO JavaScript scripts via MCP resources (wdio://sessions, wdio://session/current/steps)

Prerequisites for Mobile: A running Appium server with platform-specific drivers (XCUITest for iOS, UiAutomator2/Espresso for Android) and configured devices/emulators.

Question 2

Which integrations are available for this server?

Accepted Answer

Enables automated testing of Android applications (.apk) on emulators and physical devices with support for UiAutomator selectors, device-specific gestures, key codes, and system interactions like notifications and keyboard control.

Provides mobile app automation for iOS and Android applications with native app testing, touch gestures, app lifecycle management, context switching for hybrid apps, device control, and cross-platform element selection.

Enables automated testing of iOS applications (.app/.ipa) on simulators and physical devices with XCUITest support, iOS Predicate selectors, and iOS-specific features like device shake functionality.

Enables browser automation for Chrome with session management, navigation, element interaction, cookie management, screenshot capture, and accessibility tree analysis in both headless and headed modes.

Question 3

How do I use WebDriverIO MCP Server?

Accepted Answer

1. Click on "Install Server".
2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@WebDriverIO MCP Server start a browser session and navigate to example.com"

That's it! The server will respond to your query, and you can continue using it as needed.

Here is a step-by-step guide with screenshots.