Control mobile devices (Android, iOS, Aurora OS) and desktop applications through natural language commands for automation, testing, and interaction.
Device & Platform Management: List connected devices across Android (via ADB), iOS Simulator (via simctl+WebDriverAgent), Desktop (Compose Multiplatform), and Aurora OS (via audb). Set active device and get system information.
UI Inspection & Interaction: Take screenshots (standard, auto-compressed, or annotated with numbered bounding boxes). Get UI hierarchy/accessibility tree. Tap, long press, and swipe by coordinates or element identification (text, resource ID, class name, index). Type text, press hardware buttons (BACK, HOME, VOLUME, etc.), and find/wait for elements.
App Management: Launch, stop, install apps (APK/RPM/.app bundles). Get current activity, list installed apps, manage permissions (grant/revoke/reset), and open URLs in device browsers.
System Operations: Execute shell commands, access device logs (logcat/system logs), inspect WebViews, and perform assertions for testing.
Desktop-Specific: Window management (resize, focus), clipboard operations, and performance metrics (CPU/memory).
Aurora OS-Specific: Push/pull files to/from device.
Efficiency: Batch commands for single round-trip execution. Available as Claude Code Plugin, npm package, or standalone native Rust CLI for CI/CD integration.
Enables mobile device automation for Android devices and emulators via ADB, including screenshot capture, UI interactions (tap, swipe, long press), text input, app control (launch, stop, install), hardware button presses, and UI hierarchy inspection.
Enables mobile device automation for iOS Simulator via simctl, including screenshot capture, UI interactions (tap, swipe, long press), text input, app control (launch, stop, install), and URL handling.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Claude Mobiletake a screenshot of my android phone"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Claude Mobile
MCP server for mobile and desktop automation — Android (via ADB), iOS Simulator (via simctl), Desktop (Compose Multiplatform), and Aurora OS (via audb). Like Claude in Chrome but for mobile devices and desktop apps.
Control your Android phone, emulator, iOS Simulator, Desktop applications, or Aurora OS device with natural language through Claude.
Features
Unified API — Same commands work for Android, iOS, Desktop, and Aurora OS
Smart screenshots — Auto-compressed for optimal LLM processing (no more oversized images!)
Annotated screenshots — Screenshots with colored bounding boxes and numbered element labels for visual UI understanding
Permission management — Grant, revoke, and reset app permissions programmatically (Android runtime permissions, iOS privacy services)
Device logs — Read logcat/system logs with filters for debugging
UI interactions — Tap, long press, swipe by coordinates or element text
Text input — Type into focused fields
App control — Launch, stop, and install apps
Platform selection — Explicitly target Android, iOS, Desktop, or Aurora OS
Desktop support — Test Compose Multiplatform desktop apps with window management, clipboard, and performance metrics
Installation
Claude Code CLI (recommended)
To add globally (available in all projects):
From npm
From source
Note: For Desktop support, you need to run
npm run build:desktop(orbuild:all) to compile the Desktop companion app.
Manual configuration
Add to your Claude Code settings (~/.claude.json or project settings):
Windows
Requirements
Android
ADB installed and in PATH
Connected Android device (USB debugging enabled) or emulator
iOS
macOS with Xcode installed
iOS Simulator (no physical device support yet)
WebDriverAgent for full UI inspection and element-based interaction:
npm install -g appium appium driver install xcuitestOr set
WDA_PATHenvironment variable to custom WebDriverAgent location
Desktop
macOS (Windows/Linux support planned)
JDK 17+ for building the Desktop companion
Compose Multiplatform desktop application to test
Aurora OS
audb CLI installed and in PATH (
cargo install audb-client)Connected Aurora OS device with SSH enabled
Python on device required for tap/swipe:
devel-su pkcon install python
Available Tools
Core Tools (All Platforms)
Tool | Android | iOS | Desktop | Aurora | Description |
| ✅ | ✅ | ✅ | ✅ | List all connected devices |
| ✅ | ✅ | ✅ | ✅ | Select active device |
| ✅ | ✅ | ✅ | ✅ | Take screenshot |
| ✅ | ✅ | ✅ | ⚠️ | Tap at coordinates or by text/label (iOS: WDA required for element tap) |
| ✅ | ✅ | ✅ | ✅ | Long press gesture |
| ✅ | ✅ | ✅ | ⚠️ | Swipe in direction or coordinates (requires Python on Aurora) |
| ✅ | ✅ | ✅ | ❌ | Type text |
| ✅ | ✅ | ✅ | ✅ | Press hardware buttons |
| ✅ | ✅ | ❌ | ✅ | Launch app |
| ✅ | ✅ | ❌ | ✅ | Stop app |
| ✅ | ✅ | ❌ | ✅ | Install APK/.app/.rpm |
| ❌ | ❌ | ❌ | ✅ | List installed apps (Aurora only) |
| ✅ | ✅ | ✅ | ❌ | Get UI hierarchy (iOS: requires WebDriverAgent) |
| ✅ | ✅ | ✅ | ❌ | Find elements by text/id/label (iOS: requires WebDriverAgent) |
| ✅ | ✅ | ❌ | ❌ | Screenshot with colored bounding boxes and numbered element labels |
| ✅ | ✅ | ❌ | ❌ | Grant app permission (Android: runtime, iOS: privacy service) |
| ✅ | ✅ | ❌ | ❌ | Revoke app permission |
| ✅ | ✅ | ❌ | ❌ | Reset all permissions for an app |
| ✅ | ❌ | ❌ | ❌ | Get foreground activity |
| ✅ | ✅ | ❌ | ❌ | Open URL in browser (not yet implemented on Aurora) |
| ✅ | ✅ | ❌ | ✅ | Run shell command |
| ✅ | ✅ | ✅ | ✅ | Wait for duration |
| ✅ | ✅ | ❌ | ✅ | Get device logs (logcat/system log) |
| ✅ | ⚠️ | ❌ | ✅ | Clear log buffer |
| ✅ | ❌ | ❌ | ✅ | Battery, memory info |
| ✅ | ✅ | ❌ | ❌ | Wait for element to appear (polling + timeout) |
| ✅ | ✅ | ❌ | ❌ | Assert element is visible (pass/fail) |
| ✅ | ✅ | ❌ | ❌ | Assert element does NOT exist (pass/fail) |
| ✅ | ✅ | ✅ | ✅ | Execute multiple commands in single round-trip |
| ✅ | ❌ | ❌ | ❌ | Inspect WebView via Chrome DevTools Protocol |
| ❌ | ❌ | ❌ | ✅ | Upload file (Aurora only) |
| ❌ | ❌ | ❌ | ✅ | Download file (Aurora only) |
Desktop-Specific Tools
Tool | Description |
| Set target platform (android/ios/desktop) |
| Get current target platform |
| Launch a Compose Desktop application |
| Stop the running desktop application |
| Get desktop window position and size |
| Bring desktop window to front |
| Resize desktop window |
| Get system clipboard content |
| Set system clipboard content |
| Get CPU/memory usage of desktop app |
For detailed Desktop API documentation, see Desktop Specification
Usage Examples
Just talk to Claude naturally:
Permission Management
Annotated Screenshots
Platform Selection
You can explicitly specify the platform:
Or set the active device:
Desktop Examples
Aurora Examples
Native CLI
A 2 MB native Rust binary with all the same commands. No Node.js, no dependencies.
Install CLI
Or download from Releases.
Advantages over MCP
Easy install —
brew installor copy a single 2 MB binaryNo dependencies — no Node.js, no npm, nothing
Use from terminal — run commands directly, no Claude Code or MCP client needed
Test automation — write universal
.shscripts for any platform without learning platform internalsToken-efficient — skill documentation loads only when used; MCP loads all tool schemas into every request, which adds up fast over a session
Fast — ~5ms command startup (Rust) vs ~500ms (Node.js MCP)
CI/CD ready — exit codes, stdout/stderr, runs anywhere
Test script example
Claude Code Plugin
After installing, Claude Code controls devices with natural language. The skill loads into context only on demand — no token overhead when not in use.
See cli/README.md for full CLI documentation.
iOS WebDriverAgent Setup
For full iOS UI inspection and element-based interaction, WebDriverAgent is required. It enables:
get_ui- JSON accessibility tree inspectiontapwithlabelortextparameters - Element-based tappingfind_element- Element discovery and queryingswipe- Improved gesture simulation
Installation
Automatic (via Appium):
Manual:
Set the WDA_PATH environment variable to your WebDriverAgent location:
First Use
On first use, WebDriverAgent will be automatically:
Discovered from Appium installation or
WDA_PATHBuilt with xcodebuild (one-time, ~2 minutes)
Launched on the iOS simulator
Connected via HTTP on port 8100+
Troubleshooting
Build fails:
Session fails:
Ensure simulator is booted:
xcrun simctl list | grep BootedCheck port availability:
lsof -i :8100Try restarting the simulator
Manual test:
How It Works
Claude sends commands through MCP protocol
Server routes to appropriate platform (ADB, simctl+WDA, Desktop companion, or audb)
Commands execute on your device or desktop app
Results (screenshots, UI data, metrics) return to Claude
License
MIT