Integrations
Provides tools to control and automate Android devices through uiautomator2, enabling app management, UI interactions (tapping, swiping, scrolling), screenshot capture, device monitoring, and more
Supports integration with GitHub Copilot Chat, allowing it to control Android devices and perform automation tasks through natural language
MCP Android Agent
This project provides an MCP (Model Context Protocol) server for automating Android devices using uiautomator2. It's designed to be easily plugged into AI agents like GitHub Copilot Chat, Claude, or Open Interpreter to control Android devices through natural language.
Quick Demo
Requirements
- Python 3.13 or higher
- Android Debug Bridge (adb) installed and in PATH
- Connected Android device with USB debugging enabled
- uiautomator2 compatible Android device
Features
- Start, stop, and manage apps by package name
- Retrieve installed apps and current foreground app
- Tap, swipe, scroll, drag, and perform UI interactions
- Get device info, screen resolution, battery status, and more
- Capture screenshots or last toast messages
- Programmatically unlock, wake, or sleep the screen
- Clear app data and wait for activities
- Includes a health check and
adb
diagnostic tool
Use Cases
Perfect for:
- AI agents that need to interact with real devices
- Remote device control setups
- Automated QA tools
- Android bot frameworks
- UI testing and automation
- Device management and monitoring
Installation
1. Clone the repo
2. Create and activate virtual environment
3. Install dependencies
Running the Server
Option 1: Using uvicorn (Recommended)
Option 2: Using MCP stdio (For AI agent integration)
Usage
An MCP client is needed to use this server. The Claude Desktop app is an example of an MCP client. To use this server with Claude Desktop:
Locate your Claude Desktop configuration file
- Windows:
%APPDATA%\Claude\claude_desktop_config.json
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json
Add the Android MCP server configuration to the mcpServers section
Replace /path/to/mcp-adb
with the absolute path to where you cloned this repository. For example: /Users/username/Projects/mcp-adb
Using with VS Code
You can also use this MCP server with VS Code's agent mode (requires VS Code 1.99 or newer). To set up:
- Create a
.vscode/mcp.json
file in your workspace:
Replace /path/to/mcp-adb
with the absolute path to where you cloned this repository.
After adding the configuration, you can manage the server using:
- Command Palette →
MCP: List Servers
to view and manage configured servers - Command Palette →
MCP: Start Server
to start the server - The server's tools will be available in VS Code's agent mode chat
UI Inspector
The project includes support for uiauto.dev, a powerful UI inspection tool for viewing and analyzing your device's interface structure.
- Install the UI inspector:
- Start the inspector:
- Open your browser and navigate to https://uiauto.dev
Available MCP Tools
Tool Name | Description |
---|---|
mcp_health | Check if the MCP server is running properly |
connect_device | Connect to an Android device and get basic info |
get_installed_apps | List all installed apps with version and package info |
get_current_app | Get info about the app currently in the foreground |
start_app | Start an app by its package name |
stop_app | Stop an app by its package name |
stop_all_apps | Stop all currently running apps |
screen_on | Turn on the screen |
screen_off | Turn off the screen |
get_device_info | Get detailed device info: serial, resolution, battery, etc. |
press_key | Simulate hardware key press (e.g. home , back , menu , etc.) |
unlock_screen | Unlock the screen (turn on and swipe if necessary) |
check_adb | Check if ADB is installed and list connected devices |
wait_for_screen_on | Wait asynchronously until the screen is turned on |
click | Tap on an element by text , resourceId , or description |
long_click | Perform a long click on an element |
send_text | Input text into currently focused field (optionally clearing before) |
get_element_info | Get info on UI elements (text, bounds, clickable, etc.) |
swipe | Swipe from one coordinate to another |
wait_for_element | Wait for an element to appear on screen |
screenshot | Take and save a screenshot from the device |
scroll_to | Scroll until a given element becomes visible |
drag | Drag an element to a specific screen location |
get_toast | Get the last toast message shown on screen |
clear_app_data | Clear user data/cache of a specified app |
wait_activity | Wait until a specific activity appears |
License
This project is licensed under the MIT License - see the LICENSE file for details.
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A Model Context Protocol server that enables AI agents to control and automate Android devices through natural language, supporting actions like app management, UI interactions, and device monitoring.
Related MCP Servers
- -securityAlicense-qualityA Model Context Protocol server enabling AI agents to access and manipulate ServiceNow data through natural language interactions, allowing users to search for records, update them, and manage scripts.Last updated -9PythonMIT License
- -securityFlicense-qualityA versatile Model Context Protocol server that enables AI assistants to manage calendars, track tasks, handle emails, search the web, and control smart home devices.Last updated -2Python
- AsecurityAlicenseAqualityA Model Context Protocol (MCP) server that enables AI assistants to control and interact with Android devices, allowing for device management, app debugging, system analysis, and UI automation through natural language commands.Last updated -2930PythonApache 2.0
- -securityAlicense-qualityA Model Context Protocol server that enables AI assistants to interact with Android devices through ADB, allowing for automated device management, app installation, file transfers, and screenshot capture.Last updated -112JavaScriptISC License