Automation MCP
Provides desktop automation on macOS, including mouse control, keyboard input, screen capture, window management, and UI element interaction.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Automation MCPtake a screenshot of the active window"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
🤖 Automation MCP
Automation MCP is a Model Context Protocol (MCP) server that provides AI models with complete desktop automation capabilities on macOS. It enables AI assistants to:
🖱️ Control your mouse (click, move, scroll, drag)
⌨️ Type and send keyboard input (including system shortcuts)
📸 Take screenshots and analyze screen content
🪟 Manage windows (focus, move, resize, minimize)
🎯 Interact with UI elements through coordinates
🎨 Analyze screen colors and highlight regions
🔍 Wait for images to appear on screen
🚀 Quick start
Make sure you have furi installed, and then run the following command:
furi add ashwwwin/automation-mcpfollowed by:
furi start ashwwwin/automation-mcpand you're done! (or you can just use the furi desktop app for no cli).
🥲 Normal start (without furi)
Prerequisites
Bun runtime - Install with:
curl -fsSL https://bun.sh/install | bash
1. Clone and Install
git clone https://github.com/ashwwwin/automation-mcp.git
cd automation-mcp
bun install2. Start the Server
# Start with HTTP transport (recommended for web apps)
bun run index.ts
# Or start with stdio transport (for command line tools)
bun run index.ts --stdio3. Grant Permissions
On first run, macOS will ask for permissions. You must grant these for full functionality:
Accessibility - Allows keyboard/mouse control
Screen Recording - Enables screenshots and screen analysis
Or manually enable in: System Settings → Privacy & Security → Accessibility/Screen Recording
🛠️ Available Tools
🖱️ Mouse Control
mouseClick- Click at coordinates with left/right/middle buttonmouseDoubleClick- Double-click at coordinatesmouseMove- Move cursor to positionmouseGetPosition- Get current cursor locationmouseScroll- Scroll in any directionmouseDrag- Drag from current position to targetmouseButtonControl- Press/release mouse buttonsmouseMovePath- Follow a smooth path with multiple points
⌨️ Keyboard Input
type- Type text or press key combinationskeyControl- Advanced key press/release controlsystemCommand- Common shortcuts (copy, paste, undo, save, etc.)
📸 Screen Capture & Analysis
screenshot- Capture full screen, regions, or specific windowsscreenInfo- Get screen dimensionsscreenHighlight- Highlight screen regions visuallycolorAt- Get color of any pixelwaitForImage- Wait for images to appear (template matching)
🪟 Window Management
getWindows- List all open windowsgetActiveWindow- Get current active windowwindowControl- Focus, move, resize, minimize windows
🔒 Security & Permissions
Accessibility - Required for:
Mouse clicks and movement
Keyboard input simulation
Window management
Screen Recording - Required for:
Taking screenshots
Screen analysis
Color detection
🚀 Integration Examples
With Claude Desktop + furi
If you've already configured furi with Claude Desktop, you don't need to do anything.
Add to your MCP configuration:
{
"mcpServers": {
"furi": {
"command": "furi",
"args": ["connect"]
}
}
}With Claude Desktop
Add to your MCP configuration:
{
"mcpServers": {
"automation": {
"command": "bun",
"args": ["run", "/path/to/automation-mcp/index.ts", "--stdio"]
}
}
}🐛 Troubleshooting
Common Issues
Permission Denied Errors
Ensure Accessibility and Screen Recording permissions are granted
Ensure Xcode Command Line Tools:
xcode-select --install
🙋♂️ Support
Having issues? Check the troubleshooting section above or open an issue with:
Your operating system and version
Error messages
Steps to reproduce
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ashwwwin/automation-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server