Enables full remote control of macOS systems via VNC, including screen capture, keyboard input, mouse operations (move, click, double-click, scroll, drag-and-drop), and application launching.
Uses OpenAI's GPT-4 for natural language understanding to translate user commands into macOS control actions through the web chat interface.
Provides low-latency real-time screen sharing through LiveKit integration as an alternative to traditional VNC connections.
MCP Remote macOS Control Server + AI Chat Web App
The first open-source MCP server that enables AI to fully control remote macOS systems, now with a web-based chat interface.
This project provides both:
MCP Server - Python-based server for remote macOS control via VNC
AI Chat Web App - Modern web interface for chatting with AI to control your Mac
š Quick Start for Web App
Prerequisites
Docker Desktop installed
Node.js 18+ installed
A Mac with Screen Sharing enabled (can be the same machine)
1. Clone and Setup
2. Configure Environment
3. Enable Screen Sharing (macOS)
Open System Preferences > Sharing
Enable "Screen Sharing"
Set a VNC password when prompted
4. Run the Application
5. Open and Chat!
Open http://localhost:3000 in your browser
Wait for connection to establish
Try commands like:
"Take a screenshot"
"Open Safari"
"Click on the Dock"
"Type hello world"
š Project Structure
š§ Development Commands
šÆ Architecture
š ļø How It Works
Frontend: Modern React app with real-time chat interface
Backend: Express.js server with Socket.IO for real-time communication
LLM Integration: OpenAI GPT-4 for natural language understanding
MCP Client: Communicates with Python MCP server via Docker
macOS Control: VNC-based control of local or remote Macs
š® Example Interactions
š Security Notes
Only use with Macs you own or have explicit permission to control
VNC passwords are transmitted securely
LLM API keys are stored server-side only
All actions are logged for debugging
š Original MCP Server Documentation
The original Python MCP server functionality remains fully intact. See below for the original documentation about using it directly with Claude Desktop.
Original MCP Server Documentation
The first open-source MCP server that enables AI to fully control remote macOS systems.
A direct alternative to OpenAI Operator, optimized specifically for autonomous AI agents with complete desktop capabilities, requiring no additional software installation.
Showcases
Research Twitter and Post Twitter(https://www.youtube.com/watch?v=--QHz2jcvcs)
Use CapCut to create short highlight video(https://www.youtube.com/watch?v=RKAqiNoU8ec)
AI Recruiter: Automated candidate information collection, qualifying applications and sending screening sessions using Mail App
AI Marketing Intern: LinkedIn engagement - automated following, liking, and commenting with relevant users
AI Marketing Intern: Twitter engagement - automated following, liking, and commenting with relevant users
To-Do List (Prioritized)
Performance Optimization - Match speed of Ubuntu desktop alternatives
Apple Scripts Generation - Reduce execution time while maintaining flexibility
VNC Cursor Visibility - Improve debugging and demo experience
We welcome contributions!
Features
No Extra API Costs: Free screen processing with your existing Claude Pro plan
Minimal Setup: Just enable Screen Sharing on the target Mac ā no additional software needed
Universal Compatibility: Works with all macOS versions, current and future
Why We Built This
Native macOS Experience Without Compromise
The macOS native ecosystem remains unmatched in user experience today and will continue to be the gold standard for years to come. This is where human capabilities truly thrive, and now your AI can operate in this environment with the same fluency.
Open Architecture By Design
Universal LLM Compatibility: Work with any MCP Client of your choice
Model Flexibility: Seamlessly integrate with OpenAI, Anthropic, or any other LLM provider
Future-Proof Integration: Designed to evolve with the MCP ecosystem
Effortless Deployment
Zero Setup on Target Machines: No background applications or agents needed on macOS
Screen Sharing is All You Need: Control any Mac with Screen Sharing enabled
Eliminate Backend Complexity: Unlike other solutions that require running Python applications or background services
Streamlined Bootstrap Process
Leverage Claude Desktop's Polished UI: No need for developer-style Python interfaces
Intuitive User Experience: Interact with your AI-controlled Mac through a familiar, user-friendly interface
Instant Productivity: Start working immediately without configuration hassles
Architecture
Installation
Enable Screen Sharing on MacOs If you rent a mac from macstadium.com, you can skip this step
Add this MCP server to Claude Desktop You can configure Claude Desktop to use the Docker image by adding the following to your Claude configuration:
WebRTC Support via LiveKit
This server now includes WebRTC support through LiveKit integration, enabling:
Low-latency real-time screen sharing
Improved performance and responsiveness
Better network efficiency compared to traditional VNC
Automatic quality adaptation based on network conditions
To use WebRTC features, you'll need to:
Set up a LiveKit server or use LiveKit Cloud
Configure the LiveKit environment variables as shown in the configuration example above
Developer Instruction
Clone the repo
Building the Docker Image
Cross-Platform Publishing
To publish the Docker image for multiple platforms, you can use the docker buildx command. Follow these steps:
Create a new builder instance (if you haven't already):
docker buildx create --useBuild and push the image for multiple platforms:
docker buildx build --platform linux/amd64,linux/arm64 -t buryhuang/mcp-remote-macos-use:latest --push .Verify the image is available for the specified platforms:
docker buildx imagetools inspect buryhuang/mcp-remote-macos-use:latest
Usage
The server provides Remote MacOs functionality through MCP tools.
Tools Specifications
The server provides the following tools for remote macOS control:
remote_macos_get_screen
Connect to a remote macOS machine and get a screenshot of the remote desktop. Uses environment variables for connection details.
remote_macos_send_keys
Send keyboard input to a remote macOS machine. Uses environment variables for connection details.
remote_macos_mouse_move
Move the mouse cursor to specified coordinates on a remote macOS machine, with automatic coordinate scaling. Uses environment variables for connection details.
remote_macos_mouse_click
Perform a mouse click at specified coordinates on a remote macOS machine, with automatic coordinate scaling. Uses environment variables for connection details.
remote_macos_mouse_double_click
Perform a mouse double-click at specified coordinates on a remote macOS machine, with automatic coordinate scaling. Uses environment variables for connection details.
remote_macos_mouse_scroll
Perform a mouse scroll at specified coordinates on a remote macOS machine, with automatic coordinate scaling. Uses environment variables for connection details.
remote_macos_open_application
Opens/activates an application and returns its PID for further interactions.
remote_macos_mouse_drag_n_drop
Perform a mouse drag operation from start point and drop to end point on a remote macOS machine, with automatic coordinate scaling.
All tools use the environment variables configured during setup instead of requiring connection parameters.
Limitations
Authentication Support:
Only Apple Authentication (protocol 30) is supported
Security Note
https://support.apple.com/guide/remote-desktop/encrypt-network-data-apdfe8e386b/mac https://cafbit.com/post/apple_remote_desktop_quirks/
We only support protocol 30, which uses the Diffie-Hellman key agreement protocol with a 512-bit prime. This protocol is used by macOS 11 to macOS 12 when communicating with OS X 10.11 or earlier clients.
Here's the information converted to a markdown table:
macOS version running Remote Desktop | macOS client version | Authentication | Control and Observe | Copy items or install package | All other tasks | Protocol Version |
macOS 13 | macOS 13 | 2048-bit RSA host keys | 2048-bit RSA host keys | 2048-bit RSA host keys to authenticate, then 128-bit AES | 2048-bit RSA host keys | 36 |
macOS 13 | macOS 10.12 | Secure Remote Password (SRP) protocol for local only. Diffie-Hellman (DH) if bound to LDAP or macOS server is version 10.11 or earlier | SRP or DH,128-bit AES | SRP or DH to authenticate, then 128-bit AES | 2048-bit RSA host keys | 35 |
macOS 11 to macOS 12 | macOS 10.12 to macOS 13 | Secure Remote Password (SRP) protocol for local only, Diffie-Hellman if bound to LDAP | SRP or DH 1024-bit, 128-bit AES | 2048-bit RSA host keys macOS 13 to macOS 10.13 | 2048-bit RSA host keys macOS 10.13 or later | 33 |
macOS 11 to macOS 12 | OS X 10.11 or earlier | DH 1024-bit | DH 1024-bit, 128-bit AES | Diffie-Hellman Key agreement protocol with a 512-bit prime | Diffie-Hellman Key agreement protocol with a 512-bit prime | 30 |
Always use secure, authenticated connections when accessing remote remote MacOs machines. This tool should only be used with servers you trust and have permission to access.
License
See the LICENSE file for details.