Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MCP Desktop ToolsOpen Chrome, go to news.ycombinator.com, and summarize the top story"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Desktop Tools
An MCP server that gives Claude desktop automation capabilities — browser control, screenshots, mouse/keyboard input, window management, and clipboard access.
Built with TypeScript, Playwright, and native Windows APIs.
Tools
Browser
Tool | Description |
| Launch Chromium and navigate to a URL |
| Navigate to a URL with configurable wait conditions |
| Click elements by CSS selector |
| Type into input fields, optionally clear or press Enter |
| Read page content (text, HTML, title, URL, or specific elements) |
| Capture viewport or full-page screenshots |
| Close the browser |
Screenshots
Tool | Description |
| Capture entire screen (multi-monitor supported) |
| Capture a rectangular region by coordinates |
| Capture a specific window by title (partial match) |
Desktop
Tool | Description |
| Click at screen coordinates |
| Move cursor (instant or smooth animation) |
| Type text via simulated keystrokes |
| Press keyboard shortcuts (e.g. ctrl+c, alt+tab) |
| List all visible windows with positions and sizes |
| Focus a window by title |
| Move and/or resize a window |
| Launch apps by path, name, or URI |
| Read clipboard text |
| Write text to clipboard |
Setup
Claude Code Configuration
Add to your Claude Code MCP settings (~/.claude/settings.json):
Restart Claude Code to pick up the new server.
Requirements
Windows 10/11
Node.js 18+
PowerShell (used for native window/mouse/keyboard operations)
License
MIT