We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/CursorTouch/Windows-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server
manifest.json•6.52 KiB
{
"manifest_version": "0.2",
"name": "Windows-MCP",
"version": "0.6.2",
"description": "MCP Server that enables Claude to interact with Windows OS",
"long_description": "Windows MCP is an open-source project that enables seamless integration between AI agents and the Windows operating system. Acting as an MCP server, it bridges the gap between large language models (LLMs) and the Windows OS, allowing agents to perform tasks such as **file navigation, application control, UI interaction, QA testing, and more**.\n\n**KEY FEATURES**\n- **Seamless Windows Integration**: Interacts natively with Windows UI elements, opens applications, controls windows, simulates user input, and more.\n- **Use Any LLM (Vision Optional)**: Does not rely on traditional computer vision techniques or fine-tuned models. Works with any LLM, reducing complexity and setup time.\n- **Rich Toolset for UI Automation**: Includes tools for keyboard and mouse control, window management, and capturing window or UI state.\n- **Lightweight & Open-Source**: Minimal dependencies with full source code available under the MIT license.\n- **Customizable & Extendable**: Easily adapt or extend tools to suit custom automation workflows or AI integrations.\n- **Real-Time Interaction**: Typical latency between actions ranges from `0.2` to `0.9` seconds, depending on system load, active applications, and LLM inference speed.\n\n**MINIMUM REQUIREMENTS**\n- Python 3.13 or higher\n- UV Package Manager\nThis MCP server requires UV, a fast Python package manager.\nInstallation:\n`curl -LsSf https://astral.sh/uv/install.sh | sh`\nFor detailed installation instructions, [see the UV documentation](https://github.com/astral-sh/uv)",
"author": {
"name": "CursorTouch",
"url": "https://cursortouch.com/"
},
"homepage": "https://cursortouch.com/",
"documentation": "https://github.com/CursorTouch/Windows-MCP",
"icon": "assets/logo.png",
"screenshots": [
"assets/screenshots/screenshot_1.png",
"assets/screenshots/screenshot_2.png",
"assets/screenshots/screenshot_3.png"
],
"server": {
"type": "python",
"entry_point": "./src/windows_mcp/__main__.py",
"mcp_config": {
"command": "uv",
"args": [
"--directory",
"${__dirname}",
"run",
"windows-mcp"
],
"env": {
"ANONYMIZED_TELEMETRY": "${user_config.anonymized_telemetry}"
}
}
},
"user_config": {
"anonymized_telemetry": {
"type": "boolean",
"title": "Anonymized Telemetry",
"description": "Windows-MCP collects basic usage data to help improve the MCP server. No personal information, tool arguments, or tool outputs are tracked.",
"required": false,
"default": true
}
},
"tools": [
{
"name": "App",
"description": "Manages Windows applications with three modes: 'launch' (opens the prescibed application), 'resize' (adjusts active window size/position), 'switch' (brings specific window into focus)."
},
{
"name": "Shell",
"description": "A comprehensive system tool for executing any PowerShell commands. Use it to navigate the file system, manage files and processes, and execute system-level operations. Capable of accessing web content, interacting with network resources, and performing complex administrative tasks."
},
{
"name": "Snapshot",
"description": "Captures complete desktop state including: system language, focused/opened windows, interactive elements (buttons, text fields, links, menus with coordinates), and scrollable areas. Set use_vision=True to include screenshot. Set use_dom=True for browser content to get web page elements instead of browser UI. Always call this first to understand the current desktop state before taking actions."
},
{
"name": "Click",
"description": "Performs mouse clicks at specified coordinates [x, y]. Supports button types: 'left' for selection/activation, 'right' for context menus, 'middle'. Supports clicks: 0=hover only (no click), 1=single click (select/focus), 2=double click (open/activate)."
},
{
"name": "Type",
"description": "Types text at specified coordinates [x, y]. Set clear=True to clear existing text first, False to append. Set press_enter=True to submit after typing. Set caret_position to 'start' (beginning), 'end' (end), or 'idle' (default)."
},
{
"name": "Scroll",
"description": "Scrolls at coordinates [x, y] or current mouse position if loc=None. Type: vertical (default) or horizontal. Direction: up/down for vertical, left/right for horizontal. wheel_times controls amount (1 wheel ≈ 3-5 lines). Use for navigating long content, lists, and web pages."
},
{
"name": "Move",
"description": "Moves mouse cursor to coordinates [x, y]. Set drag=True to perform a drag-and-drop operation from the current mouse position to the target coordinates. Default (drag=False) is a simple cursor move (hover)."
},
{
"name": "Shortcut",
"description": "Executes keyboard shortcuts using key combinations separated by +. Examples: \"ctrl+c\" (copy), \"ctrl+v\" (paste), \"alt+tab\" (switch apps), \"win+r\" (Run dialog), \"win\" (Start menu), \"ctrl+shift+esc\" (Task Manager). Use for quick actions and system commands."
},
{
"name": "Wait",
"description": "Pauses execution for specified duration in seconds. Use when waiting for: applications to launch/load, UI animations to complete, page content to render, dialogs to appear, or between rapid actions. Helps ensure UI is ready before next interaction."
},
{
"name": "Scrape",
"description": "Fetch content from a URL or the active browser tab. By default, performs a lightweight HTTP request to the URL. If you need to extract text from the active tab's DOM within the viewport, ensure the page is open in a browser and use Snapshot with use_dom=True first, then the agent will handle extraction."
},
{
"name": "MultiSelect",
"description": "Selects multiple items such as files, folders, or checkboxes if press_ctrl=True, or performs multiple clicks if False."
},
{
"name": "MultiEdit",
"description": "Enters text into multiple input fields at specified coordinates [[x,y,text], ...]."
}
],
"compatibility": {
"platforms": [
"win32"
],
"runtimes": {
"python": ">=3.13"
}
},
"keywords": [
"windows",
"automation",
"ai",
"mcp",
"computer-use"
],
"license": "MIT",
"repository": {
"type": "git",
"url": "https://github.com/CursorTouch/Windows-MCP"
}
}