pywinauto-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@pywinauto-mcpOpen Notepad and type 'Hello world'."
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
windows-computer-use-mcp
A tool for agents, and an agent itself.
You | It |
Use it as an MCP server | Claude, Cursor, DeepSeek call |
Use it as an autonomous agent | Give it a goal: |
Use it as a webapp |
|
Use it as a desktop app | The NSIS installer bundles everything into one binary — no Python, no uv, no git needed |
Exhibit A: 100 Tauri/NSIS installers, one unattended run, $2 in LLM costs. Install, screenshot, verify, report — zero human intervention. That is what agentic Windows automation looks like at scale.
Built on pywinauto. Read docs/SAFETY.md before production use.
Quick Start
Method | Command / Config |
MCP stdio (Cursor, Claude Desktop) |
|
HTTP streamable (any MCP HTTP client) |
|
Web operator UI |
|
Desktop app (NSIS installer) | Download from Releases — zero deps |
See INSTALL.md for detailed setup. Run just demo for examples.
Related MCP server: ClawdCursor
Features
Window Management — find, activate, maximize, minimize, position, close
Mouse & Keyboard — click, drag, type, hotkeys, app shortcuts
UI Elements — inspect, click, read text, verify state via UIA / Win32
Visual Intelligence — screenshots, OCR, template matching
Autonomous Missions — give it a goal, it plans and executes with retry + verification
Macro Recording — record any UI sequence, replay, verify outcomes
Multi-App Workflows — chain actions across Notepad, Calc, Paint, or any Windows app
Telemetry — every action logged to SQLite; query failure patterns by tool
Adaptive Location — auto-cascades through title/auto_id/control_id/class/OCR to find elements
Face Recognition — optional, off by default
Documentation
Doc | Content |
Setup: desktop app, uv, MCP config | |
Full documentation hub | |
Python dependency deep dive | |
Browser automation with Playwright MCP | |
OCR system — Tesseract setup, limitations, competition | |
Dogfooding: using the tool to test its own NSIS installer | |
Improvement roadmap short/medium/long term | |
HITL, kill switch, opt-in features | |
Portmanteau tool reference | |
Test suite guide and e2e setup | |
Runnable demos | |
MCPB bundle packaging | |
Operator UI build/dev guide | |
Release history |
Ports
Port | Service |
10788 | Frontend — Vite operator UI |
10789 | Backend — FastAPI + FastMCP HTTP |
stdio | MCP transport (port-free) |
Related
Repo | What it does |
Raw input recording/replay via AHK | |
Playwright browser control — for webapps, HTML DOM, websites | |
Sandbox / VM isolation | |
Registry, services, accounts |
Browser vs desktop: This server drives Win32 / UI Automation. For HTML/DOM and websites, pair with browser-mcp (Playwright). Both MCPs can run side by side — use one profile that loads both and let the LLM pick the right tool for the target.
Fleet standards: mcp-central-docs.
License
MIT — Copyright (c) 2026 Sandra Schipal.
This server cannot be installed
Maintenance
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/sandraschi/windows-computer-use-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server