The Peekaboo MCP server provides macOS screen capture and AI-powered image analysis for integration with AI assistants like Claude Desktop and Cursor IDE.
Capture Screenshots: Take screenshots of entire screens, specific app windows, or all windows of an app, with control over output format (file or Base64 data).
AI Image Analysis: Analyze captured or existing images using AI models like GPT-4 Vision, Claude, or local models via Ollama.
System Information: List running applications, their open windows (including details like position, size, and IDs), and server status.
Integration: Seamlessly integrate with AI assistants for visual context and debugging workflows.
Privacy Options: Support for local AI analysis through Ollama for privacy-focused use cases.
Advanced Targeting: Fuzzy matching and precise targeting for apps, windows, and screens.
Enables comprehensive screen capture capabilities on macOS, including capturing entire screens, specific application windows, or all windows of an app with various formatting options.
Enables local AI image analysis of screenshots through Ollama, supporting models like LLaVA and Qwen2-VL for vision tasks without sending data to the cloud.
Provides integration with OpenAI's vision models (like GPT-4o) for analyzing captured screenshots through the OpenAI API.
Peekaboo MCP: Lightning-fast macOS Screenshots & GUI Automation π

π NEW in v3: Complete GUI automation framework with AI Agent! Click, type, scroll, and automate any macOS application using natural language. Plus comprehensive menu bar extraction without clicking! See the GUI Automation section and AI Agent section for details.
Peekaboo is a powerful macOS utility for capturing screenshots, analyzing them with AI vision models, and now automating GUI interactions. It works both as a standalone CLI tool (recommended) and as an MCP server for AI assistants like Claude Desktop and Cursor.
π― Choose Your Path
π₯οΈ CLI Tool (Recommended for Most Users)
Perfect for:
Command-line workflows and automation
Shell scripts and CI/CD pipelines
Quick screenshots and AI analysis
System administration tasks
π» MCP Server (For AI Assistants)
Perfect for:
Claude Desktop integration
Cursor IDE workflows
AI agents that need visual context
Interactive AI debugging sessions
What is Peekaboo?
Peekaboo bridges the gap between visual content on your screen and AI understanding. It provides:
Lightning-fast screenshots of screens, applications, or specific windows
AI-powered image analysis using GPT-4.1 Vision, Claude, Grok, or local models (Ollama)
Complete GUI automation (v3) - Click, type, scroll, and interact with any macOS app
Natural language automation (v3) - AI agent that understands tasks like "Open TextEdit and write a poem"
Smart UI element detection - Automatically identifies buttons, text fields, links, and more with precise coordinate mapping
Menu bar extraction (v3) - Discover all menus and keyboard shortcuts without clicking or opening menus
Automatic session resolution - Commands intelligently use the most recent session (no manual tracking!)
Window and application management with smart fuzzy matching
Multi-screen support - List which screen windows are on and move them between displays
Privacy-first operation with local AI options via Ollama
Non-intrusive capture without changing window focus
Automation scripting - Chain commands together for complex workflows
ποΈ Architecture
Peekaboo now composes three focused SwiftPM targets plus a thin umbrella:
PeekabooAutomation β Screen capture, permissions, accessibility services, menu/window helpers, and every strongly typed model shared across apps.
PeekabooAgentRuntime β Tool registry, MCP server tooling, streaming/agent glue. Nothing here touches AppKit directly; it consumes the automation protocols.
PeekabooVisualizer β Event serialization and UI feedback for the macOS app + CLI visual overlays.
PeekabooCore β
_exportedshim that re-exports the three modules so downstream targets can continue toimport PeekabooCorewhile opting into narrower modules when desired.CLI β Command-line interface that injects a
PeekabooServices()instance into commander commands and agents.Mac App β Native macOS GUI that keeps a long-lived
@State private var services = PeekabooServices()and passes it through SwiftUI scenes.MCP Server β Model Context Protocol server for AI assistants (Claude Desktop, Cursor, etc.) built entirely on
PeekabooAgentRuntime.Commander (in-repo) β Lightweight Swift 6 parsing helpers used by the CLI runtime (swift-tools-version 6.0, no Swift 6.2+ features).
The CLI command structs remain @MainActor so they run on the main thread, but the static commandDescription can just be a normal static let constantβno nonisolated(unsafe) or extra @MainActor wrappers are necessary when describing the command metadata.
All components share the same core services, ensuring consistent behavior and optimal performance. See Service API Reference for detailed documentation.
Embedding Peekaboo services
The legacy singleton is goneβif you construct PeekabooServices() yourself (e.g., tests, daemon, or a new host), call services.installAgentRuntimeDefaults() once after initialization so MCP tools, the ToolRegistry, and PeekabooAgentService share that instance:
Skipping the install step will cause MCP/ToolRegistry APIs to fatal with βdefault factory not configuredβ because thereβs no hidden global anymore.
Git Submodules
Peekaboo vendors three shared dependencies as top-level git submodules:
Path | Purpose |
| Accessibility automation primitives |
| Swift command parser used by the CLI |
| AI provider + MCP integrations |
Clone with git clone --recursive or run git submodule update --init --recursive after pulling to ensure all three are present.
Submodule Details (updated)
AXorcistis our accessibility engine: it wraps the macOS AX APIs with type-safe Swift helpers so every command (and agent) can discover UI elements, grant permissions, and drive buttons/text fields without reinventing accessibility plumbing. Because the underlying APIs are macOS-only we keep AXorcistβs CI and release targets scoped to macOS 14+.Commanderis the shared parser/runtime that replaces Swift Argument Parser across Peekaboo. It provides property-wrapper metadata, a central router, standard CLI flags, and binders that hydrate existing command structs while keeping the runtime @MainActor-friendly. Commanderβs CI spans macOS, Linux, Apple simulators (iOS/tvOS/watchOS/visionOS), and Android (via--swift-sdk android).Tachikomais the AI provider SDK plus MCP adapters. It now owns all credential/OAuth handling viaTKAuthManagerand exposes the standalonetk-configCLI (add/login/status) so hosts can reuse the same auth + validation flows. Peekabooβs CLI delegates provider auth to Tachikoma; setTachikomaConfiguration.profileDirectoryNameto.peekabooto share credentials.
π§© Platform Support
Component | Supported OS targets | Notes |
Peekaboo CLI + Mac app | macOS 14.0+ | Everything ships as a native macOS product; CI only runs on macOS runners so we can exercise ScreenCaptureKit and Accessibility APIs. |
AXorcist | macOS 14.0+ | Accessibility frameworks are macOS-only, so both the library and its CLI stay scoped to macOS. |
Commander | macOS, Linux, Apple simulators (iOS/tvOS/watchOS/visionOS), Android (aarch64) | CI installs Swift toolchains via Swiftly, runs
against each simulator platform, and cross-compiles with
so Commander stays portable on the platforms we actively test. |
Tachikoma | macOS, Linux, Apple simulators (iOS/tvOS/watchOS/visionOS), Android (aarch64) | Matches Commanderβs matrix and additionally runs the Android cross-compilation plus the Apple destination builds for provider SDK validation. |
π Quick Start: CLI Tool
Installation
Command Catalog
Peekaboo ships many focused commands. Each entry below links to a short doc in docs/commands/ with complete flag tables, workflows, and troubleshooting notes.
Vision & Capture
seeβ Capture annotated UI maps, produce session IDs, and optionally run inline analysis.imageβ Grab raw PNG/JPG screenshots (screens, windows, menu bar) and feed them into AI with--analyze.
Core Utilities
listβ Subcommands:apps,windows,screens,menubar,permissions.toolsβ Enumerate native + MCP tools; filter by server/source.configβ Subcommands:init,show(live validation),add <provider> <secret> [--timeout],login <provider>(OpenAI/Codex, Anthropic Max OAuth), legacyset-credential,models.permissionsβstatus(default) andgranthelpers for Screen Recording, Accessibility, etc.learnβ Emit the full agent guide/system prompt/Commander signature dump.runβ Execute.peekaboo.jsonautomation scripts (--output,--no-fail-fast).sleepβ Millisecond delays between scripted steps.cleanβ Prune session caches via--all-sessions,--older-than, or--session.
Interaction
clickβ Element IDs, fuzzy queries, or coordinates; built-in wait/focus helpers.typeβ Text + escape sequences,--clear, tab/return/escape/delete flags.pressβ Special key sequences with repeat counts.hotkeyβ Modifier combos such ascmd,shift,t(terminal-safe parsing).scrollβ Directional scrolls with optional element targets.swipeβ Smooth drags between IDs or coordinates (--duration,--steps).dragβ Drag-and-drop, modifiers, Dock/Trash targets.moveβ Cursor placement (coords, element IDs, queries, or screen center).
Windows, Menus, Apps, Spaces
windowβ Subcommands:close,minimize,maximize,move,resize,set-bounds,focus,list.menuβclick,click-extra,list,list-allfor app menus and menu extras.menubarβlist/clickstatus-bar items by name or index.appβlaunch,quit,relaunch,hide,unhide,switch,list.openβ macOS-styleopenwith Peekaboo focus/failure handling.dockβlaunch,right-click,hide,show,listfor Dock entries.dialogβclick,input,file,dismiss,listsystem dialogs.spaceβlist,switch,move-window(Spaces/virtual desktops).
Agents & Integrations
agentβ Natural-language automation (--dry-run,--resume,--model, audio options, session caching).mcpβserve,list,add,remove,enable,disable,info,test,call,inspect(stub) for Model Context Protocol workflows.
Each doc contains exhaustive flag descriptions and examples; the README only covers intent and grouping. Use peekaboo <command> --help for inline summaries.
Debugging with Verbose Mode
All Peekaboo commands support the --verbose or -v flag for detailed logging:
Verbose logs are written to stderr with timestamps:
This is invaluable for:
Debugging automation scripts
Understanding why elements aren't found
Performance optimization
Learning Peekaboo's internals
Configuration
Peekaboo uses a unified configuration directory at ~/.peekaboo/ for better discoverability:
Managing API Keys Securely
Example Configuration
~/.peekaboo/config.json:
~/.peekaboo/credentials (auto-created with proper permissions):
Common Workflows
π» MCP Server Setup
For AI assistants like Claude Desktop and Cursor, Peekaboo provides a Model Context Protocol (MCP) server.
For Claude Desktop
Open Claude Desktop Settings (from the menubar, not the in-app settings)
Navigate to Developer β Edit Config
Add the Peekaboo MCP server configuration:
Save and restart Claude Desktop
For Claude Code
Run the following command:
Alternatively, if you've already installed the server via Claude Desktop, you can import it:
Local Development
For local development, use the built MCP server directly:
For Cursor IDE
Add to your Cursor settings:
π MCP Client Integration
Peekaboo v3 now functions as both an MCP server (exposing its tools) and an MCP client (consuming external tools). This enables powerful workflows that combine Peekaboo's native automation with tools from the broader MCP ecosystem.
Default Integration: Chrome DevTools MCP
Peekaboo ships with the Chrome DevTools MCP enabled by default, providing Chromium automation via the DevTools protocol:
Managing External MCP Servers
Configuration
External servers are configured in ~/.peekaboo/config.json. To disable Chrome DevTools MCP:
Available External Tools
All external tools are prefixed with their server name:
chrome-devtools:navigate_page - Navigate to URL (Chrome DevTools MCP)
chrome-devtools:click - Click elements on webpage (Chrome DevTools MCP)
chrome-devtools:screenshot - Take webpage screenshot (Chrome DevTools MCP)
github:create_issue - Create GitHub issues (GitHub server)
files:read_file - Read files (Filesystem server)
The AI agent automatically uses the best combination of native and external tools for each task.
See docs/mcp-client.md for complete documentation.
MCP Tools Available
Core Tools
image- Capture screenshots (with optional AI analysis via question parameter)list- List applications, windows, or check server statusanalyze- Analyze existing images with AI vision models (MCP-only tool, usepeekaboo image --analyzein CLI)
UI Automation Tools
see- Capture screen and identify UI elementsclick- Click on UI elements or coordinatestype- Type text into UI elements (supports escape sequences)press- Press individual keys (return, tab, escape, arrows, etc.)scroll- Scroll content in any directionhotkey- Press keyboard shortcutsswipe- Perform swipe/drag gesturesmove- Move mouse cursor to specific position or elementdrag- Perform drag and drop operations
Application & Window Management
app- Launch, quit, focus, hide, and manage applicationswindow- Manipulate windows (close, minimize, maximize, move, resize, focus)menu- Interact with application menus and system menu extrasdock- Launch apps from dock and manage dock itemsdialog- Handle dialog windows (click buttons, input text)space- Manage macOS Spaces (virtual desktops)
Utility Tools
run- Execute automation scripts from .peekaboo.json filessleep- Pause execution for specified durationclean- Clean up session cache and temporary filespermissions- Check system permissions (screen recording, accessibility)agent- Execute complex automation tasks using AI
π CLI Command Reference
Full command descriptions now live in docs/cli-command-reference.md.
π GUI Automation with Peekaboo v3
Peekaboo v3 introduces powerful GUI automation capabilities, transforming it from a screenshot tool into a complete UI automation framework for macOS. This enables AI assistants to interact with any application through natural language commands.
How It Works
The v3 automation system uses a see-then-interact workflow:
See - Capture the screen and identify UI elements
Interact - Click, type, scroll, or perform other actions
Verify - Capture again to confirm the action succeeded
π― The see Tool - UI Element Discovery
The see tool is the foundation of GUI automation. It captures a screenshot and identifies all interactive UI elements, assigning them unique Peekaboo IDs.
Discovering Available Screens
Before capturing specific screens, you can list all connected displays:
This command shows:
Screen index: Use with
see --screen-indexorimage --screen-indexDisplay name: Built-in, External, or specific model names
Resolution: Full screen resolution
Position: Coordinates in the unified desktop space
Scale factor: Retina display information
Visible area: Usable area (excluding menu bar on primary screen)
Multi-Screen Capture
When capturing multiple screens, Peekaboo automatically saves each screen as a separate file:
Primary screen:
screenshot.pngAdditional screens:
screenshot_screen1.png,screenshot_screen2.png, etc.
Display information (name, resolution) is shown for each captured screen:
Note: Annotation is automatically disabled for full screen captures due to performance constraints.
Element ID Format
B1, B2... - Buttons
T1, T2... - Text fields/areas
L1, L2... - Links
G1, G2... - Groups/containers
I1, I2... - Images
S1, S2... - Sliders
C1, C2... - Checkboxes/toggles
M1, M2... - Menu items
π±οΈ The click Tool
Click on UI elements using various targeting methods:
β¨οΈ The type Tool
Type text with support for escape sequences:
Supported Escape Sequences
\n- Newline/return\t- Tab\b- Backspace/delete\e- Escape\\- Literal backslash
π The press Tool
Press individual keys or key sequences:
Available Keys
Navigation: up, down, left, right, home, end, pageup, pagedown
Editing: delete (backspace), forward_delete, clear
Control: return, enter, tab, escape, space
Function: f1-f12
Special: caps_lock, help
π The scroll Tool
Scroll content in any direction:
β¨οΈ The hotkey Tool
Press keyboard shortcuts:
π The swipe Tool
Perform swipe or drag gestures:
π±οΈ The move Tool
Move the mouse cursor to specific positions or UI elements:
π― The drag Tool
Perform drag and drop operations between UI elements or coordinates:
π The permissions Tool
Check macOS system permissions required for automation:
π The run Tool - Automation Scripts
Execute complex automation workflows from JSON script files:
Script Format (.peekaboo.json)
π― Automatic Window Focus Management
Peekaboo v3 includes intelligent window focus management that ensures your automation commands target the correct window, even across different macOS Spaces (virtual desktops).
How Focus Management Works
All interaction commands (click, type, scroll, menu, hotkey, drag) automatically:
Track window identity - Using stable window IDs that persist across interactions
Detect window location - Find which Space contains the target window
Switch Spaces if needed - Automatically switch to the window's Space
Focus the window - Ensure the window is frontmost before interaction
Verify focus - Confirm the window is ready before proceeding
Focus Options
All interaction commands support these focus-related flags:
Space Management Commands
Peekaboo provides dedicated commands for managing macOS Spaces:
Window Focus Command
For explicit window focus control:
Focus Behavior
By default, Peekaboo:
Automatically focuses windows before any interaction
Switches Spaces when the target window is on a different desktop
Waits for focus to ensure the window is ready
Retries if needed with exponential backoff
This ensures reliable automation across complex multi-window, multi-Space workflows without manual window management.
π» AI Agent Automation
Peekaboo v3 introduces an AI-powered agent that can understand and execute complex automation tasks using natural language. The agent uses OpenAI's Chat Completions API with streaming support to break down your instructions into specific Peekaboo commands.
Setting Up the Agent
Two Ways to Use the Agent
1. Direct Natural Language (Default)
When you provide a text argument without a subcommand, Peekaboo automatically uses the agent:
2. Explicit Agent Command
Use the agent subcommand for more control and options:
How the Agent Works
Understands Your Intent - The AI agent analyzes your natural language request
Plans the Steps - Breaks down the task into specific actions
Executes Commands - Uses Peekaboo's automation tools to perform each step
Verifies Results - Takes screenshots to confirm actions succeeded
Handles Errors - Can retry failed actions or adjust approach
Real-World Examples
Agent Options
--verbose- See the agent's reasoning and planning process--dry-run- Preview what the agent would do without executing--max-steps <n>- Limit the number of actions (default: 20)--model <model>- Choose OpenAI model (default: gpt-4-turbo)--json-output- Get structured JSON output--resume- Resume the latest unfinished agent session--resume <session-id>- Resume a specific session by ID
Agent Capabilities
The agent has access to all Peekaboo commands:
Visual Understanding - Can see and understand what's on screen
UI Interaction - Click buttons, fill forms, navigate menus
Text Entry - Type text, use keyboard shortcuts
Window Management - Open, close, minimize, arrange windows
Application Control - Launch apps, switch between them
File Operations - Save files, handle dialogs
Complex Workflows - Chain multiple actions together
Multiple AI Models - Supports OpenAI (GPT-4o, o3), Anthropic (Claude), and Grok (xAI)
Understanding Agent Execution
When you run an agent command, here's what happens behind the scenes:
Example Workflow
Debugging Agent Actions
Use --verbose to see exactly what the agent is doing:
Tips for Best Results
Be Specific - "Click the blue Submit button" works better than "submit"
One Task at a Time - Break complex workflows into smaller tasks
Verify State - The agent works best when it can see the current screen
Use Verbose Mode - Add
--verboseto understand what the agent is doingSet Reasonable Limits - Use
--max-stepsto prevent runaway automation
Resuming Agent Sessions
The agent supports resuming interrupted or incomplete sessions, maintaining full conversation context:
How Resume Works
Session Persistence - Each agent run creates a session with a unique ID
Thread Continuity - Uses OpenAI's thread persistence to maintain conversation history
Context Preservation - The AI remembers all previous interactions in the session
Smart Recovery - Can continue from any point, understanding what was already done
Resume Examples
βΈοΈ The sleep Tool
Pause execution between actions:
πͺ The window Tool
Comprehensive window manipulation for any application:
Window Actions
close - Close the window (animated if has close button)
minimize - Minimize to dock
maximize - Maximize/zoom window
move - Move to specific coordinates
resize - Change window dimensions
set-bounds - Set position and size in one operation
focus - Bring window to front and focus
Targeting Options
app - Target by application name (fuzzy matching supported)
title - Target by window title (substring matching)
index - Target by index (0-based, front to back order)
π₯οΈ Multi-Screen Support
Peekaboo v3 includes comprehensive multi-screen support for window management across multiple displays. When listing windows, Peekaboo shows which screen each window is on, and provides powerful options for moving windows between screens.
Screen Identification
When listing windows, each window shows its screen location:
Moving Windows Between Screens
Using Screen Index (0-based):
Using Screen Presets:
Combined Screen and Window Operations
You can combine screen movement with window positioning:
How It Works
Unified Coordinate System: macOS uses a single coordinate space across all screens
Smart Positioning: When moving windows between screens without explicit coordinates, windows maintain their relative position (e.g., a window at 25% from the left edge stays at 25% on the new screen)
Screen Detection: Windows are assigned to screens based on their center point
0-Based Indexing: Screens are indexed starting from 0, matching macOS's internal ordering
Multi-Screen with AI Agent
The AI agent understands multi-screen commands:
π The menu Tool
Interact with application menu bars and system menu extras:
Menu Subcommands
list - List all menus and their items (including keyboard shortcuts)
list-all - List menus for the frontmost application
click - Click a menu item (default if not specified)
click-extra - Click system menu extras in the status bar
Key Features
Pure Accessibility - Extracts menu structure without clicking or opening menus
Full Hierarchy - Discovers all submenus and nested items
Keyboard Shortcuts - Shows all available keyboard shortcuts
Smart Discovery - AI agents can use list to discover available options
π The app Tool
Control applications - launch, quit, focus, hide, and switch between apps:
π― The dock Tool
Interact with the macOS Dock:
π¬ The dialog Tool
Handle system dialogs and alerts:
π§Ή The clean Tool
Clean up session cache and temporary files:
Session Management
Peekaboo v3 uses sessions to maintain UI state across commands:
Sessions are created automatically by the
seetoolEach session stores screenshot data and element mappings
Sessions persist in
~/.peekaboo/session/<PID>/Element IDs remain consistent within a session
Sessions are automatically cleaned up on process exit
Best Practices
Always start with - Capture the current UI state before interacting
Use element IDs when possible - More reliable than coordinate clicking
Add delays for animations - Use
sleepafter actions that trigger animationsVerify actions - Call
seeagain to confirm actions succeededHandle errors gracefully - Check if elements exist before interacting
Clean up sessions - Use the
cleantool periodically
Example Workflows
Login Automation
Web Search
Form Filling
Troubleshooting
Elements not found - Ensure the UI is visible and not obscured
Clicks not working - Try increasing
wait_fortimeoutWrong element clicked - Use specific element IDs instead of queries
Session errors - Run
cleantool to clear corrupted sessionsPermissions denied - Grant Accessibility permission in System Settings
Debugging with Logs
Peekaboo uses macOS's unified logging system. Use pblog to monitor logs:
Note: macOS redacts log values by default, showing <private>.
See docs/pblog-guide.md and docs/logging-profiles/README.md for solutions.
π§ Configuration
For full configuration + environment variable tables, see docs/configuration.md.
π¨ Setting Up Local AI with Ollama
Need fully local models or Ultrathink experimentation? Follow the dedicated playbook in docs/ollama.md for installation, recommended models, environment variables, and troubleshooting tips. At runtime you can point Peekaboo at your Ollama server with PEEKABOO_AI_PROVIDERS="ollama/llama3.3" peekaboo agent "β¦".
π Requirements
Peekaboo needs macOS 14.0+, Screen Recording permission, and (ideally) Accessibility access. Follow docs/permissions.md for step-by-step instructions plus performance tips.
ποΈ Building from Source
See docs/building.md for prerequisites, pnpm build commands, and release script pointers.
π» Poltergeist
Use the watcher by following docs/poltergeist.md; it covers start/stop commands, tuning, and queue behavior.
π§ͺ Testing
Running Tests
Peekaboo uses Swift Testing framework (Swift 6.0+) for all test suites:
Testing the CLI
π Documentation
π Troubleshooting
Issue | Solution |
| Grant Screen Recording permission in System Settings |
| Try using fuzzy matching or list windows first |
| Check API keys and provider configuration |
| Ensure Peekaboo is in your PATH or use full path |
Enable debug logging for more details:
For step-by-step debugging, use the verbose flag:
π οΈ Development
Poltergeist - Automatic CLI Builder
Peekaboo includes Poltergeist, an automatic build system that watches Swift source files and rebuilds the CLI in the background. This ensures your CLI binary is always up-to-date during development.
Key features:
Watches all Swift source files automatically
Smart wrapper script (
./scripts/peekaboo-wait.sh) handles build coordinationExit code 42 indicates build failure - fix immediately
See Poltergeist repository for full documentation
Building from Source
π€ Contributing
Contributions are welcome! Please:
Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Open a Pull Request
π License
MIT License - see LICENSE file for details.
π€ Author
Created by Peter Steinberger - @steipete
π Acknowledgments
Apple's ScreenCaptureKit for blazing-fast captures
The MCP team for the Model Context Protocol
The Swift and TypeScript communities
π Coverage
Date | Command | Scope | Line Coverage |
2025-11-13 |
| CLI + Core services under
(non-interactive subset) | 9.82β―% |
2025-11-12 |
| Entire workspace (Peekaboo + subrepos) | 8.38β―% |
Coverage generated via
xcrun llvm-cov report Apps/CLI/.build/arm64-apple-macosx/debug/peekabooPackageTests.xctest/Contents/MacOS/peekabooPackageTests -instr-profile Apps/CLI/.build/arm64-apple-macosx/debug/codecov/default.profdata. Because the CLI target depends on AXorcist, Commander, Tachikoma, and TauTUI, the figure reflects the aggregate workspace, even though automation-heavy suites remain disabled during headless runs.
Related MCP Servers
- AsecurityAlicenseAqualityProvides screenshot and OCR capabilities for macOS.Last updated -13221MIT License
- AsecurityAlicenseAqualityEnables capturing high-quality native macOS screenshots using Safari through a Node.js server, supporting various sizes, zoom levels, and load wait times.Last updated -10MIT License
- -security-license-qualityA Model Context Protocol server that provides AI vision capabilities for analyzing UI screenshots, offering tools for screen analysis, file operations, and UI/UX report generation.
- AsecurityFlicenseAqualityEnables AI tools to capture and process screenshots of a user's screen, allowing AI assistants to see and analyze what the user is looking at through a simple MCP interface.Last updated -121