image
Capture macOS screen content from entire displays, specific applications, or individual windows, then optionally analyze images with AI models to answer questions about visual content.
Instructions
Captures macOS screen content and optionally analyzes it. Targets can be entire screen, specific app window, or all windows of an app (via app_target). Supports foreground/background capture. Output via file path or inline Base64 data (format: "data"). If a question is provided, image is analyzed by an AI model (auto-selected from PEEPIT_AI_PROVIDERS). Window shadows/frames excluded. PeepIt MCP 1.0.0-beta.1 using openai/gpt-4o, ollama/llava:latest
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| app_target | No | Optional. Specifies the capture target. For example: Omit or use an empty string (e.g., `''`) for all screens. Use `'screen:INDEX'` (e.g., `'screen:0'`) for a specific display. Use `'frontmost'` for all windows of the current foreground application. Use `'AppName'` (e.g., `'Safari'`) for all windows of that application. Use `'PID:PROCESS_ID'` (e.g., `'PID:663'`) to target a specific process by its PID. Use `'AppName:WINDOW_TITLE:Title'` (e.g., `'TextEdit:WINDOW_TITLE:My Notes'`) for a window of 'AppName' matching that title. Use `'AppName:WINDOW_INDEX:Index'` (e.g., `'Preview:WINDOW_INDEX:0'`) for a window of 'AppName' at that index. Ensure components are correctly colon-separated. | |
| path | Yes | Optional. Base absolute path for saving the image. Relevant if `format` is `'png'`, `'jpg'`, or if `'data'` is used with the intention to also save the file. If a `question` is provided and `path` is omitted, a temporary path is used for image capture, and this temporary file is deleted after analysis. | |
| question | No | Optional. If provided, the captured image will be analyzed by an AI model. The server automatically selects an AI provider from the `PEEPIT_AI_PROVIDERS` environment variable. The analysis result (text) is included in the response. | |
| format | Yes | Optional. Output format. Can be `'png'`, `'jpg'`, `'jpeg'` (alias for jpg), or `'data'`. Format is case-insensitive (e.g., 'PNG', 'Png', 'png' are all valid). If `'png'` or `'jpg'`, saves the image to the specified `path`. If `'data'`, returns Base64 encoded PNG data inline in the response. If `path` is also provided when `format` is `'data'`, the image is saved (as PNG) AND Base64 data is returned. Defaults to `'data'` if `path` is not given. Invalid format values automatically fall back to 'png'. | |
| capture_focus | No | Optional. Focus behavior. 'auto' (default): bring target to front only if not already active. 'background': capture without altering window focus. 'foreground': always bring target to front before capture. | auto |