mcp-screenshot
by kazuph
local-only server
The server can only run on the client’s local machine because it depends on local resources.
Integrations
Supports outputting OCR results in Markdown format
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
- yomitoku
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
- Tesseract.js
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
Copy
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json
:
Copy
Environment Variables
Variable Name | Description | Default Value |
---|---|---|
OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Copy
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region
: Screenshot area ('left'/'right'/'full', default: 'left')format
: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
You must be authenticated.
Provides screenshot and OCR capabilities for macOS.