Integrations
Supports outputting OCR results in Markdown format
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
- yomitoku
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
- Tesseract.js
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
Copy
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json
:
Copy
Environment Variables
Variable Name | Description | Default Value |
---|---|---|
OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Copy
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region
: Screenshot area ('left'/'right'/'full', default: 'left')format
: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
You must be authenticated.
local-only server
The server can only run on the client's local machine because it depends on local resources.
Tools
Provides screenshot and OCR capabilities for macOS.