Supports outputting OCR results in Markdown format
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
Screenshot capture (left half, right half, full screen)
OCR text recognition (supports Japanese and English)
Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
Primary OCR engine
High-accuracy Japanese text recognition
Runs as an API server
Fallback OCR engine
Used when yomitoku is unavailable
Supports both Japanese and English recognition
Installation
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json
:
Environment Variables
Variable Name | Description | Default Value |
OCR_API_URL | yomitoku API base URL |
Usage Example
You can use it by instructing Claude like this:
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region
: Screenshot area ('left'/'right'/'full', default: 'left')format
: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
local-only server
The server can only run on the client's local machine because it depends on local resources.
Tools
Provides screenshot and OCR capabilities for macOS.
Related MCP Servers
- AsecurityAlicenseAqualityEnables capturing high-quality native macOS screenshots using Safari through a Node.js server, supporting various sizes, zoom levels, and load wait times.Last updated -10MIT License
- AsecurityFlicenseAqualityEnables AI tools to capture and process screenshots of a user's screen, allowing AI assistants to see and analyze what the user is looking at through a simple MCP interface.Last updated -118
Textin MCP Serverofficial
AsecurityAlicenseAqualityA server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.Last updated -34523MIT License- AsecurityAlicenseAqualityA macOS utility that captures screenshots and analyzes them with AI vision, enabling AI assistants to see and interpret what's on your screen.Last updated -35,759632MIT License
Appeared in Searches
- MCP server for screenshot viewing and automatic cropping
- How to connect to the internet, browse a webpage, and take screenshots
- A microcontroller with image processing capabilities
- A system or tool for enabling computer vision capabilities in an MCP (Microcontroller Platform)
- Tools and Methods for Image Generation