mcp-screenshot

MCP Screenshot

An MCP server that captures screenshots and performs OCR text recognition.

Features

Screenshot capture (left half, right half, full screen)
OCR text recognition (supports Japanese and English)
Multiple output formats (JSON, Markdown, vertical, horizontal)

OCR Engines

This server uses two OCR engines:

yomitoku
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
Tesseract.js
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition

Installation

npx -y @kazuph/mcp-screenshot

Claude Desktop Configuration

Add the following configuration to your claude_desktop_config.json:

{
  "mcpServers": {
    "screenshot": {
      "command": "npx",
      "args": ["-y", "@kazuph/mcp-screenshot"],
      "env": {
        "OCR_API_URL": "http://localhost:8000"  // yomitoku API base URL
      }
    }
  }
}

Environment Variables

Variable Name	Description	Default Value
OCR_API_URL	yomitoku API base URL	http://localhost:8000

Usage Example

You can use it by instructing Claude like this:

Please take a screenshot of the left half of the screen and recognize the text in it.

Tool Specification

capture

Takes a screenshot and performs OCR.

Options:

region: Screenshot area ('left'/'right'/'full', default: 'left')
format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')

License

MIT

Author

kazuph

Install Server

HTTP connection URL

security – no known vulnerabilities

license - permissive license

quality - confirmed to work

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

Tools

capture

Provides screenshot and OCR capabilities for macOS.

Related MCP Servers

Safari Screenshot MCP Server
rogerheykoop
A
security
A
license
A
quality
Enables capturing high-quality native macOS screenshots using Safari through a Node.js server, supporting various sizes, zoom levels, and load wait times.
Last updated -
1
0
TypeScript
MIT License
Screenshot MCP Server
codingthefuturewithai
A
security
F
license
A
quality
Enables AI tools to capture and process screenshots of a user's screen, allowing AI assistants to see and analyze what the user is looking at through a simple MCP interface.
Last updated -
1
13
Python
Textin MCP Serverofficial
intsig-textin
A
security
A
license
A
quality
A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.
Last updated -
3
19
18
JavaScript
MIT License
Peekaboo MCP
steipete
A
security
A
license
A
quality
A macOS utility that captures screenshots and analyzes them with AI vision, enabling AI assistants to see and interpret what's on your screen.
Last updated -
3
10,773
490
MIT License

View all related MCP servers

mcp-screenshot

MCP Screenshot

Features

OCR Engines

Installation

Claude Desktop Configuration

Environment Variables

Usage Example

Tool Specification

capture

License

Author

Tools

Related MCP Servers

Safari Screenshot MCP Server

Screenshot MCP Server

Textin MCP Serverofficial

Peekaboo MCP

Appeared in Searches

New MCP Servers

MCP directory API