Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MCP OCR ServerExtract the text from https://example.com/invoice.png"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP OCR Server
A production-grade OCR server built using MCP (Model Context Protocol) that provides OCR capabilities through a simple interface.
Features
Extract text from images using Tesseract OCR
Support for multiple input types:
Local image files
Image URLs
Raw image bytes
Automatic Tesseract installation
Support for multiple languages
Production-ready error handling
Installation
Tesseract will be installed automatically on supported platforms:
macOS (via Homebrew)
Linux (via apt, dnf, or pacman)
Windows (manual installation instructions provided)
Usage
As an MCP Server
Start the server:
Configure Claude for Desktop: Add to
~/Library/Application Support/Claude/claude_desktop_config.json:
Available Tools
perform_ocr
Extract text from images:
get_supported_languages
List available OCR languages:
Development
Clone the repository:
Set up development environment:
Run tests:
Contributing
Fork the repository
Create your feature branch (
git checkout -b feature/amazing-feature)Commit your changes (
git commit -m 'Add amazing feature')Push to the branch (
git push origin feature/amazing-feature)Open a Pull Request
Security
Never commit API tokens or sensitive credentials
Use environment variables or secure credential storage
Follow GitHub's security best practices
License
This project is licensed under the MIT License - see the LICENSE file for details.