Why this server?
This server is specifically named 'OCR-MCP' and its description highlights advanced OCR capabilities with multiple state-of-the-art backends and multi-format output.
-securityAlicense-qualityProvides advanced OCR capabilities with multiple state-of-the-art backends (DeepSeek-OCR, Florence-2, DOTS.OCR, PP-OCRv5), supporting document processing, scanner integration, and multi-format output with layout preservation.Last updated 20 days ago9MITWhy this server?
This server directly provides 'OCR services' powered by Google's Gemini API for high-accuracy text recognition from images.
-securityFlicense-qualityProvides OCR services powered by Google's Gemini API to extract text from images via file paths or base64 strings. It enables high-accuracy text recognition and CAPTCHA processing through simple MCP tools.Last updated 9 months ago4Why this server?
This server explicitly states it provides 'OCR capabilities to extract text from PDF documents using Tesseract' and supports multiple languages.
-securityFlicense-qualityProvides OCR capabilities to extract text from PDF documents using Tesseract, with support for multiple languages including English and Simplified Chinese.Last updated 9 months ago2Why this server?
This server clearly enables 'OCR capabilities to recognize text from images, PDFs, and Word documents' and can convert them to Markdown.

Textin MCP Serverofficial
AsecurityAlicense-qualityA server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.Last updated 10 months ago32428MITWhy this server?
This server offers 'offline, high-accuracy OCR capabilities for images and PDFs' using macOS's Vision framework, with multi-language support.
-securityFlicense-qualityProvides offline, high-accuracy OCR capabilities for images and PDFs using macOS's built-in Vision framework. Supports multi-language text extraction with intelligent block aggregation for tables and paragraphs, outputting structured JSON data suitable for document reconstruction.Last updated 3 months ago1Why this server?
This server provides 'optical character recognition (OCR) capabilities' for screen capture, allowing text extraction from images.
AsecurityFlicense-qualityProvides screen capture and optical character recognition (OCR) capabilities for entire displays or specific application windows. It enables users to list running applications, take screenshots, and extract text from images using multi-language support.Last updated 9 months ago51Why this server?
This server integrates with a 'Handwriting OCR service' to process images and PDF documents and retrieve OCR results.
-securityFlicense-qualityEnables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.Last updated a year ago15Why this server?
This server is directly named 'mcp-mistral-ocr' and explicitly states its purpose is to 'OCR images or pdfs' using the Mistral OCR API.
-securityAlicense-qualityOCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)Last updated 2 months ago36MITWhy this server?
This server is named 'MCP OCR Server' and focuses on extracting text from images using Tesseract OCR, supporting various inputs and multiple languages.
-securityAlicense-qualityExtracts text from images using Tesseract OCR with support for local files, URLs, and raw image bytes. It provides production-grade OCR capabilities and multi-language support through the Model Context Protocol.Last updated 3 months ago35MIT