ocr-mcp
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| OCR_MCP_ALLOWED_DIRS | No | An os.pathsep-separated list of directories to restrict OCR file access to. |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": false
} |
| prompts | {
"listChanged": false
} |
| resources | {
"subscribe": false,
"listChanged": false
} |
| experimental | {} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| list_enginesA | List OCR engines and whether each is currently usable on this machine. Returns JSON: for each engine -> {available, status}. Call this first to see which engines compare_engines will actually run. |
| ocr_imageA | OCR a single image file (PNG/JPG/TIFF/BMP). Args: path: absolute path to the image. engine: 'auto' (RapidOCR), 'rapidocr', 'tesseract', or 'finereader'. lang: ISO 639-1 code ('en','de','fr','ro',...). Mapped per-engine. preprocess: if true, apply grayscale/denoise/deskew first (needs opencv). Returns JSON: {engine, ok, text, mean_confidence, line_count, low_confidence_count, lines:[{text,confidence,bbox}], warnings}. |
| ocr_pdfA | OCR a PDF by rasterizing pages (PyMuPDF) then running an engine per page. Args: path: absolute path to the PDF. engine/lang: see ocr_image. pages: 'all' or a range like '1-3,5'. dpi: rasterization DPI (default 300; higher = slower, more accurate). Returns JSON: {page_count, pages:[{page, ...ocr_image result...}], full_text}. |
| batch_ocrA | OCR many images. |
| compare_enginesA | Run ALL available engines on one image and compare them — the core accuracy tool when you have no ground truth. Returns JSON: per-engine {text, mean_confidence, ok}, plus pairwise text similarity, average agreement, and a 'consensus_engine' (the one whose output best agrees with the others). |
| evaluate_accuracyA | Score OCR output against a ground-truth text file (CER/WER). Provide EITHER Returns JSON: {cer, wer, char_accuracy_pct, word_accuracy_pct, substitutions, deletions, insertions, hits}. |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Prekzursil/abbyy-finereader-ocr-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server