Information about olmOCR

Search for:

Information about olmOCR

View all MCP Servers

Why this server?
This server enables recording audio from a microphone and transcribing it using OpenAI's Whisper model, which can be useful for processing speech to text, although it does not directly perform OCR.
Voice Recorder MCP Server
Speech Processing Audio Processing Command Line
DefiBax
A
license
-
quality
D
maintenance
Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
Last updated 2025-03-21
6
MIT
Why this server?
This MCP server extracts text content from local PDF files and supports OCR capabilities, which can be used with the Model Context Protocol (MCP).
PDF Extraction MCP Server
File Systems Text Summarization
xraywu
F
license
C
quality
D
maintenance
An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
Last updated 2025-05-31
1
31
Why this server?
This server provides image recognition capabilities and offers optional text extraction via Tesseract OCR, which can be useful for processing images to text.
MCP Image Recognition Server
Image & Video Processing App Automation
mario-andreschak
A
license
A
quality
D
maintenance
Provides image recognition capabilities using Anthropic Claude Vision and OpenAI GPT-4 Vision APIs, supporting multiple image formats and offering optional text extraction via Tesseract OCR.
Last updated 2025-04-12
3
39
MIT
Why this server?
This MCP service retrieves image dimensions and also can compress images which may indirectly help with OCR processing after retrieval.
image-tools-mcp
Image & Video Processing Multimedia Processing
kshern
A
license
C
quality
D
maintenance
Image Tools MCP is a Model Context Protocol (MCP) service that retrieves image dimensions and compresses images from URLs and local files using the TinyPNG API. It supports converting images to formats like webp, jpeg/jpg, and png, providing detailed information on width, height, type, and compressi
Last updated 2025-06-26
2
39
10
MIT
Why this server?
This server can fetch web content and transform it into various formats; this might be useful for retrieving images or documents from web sources for OCR.
MCP NPX Fetch
Web Scraping Browser Automation RAG Systems
tokenizin-agency
A
license
A
quality
D
maintenance
A powerful MCP server for fetching and transforming web content into various formats (HTML, JSON, Markdown, Plain Text) with ease.
Last updated 2024-12-26
4
2,395
41
MIT
Why this server?
This server converts various file types and web content to Markdown format; although it does not directly provide OCR, converting a document to Markdown might simplify the OCR process or its integration with LLMs.
Markdownify MCP Server
File Systems App Automation Content Management Systems
zcaceres
A
license
A
quality
C
maintenance
Converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.
Last updated 2026-07-09
10
434
2,784
MIT
Why this server?
This server enables extraction and usage of content from unstructured documents across a variety of file formats, which could include preparation for OCR tasks.
Unstructured Document
Documentation Access Developer Tools Search
MKhalusova
F
license
B
quality
D
maintenance
A Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats.
Last updated 2025-03-20
1
11