ReadPDFx - OCR PDF MCP Server

CHANGELOG.md•5.18 KiB

# Changelog All notable changes to the OCR PDF MCP Server will be documented in this file. The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/), and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html). ## [1.0.1] - 2025-11-02 ### Added - **HTTP REST API Server** for ML Studio and HTTP client compatibility - **LM Studio SSE Support**: Server-Sent Events streaming resolving HTTP 405 errors ✨ - **ML Studio Integration**: Complete HTTP endpoint support resolving 404 errors - **HTTP Server Launcher**: Easy start script with `start_http_server.py` - **HTTP Testing Suite**: Automated testing with `test_http_server.py` and `test_lm_studio.py` - **Dual Mode Support**: Both MCP Protocol and HTTP REST API in same package - **File Upload Endpoint**: Support for direct PDF file uploads via HTTP - **Interactive API Documentation**: Swagger UI and ReDoc interfaces - **Health Monitoring**: Dedicated health check endpoints for monitoring - **Streaming Endpoints**: SSE events and streaming support for real-time clients - **CORS Enhancement**: Complete CORS support with preflight handling - **ML Studio Guide**: Comprehensive integration documentation - **LM Studio Guide**: HTTP 405 error fix documentation ### Enhanced - **Cross-Client Compatibility**: Supports both MCP clients and HTTP clients - **Production Dependencies**: Added FastAPI, Uvicorn, and Python-multipart - **Error Handling**: HTTP-specific error responses with proper status codes - **Documentation**: Updated README with dual-mode usage instructions ### Fixed - **LM Studio HTTP 405 Error**: Resolved by implementing SSE streaming support ✅ - **ML Studio 404 Issue**: Resolved by implementing proper HTTP endpoints ✅ - **CORS Preflight**: Added OPTIONS handler for all endpoints - **SSE Streaming**: Server-Sent Events for real-time communication - **Type Annotations**: Fixed Optional type hints for HTTP parameters - **File Validation**: Enhanced upload file validation and error handling ## [1.0.0] - 2024-12-19 ### Added - **Production-ready MCP server** with comprehensive error handling - **Four core MCP tools** for PDF processing: - `get_pdf_info`: Extract metadata and basic information from PDF files - `extract_pdf_text`: Extract text directly from PDF files (no OCR) - `ocr_pdf`: Perform OCR on scanned PDF files with customizable options - `process_pdf_smart`: Intelligent processing that chooses optimal method - **Automated installer** supporting multiple MCP clients: - Claude Desktop - Cline (VS Code Extension) - Continue.dev - **Cross-platform support** for Windows, macOS, and Linux - **Multi-language OCR** with Tesseract integration - **Parallel processing** for improved performance - **Comprehensive configuration management** with environment variables - **Production-grade logging** with configurable levels - **Complete documentation** including installation and usage guides - **Version management system** with centralized version tracking - **Security features** including file validation and size limits - **Performance optimizations** with smart caching and resource management ### Features - **OCR Engine**: Tesseract 5.4.0+ with multi-language support - **PDF Processing**: PyMuPDF-based text extraction and image conversion - **Configuration**: Environment-based configuration with smart defaults - **Error Handling**: Comprehensive error management with detailed logging - **Performance**: Configurable parallel processing and timeout controls - **Security**: File type validation, size limits, and path sanitization - **Compatibility**: MCP Protocol v1.20.0 compliance ### Documentation - Complete installation guide for all supported platforms - Client-specific configuration instructions - API documentation with examples - Troubleshooting guide - Performance tuning recommendations - Multi-language OCR setup instructions ### Testing - Comprehensive test suite with 8 PDF samples - 100% success rate in production testing - 20,790+ characters successfully extracted during testing - Cross-platform compatibility verified - Multi-client installation tested and verified ### Infrastructure - Git repository with proper .gitignore and .gitattributes - Docker support with .dockerignore - Environment template with comprehensive examples - Automated build and deployment scripts - Version tracking and changelog maintenance ### Technical Specifications - **Python**: 3.8+ (tested with 3.12) - **MCP Protocol**: v1.20.0 - **Dependencies**: 8 core packages with automatic installation - **Memory Usage**: Optimized for large PDF processing - **Performance**: Multi-threaded OCR processing - **Security**: Input validation and resource limits ### Installation Methods - Automated installer for immediate deployment - Manual installation with step-by-step guide - Docker deployment ready - Multiple MCP client support --- ## [Unreleased] ### Planned Features - REST API mode for HTTP access - Batch processing capabilities - Advanced OCR preprocessing options - Custom model integration - Performance monitoring dashboard - Plugin system for extended functionality --- **Note**: This is the initial production release. All features have been thoroughly tested and are considered stable for production use.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/irev/mcp-readpdfx'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

CHANGELOG.md•5.18 KiB