MCP-MinerU
MCP server for document and image parsing via MinerU. Extract text, tables, and formulas from PDFs, screenshots, and scanned documents with MLX acceleration on Apple Silicon.
Installation
This command installs and configures the server for all your Claude Code projects using uvx (no manual installation required).
Alternative methods: See Installation Guide for PyPI, source installation, and Claude Desktop configuration.
Features
Multiple format support: PDF, JPEG, PNG, and other image formats
OCR capabilities: Built-in text extraction from screenshots and photos
Table recognition: Preserves structure when extracting tables
Formula extraction: Converts mathematical equations to LaTeX
MLX acceleration: Optimized for Apple Silicon (M1/M2/M3/M4)
Multiple backends: Choose speed vs quality tradeoffs
Quick Start
Parse a PDF document
Extract text from a screenshot
Check system capabilities
For more examples, see Usage Examples.
Tools
parse_pdf
Parse PDF and image files to extract structured content as Markdown.
Parameters:
file_path(required): Absolute path to file (PDF, JPEG, PNG, etc.)backend(optional):pipeline|vlm-mlx-engine|vlm-transformersformula_enable(optional): Enable formula recognition (default: true)table_enable(optional): Enable table recognition (default: true)start_page(optional): Starting page for PDFs (default: 0)end_page(optional): Ending page for PDFs (default: -1)
list_backends
Check system capabilities and get backend recommendations.
Returns: System information, available backends, and performance recommendations.
Supported Formats
PDF documents (.pdf)
JPEG images (.jpg, .jpeg)
PNG images (.png)
Other image formats (WebP, GIF, etc.)
Performance
Benchmarked on Apple Silicon M4 (16GB RAM):
pipeline: ~32s/page, CPU-only, good quality
vlm-mlx-engine: ~38s/page, Apple Silicon optimized, excellent quality
vlm-transformers: ~148s/page, highest quality, slowest
Documentation
Installation Guide - Detailed installation options
Updating Guide - How to update to the latest version
Usage Examples - More use cases and API reference
MinerU Documentation - Underlying parsing engine
Development
License
Apache License 2.0 - see LICENSE file for details.
Acknowledgments
Built on top of MinerU by OpenDataLab.