The MinerU MCP Server enables document parsing and content extraction (text, tables, formulas) from various file formats using the MinerU API.
Core Capabilities:
Parse single documents (
mineru_parse): Submit PDFs, DOC/DOCX, PPT/PPTX, or images (PNG, JPG, JPEG) via URL, with options for model selection (pipelinefor speed orvlmfor 90%+ accuracy), page ranges, OCR (109 languages), formula/table recognition, and extra export formats (Markdown, DOCX, HTML, LaTeX)Batch processing (
mineru_batch): Submit up to 200 document URLs or local files in a single request with the same parsing optionsLocal file support: Upload files from disk for parsing, with original filenames preserved (spaces converted to underscores)
Monitor single task status (
mineru_status): Poll parsing progress by task ID and retrieve the download URL on completion; supports concise or detailed outputMonitor batch status (
mineru_batch_status): Track batch job results with pagination (limit/offset) and concise or detailed outputDownload results: Retrieve parsed content as named Markdown files
Limits: 200MB max file size, 600 pages max per file, up to 200 files per batch request.
Provides document parsing capabilities for JPEG images (along with PNG, PDF, DOC, DOCX, PPT, PPTX formats) through the MinerU API, with support for OCR, formula recognition, table recognition, and page range selection across 109 languages.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@MinerU MCP Serverparse this PDF from pages 1-5 using VLM model with OCR enabled"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
mineru-mcp
MCP server for MinerU document parsing API — extract text, tables, and formulas from PDFs, DOCs, and images.
Features
VLM model — 90%+ accuracy for complex documents
Pipeline model — Fast processing for simple documents
Local file upload — Upload files from disk for batch parsing
Batch processing — Parse up to 200 documents at once
Download & rename — Extract markdown with original filenames
Page ranges — Extract specific pages only
109 language OCR support
Optimized for Claude Code — 73% token reduction vs alternatives
Tools
Tool | Description |
| Parse a document URL |
| Check task progress, get download URL |
| Parse multiple URLs (max 200) |
| Get batch results with pagination |
| Upload local files for batch parsing |
| Download results as named markdown files |
Installation
Requires Node.js 18+ and a MinerU API key.
CLI Install (one-liner)
Claude Desktop
Add to your claude_desktop_config.json:
OS | Config path |
macOS |
|
Windows |
|
Linux |
|
VS Code
Add to .vscode/mcp.json (workspace) or open Command Palette > MCP: Open User Configuration (global):
Note: VS Code uses
"servers"as the top-level key, not"mcpServers". Other VS Code forks (Trae, Void, PearAI, etc.) typically use this same format.
Cursor
Add to ~/.cursor/mcp.json (global) or .cursor/mcp.json (project):
Windsurf
Add to ~/.codeium/windsurf/mcp_config.json (Windows: %USERPROFILE%\.codeium\windsurf\mcp_config.json):
Cline
Open MCP Servers icon in Cline panel > Configure > Advanced MCP Settings, then add:
Cherry Studio
In Settings > MCP Servers > Add Server, set Type to STDIO, Command to npx, Args to -y mineru-mcp, and add environment variable MINERU_API_KEY. Or paste in JSON/Code mode:
Witsy
In Settings > MCP Servers, add a new server with Type: stdio, Command: npx, Args: -y mineru-mcp, and set environment variable MINERU_API_KEY to your API key.
Codex CLI (TOML config)
Alternatively, edit ~/.codex/config.toml directly:
Gemini CLI (JSON config)
Alternatively, edit ~/.gemini/settings.json directly:
Windows
On Windows, npx requires a shell wrapper. Replace "command": "npx" with:
For CLI tools on Windows:
ChatGPT
ChatGPT only supports remote MCP servers over HTTPS — local stdio servers like this one are not directly supported. You would need to deploy behind a public URL with HTTP transport.
Configuration
Environment Variable | Default | Description |
| (required) | Your MinerU API Bearer token |
|
| API base URL |
|
| Default model: |
Get your API key at mineru.net
Usage
Parse a single URL
Check task progress
Concise output: done | abc-123 | https://cdn-mineru.../result.zip
Batch parse URLs
Check batch progress
Upload local files
Returns batch_id for tracking. Each file's original name is preserved via data_id (spaces become underscores).
Download results as markdown
Output filenames are derived from data_id (e.g., my_paper_title.md). Spaces in original filenames become underscores.
Typical local file workflow
Supported Formats
PDF, DOC, DOCX, PPT, PPTX
PNG, JPG, JPEG
Limits
Single file: 200MB max, 600 pages max
Daily quota: 2000 pages at high priority
Batch: max 200 files per request
License
MIT
Links
MinerU — Document parsing service
MinerU GitHub — Open source version
MCP Specification — Model Context Protocol