What can you do with this server?

MinerU Open MCP is a document parsing server that converts PDFs, Word docs, PowerPoint files, images, and web pages into Markdown (and optionally other formats) using the MinerU API. * Parse documents (parse_documents): Convert local files or remote URLs into Markdown. Supported formats include PDF, PNG, JPG/JPEG, JP2, WebP, GIF, BMP, DOC, DOCX, PPT, PPTX, and (in Flash mode) XLS/XLSX. * Parse web pages: Automatically detect and parse HTML pages by setting the model to html. * Extra output formats: Generate Word (.docx), HTML, or LaTeX alongside Markdown when a MinerU API token is provided. * Targeted page ranges: Specify per-file page ranges (e.g., pages 1–3, or specific pages like 2, 4–6) in multi-file batch operations. * OCR support: Enable OCR for scanned PDFs or images, with language specification (Chinese, English, Japanese, Korean, Arabic, Tamil, French, Macedonian, and more). * List OCR languages (get_ocr_languages): Retrieve the full list of supported OCR languages. * Choose parsing models: Use vlm (default), html (for web pages), or pipeline for specialized cases. * Custom output directory: Specify where parsed result files are saved locally. * Flash mode (no API key required): Free Markdown parsing for files up to 20 pages / 10 MB, no sign-up needed. * Flexible transport modes: Supports both stdio (desktop clients) and streamable-http (web-based clients).

Which integrations are available for this server?

Provides document parsing capabilities to convert various file formats (PDF, Word, PowerPoint, spreadsheets, images) into LaTeX format through MinerU's document processing service. Provides document parsing capabilities to convert various file formats (PDF, Word, PowerPoint, spreadsheets, images) into Markdown format through MinerU's document processing service, with free Flash mode supporting markdown-only conversion.

de en es ja ko ru zh

MinerU Open MCP (Official)

by opendatalab

Overview Schema Related Servers Score Discussions

Python

Hybrid

MinerU Open MCP

MinerU-Ecosystem MCP server

An Official Mineru MCP server that exposes MinerU's document parsing as MCP tools. Connect any MCP-compatible AI client to convert PDFs, Word docs, PowerPoint files, and images into Markdown.

No API key required — Flash mode works out of the box, free with no sign-up but lower limits are applied. Set MINERU_API_TOKEN to unlock higher limits and extra output formats.

Notes for MCP clients with sandbox - For some MCP clients, your files dragged into the input box are sandboxed into a temporary directory. If you need to upload and parse local files, please provide full paths of the target files in your prompt, in case files cannot be found by the server.

⚡ Quickest Way to Run — uvx (no install needed)

mineru-open-mcp is on PyPI. With uv installed, you can run it directly — no separate install step.

Configure your MCP client

stdio — Claude Desktop, Cursor, Windsurf

The MCP client launches mineru-open-mcp as a subprocess automatically.

Using uvx (recommended — always runs the latest version):

{
  "mcpServers": {
    "mineru": {
      "command": "uvx",
      "args": ["mineru-open-mcp"],
      "env": {
        "MINERU_API_TOKEN": "your_key_here"
      }
    }
  }
}

No API key? The server runs in Flash mode — free, markdown-only, learn more at Flash Mode Docs

mineru-open-mcp not on PATH? Use the full path: "/Users/you/.local/bin/mineru-open-mcp", or use the uvx approach above which handles this automatically.

Related MCP server: MinerU MCP Server

Usage Examples

Example 1: Parse a local PDF document with target page ranges

User prompt: "Parse the 3rd-5th pages of this PDF into markdown: <your_path_to_file>" What happens:

MinerU uploads and parses the PDF
Returns clean Markdown with tables (HTML) and formulas (Latex) preserved
Returns markdown texts in the chat if length permitted along with the output path, and the zip url if you prefer
MCP client summarizes the content

Example 2: Parse a remote url hosting a file

User prompt: "Extract contents from this paper: https://arxiv.org/pdf/2509.22186" What happens:

MinerU parses the paper into markdown
MCP client formats and explains the tables

Example 3: Parse local PDF files with independent page ranges

User prompt: "Parse <file1> page 1-5, <file2> page 2-9, <file3> page 3 into markdown" What happens:

MinerU uploads and parses the files separatedly
Returns target format ouputs, the zip url for you to download, markdown abstract, the directory you want to save the output to
MCP client uses the content for further analysis

Example 4: Advanced custom preferences

User prompt1: "use pipeline model to parse this Korean file your_path_here" User prompt2: "parse your_path_here and save the markdown to your_output_dir" What happends:

Pipeline model is another model provided by MinerU service (BTW, vlm model is the default choice)
You are allowed to specify a model, an ocr language, or even an independent output dir different from OUTPUT_DIR by structuring your prompt
Your requests are parameterized into parse_documents tool and MinerU will handle the rest.

streamable-http — web-based MCP clients

Start the server manually, then point your client at it:

MINERU_API_TOKEN=your_key mineru-open-mcp --transport streamable-http --port 8001

{
  "mcpServers": {
    "mineru": {
      "type": "streamableHttp",
      "url": "http://127.0.0.1:8001/mcp"
    }
  }
}

Features

parse_documents ? convert local files and/or remote URLs to Markdown; Input supports PDF, images（png/jpg/jpeg/jp2/webp/gif/bmp, Doc, Docx, Ppt, PPTx. Flash Mode also supports xlsx.
get_ocr_languages — list all OCR languages supported by MinerU
Flash mode — works without an API key (free, markdown output only, supports PDF/images/Docx/PPTx/xls/xlsx); For full features, please provide MINERU_API_TOKEN, which will disable flash mode.
Output behavior ? single-file parses return inline Markdown by default; batch parses save results to disk and return file metadata. Oversized inline content is also saved locally and returned via extract_path.
Two transport modes ? stdio, streamable-http

Environment Variables

Variable	Description	Default
`MINERU_API_TOKEN`	MinerU API token, apply on MinerU for full capability. If not provided, flash mode is enabled.	—
`OUTPUT_DIR`	Directory used when parsed results need to be saved locally, such as batch parsing or oversized inline content	`~/mineru-downloads`

Privacy Policy

mineru-open-mcp connects to the official MinerU API (mineru.net) to parse documents.

Data sent: Document content (files or URLs you provide for parsing)
Data storage: Parsed results are temporarily cached by MinerU servers; not used for training
Third-party: MinerU API (mineru.net) — see OpenDatalab Privacy Policy
Local data: Parsed results will be saved to target output directory. Log files (only when ENABLE_LOG=true), saved to MINERU_LOG_DIR;
Contact: OpenDataLab@pjlab.org.cn (or raise an issue at MinerU-Ecosystem )

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

3dResponse time

3wRelease cycle

2Releases (12mo)

Commit activity

Issues opened vs closed

Resources

GitHub Repository

Need Help?

Related Servers

Tools

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/opendatalab/MinerU-Ecosystem'

If you have feedback or need assistance with the MCP directory API, please join our Discord server