Skip to main content
Glama

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
ENABLE_LOGNoSet to 'true' to write timestamped log filesdisabled
OUTPUT_DIRNoDirectory for saved Markdown (and extra format) output~/mineru-downloads
MINERU_LOG_DIRNoOverride directory for log files; defaults to workspace 'logs/' or '~/.mineru-open-mcp/logs/'
MINERU_API_TOKENNoMinerU cloud API token to unlock higher limits and extra output formats; optional for Flash mode (free, markdown-only, 20 pages / 10 MB per file)

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}
logging
{}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
parse_documents

Convert PDFs, Word (DOCX), PowerPoint (PPTX), Excel (XLSX in Flash mode), images, and public document URLs—or HTML page URLs—into Markdown using the MinerU cloud API.

World effects: Reads local paths you pass; fetches http(s) URLs. Uploads file or URL content to mineru.net for processing (do not use for data you must not send off-device). Does not modify or delete originals. May write Markdown (and sometimes images) under output_dir or the server default when results are saved or inline content is too large.

Auth & limits: Without MINERU_API_TOKEN, uses free Flash mode: Markdown-only output, service limits apply. With MINERU_API_TOKEN, higher per-file page limits and optional extra output formats per MinerU plans; token is read from env (or HTTP Bearer when using streamable HTTP).

Use this when: The user needs full-document extraction, tables/formulas as HTML/Latexa, batch conversion, or per-file PDF page ranges. Do not use for listing supported OCR script codes—call get_ocr_languages instead. Not a substitute for offline-only or strictly local parsers.

Parameters (intent): file_sources is a list of path/URL strings or {"source": "…", "pages": "1-5"} objects (PDF page ranges; Flash allows simple N or N-M). enable_ocr defaults to true. language is an OCR/script code (default ch); see get_ocr_languages for valid values. Set model to "html" only when every source is a web page URL; otherwise omit. output_dir overrides where large or batch results are written.

get_ocr_languages

Return the supported MinerU OCR / script language codes (e.g. ch, en, japan, latin). Read-only; no uploads. Use before setting language on parse_documents for scanned or multilingual documents. Do not use for converting files—call parse_documents instead.

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/opendatalab/MinerU-Ecosystem'

If you have feedback or need assistance with the MCP directory API, please join our Discord server