225,938 tools. Last updated 2026-06-22 23:22

"A tool for document extraction using MinerU" matching MCP tools:

get_extraction_statusA
sifter-mcp
Retrieve the extraction status for a document in a sift, indicating if processing is queued, running, completed, or failed.
MIT
run_extractionC
sifter-mcp
Initiate an extraction on a document by specifying its ID and the sift to use, converting unstructured content into structured records.
MIT
get_pdf_metadataA
pdfmux
Retrieve PDF metadata including page count, file size, document type, and table presence to decide which extraction tool to use next.
MIT
get_markdownA
MCP Server Fetch TypeScript
Convert web pages to structured Markdown while preserving tables, lists, and document hierarchy for clean content extraction.
MIT
request_scanA
mailbox-mcp
Request OCR scanning and structured data extraction for a package's document, label, envelope, or content. Receive extracted text, addresses, and dates.
MIT
queue_entity_extractionA
TDZ C64 Knowledge
Queue a document for asynchronous entity extraction. Returns a job ID to track progress while processing runs in the background.

Matching MCP Servers

MinerU Document Explorerofficial
RAG Systems Search
opendatalab
A
license
-
quality
B
maintenance
Enables AI agents to search, deep-read, and build knowledge bases from Markdown, PDF, DOCX, and PPTX documents via MCP tools for retrieval, document navigation, and ingestion.
Last updated 2026-04-26
64
587
MIT
MinerU MCP Server
Documentation Access App Automation Research & Data
linxule
A
license
A
quality
C
maintenance
Enables document parsing and extraction from PDFs and other formats using the MinerU API. Supports batch processing, page range selection, OCR in 109 languages, and VLM/pipeline models for high-accuracy content extraction.
Last updated 2026-05-07
4
108
6
MIT

Matching MCP Connectors

Document Integrity Validator
AI reasoning checks any document against known international standards before your agent acts on it.
Call For Me
Give your AI agent a phone. Place outbound calls to US businesses to ask, book, or confirm.

get_extraction_statusA
TDZ C64 Knowledge
Checks the entity extraction status of a document, including job state and error details, to confirm completion before querying entities.
saptiva_ocrB
MCP-Saptiva
Extract text from images for document processing, receipt scanning, and image text extraction using OCR technology. Supports both URLs and base64 encoded images.
parse_documentsA
MinerU Open MCP (Official)
Convert PDF, Office documents, images, and web pages to Markdown with OCR support and page range extraction.
Apache 2.0
lookup_documents_by_pathA
IBM Core Content Services MCP Server
Search for documents by their folder path using keywords at each level. Returns matching document filings.
Apache 2.0
talonic_save_schemaA
talonic-mcp
Save a reusable data extraction schema to your workspace for use across future document extractions. Define the schema with JSON Schema or field-type map.
MIT
generate_schemaA
MCP-Upstage-Server
Analyzes documents to automatically create JSON schemas for structured data extraction, enabling consistent field definitions across similar documents.
MIT
update_document_propertiesA
Core Content Services MCP Server
Update document properties in the content repository by providing the document identifier and new property values. Properties are updated without altering the document class.
Apache 2.0
talonic_request_uploadA
talonic-mcp
Generates a browser upload link for the user to add a file to their workspace. Returns a pre-allocated document ID for later extraction.
MIT
send_invite_from_templateA
SignNow MCP Server
Create a document from a template or template group and send a signing invite immediately, specifying recipient order and roles.
MIT
needle_add_fileA
Needle MCP Server
Add documents to a collection by providing a URL for download, processing them for text extraction, and indexing them for semantic search.
MIT
read-documentationA
Amazon Business Integrations MCP Server
Retrieve complete Amazon Business API documentation content using document references from search results. Access full API references, implementation guides, and detailed endpoint information for integration development.
Apache 2.0
get_class_property_descriptionsA
Core Content Services MCP Server
Obtain a complete list of all properties for a specified class, including system and hidden properties, to facilitate general document updates.
Apache 2.0
get_class_property_descriptionsA
IBM Core Content Services MCP Server
Retrieve all properties of a class, including system and hidden ones, to enable full document updates.
Apache 2.0
create_embedded_sending_from_templateA
SignNow MCP Server
Create a document from a template and generate an embedded sending link for immediate e-signature requests.
MIT

"A tool for document extraction using MinerU" matching MCP tools:

get_extraction_statusA

run_extractionC

get_pdf_metadataA

get_markdownA

request_scanA

queue_entity_extractionA

Matching MCP Servers

MinerU Document Explorerofficial

MinerU MCP Server

Matching MCP Connectors

get_extraction_statusA

saptiva_ocrB

parse_documentsA

lookup_documents_by_pathA

talonic_save_schemaA

generate_schemaA

update_document_propertiesA

talonic_request_uploadA

send_invite_from_templateA

needle_add_fileA

read-documentationA

get_class_property_descriptionsA

get_class_property_descriptionsA

create_embedded_sending_from_templateA