206,568 tools. Last updated 2026-06-17 14:27

"High-Quality PDF Extraction to Text with Tokenization and Accurate Processing of Complex Layouts" matching MCP tools:

parse_documentsA
MinerU Open MCP (Official)
Convert PDF, Office documents, images, and web pages to Markdown with OCR support and page range extraction.
Apache 2.0
extract_receipt_asyncA
Dokmatiq DocGen
Submit a receipt image or PDF for asynchronous AI extraction of receipt data. Receive a job ID to poll for status, with optional webhook notification. Returns extracted data for further processing.
MIT
get_receipt_jobA
Dokmatiq DocGen
Check the status of an asynchronous receipt extraction job. Provide the job ID to get the current status: pending, processing, completed, or failed.
MIT
needle_add_fileA
Needle MCP Server
Add documents to a collection by providing a URL for download, processing them for text extraction, and indexing them for semantic search.
MIT
get_technical_specsA
echo3s-io
Retrieve Echo3s technical specifications—supported formats, AI processing details, output quality, credit system, and platform info.
MIT
save_screenshotA
MobAI MCP Server
Save a full-quality PNG screenshot to disk for high-resolution reporting, debugging, or sharing.
Apache 2.0

Matching MCP Servers

PDF to Text MCP Server
xxx87
-
license
-
quality
-
maintenance
Converts PDF files to text for use with MCP-compatible applications like Cursor IDE.
Last updated 2025-09-19
PDF Extraction MCP Server
File Systems Text Summarization
xraywu
F
license
C
quality
D
maintenance
An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
Last updated 2025-05-31
1
30

Matching MCP Connectors

Complex Portal
EBI Complex Portal MCP.
Quality QR
Create and manage trackable QR codes with scan tracking, analytics, and dynamic URL updates.

receipt_to_documentA
Dokmatiq DocGen
Extract data from receipt images and generate expense reports as PDF, DOCX, or ODT in one step, using AI extraction and customizable templates.
MIT
vault_tokenizeA
ALTR MCP Server
Securely replaces sensitive plaintext values with vault tokens, enabling reversible detokenization while preserving data utility. Supports deterministic tokenization for consistent token mapping.
GPL 3.0
extract_text_from_pdfA
Dokmatiq DocGen
Extract all text content from a PDF. Input the PDF as a base64-encoded string and receive the extracted text.
MIT
pdf_to_csvB
PDF.co MCP Server
Convert PDF and scanned documents to CSV files, preserving layout and table structure for data extraction.
MIT
pdf_to_textB
PDF.co MCP Server
Convert PDF and scanned images to text while preserving layout. Supports URL input, optional page ranges, and OCR language selection.
MIT
pdf_to_xmlB
PDF.co MCP Server
Convert PDF files and scanned images to XML format for data extraction and further processing.
MIT
pdf_make_unsearchableB
PDF.co MCP Server
Remove the text layer from a PDF to make it non-searchable, preventing text extraction and search functionality.
MIT
pdf_to_jsonC
PDF.co MCP Server
Convert PDFs and scanned images to JSON while preserving text, fonts, images, vectors, and formatting using the /pdf/convert/to/json2 endpoint.
MIT
parse_generic_documentA
document-to-json-mcp
Parse any PDF to extract text and tables, returning structured JSON for flexible data processing.
MIT
zotero_get_pdf_outlineA
Zotero MCP
Extract the table of contents from a PDF attachment as a hierarchical markdown list with page numbers, helping you quickly orient in a paper before fetching full text.
MIT
read_pdf_structureA
pdf-modifier-mcp
Extract PDF page layout including text, coordinates, fonts, and colors to understand document structure before making edits.
MIT
pdf_to_markdownA
pdf-toolkit-mcp
Converts PDF to reading-order Markdown for LLM consumption. Reconstructs up to 2 content columns, infers headings from font size, detects lists; tables rendered as plain text.
MIT
generate_from_markdownA
pretext-pdf-mcp
Convert Markdown to PDF with support for headings, lists, tables, code blocks, and more. Returns a base64-encoded PDF file.
MIT
box_ai_extract_structured_enhanced_using_fields_toolA
MCP Server Box
Extract structured data from documents using custom field definitions, combining information from multiple files into a single record with enhanced AI accuracy for complex layouts and low-quality scans.
MIT

"High-Quality PDF Extraction to Text with Tokenization and Accurate Processing of Complex Layouts" matching MCP tools:

parse_documentsA

extract_receipt_asyncA

get_receipt_jobA

needle_add_fileA

get_technical_specsA

save_screenshotA

Matching MCP Servers

PDF to Text MCP Server

PDF Extraction MCP Server

Matching MCP Connectors

receipt_to_documentA

vault_tokenizeA

extract_text_from_pdfA

pdf_to_csvB

pdf_to_textB

pdf_to_xmlB

pdf_make_unsearchableB

pdf_to_jsonC

parse_generic_documentA

zotero_get_pdf_outlineA

read_pdf_structureA

pdf_to_markdownA

generate_from_markdownA

box_ai_extract_structured_enhanced_using_fields_toolA