MCP PDF Reader
A Model Context Protocol (MCP) server that provides tools for reading and processing PDF documents. Built with Docling for document conversion and text extraction.
Features
MCP Server with tools for PDF document processing
Document Text Extraction: Convert PDF content to clean Markdown format
Document Discovery: List and access available PDF files
Tools
The server provides two main tools:
get_document_list
: Returns a list of all available PDF files in the data directoryget_document_text
: Extracts and returns the full text content of a specified PDF file in Markdown format
Install
Make sure you have uv
.
Clone the repository:
Install Python:
Create and activate a virtual environment:
Install dependencies:
Usage
Add PDF Documents
Place your PDF files in the data/
directory. The server will automatically detect and make them available through the tools.
Run MCP Server
Start the MCP server:
The server runs using stdio transport and can be integrated with any MCP-compatible client.
Development and Testing
Use the MCP inspector to test the server:
This will open a web interface where you can test the available tools and inspect their responses.
Use in VSCode/Cursor
You can use the MCP integration in your editor. Tools & Integrations
-> New MCP Server
and edit the mcp.json
file to include the following:
License
See LICENSE file for details.
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
A server that provides tools for reading and processing PDF documents, allowing users to list available PDFs and extract their content in Markdown format.
Related MCP Servers
- AsecurityFlicenseAqualityAn MCP server for converting Markdown documents to PDF files.Last updated -1915
- AsecurityAlicenseAqualityA document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.Last updated -10110MIT License
- -securityAlicense-qualityA server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.Last updated -18MIT License
Textin MCP Serverofficial
AsecurityAlicenseAqualityA server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.Last updated -34523MIT License