Integrations
Offers containerized deployment of the PDF server, with volume mounting capabilities to access and manage PDF files between host and container
Provides a web interface for managing PDF files, allowing users to upload, download, and view PDF files through a FastAPI-based web UI
MCP PDF Server
MCP PDF Server is a Model Context Protocol (MCP) based server that can efficiently manage PDF files.
This project was created to make it easier for me, an embedded developer, to read and summarize PDF datasheet documents directly in AI coding tools such as Cursor, or to conduct Q&A, etc., to make development work more convenient. In other words, the main purpose is to support AI to quickly understand the contents of PDF datasheets and provide the necessary information right away.
This project consists of two main components:
- manager_server : A web page based on FastAPI that allows users to upload or download PDF files and view and manage file lists through a web UI. It also provides a RESTful API for linking with external systems.
- mcp_server : Provides file name search and text extraction functions based on PDF files managed by manager_server. The extracted text can be linked with external systems (e.g. Cursor, etc.) via the MCP protocol.
Key Features:
- Extract PDF text (supports local files and URLs)
- PDF search based on file name
- View and manage PDF lists
- Supports PDF file web upload/download
- Providing RESTful APIs and web services
- Connectivity to external systems (Curator, Cursor, etc.) via MCP protocol
It can be easily integrated with external systems through RESTful API and web UI, and can be easily deployed and operated in both Docker and local environments. It is suitable for automated management and search of various PDF documents such as data sheets, papers, and contracts.
Key Features
- Extract text from local PDF files and PDFs accessible via URL
- Provides a list of PDF files under
/app/datasheets
- Provides PDF search function by file name
- Reliable text extraction and exception handling based on PyPDF2
- Providing standardized MCP tools based on FastMCP
Running with Docker
- Build imageCopy
- Running a containerCopy
- If you put a PDF file in
/호스트/경로/데이터
you can access it from/app/datasheets
inside the container. - Ports 5050 and 5080 are used.
- If you put a PDF file in
- When using docker-composeCopy
Running it directly locally (Python)
- Installing dependenciesCopy
- Running the serverCopy
MCP Tool (API) Description
- read_local_pdf
Extract text by entering a local PDF file path. - read_url_pdf
Enter a URL of a PDF file and extract the text from it. - server_pdf_list
Returns a list of all PDF files under/app/datasheets
. - server_pdf_search
Enter a file name, search for a PDF file on the server, and extract the text from that PDF file.
Route guidance
- PDF data must be located in the
/app/datasheets
path (inside the Docker container). - When using Docker, mount the host's PDF folder to
/app/datasheets
. - The source code is located in
/app/mcp_server
(inside the container).
License
Apache License 2.0
Author: Dev91
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A Model Context Protocol (MCP) based server that efficiently manages PDF files, allowing AI coding tools like Cursor to read, summarize, and extract information from PDF datasheets to assist embedded development work.
Related MCP Servers
- AsecurityFlicenseAqualityA Model Context Protocol server that converts PDF documents into PNG images through a simple MCP tool call.Last updated -12Python
- -securityFlicense-qualityImplements the Model Context Protocol (MCP) to provide AI models with a standardized interface for connecting to external data sources and tools like file systems, databases, or APIs.Last updated -90Python
- -securityAlicense-qualityA Model Context Protocol (MCP) server that allows AI models to safely access and interact with local file systems, enabling reading file contents, listing directories, and retrieving file metadata.Last updated -471JavaScriptMIT License
- -security-license-qualityA Model Context Protocol (MCP) server that allows interaction with PDF and EPUB documents, designed to work with Windsurf IDE by Codeium.Last updated -3PythonMIT License