The Simple Document Processing MCP Server provides comprehensive document processing capabilities for various file formats:
- Read content from PDF, DOCX, TXT, HTML, CSV, and Excel files
- Convert documents between formats (DOCX to PDF/HTML, HTML to TXT/Markdown, Excel to JSON, and between Markdown/HTML/XML/JSON)
- PDF manipulation: Merge multiple PDFs or split PDFs by page ranges
- Text processing: Split text by lines/delimiters, compare files, format text, and convert between encodings (UTF-8, Big5, GBK)
- HTML processing: Clean HTML, extract resources (images, links, videos), format HTML, and convert to plain text or Markdown
Simple Document Processing MCP Server
A powerful Model Context Protocol (MCP) server providing comprehensive document processing capabilities.
Features
Document Reader
- Read DOCX, PDF, TXT, HTML, CSV
Document Conversion
- DOCX to HTML/PDF conversion
- HTML to TXT/Markdown conversion
- PDF manipulation (merge, split)
Text Processing
- Multi-encoding transfer support (UTF-8, Big5, GBK)
- Text formatting and cleaning
- Text comparison and diff generation
- Text splitting by lines or delimiter
HTML Processing
- HTML cleaning and formatting
- Resource extraction (images, links, videos)
- Structure-preserving conversion
Installation
Installing via Smithery
To install Document Processing Server for Claude Desktop automatically via Smithery:
Manual Installation
Usage
Cli
With Dive Desktop
- Click "+ Add MCP Server" in Dive Desktop
- Copy and paste this configuration:
- Click "Save" to install the MCP server
License
MIT
Contributing
Welcome community participation and contributions! Here are ways to contribute:
- ⭐️ Star the project if you find it helpful
- 🐛 Submit Issues: Report problems or provide suggestions
- 🔧 Create Pull Requests: Submit code improvements
Contact
If you have any questions or suggestions, feel free to reach out:
- 📧 Email: reahtuoo310109@gmail.com
- 📧 GitHub: CabLate
- 🤝 Collaboration: Welcome to discuss project cooperation
- 📚 Technical Guidance: Sincere welcome for suggestions and guidance
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Tools
Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
Related Resources
Related MCP Servers
- AsecurityAlicenseAqualityEnables text extraction from web pages and PDFs, and execution of predefined commands, enhancing content processing and automation capabilities.Last updated -3TypeScriptMIT License
- -securityFlicense-qualityProvides functionality to fetch and transform web content in various formats (HTML, JSON, plain text, and Markdown) through simple API calls.Last updated -101,9181TypeScript
- -securityAlicense-qualityA TypeScript-based document processing server that supports various document formats (.docx, .pdf, .xlsx) and integrates with Model Context Protocol SDK for efficient document context management.Last updated -6271TypeScriptMIT License
- -securityAlicense-qualityA server that provides tools for reading and processing PDF documents, allowing users to list available PDFs and extract their content in Markdown format.Last updated -1PythonMIT License