The Simple Document Processing MCP Server provides comprehensive document processing capabilities for various file formats:
Read content from PDF, DOCX, TXT, HTML, CSV, and Excel files
Convert documents between formats (DOCX to PDF/HTML, HTML to TXT/Markdown, Excel to JSON, and between Markdown/HTML/XML/JSON)
PDF manipulation: Merge multiple PDFs or split PDFs by page ranges
Text processing: Split text by lines/delimiters, compare files, format text, and convert between encodings (UTF-8, Big5, GBK)
HTML processing: Clean HTML, extract resources (images, links, videos), format HTML, and convert to plain text or Markdown
Simple Document Processing MCP Server
A powerful Model Context Protocol (MCP) server providing comprehensive document processing capabilities.
Features
Document Reader
Read DOCX, PDF, TXT, HTML, CSV
Document Conversion
DOCX to HTML/PDF conversion
HTML to TXT/Markdown conversion
PDF manipulation (merge, split)
Text Processing
Multi-encoding transfer support (UTF-8, Big5, GBK)
Text formatting and cleaning
Text comparison and diff generation
Text splitting by lines or delimiter
HTML Processing
HTML cleaning and formatting
Resource extraction (images, links, videos)
Structure-preserving conversion
Installation
Installing via Smithery
To install Document Processing Server for Claude Desktop automatically via Smithery:
Manual Installation
Usage
Cli
With Dive Desktop
Click "+ Add MCP Server" in Dive Desktop
Copy and paste this configuration:
Click "Save" to install the MCP server
License
MIT
Contributing
Welcome community participation and contributions! Here are ways to contribute:
⭐️ Star the project if you find it helpful
🐛 Submit Issues: Report problems or provide suggestions
🔧 Create Pull Requests: Submit code improvements
Contact
If you have any questions or suggestions, feel free to reach out:
📧 Email: reahtuoo310109@gmail.com
📧 GitHub: CabLate
🤝 Collaboration: Welcome to discuss project cooperation
📚 Technical Guidance: Sincere welcome for suggestions and guidance
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Tools
Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
Related Resources
Related MCP Servers
- AsecurityAlicenseAqualityEnables text extraction from web pages and PDFs, and execution of predefined commands, enhancing content processing and automation capabilities.
- -securityFlicense-qualityProvides functionality to fetch and transform web content in various formats (HTML, JSON, plain text, and Markdown) through simple API calls.Last updated -105,4181
- -securityAlicense-qualityA TypeScript-based document processing server that supports various document formats (.docx, .pdf, .xlsx) and integrates with Model Context Protocol SDK for efficient document context management.Last updated -7241MIT License
- -securityAlicense-qualityProvides advanced document search and processing capabilities through vector stores, including PDF processing, semantic search, web search integration, and file operations. Enables users to create searchable document collections and retrieve relevant information using natural language queries.Last updated -MIT License