Simple Document Processing MCP Server

Integrations
  • Provides contact information via GitHub (CabLate) for support and collaboration.

  • Supports HTML to Markdown conversion as part of its document conversion capabilities.

Simple Document Processing MCP Server

A powerful Model Context Protocol (MCP) server providing comprehensive document processing capabilities.

Features

Document Reader

  • Read DOCX, PDF, TXT, HTML, CSV

Document Conversion

  • DOCX to HTML/PDF conversion
  • HTML to TXT/Markdown conversion
  • PDF manipulation (merge, split)

Text Processing

  • Multi-encoding transfer support (UTF-8, Big5, GBK)
  • Text formatting and cleaning
  • Text comparison and diff generation
  • Text splitting by lines or delimiter

HTML Processing

  • HTML cleaning and formatting
  • Resource extraction (images, links, videos)
  • Structure-preserving conversion

Installation

Installing via Smithery

To install Document Processing Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @cablate/mcp-doc-forge --client claude

Manual Installation

npm install -g @cablate/mcp-doc-forge

Usage

Cli

mcp-doc-forge

With Dive Desktop

  1. Click "+ Add MCP Server" in Dive Desktop
  2. Copy and paste this configuration:
{ "mcpServers": { "searxng": { "command": "npx", "args": [ "-y", "@cablate/mcp-doc-forge" ], "enabled": true } } }
  1. Click "Save" to install the MCP server

License

MIT

Contributing

Welcome community participation and contributions! Here are ways to contribute:

  • ⭐️ Star the project if you find it helpful
  • 🐛 Submit Issues: Report problems or provide suggestions
  • 🔧 Create Pull Requests: Submit code improvements

Contact

If you have any questions or suggestions, feel free to reach out:

  • 📧 Email: reahtuoo310109@gmail.com
  • 📧 GitHub: CabLate
  • 🤝 Collaboration: Welcome to discuss project cooperation
  • 📚 Technical Guidance: Sincere welcome for suggestions and guidance

You must be authenticated.

A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.

  1. Features
    1. Document Reader
    2. Document Conversion
    3. Text Processing
    4. HTML Processing
  2. Installation
    1. Installing via Smithery
    2. Manual Installation
  3. Usage
    1. Cli
    2. With Dive Desktop
  4. License
    1. Contributing
      1. Contact

        Related MCP Servers

        • A
          security
          A
          license
          A
          quality
          Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
          Last updated -
          4
          125,855
          353
          TypeScript
          MIT License
        • A
          security
          A
          license
          A
          quality
          Converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.
          Last updated -
          10
          5
          1,592
          TypeScript
          MIT License
          • Apple
        • A
          security
          A
          license
          A
          quality
          Enables text extraction from web pages and PDFs, and execution of predefined commands, enhancing content processing and automation capabilities.
          Last updated -
          3
          TypeScript
          MIT License
        • -
          security
          F
          license
          -
          quality
          Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
          Last updated -
          3
          Python

        View all related MCP servers

        ID: pb9df6lnel