The Document Organizer MCP Server provides comprehensive document management capabilities, including PDF-to-Markdown conversion, automated organization, and Universal Project Documentation Standard implementation.
PDF-to-Markdown Conversion: Convert PDFs using
marker(recommended) orpymupdf4llmengines with configurable options for table preservation, image extraction, memory efficiency, and auto-cleaning.Document Discovery & Management: Recursively scan directories to find PDFs, audit conversion status, and efficiently convert only missing Markdown files through automated workflows.
Content Analysis & Organization: Automatically categorize documents (Research, Planning, Technical, Business, etc.) based on content analysis and organize them into structured directory hierarchies.
Full Automation Pipeline: Execute end-to-end workflows that discover, convert, categorize, and organize all documents from start to finish.
Universal Project Documentation Standard: Initialize standardized project structures with essential files like
CURRENT_STATUS.mdandACTIVE_PLAN.md, validate compliance, manage plan lifecycles (ACTIVE, ARCHIVED, SUPERSEDED, BLOCKED), and generate automated progress reports and handoff documentation.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Document Organizer MCP Serverconvert my research paper PDF to Markdown for the project wiki"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
File Converter MCP
A Model Context Protocol (MCP) server that aggregates various file conversion tools for quick formatting and file type transformations.
Features
Supported Conversions
PDF to Markdown - Convert PDF documents to markdown format
Image Format Conversion - Transform between common image formats (PNG, JPG, WebP, etc.)
Document Conversion - Convert between document formats (DOCX, TXT, HTML, etc.)
Spreadsheet Conversion - Transform spreadsheet formats (CSV, XLSX, JSON, etc.)
Code Format Conversion - Convert between code formats and syntax highlighting
Archive Operations - Extract and create archive files (ZIP, TAR, etc.)
Conversion Engines
PDF Engine: marker (recommended) and pymupdf4llm support
Image Engine: Sharp and ImageMagick integration
Document Engine: Pandoc integration for broad format support
Archive Engine: Built-in Node.js compression libraries
Installation
Dependencies
Install conversion engines based on your needs:
Usage
MCP Configuration
Add to your MCP client configuration:
Available Tools
PDF Conversion
convert_pdf_to_markdown- Convert PDF files to Markdownextract_pdf_text- Extract plain text from PDF filesextract_pdf_images- Extract images from PDF files
Image Conversion
convert_image_format- Convert between image formatsresize_image- Resize images with quality optionscompress_image- Reduce image file size
Document Conversion
convert_document- Convert between document formats using Pandocextract_document_text- Extract text from various document formatsconvert_markdown_to_html- Convert Markdown to HTML with styling
Spreadsheet Conversion
convert_csv_to_json- Convert CSV data to JSON formatconvert_json_to_csv- Convert JSON data to CSV formatconvert_xlsx_to_csv- Extract CSV data from Excel files
Archive Operations
create_archive- Create ZIP or TAR archives from files/foldersextract_archive- Extract contents from archive fileslist_archive_contents- List files in archive without extracting
Utility Tools
detect_file_type- Identify file format and encodingvalidate_conversion- Check if conversion is supportedbatch_convert- Convert multiple files in one operation
Examples
Basic PDF Conversion
Image Format Conversion
Document Conversion
Batch Operations
Configuration Options
Conversion Settings
Supported File Types
Input Formats
Documents: PDF, DOCX, DOC, RTF, TXT, HTML, XML
Images: PNG, JPG, JPEG, WebP, GIF, BMP, TIFF, SVG
Spreadsheets: CSV, XLSX, XLS, JSON, TSV
Archives: ZIP, TAR, GZ, 7Z, RAR (extract only)
Code: Various programming language files
Output Formats
Text: Markdown, HTML, TXT, RTF
Images: PNG, JPG, WebP, GIF, BMP
Data: JSON, CSV, XML, YAML
Archives: ZIP, TAR, GZ
Performance Considerations
Memory Usage: Large files are processed in chunks to prevent memory issues
Processing Speed: Different engines have different speed/quality tradeoffs
Batch Processing: More efficient for multiple file conversions
Caching: Converted files can be cached to avoid re-processing
Error Handling
The server provides comprehensive error handling:
Input file validation and format detection
Graceful fallback between conversion engines
Detailed error messages with suggested solutions
Progress tracking for long-running conversions
Development
Contributing
Fork the repository
Create a feature branch
Add support for new file formats or conversion engines
Add tests for new functionality
Submit a pull request
License
MIT License - see LICENSE file for details.