The Document Organizer MCP Server provides comprehensive document management capabilities, including PDF-to-Markdown conversion, automated organization, and Universal Project Documentation Standard implementation.
PDF-to-Markdown Conversion: Convert PDFs using
marker(recommended) orpymupdf4llmengines with configurable options for table preservation, image extraction, memory efficiency, and auto-cleaning.Document Discovery & Management: Recursively scan directories to find PDFs, audit conversion status, and efficiently convert only missing Markdown files through automated workflows.
Content Analysis & Organization: Automatically categorize documents (Research, Planning, Technical, Business, etc.) based on content analysis and organize them into structured directory hierarchies.
Full Automation Pipeline: Execute end-to-end workflows that discover, convert, categorize, and organize all documents from start to finish.
Universal Project Documentation Standard: Initialize standardized project structures with essential files like
CURRENT_STATUS.mdandACTIVE_PLAN.md, validate compliance, manage plan lifecycles (ACTIVE, ARCHIVED, SUPERSEDED, BLOCKED), and generate automated progress reports and handoff documentation.
File Converter MCP
A Model Context Protocol (MCP) server that aggregates various file conversion tools for quick formatting and file type transformations.
Features
Supported Conversions
PDF to Markdown - Convert PDF documents to markdown format
Image Format Conversion - Transform between common image formats (PNG, JPG, WebP, etc.)
Document Conversion - Convert between document formats (DOCX, TXT, HTML, etc.)
Spreadsheet Conversion - Transform spreadsheet formats (CSV, XLSX, JSON, etc.)
Code Format Conversion - Convert between code formats and syntax highlighting
Archive Operations - Extract and create archive files (ZIP, TAR, etc.)
Conversion Engines
PDF Engine: marker (recommended) and pymupdf4llm support
Image Engine: Sharp and ImageMagick integration
Document Engine: Pandoc integration for broad format support
Archive Engine: Built-in Node.js compression libraries
Installation
Dependencies
Install conversion engines based on your needs:
Usage
MCP Configuration
Add to your MCP client configuration:
Available Tools
PDF Conversion
convert_pdf_to_markdown- Convert PDF files to Markdownextract_pdf_text- Extract plain text from PDF filesextract_pdf_images- Extract images from PDF files
Image Conversion
convert_image_format- Convert between image formatsresize_image- Resize images with quality optionscompress_image- Reduce image file size
Document Conversion
convert_document- Convert between document formats using Pandocextract_document_text- Extract text from various document formatsconvert_markdown_to_html- Convert Markdown to HTML with styling
Spreadsheet Conversion
convert_csv_to_json- Convert CSV data to JSON formatconvert_json_to_csv- Convert JSON data to CSV formatconvert_xlsx_to_csv- Extract CSV data from Excel files
Archive Operations
create_archive- Create ZIP or TAR archives from files/foldersextract_archive- Extract contents from archive fileslist_archive_contents- List files in archive without extracting
Utility Tools
detect_file_type- Identify file format and encodingvalidate_conversion- Check if conversion is supportedbatch_convert- Convert multiple files in one operation
Examples
Basic PDF Conversion
Image Format Conversion
Document Conversion
Batch Operations
Configuration Options
Conversion Settings
Supported File Types
Input Formats
Documents: PDF, DOCX, DOC, RTF, TXT, HTML, XML
Images: PNG, JPG, JPEG, WebP, GIF, BMP, TIFF, SVG
Spreadsheets: CSV, XLSX, XLS, JSON, TSV
Archives: ZIP, TAR, GZ, 7Z, RAR (extract only)
Code: Various programming language files
Output Formats
Text: Markdown, HTML, TXT, RTF
Images: PNG, JPG, WebP, GIF, BMP
Data: JSON, CSV, XML, YAML
Archives: ZIP, TAR, GZ
Performance Considerations
Memory Usage: Large files are processed in chunks to prevent memory issues
Processing Speed: Different engines have different speed/quality tradeoffs
Batch Processing: More efficient for multiple file conversions
Caching: Converted files can be cached to avoid re-processing
Error Handling
The server provides comprehensive error handling:
Input file validation and format detection
Graceful fallback between conversion engines
Detailed error messages with suggested solutions
Progress tracking for long-running conversions
Development
Contributing
Fork the repository
Create a feature branch
Add support for new file formats or conversion engines
Add tests for new functionality
Submit a pull request
License
MIT License - see LICENSE file for details.
Support
local-only server
The server can only run on the client's local machine because it depends on local resources.
Tools
Enables systematic document organization with PDF-to-Markdown conversion, intelligent categorization, and automated workflow management. Supports project documentation standards and provides complete end-to-end document processing pipelines.