Unstructured Document Processor MCP

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Integrations

  • Supports processing JPEG/JPG files to extract content for large language models

  • Supports processing Org files to extract content for large language models

  • Supports processing SVG files to extract content for large language models

提供非结构化文档处理功能的模型上下文协议 (MLM) 服务器。该服务器使 LLM 能够提取并使用非结构化文档中的内容。

此 repo 工作正在进行中,请谨慎操作:)

支持的文件类型:

{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot", ".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html", ".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages", ".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst", ".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1", ".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml", ".zabw"}

先决条件:您需要:

关于如何将此 MCP 添加到您的 Claude 桌面的快速 TLDR:

  1. 克隆 repo 并设置 UV 环境。
  2. 在根目录中创建一个.env文件并添加以下环境变量: UNSTRUCTURED_API_KEY
  3. 运行 MCP 服务器: uv run doc_processor.py
  4. 前往~/Library/Application Support/Claude/并创建claude_desktop_config.json文件。在该文件中添加:
{ "mcpServers": { "unstructured_doc_processor": { "command": "PATH/TO/YOUR/UV", "args": [ "--directory", "ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/", "run", "doc_processor.py" ], "disabled": false } } }
  1. 重启 Claude Desktop。现在您应该可以使用 MCP 了。
-
security - not tested
F
license - not found
-
quality - not tested

模型上下文协议服务器使 LLM 能够从各种文件格式的非结构化文档中提取和使用内容。

ID: 6ncg8w9ovd