Skip to main content
Glama

Unstructured Document Processor MCP

by MKhalusova

提供非结构化文档处理功能的模型上下文协议 (MLM) 服务器。该服务器使 LLM 能够提取并使用非结构化文档中的内容。

此 repo 工作正在进行中,请谨慎操作:)

支持的文件类型:

{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot", ".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html", ".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages", ".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst", ".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1", ".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml", ".zabw"}

先决条件:您需要:

关于如何将此 MCP 添加到您的 Claude 桌面的快速 TLDR:

  1. 克隆 repo 并设置 UV 环境。
  2. 在根目录中创建一个.env文件并添加以下环境变量: UNSTRUCTURED_API_KEY
  3. 运行 MCP 服务器: uv run doc_processor.py
  4. 前往~/Library/Application Support/Claude/并创建claude_desktop_config.json文件。在该文件中添加:
{ "mcpServers": { "unstructured_doc_processor": { "command": "PATH/TO/YOUR/UV", "args": [ "--directory", "ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/", "run", "doc_processor.py" ], "disabled": false } } }
  1. 重启 Claude Desktop。现在您应该可以使用 MCP 了。
-
security - not tested
F
license - not found
-
quality - not tested

local-only server

The server can only run on the client's local machine because it depends on local resources.

模型上下文协议服务器使 LLM 能够从各种文件格式的非结构化文档中提取和使用内容。

Related MCP Servers

  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that provides tools for analyzing text documents, including counting words and characters. This server helps LLMs perform text analysis tasks by exposing simple document statistics functionality.
    Last updated -
    1
    8
    7
    JavaScript
    Apache 2.0
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to read, search, and analyze code files with advanced caching and real-time file watching capabilities.
    Last updated -
    2
    15
    JavaScript
    MIT License
    • Linux
    • Apple
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that allows LLMs to interact with web content through standardized tools, currently supporting web scraping functionality.
    Last updated -
    Python
    MIT License
    • Linux
    • Apple
  • -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
    Last updated -
    TypeScript
    • Apple

View all related MCP servers

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/MKhalusova/unstructured-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server