Procesador de documentos no estructurados MCP

Un servidor de Protocolo de Contexto de Modelo que proporciona capacidades de procesamiento de documentos no estructurados. Este servidor permite a los LLM extraer y utilizar contenido de un documento no estructurado.

Este repositorio es un trabajo en progreso, proceda con precaución :)

Tipos de archivos admitidos:

{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot",
 ".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html",
 ".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages",
 ".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst",
 ".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1",
 ".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml",
 ".zabw"}

Prerrequisitos: Necesitarás:

Clave API no estructurada. Aprende a obtenerla aquí.
Claude Desktop instalado localmente

Resumen rápido sobre cómo agregar este MCP a su Claude Desktop:

Clonar el repositorio y configurar el entorno UV.
Cree un archivo .env en el directorio raíz y agregue la siguiente variable env: UNSTRUCTURED_API_KEY .
Ejecute el servidor MCP: uv run doc_processor.py
Vaya a ~/Library/Application Support/Claude/ y cree un archivo claude_desktop_config.json . En ese archivo, agregue:

{
    "mcpServers": {
        "unstructured_doc_processor": {
            "command": "PATH/TO/YOUR/UV",
            "args": [
                "--directory",
                "ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/",
                "run",
                "doc_processor.py"
            ],
            "disabled": false
        }
    }
}

Reinicie Claude Desktop. Ahora debería poder usar el MCP.

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

Un servidor de protocolo de contexto de modelo que permite a los LLM extraer y utilizar contenido de documentos no estructurados en una amplia variedad de formatos de archivos.

Related Resources

Reddit Discussion about this server

Related MCP Servers

MCP Word Counter
qpd-v
A
security
A
license
A
quality
A Model Context Protocol server that provides tools for analyzing text documents, including counting words and characters. This server helps LLMs perform text analysis tasks by exposing simple document statistics functionality.
Last updated -
1
8
7
JavaScript
Apache 2.0
MCP File Context Server
bsmi021
-
security
A
license
-
quality
A Model Context Protocol server that enables LLMs to read, search, and analyze code files with advanced caching and real-time file watching capabilities.
Last updated -
2
15
JavaScript
MIT License
MCP Web Tools Server
surya-madhav
-
security
A
license
-
quality
A Model Context Protocol server that allows LLMs to interact with web content through standardized tools, currently supporting web scraping functionality.
Last updated -
Python
MIT License
MCP URL Fetcher
nathanonn
-
security
F
license
-
quality
A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
Last updated -
TypeScript

View all related MCP servers

Unstructured Document Processor MCP

Related Resources

Related MCP Servers

MCP Word Counter

MCP File Context Server

MCP Web Tools Server

MCP URL Fetcher

Appeared in Searches

New MCP Servers

MCP directory API