Unstructured Document Processor MCP
by MKhalusova
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Integrations
A Model Context Protocol server that provides unstructured document processing capabilities. This server enables LLMs to extract and use content from an unstructured document.
This repo is work in progress, proceed with caution :)
Supported file types:
Copy
Prerequisites: You'll need:
- Two S3 buckets, one will be used as a source of documents in a processing workflow, and the other one will be a destination since Unstructured Platform currently doesn't support local sources and destinations.
- Your AWS auth credentials: access key and secret key.
- Unstructured Platform API key.
Quick TLDR on how to add this MCP to your Claude Desktop:
- Clone the repo and set up the UV environment.
- Create a
.env
file in the root directory and add the following env variables:AWS_S3_SOURCE_BUCKET
,AWS_S3_DESTINATION_BUCKET
,AWS_KEY
,AWS_SECRET
,UNSTRUCTURED_API_KEY
. - Run the MCP server:
uv run doc_processor.py
- Go to
~/Library/Application Support/Claude/
and create aclaude_desktop_config.json
. In that file add:
Copy
- Restart Claude Desktop. You should now be able to use the MCP.
This server cannot be installed
A Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats.