document-intelligence-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@document-intelligence-mcpExtract text from quarterly_report.pdf"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
document-intelligence-mcp
Local document intelligence for AI agents — extract text, detect tables, read metadata, analyze structure, search keywords, and detect language from PDF and DOCX files. No cloud API required, no API key needed.
Features
10 MCP Tools for PDF and DOCX processing
Local processing — no data leaves your machine
No API key required
Supports PDF (via PyMuPDF + pdfplumber) and Microsoft Word DOCX (via python-docx)
Language detection via langdetect (55+ languages)
Related MCP server: PDF Document MCP Server
Tools
Tool | Description |
| Extract all text from a PDF, page by page |
| Detect and extract tables from PDF |
| Read PDF metadata: title, author, dates, outline |
| Detect headings, font sizes, section structure |
| Search for keywords with context in PDF |
| Extract all text from a Word DOCX file |
| Extract all tables from a DOCX file |
| Analyze headings, styles, and structure of DOCX |
| Word count, sentence count, reading time, top words |
| Detect language of PDF or DOCX (55+ languages) |
Installation
pip install document-intelligence-mcpClaude Desktop Configuration
Add to your claude_desktop_config.json:
{
"mcpServers": {
"document-intelligence": {
"command": "document-intelligence-mcp"
}
}
}Usage Examples
Extract text from a PDF:
Extract the text from /path/to/report.pdfFind tables in a PDF:
Find all tables in /path/to/financial_report.pdfSearch for a keyword:
Search for "revenue" in /path/to/annual_report.pdfGet document stats:
Count the words and estimate reading time for /path/to/document.docxDetect language:
What language is /path/to/document.pdf written in?Requirements
Python 3.10+
PyMuPDF >= 1.24.0
pdfplumber >= 0.11.0
python-docx >= 1.1.0
langdetect >= 1.0.9
License
MIT License — free to use, modify, and distribute.
Built by AiAgentKarl | Part of the AI Agent Economy toolkit
This server cannot be installed
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/AiAgentKarl/document-intelligence-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server