Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@PDF Agent MCPShow me the outline and search for 'results' in /Users/data/report.pdf"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
PDF Agent MCP
š Visit the Landing Page for an overview and easy download
ā ļø Important Setup Instructions
Before using this extension, you MUST configure Claude Desktop properly:
Required Configuration
Install Node.js LTS: Visit nodejs.org and download the LTS version
Configure Claude Desktop:
Go to Claude > Settings > Extensions > Advanced Settings
Disable "Use Built-in Node.js for MCP"
Restart Claude Desktop
This extension will NOT work with Claude's built-in Node.js. You must use your system's Node.js installation.
Troubleshooting
If you experience issues loading the extension:
Verify Node.js is installed: Run
node --versionin your terminalEnsure "Use Built-in Node.js for MCP" is disabled in Claude Desktop settings
Restart Claude Desktop completely
Check the logs at
~/Library/Logs/Claude/mcp-server-PDF Agent MCP.log(macOS) or%LOCALAPPDATA%\Claude\Logs\mcp-server-PDF Agent MCP.log(Windows)
A Model Context Protocol server designed for agentic reading and selective PDF processing. Enables AI systems to efficiently navigate and extract content from PDFs without overwhelming context windows.
Features
Metadata Extraction: Get PDF properties, page count, and file information
Text Extraction: Native text extraction with hybrid processing for better results
Image Conversion: Convert PDF pages to optimized images for visual analysis
Content Search: Pattern/regex search with context snippets
Table of Contents: Extract bookmarks and document outline
Flexible Path Support: Use absolute paths or relative paths from
~/pdf-agent/
Usage Guide
PDF Agent MCP solves the common problem of context window overflow when working with PDFs in AI tools.
Important: Do not drag PDFs into the chat - this will load the entire PDF content traditionally and bypass the intelligent processing. Instead, provide file paths or URLs to activate the PDF Agent tools for selective processing.
How to Use
For Local PDFs:
Provide the absolute file path to your PDF
Quick tip: Right-click your PDF ā "Open with Chrome" ā copy the address bar URL for the absolute path
For Online PDFs:
Simply provide the PDF URL - the agent will download and process it locally
Key Benefits
Selective Reading: The AI first examines metadata and outline, then opens only relevant pages
Token Efficiency: Avoids images when possible, uses them only when necessary for visual analysis
Scalable: Works with large documents (1000+ page textbooks) and multiple PDFs simultaneously
Search Capability: Built-in pattern/regex search across PDF content
Approach
This MCP uses agentic search with simple tools rather than complex alternatives:
No embedding creation, chunking, or vector storage required
No multi-agent coordination or handoff complexity
Just clean, effective tools that modern AI systems can use intelligently
Perfect for researchers, students, and professionals working with extensive PDF libraries.
AI Assistant Prompt for Optimal Usage
Copy this prompt into your AI assistant's custom instructions or context for best results:
Installation
Option 1: DXT Package (Recommended)
First, ensure you have completed the Required Configuration above
Download the latest
pdf-agent-mcp.dxtfile from the releasesDouble-click the
.dxtfile to install it in Claude Desktop
Option 2: Manual Installation
First, ensure you have completed the Required Configuration above
Clone this repository
Build the project:
npm install && npm run buildFind your Claude Desktop config file:
macOS:
~/Library/Application Support/Claude/claude_desktop_config.jsonWindows:
%APPDATA%\Claude\claude_desktop_config.json
Add the following:
Replace PATH_TO_REPO with the actual path to your cloned repository.
Development
Viewing Logs
To debug issues, you can view the MCP server logs:
License
MIT