liteparse-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@liteparse-mcpParse report.pdf and extract all text"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
liteparse-mcp
Fast, local PDF parsing as an MCP server — text extraction, bounding boxes, OCR, and visual citations. No cloud. No API key. Powered by LiteParse.
Tools
Tool | Description |
| Extract text + bounding boxes (x, y, width, height in PDF points) from a PDF |
| Parse every PDF in a folder; write JSON + screenshots per file |
| Render pages as base64 PNG images |
| Render a page with highlight boxes drawn over every text item |
| Find a phrase and return all matching positions with coordinates |
Bounding-box coordinates are in PDF points (1 pt = 1/72 in), origin top-left.
To convert to pixels: px = pt × (dpi / 72).
Related MCP server: pdf-reader-mcp
Install
pip install liteparse-mcpUsage
Claude Desktop
Add to ~/AppData/Roaming/Claude/claude_desktop_config.json (Windows) or
~/Library/Application Support/Claude/claude_desktop_config.json (macOS):
{
"mcpServers": {
"liteparse": {
"command": "liteparse-mcp"
}
}
}Or with the explicit Python path (if liteparse-mcp is not on PATH):
{
"mcpServers": {
"liteparse": {
"command": "python",
"args": ["-m", "liteparse_mcp"]
}
}
}Restart Claude Desktop — the five tools appear automatically.
Claude Code
claude mcp add liteparse -- python -m liteparse_mcpHTTP / SSE (for remote agents or testing)
liteparse-mcp --http
# Server listens on http://127.0.0.1:8765Example agent prompts
"Parse report.pdf and show me where 'efficacy' appears with bounding boxes"
"Get a cited screenshot of page 3 of study.pdf"
"Batch parse every PDF in my Downloads folder and save the output"
"Search safety_data.pdf for 'adverse event' and list the page numbers"
Outputs (batch mode)
For each PDF, batch_parse_pdfs writes:
<output_folder>/
<stem>/
pages.json # structured JSON: page text + TextItem bounding boxes
summary.txt # plain text of the whole document
page_1.png # raw page screenshot
page_1_cited.png # screenshot with bounding-box highlights
...
batch_report.json # overall success / error summaryRequirements
Python ≥ 3.10
liteparse≥ 2.0.0 (Rust-based; wheels available for Windows, macOS, Linux)fastmcp≥ 2.0.0
No Tesseract installation required for text-based PDFs.
For scanned PDFs with ocr_enabled=true, Tesseract is used automatically
if available on PATH.
License
MIT
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/lisabrennan1996/liteparse-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server