How do I use pdf-chart-parser?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@pdf-chart-parser Extract the usage chart from my electricity bill PDF." That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

pdf-chart-parser

by haoxinm

Overview Schema Related Servers Score Discussions

Python

Hybrid

pdf-chart-parser

An MCP server and Python library that extracts energy-usage charts from utility-bill PDFs. It locates the chart, calibrates the axes from the PDF's text layer, and returns structured time-series data alongside an annotated PNG for visual verification — entirely deterministic, no LLM required.

Features

Bar, line, and hybrid (bar + line, dual y-axis) chart types
Vector-first extraction via PyMuPDF get_drawings() / get_text("dict"); OpenCV raster fallback for scanned PDFs
Scanned-PDF support via an OCRmyPDF text-layer step (optional [ocr] extra): image-only pages get a searchable text layer so they flow through the same high-accuracy text-layer calibration as digital PDFs
Full page text returned as LLM-friendly Markdown (via pymupdf4llm)
MCP tool (extract_usage_chart) compatible with Claude and other MCP-aware LLMs
Supports stdio transport (local) and streamable-http (containerized deployment)
Returns structured JSON + annotated PNG; numeric data is always text content

Related MCP server: Docalyze

Installation

Prerequisites

Python 3.12+
uv package manager
Tesseract OCR (required only for the [raster] extra): apt-get install tesseract-ocr or brew install tesseract
OCRmyPDF system tools (required only for the [ocr] extra, which adds a searchable text layer to scanned PDFs): apt-get install ghostscript qpdf unpaper pngquant tesseract-ocr or brew install ocrmypdf

Quickstart

# Install (vector path only)
uv sync

# Install with raster fallback
uv sync --extra raster

# Install with the OCR text-layer step for scanned PDFs
uv sync --extra ocr

# Install everything
uv sync --extra raster --extra ocr

# Run the CLI
uv run pdf-chart-parser --help

# Run the MCP server (stdio)
uv run python -m pdf_chart_parser.server

Usage

Python library

from pdf_chart_parser.pipeline import extract_usage_chart

result = extract_usage_chart(pdf_path="bill.pdf", return_annotated_image=True)
print(result["chart_type"])   # "bar" | "line" | "hybrid"
for series in result["series"]:
    print(series["label"], series["points"])

CLI

uv run pdf-chart-parser extract bill.pdf --output result.json

MCP server

Add to your MCP config (~/.claude/claude_desktop_config.json or similar):

{
  "mcpServers": {
    "pdf-chart-parser": {
      "command": "uv",
      "args": ["run", "python", "-m", "pdf_chart_parser.server"],
      "cwd": "/path/to/pdf-chart-parser"
    }
  }
}

The server exposes the extract_usage_chart tool. It returns:

Page text — full page as Markdown
Chart reading — structured JSON (series, axes, confidence, warnings)
Annotated PNG — cropped chart with calibrated gridlines and data-point markers

Docker / ECR deployment

# Build and run locally
./scripts/run_local_server.sh

# Build and push to ECR (set ECR_REPO first)
export ECR_REPO=<account>.dkr.ecr.<region>.amazonaws.com/pdf-chart-parser
./scripts/build_and_push.sh

The container starts the server on streamable-http at port 8000. A local PDF directory can be bind-mounted to /data for ad-hoc testing (see docker/docker-compose.yml).

Manual testing (no LLM)

# In-process test against fixtures
uv run python scripts/run_manual_tests.py

# Against a running HTTP server
uv run python scripts/run_manual_tests.py --http http://localhost:8000

Output is written to manual_test_output/.

License

AGPL-3.0-or-later. This license is required because the project links against PyMuPDF, which is itself AGPL-3.0 licensed.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

extract_usage_chartA

Related MCP Servers

PDF Reader MCP Server
Documentation Access File Systems
averagejoeslab
F
license
-
quality
D
maintenance
An MCP server that provides comprehensive PDF processing capabilities including text extraction, image extraction, table detection, annotation extraction, metadata retrieval, page rendering, and document structure analysis.
Last updated 2026-02-07
Docalyze
File Systems Image & Video Processing
LunarPerovskite
A
license
B
quality
F
maintenance
An MCP server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.
Last updated 2026-04-28
4
68
MIT
DeepSeek PDF Reader
File Systems Search Documentation Access
flt1milize
A
license
-
quality
B
maintenance
MCP server for comprehensive PDF processing including text extraction with OCR, keyword search with regex, table extraction, and page preview as Base64 PNG images.
Last updated 2026-07-16
1
MIT
Xberg MCP Server
jamon8888
A
license
-
quality
F
maintenance
MCP server that extracts clean text, tables, and structured data from documents, images, code, and audio files, supporting 97 formats with OCR, transcription, and code intelligence.
Last updated 2026-07-27
MIT

View all related MCP servers

Related MCP Connectors

clianythingharness-mcp
A paid remote MCP for CLI tool MCP, built to return verdicts, receipts, usage logs, and audit-ready
PlanExe
MCP server for generating rough-draft project plans from natural-language prompts.
mcp
MCP server providing access to the Scorecard API to evaluate and optimize LLM systems.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/haoxinm/pdf-chart-parser'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

pdf-chart-parser

Features

Installation

Prerequisites

Quickstart

Usage

Python library

CLI

MCP server

Docker / ECR deployment

Manual testing (no LLM)

License

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

PDF Reader MCP Server

Docalyze

DeepSeek PDF Reader

Xberg MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API