Which integrations are available for this server?

Provides tools for building and searching SQLite databases from PDFs, with image extraction and captioning support.

How do I use PDF Inline Image RAG MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@PDF Inline Image RAG MCP search for 'data visualization' in annual_report.pdf and get uncaptioned images" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

PDF Inline Image RAG MCP

by Joncallim

Overview Schema Related Servers Score Discussions

Python

Local

PDF Inline Image RAG MCP

An MCP server and CLI for building local, searchable SQLite databases from PDFs where important content appears inside inline images, figures, diagrams, or scanned image blocks.

The key rule is simple:

Extract PDF text normally.
Extract only actual PDF image blocks, not whole-page screenshots.
Insert image placeholders into the page text stream at their page-flow location.
Store every extracted image with its exact PDF bounding box.

Example text_with_images marker:

[[IMAGE page=72 index=1 bbox=80.6,76.0,535.7,645.7 size=1896x2373 file=mtp-2_assets/images/page_0072_image_01.png]]

This gives an AI agent enough context to search normal text, notice where an image appeared, fetch the image asset, caption/OCR it, and save the caption back into the searchable index.

Install

pip install git+https://github.com/Joncallim/pdf-inline-image-rag-mcp.git

For local development:

git clone https://github.com/Joncallim/pdf-inline-image-rag-mcp.git
cd pdf-inline-image-rag-mcp
python -m venv .venv
. .venv/bin/activate
pip install -e ".[dev]"

Related MCP server: Doc Agent

MCP Usage

Add this server to your MCP client:

{
  "mcpServers": {
    "pdf-inline-image-rag": {
      "command": "pdf-inline-image-rag-mcp"
    }
  }
}

Available tools:

build_pdf_rag
search_pdf_rag
get_pdf_page
get_pdf_image
list_uncaptioned_pdf_images
save_pdf_image_caption
inspect_pdf_rag

Typical flow:

Call build_pdf_rag with a PDF path and output directory.
Call search_pdf_rag for normal text queries.
When a result includes [[IMAGE ...]], call get_pdf_page or get_pdf_image.
Caption or OCR the image with your preferred model.
Call save_pdf_image_caption so the caption is added to page text and FTS.

CLI Usage

Build a database:

pdf-inline-image-rag build \
  --input /path/to/file.pdf \
  --output-dir outputs/pdf_rag \
  --replace

Build only selected pages:

pdf-inline-image-rag build \
  --input /path/to/file.pdf \
  --output-dir outputs/pdf_rag \
  --pages 1-10,42

Search:

pdf-inline-image-rag search \
  --db outputs/pdf_rag/file_rag.sqlite \
  "sector method"

Inspect:

pdf-inline-image-rag inspect \
  --db outputs/pdf_rag/file_rag.sqlite

Output Layout

outputs/pdf_rag/
  file_rag.sqlite
  file_rag_export.md
  file_assets/
    images/page_0001_image_01.png
    visual_json/page_0001.visual.json

Whole-page PNG rendering is disabled by default. Use --render-pages only for debugging.

SQLite Tables

pages:

text: normal embedded PDF text
text_with_images: text plus inline image placeholders
markdown: page-level retrieval document
image_count
needs_ocr

images:

file_path
bbox_x0, bbox_y0, bbox_x1, bbox_y1
width, height
block_number
placeholder
caption
caption_model

pages_fts:

FTS5 index over text, image placeholders, markdown, and saved captions.

Notes

This project does not invent image captions. It extracts image blocks and makes them discoverable. Use an OCR or vision model to caption the extracted images, then persist the caption with save_pdf_image_caption.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

PDF RAG MCP Server
RAG Systems Vector Databases Documentation Access
wesleygriffin
A
license
-
quality
D
maintenance
Enables intelligent search and question-answering over PDF documents using semantic similarity and keyword search. Supports OCR for scanned PDFs, persistent vector storage with ChromaDB, and maintains source tracking with page numbers.
Last updated 2025-11-11
5
MIT
Doc Agent
Documentation Access Databases Research & Data
prosdevlab
A
license
-
quality
D
maintenance
Enables extraction of structured data from documents like invoices, receipts, and bank statements using local Vision AI (Ollama) or cloud providers (Gemini), with data stored in a local SQLite database.
Last updated 2025-12-09
28
MIT
Electronics Docs MCP Server
Documentation Access Embedded system Search
flaco-source
F
license
A
quality
D
maintenance
Provides LLMs with direct access to official vendor PDF documentation for electronics components (TI, ST, ADI) via a local SQLite full-text index and PDF retrieval tools.
Last updated 2026-04-15
6
1
PDF RAG MCP Server
RAG Systems Search
MBaranekTech
A
license
-
quality
D
maintenance
Enables RAG over messy PDFs — extract, chunk, embed, and search scanned, multi-column, and table-heavy documents.
Last updated 2026-03-28
MIT

View all related MCP servers

Related MCP Connectors

Document to JSON – PDF Invoice/Statement/Contract Parser
Turn any PDF into structured JSON via AI + OCR: invoices, bank statements, contracts.
pdf-tables-mcp
Reliable PDF table extraction. Pass a URL, get structured JSON tables with citations.
PDFMakerAPI
Turn a description into a shareable, editable PDF — invoices, certificates, reports, resumes.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Joncallim/pdf-inline-image-rag-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

PDF Inline Image RAG MCP

Install

MCP Usage

CLI Usage

Output Layout

SQLite Tables

Notes

Maintenance

Resources

Looking for Admin?

Related MCP Servers

PDF RAG MCP Server

Doc Agent

Electronics Docs MCP Server

PDF RAG MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API