PDF Extraction MCP Server

by xraywu

PDF Extraction MCP server

MCP server to extract contents from a PDF file

Components

Tools

The server implements one tool:

  • extract-pdf-contents: Extract contents from a local PDF file
    • Takes "pdf_path" as a required string argument, representing the local file path of the PDF file
    • Takes "pages" as an optional string argument, representing the page numbers to extract contents from the PDF file. Page numbers are separated in comma, and negative page numbers supported (e.g. '-1' means the last page)
    • Supports PDF file reader and OCR

Quickstart

Install

Claude Desktop

On MacOS: ~/Library/Application\ Support/Claude/claude_desktop_config.json On Windows: %APPDATA%/Claude/claude_desktop_config.json

You must be authenticated.

A
security – no known vulnerabilities
F
license - not found
A
quality - confirmed to work

local-only server

The server can only run on the client's local machine because it depends on local resources.

An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.

  1. Components
    1. Tools
  2. Quickstart
    1. Install

Related MCP Servers

  • A
    security
    F
    license
    A
    quality
    An MCP server for converting Markdown documents to PDF files.
    Last updated -
    1
    1
    JavaScript
  • -
    security
    F
    license
    -
    quality
    Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
    Last updated -
    3
    Python
  • -
    security
    F
    license
    -
    quality
    Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.
    Last updated -
    1
    JavaScript
    • Apple
    • Linux
  • -
    security
    F
    license
    -
    quality
    A Python MCP server for invoice and receipt processing that uses OCR technology to extract data from PDFs and images, offering AI assistants the ability to process, extract text from, and merge invoice documents.
    Last updated -
    Python
    • Apple

View all related MCP servers

ID: pqwezk0y02