What can you do with this server?

The oxidize-pdf MCP server provides comprehensive PDF capabilities for AI agents to read, create, manipulate, annotate, secure, and analyze PDF documents. * Read & Extract (read_pdf, extract_text): Retrieve metadata (page count, version, encryption status, title, author, dimensions) and extract text from entire PDFs or specific pages. * Extract Entities (extract_entities): Pull structured text chunks with position (x, y), font size, and font name per page. * Convert (convert_pdf): Transform PDFs into markdown, token-limited chunks, or RAG-optimized semantic chunks for LLM consumption. * Analyze (analyze_pdf): Validate structure, detect corruption, check PDF/A compliance, and compare two PDFs. * Manipulate (manipulate_pdf): Split, merge, rotate, extract pages, reverse order, or overlay PDFs. * Annotate (annotate_pdf): Add sticky note annotations or highlight rectangles at specified positions. * Manage Forms (manage_forms): Create, fill, read, and validate PDF form fields. * Secure (secure_pdf): Encrypt/decrypt with user/owner passwords, check permissions, and verify digital signatures. * Create & Save (create_pdf, add_pdf_content, save_pdf): Start a PDF session, add text at specific coordinates or new blank pages, then finalize and save with optional encryption. * Resources: Access built-in fonts, standard page sizes, server capabilities, workspace files, and session data via oxidize:// URIs. * Guided Workflows: Built-in prompts assist with summarization, data extraction, form filling, and other common tasks.

Which integrations are available for this server?

Integrates with GitHub Copilot's agent mode in VS Code, exposing PDF manipulation tools for AI-assisted PDF workflows. Integrates with the OpenAI Agents SDK, allowing AI agents to perform PDF operations such as reading, extracting, converting, and manipulating PDFs.

oxidize-pdf

by bzsanti

Overview Schema Related Servers Score Discussions

Python

Local

oxidize-pdf

PyPI version License: MIT Python Typed MCP

oxidize-python MCP server

Rust-powered PDF library for Python. Generate, parse, split, merge, and manipulate PDFs with native performance. Ships with a built-in MCP server so AI agents can work with PDFs out of the box.

No C dependencies. No Java. No subprocess calls.

Installation

pip install oxidize-pdf            # Core library
pip install "oxidize-pdf[mcp]"     # + MCP server for AI agents

Platforms: Linux (x86_64, aarch64) | macOS (x86_64, Apple Silicon) | Windows (x86_64) Requires: Python 3.10+

Related MCP server: PDFSizeAnalyzer-MCP

Why oxidize-pdf?

	oxidize-pdf	Pure-Python libs	C/Java wrappers
Performance	Native (compiled Rust)	Interpreted	Native but heavy
Dependencies	Zero	Varies	Poppler, Java, Ghostscript
Memory safety	Rust ownership model	GC-dependent	Manual / GC
Type stubs	Full (mypy/pyright)	Partial	Rare
AI-ready (MCP)	Built-in	No	No

MCP Server

Give your AI agent full PDF capabilities in one line:

oxidize-mcp

The built-in Model Context Protocol server exposes 12 tools, 6 resources, and 5 prompts — compatible with Claude, GPT, and any MCP client.

Claude Desktop integration

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "oxidize-pdf": {
      "command": "oxidize-mcp",
      "env": {
        "OXIDIZE_WORKSPACE": "/path/to/your/pdfs"
      }
    }
  }
}

GitHub Copilot (VS Code) integration

Copilot's agent mode speaks MCP. Add .vscode/mcp.json to your workspace:

{
  "servers": {
    "oxidize-pdf": {
      "command": "oxidize-mcp",
      "env": {
        "OXIDIZE_WORKSPACE": "/path/to/your/pdfs"
      }
    }
  }
}

Open the Chat view, switch to Agent mode, and the 12 PDF tools appear in the tool picker. (The same block also works under the mcp.servers key in your user settings.json if you prefer a global install.)

OpenAI Agents SDK integration

The OpenAI Agents SDK spawns the server over stdio and exposes its tools to an agent:

from agents import Agent, Runner
from agents.mcp import MCPServerStdio

async with MCPServerStdio(
    params={"command": "oxidize-mcp", "env": {"OXIDIZE_WORKSPACE": "/path/to/your/pdfs"}},
    cache_tools_list=True,
) as server:
    agent = Agent(
        name="PDF assistant",
        instructions="Use the oxidize-pdf tools to inspect and manipulate PDFs.",
        mcp_servers=[server],
    )
    result = await Runner.run(agent, "How many pages does report.pdf have?")
    print(result.final_output)

A runnable version is in examples/openai_agents_quickstart.py.

Both integrations run the server locally over stdio, so its tools operate on PDFs in the configured workspace directory. Remote/hosted use (e.g. the OpenAI Responses API hosted MCP tool) needs an HTTP transport and is not yet exposed.

Available tools

Tool	What it does
`read_pdf`	Read metadata — page count, version, encryption status, title, author
`extract_text`	Extract text from all pages or a specific page
`convert_pdf`	Convert to markdown, chunks, or RAG-optimized format
`create_pdf`	Create a new PDF with optional metadata
`save_pdf`	Save a session to disk, with optional encryption
`add_content`	Add pages, text, and graphics to a session
`annotate_pdf`	Add text annotations and highlights
`manipulate_pdf`	Split, merge, rotate, extract pages, reverse, overlay
`manage_forms`	Create, fill, read, and validate form fields
`secure_pdf`	Encrypt, check permissions, verify signatures
`extract_entities`	Extract structured entities from pages
`analyze_pdf`	Validate structure, detect corruption, check PDF/A compliance

The server also exposes resources (session data, capabilities, version info) and prompts (guided workflows for summarization, data extraction, form filling, and more).

Configuration

OXIDIZE_WORKSPACE=/path/to/pdfs oxidize-mcp

The server is configured entirely through environment variables:

Variable	Default	Purpose
`OXIDIZE_WORKSPACE`	`~/Documents/oxidize-mcp`	Sandbox root; all paths must resolve inside it.
`OXIDIZE_ALLOWED_PATHS`	(none)	Comma-separated extra directories allowed outside the workspace.
`OXIDIZE_MAX_FILE_SIZE_MB`	`100`	Reject input PDFs larger than this on disk.
`OXIDIZE_MAX_PAGES`	`10000`	Reject documents with more pages than this before any extraction work.
`OXIDIZE_MAX_OUTPUT_BYTES`	`10485760`	Cap the serialized size of a tool's JSON response (10 MB).
`OXIDIZE_MAX_SESSIONS`	`10`	Maximum concurrent stateful PDF-creation sessions.
`OXIDIZE_MAX_SESSION_BYTES`	`10485760`	Cap the content a single session may accumulate (10 MB).
`OXIDIZE_SESSION_TIMEOUT`	`3600`	Session expiry, in seconds.

Resource caps (OXIDIZE_MAX_*) protect the server from a large or malicious PDF: oversized documents are rejected up front and tool responses are bounded rather than serialized unbounded. Exceeding a cap returns an error with code RESOURCE_LIMIT.

Or start programmatically:

from oxidize_pdf.mcp.server import run
run()

Python API

Create a PDF

from oxidize_pdf import Document, Page, Font, Color

doc = Document()
doc.set_title("My Document")
doc.set_author("Jane Doe")

page = Page.a4()
page.set_font(Font.HELVETICA, 24.0)
page.set_text_color(Color.black())
page.text_at(72.0, 750.0, "Hello from oxidize-pdf!")

page.set_font(Font.TIMES_ROMAN, 12.0)
page.text_at(72.0, 700.0, "Generated with Python + Rust.")

doc.add_page(page)
doc.save("output.pdf")

Parse an existing PDF

from oxidize_pdf import PdfReader

reader = PdfReader.open("document.pdf")
print(f"Pages: {reader.page_count}, Version: {reader.version}")

for i, text in enumerate(reader.extract_text()):
    print(f"--- Page {i + 1} ---")
    print(text)

Operations

from oxidize_pdf import split_pdf, merge_pdfs, rotate_pdf, extract_pages

split_pdf("input.pdf", "output_dir/")                       # Split into individual pages
merge_pdfs(["part1.pdf", "part2.pdf"], "merged.pdf")         # Merge multiple PDFs
rotate_pdf("input.pdf", "rotated.pdf", 90)                   # Rotate all pages
extract_pages("input.pdf", "subset.pdf", [0, 2, 4])          # Extract specific pages

Graphics

from oxidize_pdf import Document, Page, Color

doc = Document()
page = Page.a4()

page.set_fill_color(Color.hex("#3498db"))
page.draw_rect(72.0, 700.0, 200.0, 100.0)
page.fill()

page.set_stroke_color(Color.red())
page.set_line_width(2.0)
page.draw_circle(300.0, 500.0, 50.0)
page.stroke()

doc.add_page(page)
doc.save("graphics.pdf")

Types

from oxidize_pdf import Color, Point, Rectangle, Margins, Font

# Colors
Color.rgb(1.0, 0.0, 0.0)          # RGB
Color.hex("#ff6600")               # Hex
Color.cmyk(0.0, 1.0, 1.0, 0.0)   # CMYK

# Geometry
Point(72.0, 720.0)
Rectangle.from_xywh(72.0, 72.0, 468.0, 648.0)
Margins.uniform(72.0)

# Fonts — all 14 standard PDF fonts
Font.HELVETICA    # Font.HELVETICA_BOLD
Font.TIMES_ROMAN  # Font.TIMES_BOLD
Font.COURIER      # Font.COURIER_BOLD

Error handling

from oxidize_pdf import PdfReader, PdfError, PdfIoError, PdfParseError

try:
    reader = PdfReader.open("missing.pdf")
except PdfIoError as e:
    print(f"I/O error: {e}")
except PdfParseError as e:
    print(f"Parse error: {e}")
except PdfError as e:
    print(f"PDF error: {e}")

Exception hierarchy: PdfError > PdfIoError, PdfParseError, PdfEncryptionError, PdfPermissionError

MCP Server

oxidize-pdf includes an MCP server that exposes PDF capabilities to AI assistants like Claude. Install with the mcp extra:

pip install oxidize-pdf[mcp]

Claude Desktop

Add this to your claude_desktop_config.json:

{
  "mcpServers": {
    "oxidize-pdf": {
      "command": "uvx",
      "args": ["--from", "oxidize-pdf[mcp]", "oxidize-mcp"]
    }
  }
}

Claude Code

claude mcp add oxidize-pdf -- uvx --from "oxidize-pdf[mcp]" oxidize-mcp

Available tools

Tool	Description
`read_pdf`	Open a PDF and get metadata (pages, version, encryption)
`extract_text`	Extract text content from PDF pages
`convert_pdf`	Convert between PDF versions
`analyze_pdf`	Analyze structure, fonts, images, and compliance
`extract_entities`	Extract images and digital signatures
`manipulate_pdf`	Split, merge, rotate, extract, and reorder pages
`annotate_pdf`	Add text annotations, highlights, and stamps
`manage_forms`	Create, fill, and read PDF form fields
`secure_pdf`	Encrypt, decrypt, and set document permissions
`create_pdf`	Create a new PDF document with pages
`add_pdf_content`	Add text, shapes, and images to pages
`save_pdf`	Save the document to file or bytes

Resources

oxidize://fonts — Available built-in PDF fonts
oxidize://page-sizes — Standard page sizes with dimensions
oxidize://capabilities — Server capabilities and tool listing
oxidize://version — Version information
oxidize://workspace — PDF files in the workspace directory
oxidize://session/{id} — Session data by ID

Known limitations

Encryption write support: Document.encrypt() configures encryption parameters but the underlying Rust library does not yet serialize the encryption dictionary to the PDF output. Reading encrypted PDFs works correctly.
Image extraction returns raw embedded streams: extract_images_from_pdf extracts each embedded image as-is (e.g. a DCTDecode JPEG is written byte-for-byte). Image preprocessing — auto rotation-correction, contrast enhancement, denoise, upscaling, force-grayscale — is not available, because the build excludes the upstream external-images feature (and its image-crate dependency). This keeps extraction faithful and lossless; it does not silently return empty or stub results.
CPython only: PyPy and GraalPy are not supported.

License

MIT — see LICENSE for details.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

1dResponse time

3dRelease cycle

25Releases (12mo)

Commit activity

Issues opened vs closed

Resources

GitHub Repository

Need Help?

Related Servers

Tools

View all tools

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bzsanti/oxidize-python'

If you have feedback or need assistance with the MCP directory API, please join our Discord server