Skip to main content
Glama

PDFtotext MCP Server

by jpwebb
MIT License
8
  • Linux
  • Apple

read_pdf_text

Extract text from PDF files using the pdftotext utility. Specify pages, preserve layout formatting, or set encoding options. Ideal for processing single or multi-page documents.

Instructions

Extract text content from a PDF file using pdftotext from poppler-utils

Input Schema

NameRequiredDescriptionDefault
encodingNoText encoding for output (default: UTF-8)UTF-8
layoutNoPreserve original text layout formatting (default: false)
pageNoSpecific page number to extract (1-based indexing). If not specified, extracts all pages.
pathYesPath to the PDF file (relative to current working directory or absolute path)

Input Schema (JSON Schema)

{ "properties": { "encoding": { "default": "UTF-8", "description": "Text encoding for output (default: UTF-8)", "enum": [ "UTF-8", "Latin1", "ASCII" ], "type": "string" }, "layout": { "default": false, "description": "Preserve original text layout formatting (default: false)", "type": "boolean" }, "page": { "description": "Specific page number to extract (1-based indexing). If not specified, extracts all pages.", "minimum": 1, "type": "number" }, "path": { "description": "Path to the PDF file (relative to current working directory or absolute path)", "type": "string" } }, "required": [ "path" ], "type": "object" }
Install Server

Other Tools from PDFtotext MCP Server

Related Tools

    MCP directory API

    We provide all the information about MCP servers via our MCP API.

    curl -X GET 'https://glama.ai/api/mcp/v1/servers/jpwebb/pdftotext-mcp'

    If you have feedback or need assistance with the MCP directory API, please join our Discord server