Search for:
Why this server?
This server provides access to image URIs, metadata, and OCR data via the Gyazo API, enabling OCR processing for PDF's converted to images.
Why this server?
This server enables LLMs to extract and use content from unstructured documents across a wide variety of file formats, which would include PDF's.
Why this server?
This server retrieves and processes content from web pages, converting HTML to markdown, which would be helpful if the PDF is available online as a webpage.
Why this server?
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
Why this server?
Provides a set of tools to manipulate PDF's including: extracting pages, merging, and searching, however it does not explicitly OCR.
Why this server?
Converts Markdown to styled PDFs, which isn't quite the user's request but is related, and could be part of a workflow.
Why this server?
OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
Why this server?
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown which could include extracting from a PDF that's rendered as a webpage.