Why this server?
This server provides access to image URIs, metadata, and OCR data via the Gyazo API, enabling OCR processing for PDF's converted to images.
-securityAlicense-qualityA TypeScript-based MCP server that enables AI assistants to interact with Gyazo images using the Model Context Protocol, providing access to image URIs, metadata, and OCR data via the Gyazo API.Last updated1025MITWhy this server?
This server enables LLMs to extract and use content from unstructured documents across a wide variety of file formats, which would include PDF's.
AsecurityFlicenseBqualityA Model Context Protocol server that enables LLMs to extract and use content from unstructured documents across a wide variety of file formats.Last updated111Why this server?
This server retrieves and processes content from web pages, converting HTML to markdown, which would be helpful if the PDF is available online as a webpage.
Asecurity-licenseBqualityThis server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.Last updated203183,665Why this server?
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
-securityFlicense-qualityProvides tools for reading and extracting text from PDF files, supporting both local files and URLs.Last updated43Why this server?
Provides a set of tools to manipulate PDF's including: extracting pages, merging, and searching, however it does not explicitly OCR.
AsecurityAlicenseBqualitymcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)Last updated574The UnlicenseWhy this server?
Converts Markdown to styled PDFs, which isn't quite the user's request but is related, and could be part of a workflow.
-securityFlicense-qualityConverts Markdown to styled PDFs using VS Code's markdown styling and Python's ReportLab, providing a simple note storage system with custom URI scheme.Last updated15Why this server?
OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
-securityAlicense-qualityOCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)Last updated36MITWhy this server?
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown which could include extracting from a PDF that's rendered as a webpage.
-securityAlicense-qualityA Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.Last updated4MIT