Tools for OCR and generating JSON or Markdown

Glama

Search for:

Tools for OCR and generating JSON or Markdown

View all MCP Servers

Why this server?
This server extracts text content from local PDF files, supporting OCR capabilities, which is directly relevant to the user's request.
PDF Extraction MCP Server
xraywu
A
security
F
license
A
quality
An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
Last updated -
1
17
Python
Why this server?
This server extracts webpage content and transforms it into clean, LLM-optimized Markdown, fulfilling the 'generate markdown' part of the user's query after potentially using OCR.
Mozilla Readability Parser MCP Server
jmh108
-
security
A
license
-
quality
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
Last updated -
2
Python
MIT License
Why this server?
Same as above but without the mention of the ID.
Mozilla Readability Parser MCP Server
emzimmer
A
security
A
license
A
quality
Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
Last updated -
1
673
13
MIT License
Why this server?
Generates a JSON file tree from a specified directory's 'src' folder, which aligns with the 'generate json' part of the query.
MCP Source Tree Server
owayo
A
security
A
license
A
quality
Generates a JSON file tree from a specified directory's 'src' folder, adhering to '.gitignore' rules, allowing for quick project structure review in Claude.
Last updated -
1
Python
MIT License
Why this server?
Analyzes web content and adds it to your knowledge base, storing content as Markdown files, which fits the use case of extracting information and formatting it.
Memex
narphorium
-
security
A
license
-
quality
A tool for Model Context Protocol (MCP) that allows you to analyze web content and add it to your knowledge base, storing content as Markdown files for easy viewing with tools like Obsidian.
Last updated -
9
Python
MIT License
Why this server?
This server is capable of collecting code from files and directories into a single markdown document, a useful function when processing information.
MCP Server Neurolorap
aindreyway
A
security
A
license
A
quality
MCP server for collecting code from files and directories into a single markdown document.
Last updated -
2
8
Python
MIT License
Why this server?
A document conversion server that transforms various file formats to Markdown format.
Markdownify MCP Server - UTF-8 Enhanced
JDJR2024
A
security
A
license
A
quality
A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
Last updated -
10
0
9
TypeScript
MIT License
Why this server?
Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities which could be used in conjunction with OCR to then convert and format.
Simple Document Processing MCP Server
cablate
A
security
A
license
A
quality
Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
Last updated -
16
46
11
TypeScript
MIT License
Why this server?
A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing. This also supports OCR use cases.
MCP Development Framework
aigo666
A
security
A
license
A
quality
A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing.
Last updated -
8
12
Python
MIT License
Why this server?
offers multiple pdf tools including search, extract pages, and merge pdfs.
mcp-pdf-tools
hanweg
A
security
A
license
A
quality
mcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)
Last updated -
5
58
Python
The Unlicense