Search for:

Tools for OCR and generating JSON or Markdown

  • Why this server?

    This server extracts text content from local PDF files, supporting OCR capabilities, which is directly relevant to the user's request.

    A
    security
    F
    license
    A
    quality
    An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
    Last updated -
    1
    2
    Python
    • Apple
  • Why this server?

    This server extracts webpage content and transforms it into clean, LLM-optimized Markdown, fulfilling the 'generate markdown' part of the user's query after potentially using OCR.

    -
    security
    A
    license
    -
    quality
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    Last updated -
    1
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    Same as above but without the mention of the ID.

    A
    security
    A
    license
    A
    quality
    Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
    Last updated -
    1
    4
    11
    MIT License
  • Why this server?

    Generates a JSON file tree from a specified directory's 'src' folder, which aligns with the 'generate json' part of the query.

    A
    security
    A
    license
    A
    quality
    Generates a JSON file tree from a specified directory's 'src' folder, adhering to '.gitignore' rules, allowing for quick project structure review in Claude.
    Last updated -
    1
    Python
    MIT License
  • Why this server?

    Analyzes web content and adds it to your knowledge base, storing content as Markdown files, which fits the use case of extracting information and formatting it.

    -
    security
    A
    license
    -
    quality
    A tool for Model Context Protocol (MCP) that allows you to analyze web content and add it to your knowledge base, storing content as Markdown files for easy viewing with tools like Obsidian.
    Last updated -
    6
    Python
    MIT License
  • Why this server?

    This server is capable of collecting code from files and directories into a single markdown document, a useful function when processing information.

    A
    security
    A
    license
    A
    quality
    MCP server for collecting code from files and directories into a single markdown document.
    Last updated -
    2
    4
    Python
    MIT License
  • Why this server?

    A document conversion server that transforms various file formats to Markdown format.

    A
    security
    A
    license
    A
    quality
    A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
    Last updated -
    10
    4
    TypeScript
    MIT License
    • Linux
    • Apple
  • Why this server?

    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities which could be used in conjunction with OCR to then convert and format.

    A
    security
    A
    license
    A
    quality
    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
    Last updated -
    16
    46
    11
    TypeScript
    MIT License
  • Why this server?

    A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing. This also supports OCR use cases.

    A
    security
    A
    license
    A
    quality
    A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing.
    Last updated -
    8
    8
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    offers multiple pdf tools including search, extract pages, and merge pdfs.

    A
    security
    A
    license
    A
    quality
    mcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)
    Last updated -
    5
    19
    Python
    The Unlicense