Search for:

Tools for OCR and generating JSON or Markdown

  • Why this server?

    This server extracts text content from local PDF files, supporting OCR capabilities, which is directly relevant to the user's request.

    A
    security
    F
    license
    A
    quality
    An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
    1
    2
    Python
    • Apple
  • Why this server?

    This server extracts webpage content and transforms it into clean, LLM-optimized Markdown, fulfilling the 'generate markdown' part of the user's query after potentially using OCR.

    -
    security
    A
    license
    -
    quality
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    1
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    Same as above but without the mention of the ID.

    A
    security
    A
    license
    A
    quality
    Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
    1
    4
    11
    MIT License
  • Why this server?

    Generates a JSON file tree from a specified directory's 'src' folder, which aligns with the 'generate json' part of the query.

    A
    security
    A
    license
    A
    quality
    Generates a JSON file tree from a specified directory's 'src' folder, adhering to '.gitignore' rules, allowing for quick project structure review in Claude.
    1
    Python
    MIT License
  • Why this server?

    Analyzes web content and adds it to your knowledge base, storing content as Markdown files, which fits the use case of extracting information and formatting it.

    -
    security
    A
    license
    -
    quality
    A tool for Model Context Protocol (MCP) that allows you to analyze web content and add it to your knowledge base, storing content as Markdown files for easy viewing with tools like Obsidian.
    6
    Python
    MIT License
  • Why this server?

    This server is capable of collecting code from files and directories into a single markdown document, a useful function when processing information.

    A
    security
    A
    license
    A
    quality
    MCP server for collecting code from files and directories into a single markdown document.
    2
    4
    Python
    MIT License
  • Why this server?

    A document conversion server that transforms various file formats to Markdown format.

    A
    security
    A
    license
    A
    quality
    A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
    10
    4
    TypeScript
    MIT License
    • Linux
    • Apple
  • Why this server?

    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities which could be used in conjunction with OCR to then convert and format.

    A
    security
    A
    license
    A
    quality
    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
    16
    21
    5
    TypeScript
    MIT License
  • Why this server?

    offers multiple pdf tools including search, extract pages, and merge pdfs.

    A
    security
    A
    license
    A
    quality
    mcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)
    5
    19
    Python
    The Unlicense