Search for:

A method to read an image and extract text

  • Why this server?

    Can fetch web content and process images, potentially including OCR if the image contains text.

    -
    security
    A
    license
    -
    quality
    Model Context Protocol server that enables Claude Desktop (or any MCP client) to fetch web content and process images appropriately.
    11
    MIT License
    • Apple
  • Why this server?

    Allows conversion of documents to markdown, extraction of tables, and *processing of document images*, suggesting potential OCR functionality.

    -
    security
    A
    license
    -
    quality
    A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
    6
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    Can fetch web content, which may contain images, and while it doesn't explicitly mention OCR, it's a useful tool for getting the image data.

    A
    security
    A
    license
    A
    quality
    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
    1
    34,289
    JavaScript
    MIT License
  • Why this server?

    Enables browser automation, allowing taking screenshots of webpages which may be needed for OCR processing of webpage text.

    A
    security
    F
    license
    A
    quality
    Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.
    4
    15
    Python
    • Linux
  • Why this server?

    Enables interaction with web pages, which could include screenshots of images to extract text.

    A
    security
    A
    license
    A
    quality
    Enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment
    10
    327
    85
    JavaScript
    MIT License
    • Apple
  • Why this server?

    Directly addresses the request by offering OCR capabilities on images and PDFs using the Mistral OCR API.

    -
    security
    F
    license
    -
    quality
    OCR images or pdfs, locally or by URLs by using Mistral OCR API (paid)
    10
    Python
    • Linux
  • Why this server?

    If the image with text is inside a PDF document, this tool can first convert the PDF to markdown.

    -
    security
    A
    license
    -
    quality
    PDF to Markdown conversion tool
    1
    Python
    MIT License
    • Linux
    • Apple