Search for:

How to extract text from images

  • Why this server?

    Provides screenshot and OCR capabilities for macOS, which can be used to extract text from images.

    A
    security
    A
    license
    A
    quality
    Provides screenshot and OCR capabilities for macOS.
    Last updated -
    1
    35
    10
    JavaScript
    MIT License
    • Apple
  • Why this server?

    Enables AI assistants to interact with Obsidian vaults, providing tools for reading, creating, editing and managing notes and tags; potentially useful if the images are stored as notes in Obsidian.

    -
    security
    A
    license
    -
    quality
    Enables AI assistants to interact with Obsidian vaults, providing tools for reading, creating, editing and managing notes and tags.
    Last updated -
    598
    149
    TypeScript
    MIT License
    • Apple
  • Why this server?

    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.

    A
    security
    A
    license
    A
    quality
    Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
    Last updated -
    1
    278
    15
    JavaScript
    MIT License
    • Apple
  • Why this server?

    Enables capturing screenshots of web pages and local HTML files through a simple MCP tool interface using Puppeteer with configurable options for dimensions and output paths.

    -
    security
    F
    license
    -
    quality
    Enables capturing screenshots of web pages and local HTML files through a simple MCP tool interface using Puppeteer with configurable options for dimensions and output paths.
    Last updated -
    1
    0
    4
    JavaScript
  • Why this server?

    A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models, which could be used for extracting text.

    -
    security
    A
    license
    -
    quality
    A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
    Last updated -
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    A server providing text-to-speech and speech-to-text capabilities using Windows' native speech services without external dependencies. Could be used as an auxiliary service.

    -
    security
    F
    license
    -
    quality
    A server providing text-to-speech and speech-to-text functionalities using Windows' native speech services without external dependencies.
    Last updated -
    4
    JavaScript
  • Why this server?

    Helps refine AI-generated content to sound more natural and human-like. Built with advanced AI detection and text enhancement capabilities; could be used for post-processing the extracted text.

    A
    security
    A
    license
    A
    quality
    Helps refine AI-generated content to sound more natural and human-like. Built with advanced AI detection and text enhancement capabilities.
    Last updated -
    1
    47
    8
    JavaScript
    MIT License
  • Why this server?

    A Model Context Protocol server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.

    -
    security
    F
    license
    -
    quality
    A PDF processing server that extracts text via normal parsing or OCR, and retrieves images from PDF files through the MCP protocol with a built-in web debugger.
    Last updated -
    6
    Python