Search for:

A tool for converting images to text

  • Why this server?

    This server enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information, directly addressing the user's need.

    A
    security
    A
    license
    A
    quality
    A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.
    Last updated -
    3
    60
    5
    JavaScript
    MIT License
  • Why this server?

    This tool converts web-based documentation into markdown format, allowing for image extraction and conversion to text.

    -
    security
    F
    license
    -
    quality
    Converts web-based documentation into markdown format using jina.ai's conversion service, allowing users to scrape documentation from any URL and save it as markdown files.
    Last updated -
    6
    Python
    • Linux
    • Apple
  • Why this server?

    This server provides functionality to capture full-page screenshots of local HTML files, useful if the image containing text is embedded in a webpage.

    A
    security
    A
    license
    A
    quality
    Provides HTML file preview and analysis capabilities. This server enables capturing full-page screenshots of local HTML files and analyzing their structure.
    Last updated -
    2
    8
    JavaScript
    MIT License
  • Why this server?

    While not directly a picture to text converter, it can be used to generate images and would be useful to users looking for other image processing abilities.

    -
    security
    F
    license
    -
    quality
    Allows AI assistants to generate and transform high-quality images from text prompts using Google's Gemini model via the MCP protocol.
    Last updated -
    1
    Python
    • Apple
  • Why this server?

    This service extracts and transcribes audio content from videos, which would include images within the video, and supports multiple transcription providers.

    -
    security
    A
    license
    -
    quality
    A service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.
    Last updated -
    5
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    This server allows to create PPT files with text and images. Might be useful for generating documents containing text extracted from images.

    -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables AI models to create and manipulate PowerPoint presentations with advanced features like financial charts, formatting, and template management.
    Last updated -
    1
    Python