Skip to main content
Glama

Tools for Reading Text from PDFs and Images MCP tools

Production-ready MCP servers that extend AI capabilities through file access, database connections, APIs, and contextual services.

71,685 tools. Last updated 2026-02-14 02:19
  • Extract targeted information from files without loading entire contents. Ask specific questions about text, code, images, or PDFs to get precise answers while minimizing context usage.
    TypeScript
    MIT
  • Extract text from images for document processing, receipt scanning, and text extraction using OCR technology. Supports both URLs and base64 encoded images.
  • Retrieve Spec3 racing documents with full text and visual content like diagrams and tables from PDFs stored in S3. Specify page ranges and include images to preserve formatting.
  • Extract text from images for document processing, receipt scanning, and image text extraction using OCR technology. Supports both URLs and base64 encoded images.

Interested in MCP?

Join the MCP community for support and updates.

RedditDiscord

Matching MCP servers

  • -
    security
    F
    license
    -
    quality
    Enables document conversion between PDF, DOCX, and Markdown formats to facilitate reading and editing complex files in AI tools like Claude Desktop or Cursor. It utilizes marker-pdf and pandoc to provide structured text versions of documents, helping to manage context and support unsupported file types.
    Last updated 8 months ago
    1
  • A
    security
    A
    license
    A
    quality
    Provides tools to fetch IIIF manifests and retrieve specific image regions or scaled images for analysis. This server enables detailed interaction with International Image Interoperability Framework resources, supporting tasks like image description and transcription.
    Last updated 7 months ago
    3
    4
    MIT
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    A Model Context Protocol server that enables AI-powered image generation through Stability AI and Black Forest Labs APIs, allowing users to create images from detailed text prompts with customizable settings and comprehensive metadata tracking.
    Last updated 7 months ago
    MIT