Skip to main content
Glama
134,441 tools. Last updated 2026-05-23 14:52

"Tools for OCR and Analyzing Text and Content in Images" matching MCP tools:

  • Search past screen capture OCR text to find when you saw specific content, error messages, or work references. Returns matching entries with timestamps and context.
    AGPL 3.0
  • Extract visible text from your screen using OCR. Returns text grouped by detected windows with bounding boxes. Use when you need code, terminal, chat, or document content without visual layout. Avoids sending data to the cloud.
    AGPL 3.0
  • Generate images from text prompts or edit existing images using AI models, supporting multiple aspect ratios and resolutions, with results saved as files.
    MIT
  • Generate and edit images using AI with text prompts and reference images, supporting multiple aspect ratios, resolutions, and style transfers.
    MIT

Matching MCP Servers

  • A
    license
    C
    quality
    C
    maintenance
    Enables access to Usage and Billing APIs for managing accounts, products, meters, plans, and usage reporting. Supports operations like creating products/plans, reporting usage, and retrieving billing information.
    Last updated
    18
    MIT
  • A
    license
    C
    quality
    C
    maintenance
    Enables interaction with Google Cloud services including billing cost analysis, log querying, and metrics monitoring through natural language commands. Provides comprehensive tools for managing GCP resources, analyzing costs, detecting anomalies, and retrieving operational insights.
    Last updated
    40
    1
    Apache 2.0

Matching MCP Connectors

  • MCP server for social media and content data including social profiles, engagement metrics, content trends, and influencer analytics for AI agents.

  • GOV.UK Content + Search APIs (every gov.uk page + full search)

  • Extract text from images using OCR technology. Convert image content into editable text by providing an image URL.
    MIT
  • Convert PDF and scanned images to text while preserving layout. Supports URL input, optional page ranges, and OCR language selection.
    MIT
  • Capture screenshots and extract text with OCR in one step, returning both the image and text with word coordinates for Hyprland desktop automation.
    MIT
  • Locate text on your screen using OCR to find matching coordinates for automation tasks. Specify target text and optional search areas like monitors, windows, or regions to get precise screen positions.
    MIT
  • Retrieve Spec3 racing documents with full text and visual content like diagrams and tables from PDFs stored in S3. Specify page ranges and include images to preserve formatting.
    MIT
  • Automatically locate text input fields using OCR, click to focus, type specified text, and optionally submit with Enter for desktop automation in Hyprland environments.
    MIT
  • Locate and click text on screen using OCR, screenshot, and mouse automation in Hyprland desktop environments.
    MIT
  • Analyze any video URL to extract transcript, key frames, metadata, comments, OCR text, and an annotated timeline. Supports Loom and direct video files. Get structured data with scene-change detection and configurable detail levels.
    MIT
  • Extract and convert PDF document text to markdown format for AI processing. This tool reads PDF content and returns clean text from all pages, simplifying document analysis for agents.
    MIT