Extract and process images from file paths for visual content analysis, OCR text extraction, and object recognition. Supports screenshots, photos, diagrams, and documents in PNG, JPG, GIF, and WebP formats.
Process images in a directory with operations like enhancing, OCR, resizing, and deduplication, returning results in JSON format for streamlined image management.
Extract content from images and videos using AI, convert it into Markdown format for structured documentation. Ideal for transforming visual media into descriptive, text-based outputs.
Extract text from image files or URLs using optical character recognition (OCR) with the Florence-2 MCP Server. Process images to retrieve text content efficiently.
Extract and return markdown text content from PDF documents for streamlined AI agent processing, using Mistral AI's OCR capabilities via the Lizeur server.
Enables interaction with Google Cloud services including billing cost analysis, log querying, and metrics monitoring through natural language commands. Provides comprehensive tools for managing GCP resources, analyzing costs, detecting anomalies, and retrieving operational insights.
Enables AI agents to break down complex tasks into manageable pieces using a structured JSON format with task tracking, context preservation, and progress monitoring capabilities.
Enables integration between MCP clients and the Handwriting OCR service, allowing users to upload images and PDF documents, check processing status, and retrieve OCR results as Markdown.