Extract text from images for document processing, receipt scanning, and text extraction using OCR technology. Supports both URLs and base64 encoded images.
Extract text from images for document processing, receipt scanning, and image text extraction using OCR technology. Supports both URLs and base64 encoded images.
Extract and process images from file paths for visual content analysis, OCR text extraction, and object recognition. Supports screenshots, photos, diagrams, and documents in PNG, JPG, GIF, and WebP formats.
Process images in a directory with operations like enhancement, OCR text extraction, resizing, and deduplication to organize and extract information from visual content.
Extract text from images using OCR technology. Convert image content into editable text with multi-language support for processing screenshots and documents.
Enables access to Usage and Billing APIs for managing accounts, products, meters, plans, and usage reporting. Supports operations like creating products/plans, reporting usage, and retrieving billing information.
Enables interaction with Google Cloud services including billing cost analysis, log querying, and metrics monitoring through natural language commands. Provides comprehensive tools for managing GCP resources, analyzing costs, detecting anomalies, and retrieving operational insights.
Enables AI agents to break down complex tasks into manageable pieces using a structured JSON format with task tracking, context preservation, and progress monitoring capabilities.