Extract targeted information from files without loading entire contents. Ask specific questions about text, code, images, or PDFs to get precise answers while minimizing context usage.
Extract text from images for document processing, receipt scanning, and text extraction using OCR technology. Supports both URLs and base64 encoded images.
Extract text from images for document processing, receipt scanning, and image text extraction using OCR technology. Supports both URLs and base64 encoded images.
Extract text from PDFs and scanned images while preserving layout. Convert content from specified pages, regions, or languages. Supports password-protected and web-hosted files.
Extract text content from PDF files to access document information. This tool processes PDFs to retrieve readable text, optionally including metadata for comprehensive analysis.
Extract images from web pages with metadata including alt text and dimensions. Use this tool to gather visual content from URLs for analysis or documentation.
Enables downloading videos from platforms like YouTube and converting them to text using OpenAI Whisper and ffmpeg. It supports multiple output formats including TXT, JSON, SRT, and VTT for transcriptions.
Converts natural language queries into valid GraphQL queries and executes them against GraphQL APIs. Includes schema introspection, query validation, execution with authentication, and query history tracking.
Enables text-to-image generation using Zhipu AI's CogView-4 API. Supports generating images from text prompts with configurable size and quality parameters through MCP-compatible clients like Claude Desktop and Cline.