Generate images using text prompts and reference images. This tool processes multi-modal inputs to create visual content through the ImaginePro MCP Server.
Extract text from images for document processing, receipt scanning, and text extraction using OCR technology. Supports both URLs and base64 encoded images.
Extract text from images for document processing, receipt scanning, and image text extraction using OCR technology. Supports both URLs and base64 encoded images.
Convert speech recordings to text transcripts using ElevenLabs MCP Enhanced. Transcribe audio files to written text for documentation, analysis, or accessibility purposes.
Enables downloading videos from platforms like YouTube and converting them to text using OpenAI Whisper and ffmpeg. It supports multiple output formats including TXT, JSON, SRT, and VTT for transcriptions.
Provides tools to fetch IIIF manifests and retrieve specific image regions or scaled images for analysis. This server enables detailed interaction with International Image Interoperability Framework resources, supporting tasks like image description and transcription.
Converts natural language queries into valid GraphQL queries and executes them against GraphQL APIs. Includes schema introspection, query validation, execution with authentication, and query history tracking.