Analyze images with Gemini AI to answer questions about visual content, identify objects, or extract information from photos using vision capabilities.
Analyze images using Gemini's vision capabilities to extract summaries, identify objects, read text, or provide detailed insights based on user preferences and context.
Analyze images to extract summaries, objects, text, or detailed insights using Gemini's multimodal vision capabilities. Supports JPEG, PNG, WebP, and other formats with optional context for enhanced results.
Set up and install an MCP server from locally cloned code on your computer by specifying the path, arguments, and environment variables. Requires npx or uv for node and Python servers.
Provides an intelligent, graph-based memory system for LLM agents using the Zettelkasten principle, enabling automatic note construction, semantic linking, memory evolution, and autonomous graph maintenance with background optimization processes.
MCP server that provides computer control capabilities including mouse movements, keyboard actions, screenshot capture with OCR, and window management through a unified API.