Skip to main content
Glama
180,147 tools. Last updated 2026-06-06 01:32

"vision" matching MCP tools:

  • Retrieve available AI vision models for analyzing images from URLs, files, or base64 data through the Vision MCP Server.
    MIT
  • Retrieves enriched metadata for a fired Pythia Vision by ID: pattern ranges, failure profile, cooldown context, concurrent fires. Enables AI agents to size positions and compare historical failure modes.
    MIT
  • Run the pdfhell adversarial-PDF benchmark to evaluate a vision model's robustness against malicious PDFs. Returns pass rate and per-family statistics.
    Apache 2.0
  • Retrieve walk-forward validated pattern detections with accuracy stats, contract details, and live token sets for on-chain market intelligence.
    MIT
  • Fetch the latest decoded video frame from a live Luxxon session as a JPEG, ready for direct input to a vision model. Retry if a 404 is returned during initial keyframe arrival.
    MIT

Matching MCP Servers

Matching MCP Connectors

  • AI-powered codebase analysis — call graphs, security, dead code, complexity. 150+ tools.

  • BJJ video analysis — YOLO pose detection, AI technique analysis, and highlight reels.

  • Retrieve workspace details: strategy (vision, non-negotiables, architecture principles), active products, and constitution rules to align teams on structured intent.
    MIT
  • Cross-reference assembled board photos against a BOM file to verify component presence, polarity, orientation, and value. Review flagged components manually before accepting the first article.
    Apache 2.0
  • Query ODEI's constitutional knowledge graph to explore structured domains, find specific entities, and understand connections between goals and execution.
  • Retrieve recent Pythia Visions for any token with pattern breakdowns and statistics. Analyze historical vision data to inform trading decisions.
    MIT
  • Returns the full game state visible to your team, including board dimensions, terrain, visible units with status, turn number, active player, and win-condition progress. Use at turn start to orient before making specific decisions.
    Apache 2.0
  • Run a design system audit to check spec compliance for WCAG accessibility, token coverage, and naming conventions. Returns structured findings with pass/warn/fail status.
    MIT
  • Get guidance on writing effective prompts for Veo video generation. Structure prompts for improved results using examples and tips.
    MIT
  • Analyze image content with quick summaries or structured 7-dimension analysis. Ask specific questions to extract text, objects, charts, and evidence from images.
    MIT
  • Split large images or web page captures into optimally sized tiles for LLM vision analysis, reducing token costs by fetching only non-blank, relevant tiles.
    MIT
  • Generate detailed descriptions of images using base64-encoded data. Ideal for uploaded images in chat conversations, providing accurate analysis via advanced vision APIs.
    MIT
  • Render a preview of a TouchDesigner operator and generate a plain-text caption with color stats to verify the network is rendering correctly, using vision or histogram analysis.
    MIT