A
licenseA
qualityC
maintenanceEnables AI agents to analyze videos and images using Qwen3-VL deployed on Modal, supporting hours-long videos with timestamp grounding, text extraction, video summarization, and Q\&A with 256K context window.
Last updated
8
2
MIT