-
securityA
license-
qualityEnables AI coding agents to locate UI elements on screen with exact pixel coordinates using a local UI-TARS-2B vision model. Provides offline GUI grounding with ~1.2s latency on 4.1GB VRAM, eliminating cloud dependencies for computer-use automation.
Last updated
MIT