XBOUND Introduces State-Level Evaluation for Device-Control Agents
XBOUND evaluates device‑control agents per UI state; the 7B UI‑TARS model achieved the highest state‑level accuracy while models under 7B lag behind in tests. Read more: getnews.me/xbound-introduces-state-... #xbound #devicecontrol #uitars