NEW Browse AI tools across categories — updated daily. See what's new →

ByteDance: UI-TARS 7B Multimodal

By Bytedance

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

OpenRouter rank
#283
0.00% share
Context window
128K tokens
Input price
$0.100 / 1M tok
Output price
$0.20 / 1M tok
Modalities
imagetext
External ID
bytedance/ui-tars-1.5-7b

No ranking history yet

This model is in our catalog but hasn't appeared on a tracked leaderboard yet. Trend lines appear after the next ingestion cycle (within 6 hours).

Back to all rankings