Back to all models
bytedance
ByteDance: UI-TARS 7B
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
bytedance/ui-tars-1.5-7b
Context Size
128K
Input Price
15,000 pts/M
Output Price
30,000 pts/M
Architecture
Image
Text
Supported Parameters
frequency_penaltylogit_biasmax_tokenspresence_penaltyrepetition_penaltyseedstoptemperaturetop_ktop_p
Details
TokenizerOther
Max Completion2,048 tokens
Provider Context128K tokens
ModeratedNo
CreatedJul 22, 2025