modelgrep
B

ByteDance: UI-TARS 7B

bytedance/ui-tars-1.5-7b

Cheaper than 81% of paidVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
5
299th fastest
Latency
1.9s
first token
Input price
$0.100
56th cheapest
Context
128K
2K max out

How it compares

Faster than3%
of all ranked models
Cheaper than81%
of all ranked models

Overview

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Parasailbf16$0.100$0.200100%

Specifications

Context window128K
Max output2K
Knowledge cutoffJan 2025
Input modalitiesimage, text
Output modalitiestext
Prompt caching
Cache read price$0.100/M
ModeratedNo

UI-TARS 7B FAQ

How much does UI-TARS 7B cost?

UI-TARS 7B costs $0.100 per million input tokens and $0.200 per million output tokens via OpenRouter, making it 56th cheapest of 298 paid models.

How fast is UI-TARS 7B ?

UI-TARS 7B generates around 5 tokens per second with 1.9s time-to-first-token (p50), the 299th fastest tracked model.

What is UI-TARS 7B 's context window?

UI-TARS 7B supports a 128K-token context window and can output up to 2K tokens. It accepts image, text input.