bytedance/ui-tars-1.5-7b
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Parasailbf16 | $0.100 | $0.200 | 128K | 100% |
UI-TARS 7B costs $0.100 per million input tokens and $0.200 per million output tokens via OpenRouter, making it 56th cheapest of 298 paid models.
UI-TARS 7B generates around 5 tokens per second with 1.9s time-to-first-token (p50), the 299th fastest tracked model.
UI-TARS 7B supports a 128K-token context window and can output up to 2K tokens. It accepts image, text input.