modelgrep

Cheapest Qwen Models

Quick answer · Updated June 2026

The cheapest Qwen model is Qwen2.5 7B Instruct at $0.040 per million input tokens. Qwen3 30B A3B Instruct 2507 ($0.048) and Qwen3 8B ($0.050) round out the top three.

$0.040Input /M
73 t/sSpeed
131KContext

AI models ranked by input token price. The most affordable large language model APIs, from budget open-weight models to discounted frontier models.

  1. 1Q
    qwen-2.5-7b-instruct
    73 t/s · 405ms ttft · 131K ctx
    $0.040
    Input /M
  2. 2Q
    qwen3-30b-a3b-instruct-2507
    ToolsJSON15.0 intel · 91 t/s · 274ms ttft
    $0.048
    Input /M
  3. 3Q
    qwen3-8b
    ReasoningToolsJSON10.6 intel · 131K ctx
    $0.050
    Input /M
  4. 4Q
    qwen3.5-flash-02-23
    ReasoningToolsJSON+177 t/s · 642ms ttft · 1M ctx
    $0.065
    Input /M
  5. 5Q
    qwen3-coder-30b-a3b-instruct
    ToolsJSON20.0 intel · 69 t/s · 983ms ttft
    $0.070
    Input /M
  6. 6Q
    qwen3-vl-8b-instruct
    ToolsJSONVision14.3 intel · 60 t/s · 455ms ttft
    $0.080
    Input /M
  7. 7Q
    qwen3-30b-a3b-thinking-2507
    ReasoningToolsJSON22.4 intel · 95 t/s · 374ms ttft
    $0.080
    Input /M
  8. 8Q
    qwen3-32b
    ReasoningToolsJSON328 t/s · 321ms ttft · 131K ctx
    $0.080
    Input /M
  9. 9Q
    qwen3-next-80b-a3b-instruct
    ToolsJSON20.1 intel · 76 t/s · 583ms ttft
    $0.090
    Input /M
  10. 10Q
    qwen3-235b-a22b-2507
    ToolsJSON25.0 intel · 84 t/s · 298ms ttft
    $0.090
    Input /M
  11. 11Q
    qwen3-next-80b-a3b-thinking
    ReasoningToolsJSON26.7 intel · 172 t/s · 252ms ttft
    $0.098
    Input /M
  12. 12Q
    qwen3.5-9b
    ReasoningToolsJSON+132.4 intel · 75 t/s · 370ms ttft
    $0.100
    Input /M
  13. 13Q
    qwen3-235b-a22b-thinking-2507
    ReasoningToolsJSON29.5 intel · 262K ctx
    $0.100
    Input /M
  14. 14Q
    qwen3-14b
    ReasoningToolsJSON16.2 intel · 66 t/s · 349ms ttft
    $0.100
    Input /M
  15. 15Q
    qwen3-vl-32b-instruct
    ToolsJSONVision24.7 intel · 262K ctx
    $0.104
    Input /M
  16. 16Q
    qwen3-coder-next
    ToolsJSON28.3 intel · 111 t/s · 636ms ttft
    $0.110
    Input /M
  17. 17Q
    qwen3-vl-8b-thinking
    ReasoningToolsJSON+116.7 intel · 139 t/s · 508ms ttft
    $0.117
    Input /M
  18. 18Q
    qwen3-30b-a3b
    ReasoningToolsJSON15.3 intel · 91 t/s · 279ms ttft
    $0.120
    Input /M
  19. 19Q
    qwen3-vl-30b-a3b-thinking
    ReasoningToolsJSON+119.7 intel · 69 t/s · 480ms ttft
    $0.130
    Input /M
  20. 20Q
    qwen3-vl-30b-a3b-instruct
    ToolsJSONVision16.0 intel · 46 t/s · 361ms ttft
    $0.130
    Input /M
  21. 21Q
    qwen3.5-35b-a3b
    ReasoningToolsJSON+130.7 intel · 153 t/s · 150ms ttft
    $0.140
    Input /M
  22. 22Q
    qwen3.6-35b-a3b
    ReasoningToolsJSON+131.5 intel · 172 t/s · 180ms ttft
    $0.150
    Input /M
  23. 23Q
    qwen3.6-flash
    ReasoningToolsJSON+1109 t/s · 872ms ttft · 1M ctx
    $0.188
    Input /M
  24. 24Q
    qwen3.5-27b
    ReasoningToolsJSON+137.2 intel · 54 t/s · 868ms ttft
    $0.195
    Input /M
  25. 25Q
    qwen3-coder-flash
    ToolsJSON40 t/s · 1.4s ttft · 1M ctx
    $0.195
    Input /M

Frequently asked

What is the cheapest Qwen model?

The cheapest Qwen model is Qwen2.5 7B Instruct at $0.040 per million input tokens. Qwen3 30B A3B Instruct 2507 ($0.048) and Qwen3 8B ($0.050) round out the top three.

What's a good alternative to Qwen2.5 7B Instruct?

Qwen3 30B A3B Instruct 2507 ($0.048) is the closest alternative on this metric, followed by Qwen3 8B ($0.050). See the full ranking above for the tradeoffs.

How many Qwen models are there?

modelgrep tracks 49 Qwen models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Qwen3.7 Max. 25 of them qualify for this ranking.

More Qwen rankings

All rankings