modelgrep

Longest-Context Qwen Models

Quick answer · Updated June 2026

Qwen3 Coder 480B A35B (free) has the largest context window of any Qwen model, at 1.0M tokens. Qwen3 Coder 480B A35B (1.0M) and Qwen3.7 Plus (1M) round out the top three.

1.0MContext
24.8Intelligence
29 t/sSpeed
FreeInput /M

AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.

  1. 1Q
    qwen3-coder:free
    Tools24.8 intel · Free/M · 29 t/s
    1.0M
    Context
  2. 2Q
    qwen3-coder
    ToolsJSON24.8 intel · $0.220/M · 29 t/s
    1.0M
    Context
  3. 3Q
    qwen3.7-plus
    ReasoningToolsJSON+153.3 intel · $0.320/M · 26 t/s
    1M
    Context
  4. 4Q
    qwen3.7-max
    ReasoningToolsJSON56.6 intel · $1.25/M · 48 t/s
    1M
    Context
  5. 5Q
    qwen3.5-plus-20260420
    ReasoningToolsJSON+1$0.300/M · 52 t/s · 1.5s ttft
    1M
    Context
  6. 6Q
    qwen3.6-flash
    ReasoningToolsJSON+1$0.188/M · 109 t/s · 872ms ttft
    1M
    Context
  7. 7Q
    qwen3.6-plus
    ReasoningToolsJSON+150.0 intel · $0.325/M · 37 t/s
    1M
    Context
  8. 8Q
    qwen3.5-flash-02-23
    ReasoningToolsJSON+1$0.065/M · 77 t/s · 642ms ttft
    1M
    Context
  9. 9Q
    qwen3.5-plus-02-15
    ReasoningToolsJSON+1$0.260/M · 36 t/s · 1.8s ttft
    1M
    Context
  10. 10Q
    qwen3-coder-plus
    ToolsJSON$0.650/M
    1M
    Context
  11. 11Q
    qwen3-coder-flash
    ToolsJSON$0.195/M · 40 t/s · 1.4s ttft
    1M
    Context
  12. 12Q
    qwen-plus-2025-07-28:thinking
    ReasoningToolsJSON$0.260/M · 63 t/s · 504ms ttft
    1M
    Context
  13. 13Q
    qwen-plus-2025-07-28
    ToolsJSON$0.260/M · 63 t/s · 504ms ttft
    1M
    Context
  14. 14Q
    qwen-plus
    ToolsJSON$0.260/M · 51 t/s · 433ms ttft
    1M
    Context
  15. 15Q
    qwen3.6-35b-a3b
    ReasoningToolsJSON+131.5 intel · $0.150/M · 172 t/s
    262K
    Context
  16. 16Q
    qwen3.6-max-preview
    ReasoningToolsJSON51.8 intel · $1.04/M · 47 t/s
    262K
    Context
  17. 17Q
    qwen3.6-27b
    ReasoningToolsJSON+137.1 intel · $0.288/M · 76 t/s
    262K
    Context
  18. 18Q
    qwen3.5-9b
    ReasoningToolsJSON+132.4 intel · $0.100/M · 75 t/s
    262K
    Context
  19. 19Q
    qwen3.5-35b-a3b
    ReasoningToolsJSON+130.7 intel · $0.140/M · 153 t/s
    262K
    Context
  20. 20Q
    qwen3.5-27b
    ReasoningToolsJSON+137.2 intel · $0.195/M · 54 t/s
    262K
    Context
  21. 21Q
    qwen3.5-122b-a10b
    ReasoningToolsJSON+135.9 intel · $0.260/M · 40 t/s
    262K
    Context
  22. 22Q
    qwen3.5-397b-a17b
    ReasoningToolsJSON+140.1 intel · $0.390/M · 83 t/s
    262K
    Context
  23. 23Q
    qwen3-max-thinking
    ReasoningToolsJSON39.8 intel · $0.780/M · 26 t/s
    262K
    Context
  24. 24Q
    qwen3-coder-next
    ToolsJSON28.3 intel · $0.110/M · 111 t/s
    262K
    Context
  25. 25Q
    qwen3-vl-32b-instruct
    ToolsJSONVision24.7 intel · $0.104/M
    262K
    Context

Frequently asked

Which Qwen model has the largest context window?

Qwen3 Coder 480B A35B (free) has the largest context window of any Qwen model, at 1.0M tokens. Qwen3 Coder 480B A35B (1.0M) and Qwen3.7 Plus (1M) round out the top three.

What's a good alternative to Qwen3 Coder 480B A35B (free)?

Qwen3 Coder 480B A35B (1.0M) is the closest alternative on this metric, followed by Qwen3.7 Plus (1M). See the full ranking above for the tradeoffs.

How many Qwen models are there?

modelgrep tracks 49 Qwen models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Qwen3.7 Max. 25 of them qualify for this ranking.

More Qwen rankings

All rankings