Qwen: Qwen3.5-9B wins on more metrics (5 of 8), but the right pick depends on what you optimize for — see the breakdown below.
| Metric | Arcee AI: Trinity Large Thinking | Qwen: Qwen3.5-9B |
|---|---|---|
| Intelligence Index | 31.9 | 32.4✓ |
| Coding Index | 27.2✓ | 25.3 |
| GPQA Diamond | 75% | 81%✓ |
| Design Arena Elo | 1180✓ | — |
| Speed (tokens/sec) | 119✓ | 68 |
| Latency | 731ms | 362ms✓ |
| Input price /M | $0.220 | $0.100✓ |
| Output price /M | $0.850 | $0.150✓ |
| Context window | 262K | 262K |
| Capabilities | ReasoningToolsJSON | ReasoningToolsJSONVision |