Meta: Llama 3.1 8B Instruct wins on more metrics (5 of 8), but the right pick depends on what you optimize for — see the breakdown below.
| Metric | AllenAI: Olmo 3 32B Think | Meta: Llama 3.1 8B Instruct |
|---|---|---|
| Intelligence Index | 12.1✓ | 11.8 |
| Coding Index | 10.5✓ | 4.9 |
| GPQA Diamond | 61%✓ | 26% |
| Design Arena Elo | — | — |
| Speed (tokens/sec) | — | 161✓ |
| Latency | — | 146ms✓ |
| Input price /M | $0.150 | $0.020✓ |
| Output price /M | $0.500 | $0.030✓ |
| Context window | 66K | 131K✓ |
| Capabilities | ReasoningJSON | ToolsJSON |