13models tracked · ranked by intelligence, speed & price
Meta's smartest model is Llama 4 Maverick (18.4 on the Intelligence Index), its fastest is Llama 3.1 8B Instruct at 147 tokens/sec, and its cheapest is Llama 3.1 8B Instruct at $0.020 per million input tokens. All 13 Meta models are compared below by intelligence, speed, latency, context and price.
| Model | Intel | Speed | Latency | In $/M | Context |
|---|---|---|---|---|---|
| llama-4-maverick ToolsJSONVision | 18.4 | 83 | 295ms | $0.150 | 1.0M |
| llama-3.3-70b-instruct:free Tools | 14.5 | 98 | 205ms | Free | 131K |
| llama-3.3-70b-instruct ToolsJSON | 14.5 | 98 | 205ms | $0.100 | 131K |
| llama-4-scout ToolsJSONVision | 13.5 | 112 | 333ms | $0.100 | 10M |
| llama-3.1-70b-instruct ToolsJSON | 12.5 | 28 | 300ms | $0.400 | 131K |
| llama-3.1-8b-instruct ToolsJSON | 11.8 | 147 | 141ms | $0.020 | 131K |
| llama-3-70b-instruct JSON | 8.9 | 18 | 1.3s | $0.510 | 8K |
| llama-3.2-11b-vision-instruct JSONVision | 8.7 | 36 | 203ms | $0.345 | 131K |
| llama-3-8b-instruct | 6.4 | 66 | 713ms | $0.140 | 8K |
| llama-3.2-1b-instruct | 6.3 | 83 | 312ms | $0.027 | 131K |
| llama-guard-4-12b JSONVision | — | 18 | 118ms | $0.180 | 164K |
| llama-3.2-3b-instruct:free | — | 72 | 262ms | Free | 131K |
| llama-3.2-3b-instruct | — | 72 | 262ms | $0.051 | 131K |