4models tracked · ranked by intelligence, speed & price
its fastest is Llama 3 8B Lunaris at 79 tokens/sec, and its cheapest is Llama 3 8B Lunaris at $0.040 per million input tokens. All 4 Sao10K models are compared below by intelligence, speed, latency, context and price.
| Model | Intel | Speed | Latency | In $/M | Context |
|---|---|---|---|---|---|
| l3.1-70b-hanami-x1 | — | 11 | 458ms | $3.00 | 16K |
| l3.3-euryale-70b JSON | — | 10 | 1.1s | $0.650 | 131K |
| l3.1-euryale-70b ToolsJSON | — | 47 | 356ms | $0.850 | 131K |
| l3-lunaris-8b JSON | — | 79 | 149ms | $0.040 | 8K |