2models tracked · ranked by intelligence, speed & price
IBM's smartest model is Granite 4.1 8B (12.4 on the Intelligence Index), its fastest is Granite 4.1 8B at 94 tokens/sec, and its cheapest is Granite 4.0 Micro at $0.017 per million input tokens. All 2 IBM models are compared below by intelligence, speed, latency, context and price.
| Model | Intel | Speed | Latency | In $/M | Context |
|---|---|---|---|---|---|
| granite-4.1-8b ToolsJSON | 12.4 | 94 | 195ms | $0.050 | 131K |
| granite-4.0-h-micro | 7.7 | 29 | 445ms | $0.017 | 131K |