ibm-granite/granite-4.0-h-micro
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Cloudflare | $0.017 | $0.112 | 131K | 100% |
Granite 4.0 Micro costs $0.017 per million input tokens and $0.112 per million output tokens via OpenRouter, making it 2nd cheapest of 298 paid models.
Granite 4.0 Micro scores 7.7 on the Artificial Analysis Intelligence Index, ranking 175th of 178 benchmarked models, with a GPQA Diamond score of 34%.
Granite 4.0 Micro generates around 29 tokens per second with 433ms time-to-first-token (p50), the 239th fastest tracked model.
Granite 4.0 Micro supports a 131K-token context window and can output up to 131K tokens. It accepts text input.