IBM: Granite 4.0 Micro

ibm-granite/granite-4.0-h-micro

Cheaper than 99% of paidJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.017

2nd cheapest

Context

131K

131K max out

How it compares

Cheaper than99%

of all ranked models

Overview

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Cloudflare	$0.017	$0.112	131K	100%

Specifications

Context window131K

Max output131K

Knowledge cutoff—

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsibm-granite/granite-4.0-h-micro ↗

Granite 4.0 Micro FAQ

How much does Granite 4.0 Micro cost?

Granite 4.0 Micro costs $0.017 per million input tokens and $0.112 per million output tokens via OpenRouter, making it 2nd cheapest of 332 paid models.

What is Granite 4.0 Micro's context window?

Granite 4.0 Micro supports a 131K-token context window and can output up to 131K tokens. It accepts text input.

More from ibm-granite

All Granite 4.0 Micro alternatives →

ibm-granite/granite-4.1-8b

Intel 6.7$0.050/M

Compare head-to-head

IBM: Granite 4.0 Micro vs IBM: Granite 4.1 8B