modelgrep

Cheapest IBM Models

Quick answer · Updated June 2026

The cheapest IBM model is Granite 4.0 Micro at $0.017 per million input tokens. Granite 4.1 8B ($0.050) is next.

$0.017Input /M
7.7Intelligence
27 t/sSpeed
131KContext

AI models ranked by input token price. The most affordable large language model APIs, from budget open-weight models to discounted frontier models.

  1. 1I
    granite-4.0-h-micro
    7.7 intel · 27 t/s · 301ms ttft
    $0.017
    Input /M
  2. 2I
    granite-4.1-8b
    ToolsJSON12.4 intel · 118 t/s · 144ms ttft
    $0.050
    Input /M

Frequently asked

What is the cheapest IBM model?

The cheapest IBM model is Granite 4.0 Micro at $0.017 per million input tokens. Granite 4.1 8B ($0.050) is next.

What's a good alternative to Granite 4.0 Micro?

Granite 4.1 8B ($0.050) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many IBM models are there?

modelgrep tracks 2 IBM models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Granite 4.1 8B. 2 of them qualify for this ranking.

More IBM rankings

All rankings