modelgrep

Cheapest Microsoft Models

Quick answer · Updated June 2026

The cheapest Microsoft model is Phi 4 at $0.065 per million input tokens. Phi 4 Mini Instruct ($0.080) and WizardLM-2 8x22B ($0.620) round out the top three.

$0.065Input /M
10.4Intelligence
79 t/sSpeed
16KContext

AI models ranked by input token price. The most affordable large language model APIs, from budget open-weight models to discounted frontier models.

  1. 1M
    phi-4
    JSON10.4 intel · 79 t/s · 138ms ttft
    $0.065
    Input /M
  2. 2M
    phi-4-mini-instruct
    JSON8.4 intel · 7 t/s · 483ms ttft
    $0.080
    Input /M
  3. 3M
    wizardlm-2-8x22b
    JSON18 t/s · 866ms ttft · 66K ctx
    $0.620
    Input /M

Frequently asked

What is the cheapest Microsoft model?

The cheapest Microsoft model is Phi 4 at $0.065 per million input tokens. Phi 4 Mini Instruct ($0.080) and WizardLM-2 8x22B ($0.620) round out the top three.

What's a good alternative to Phi 4?

Phi 4 Mini Instruct ($0.080) is the closest alternative on this metric, followed by WizardLM-2 8x22B ($0.620). See the full ranking above for the tradeoffs.

How many Microsoft models are there?

modelgrep tracks 3 Microsoft models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Phi 4. 3 of them qualify for this ranking.

More Microsoft rankings

All rankings