microsoft/phi-4
[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| NextBitint4 | $0.065 | $0.140 | 16K | 100% |
| DeepInfrabf16 | $0.070 | $0.140 | 16K | 100% |
Phi 4 costs $0.065 per million input tokens and $0.140 per million output tokens via OpenRouter, making it 30th cheapest of 298 paid models.
Phi 4 scores 10.4 on the Artificial Analysis Intelligence Index, ranking 166th of 178 benchmarked models, with a GPQA Diamond score of 57%.
Phi 4 generates around 72 tokens per second with 224ms time-to-first-token (p50), the 109th fastest tracked model.
Phi 4 supports a 16K-token context window and can output up to 16K tokens. It accepts text input.