openai/gpt-4.1-nano
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| OpenAI | $0.100 | $0.400 | 1.0M | 94.6% |
| Azurecache | $0.100 | $0.400 | 1.0M | — |
| Azurecache | $0.100 | $0.400 | 1.0M | 100% |
GPT-4.1 Nano costs $0.100 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 59th cheapest of 298 paid models.
GPT-4.1 Nano scores 13.0 on the Artificial Analysis Intelligence Index, ranking 155th of 178 benchmarked models, with a GPQA Diamond score of 51%.
GPT-4.1 Nano generates around 91 tokens per second with 649ms time-to-first-token (p50), the 78th fastest tracked model.
GPT-4.1 Nano supports a 1.0M-token context window and can output up to 33K tokens. It accepts image, text, file input.