modelgrep
O

OpenAI: GPT-4o-mini

openai/gpt-4o-mini

ToolsJSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
39
212th fastest
Latency
480ms
first token
Input price
$0.150
88th cheapest
Context
128K
16K max out

How it compares

Faster than31%
of all ranked models
Cheaper than70%
of all ranked models

Overview

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

Benchmarks

independent · via OpenRouter
Artificial Analysis
GPQA Diamond
43%
Humanity's Last Exam
4%
SciCode
23%

Providers & pricing (3)

ProviderIn $/MOut $/MUptime
Azure$0.150$0.600
Azure$0.150$0.60099.8%
OpenAI$0.150$0.60099.9%

Specifications

Context window128K
Max output16K
Knowledge cutoffOct 2023
Input modalitiestext, image, file
Output modalitiestext
Prompt caching
Cache read price$0.075/M
ModeratedNo

GPT-4o-mini FAQ

How much does GPT-4o-mini cost?

GPT-4o-mini costs $0.150 per million input tokens and $0.600 per million output tokens via OpenRouter, making it 88th cheapest of 298 paid models.

How fast is GPT-4o-mini?

GPT-4o-mini generates around 39 tokens per second with 480ms time-to-first-token (p50), the 212th fastest tracked model.

What is GPT-4o-mini's context window?

GPT-4o-mini supports a 128K-token context window and can output up to 16K tokens. It accepts text, image, file input.

Compare head-to-head