modelgrep
O

OpenAI: GPT-4.1 Mini

openai/gpt-4.1-mini

108th smartest of 178ToolsJSONVision
Use via OpenRouter ↗
Intelligence
22.9
108th of 178
Design Elo
1140
Game Dev
Speed
45
188th fastest
Latency
689ms
first token
Input price
$0.400
151st cheapest
Context
1.0M
33K max out

How it compares

Smarter than39%
of all ranked models
Faster than36%
of all ranked models
Cheaper than49%
of all ranked models

Overview

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

Benchmarks

independent · via OpenRouter
Artificial Analysis38th percentile
Intelligence Index
22.9
Coding Index
18.5
Agentic Index
25.2
GPQA Diamond
66%
Humanity's Last Exam
5%
SciCode
40%
Tau²-Bench (agentic)
53%
Design Arena · Elo687 tournaments
Game Dev
1140
Data Viz
1077
codecategories
1045
Website
1042
UI Component
1018
3D
917

Providers & pricing (3)

ProviderIn $/MOut $/MUptime
OpenAI$0.400$1.6099.2%
Azurecache$0.400$1.60
Azurecache$0.400$1.60100%

Specifications

Context window1.0M
Max output33K
Knowledge cutoffJun 2024
Input modalitiesimage, text, file
Output modalitiestext
Prompt cachingSupported
Cache read price$0.100/M
ModeratedYes

GPT-4.1 Mini FAQ

How much does GPT-4.1 Mini cost?

GPT-4.1 Mini costs $0.400 per million input tokens and $1.60 per million output tokens via OpenRouter, making it 151st cheapest of 298 paid models.

How smart is GPT-4.1 Mini?

GPT-4.1 Mini scores 22.9 on the Artificial Analysis Intelligence Index, ranking 108th of 178 benchmarked models, with a GPQA Diamond score of 66%.

How fast is GPT-4.1 Mini?

GPT-4.1 Mini generates around 45 tokens per second with 689ms time-to-first-token (p50), the 188th fastest tracked model.

What is GPT-4.1 Mini's context window?

GPT-4.1 Mini supports a 1.0M-token context window and can output up to 33K tokens. It accepts image, text, file input.

Compare head-to-head