modelgrep
O

OpenAI: GPT-4.1

openai/gpt-4.1

91st smartest of 178ToolsJSONVision
Use via OpenRouter ↗
Intelligence
26.3
91st of 178
Design Elo
1149
Game Dev
Speed
54
155th fastest
Latency
696ms
first token
Input price
$2.00
246th cheapest
Context
1.0M

How it compares

Smarter than49%
of all ranked models
Faster than47%
of all ranked models
Cheaper than17%
of all ranked models

Overview

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

Benchmarks

independent · via OpenRouter
Artificial Analysis46th percentile
Intelligence Index
26.3
Coding Index
21.8
Agentic Index
27.3
GPQA Diamond
67%
Humanity's Last Exam
5%
SciCode
38%
Tau²-Bench (agentic)
47%
Design Arena · Elo766 tournaments
Game Dev
1149
Data Viz
1147
Website
1083
codecategories
1077
UI Component
1058
3D
930

Providers & pricing (3)

ProviderIn $/MOut $/MUptime
Azurecache$2.00$8.00
Azurecache$2.00$8.00100%
OpenAI$2.00$8.00100%

Specifications

Context window1.0M
Max output
Knowledge cutoffJun 2024
Input modalitiesimage, text, file
Output modalitiestext
Prompt cachingSupported
Cache read price$0.500/M
ModeratedNo

GPT-4.1 FAQ

How much does GPT-4.1 cost?

GPT-4.1 costs $2.00 per million input tokens and $8.00 per million output tokens via OpenRouter, making it 246th cheapest of 298 paid models.

How smart is GPT-4.1?

GPT-4.1 scores 26.3 on the Artificial Analysis Intelligence Index, ranking 91st of 178 benchmarked models, with a GPQA Diamond score of 67%.

How fast is GPT-4.1?

GPT-4.1 generates around 54 tokens per second with 696ms time-to-first-token (p50), the 155th fastest tracked model.

What is GPT-4.1's context window?

GPT-4.1 supports a 1.0M-token context window. It accepts image, text, file input.

Compare head-to-head