OpenAI: GPT-4.1

openai/gpt-4.1

91st smartest of 178ToolsJSONVision

Use via OpenRouter ↗

Intelligence

26.3

91st of 178

Design Elo

1149

Game Dev

Speed

155th fastest

Latency

696ms

first token

Input price

$2.00

246th cheapest

Context

1.0M

How it compares

Smarter than49%

of all ranked models

Faster than47%

of all ranked models

Cheaper than17%

of all ranked models

Overview

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

Benchmarks

independent · via OpenRouter

Artificial Analysis46th percentile

Intelligence Index

26.3

Coding Index

21.8

Agentic Index

27.3

GPQA Diamond

67%

Humanity's Last Exam

SciCode

38%

Tau²-Bench (agentic)

47%

Design Arena · Elo766 tournaments

Game Dev

1149

Data Viz

1147

Website

1083

codecategories

1077

UI Component

1058

930

Providers & pricing (3)

Provider	In $/M	Out $/M	Context	Uptime
Azurecache	$2.00	$8.00	1.0M	—
Azurecache	$2.00	$8.00	1.0M	100%
OpenAI	$2.00	$8.00	1.0M	100%

Specifications

Context window1.0M

Max output—

Knowledge cutoffJun 2024

Input modalitiesimage, text, file

Output modalitiestext

Prompt cachingSupported

Cache read price$0.500/M

ModeratedNo

GPT-4.1 FAQ

How much does GPT-4.1 cost?

GPT-4.1 costs $2.00 per million input tokens and $8.00 per million output tokens via OpenRouter, making it 246th cheapest of 298 paid models.

How smart is GPT-4.1?

GPT-4.1 scores 26.3 on the Artificial Analysis Intelligence Index, ranking 91st of 178 benchmarked models, with a GPQA Diamond score of 67%.

How fast is GPT-4.1?

GPT-4.1 generates around 54 tokens per second with 696ms time-to-first-token (p50), the 155th fastest tracked model.

What is GPT-4.1's context window?

GPT-4.1 supports a 1.0M-token context window. It accepts image, text, file input.

Similar models

All GPT-4.1 alternatives →

z-ai/glm-4.5

Intel 26.4$0.600/M

inclusionai/ling-2.6-flash

Intel 26.2$0.010/M

qwen/qwen3-max

Intel 26.1$0.780/M

qwen/qwen3-next-80b-a3b-thinking

Compare head-to-head

OpenAI: GPT-4.1 vs Z.ai: GLM 4.5 OpenAI: GPT-4.1 vs inclusionAI: Ling-2.6-flash OpenAI: GPT-4.1 vs Qwen: Qwen3 Max OpenAI: GPT-4.1 vs Qwen: Qwen3 Next 80B A3B Thinking OpenAI: GPT-4.1 vs Upstage: Solar Pro 3 OpenAI: GPT-4.1 vs OpenAI: GPT-5 Nano