modelgrep
O

OpenAI: GPT-5.2 Chat

openai/gpt-5.2-chat

ToolsJSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
55
151st fastest
Latency
940ms
first token
Input price
$1.75
237th cheapest
Context
128K
16K max out

How it compares

Faster than49%
of all ranked models
Cheaper than20%
of all ranked models

Overview

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

Providers & pricing (2)

ProviderIn $/MOut $/MUptime
OpenAIcache$1.75$14.00100%
Azurecache$1.75$14.00100%

Specifications

Context window128K
Max output16K
Knowledge cutoff
Input modalitiesfile, image, text
Output modalitiestext
Prompt cachingSupported
Cache read price$0.175/M
ModeratedYes

GPT-5.2 Chat FAQ

How much does GPT-5.2 Chat cost?

GPT-5.2 Chat costs $1.75 per million input tokens and $14.00 per million output tokens via OpenRouter, making it 237th cheapest of 298 paid models.

How fast is GPT-5.2 Chat?

GPT-5.2 Chat generates around 55 tokens per second with 940ms time-to-first-token (p50), the 151st fastest tracked model.

What is GPT-5.2 Chat's context window?

GPT-5.2 Chat supports a 128K-token context window and can output up to 16K tokens. It accepts file, image, text input.

Compare head-to-head