modelgrep
O

OpenAI: GPT-5.1 Chat

openai/gpt-5.1-chat

ToolsJSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
61
140th fastest
Latency
1.1s
first token
Input price
$1.25
224th cheapest
Context
128K
32K max out

How it compares

Faster than55%
of all ranked models
Cheaper than25%
of all ranked models

Overview

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

Providers & pricing (2)

ProviderIn $/MOut $/MUptime
Azurecache$1.25$10.00
OpenAIcache$1.25$10.00100%

Specifications

Context window128K
Max output32K
Knowledge cutoff
Input modalitiesfile, image, text
Output modalitiestext
Prompt cachingSupported
Cache read price$0.130/M
ModeratedNo

GPT-5.1 Chat FAQ

How much does GPT-5.1 Chat cost?

GPT-5.1 Chat costs $1.25 per million input tokens and $10.00 per million output tokens via OpenRouter, making it 224th cheapest of 298 paid models.

How fast is GPT-5.1 Chat?

GPT-5.1 Chat generates around 61 tokens per second with 1.1s time-to-first-token (p50), the 140th fastest tracked model.

What is GPT-5.1 Chat's context window?

GPT-5.1 Chat supports a 128K-token context window and can output up to 32K tokens. It accepts file, image, text input.

Compare head-to-head